搜索资源列表
txt
- 该程序实现文本中总词数的统计,各个词之间用空格隔开,并实现各个词频的统计。-Implementation of the program text, the total number of statistical terms, separated by spaces between each word, and to achieve the various word frequency statistics.
Trie
- Trie,又称单词查找树、前缀树,是一种哈希树的变种。应用于字符串的统计与排序,经常被搜索引擎系统用于文本词频统计-Trie, also known as the word search tree, the prefix tree, is a variant of the hash tree. Applied statistics and sorting of strings, often by the search engine system for text word frequency s
Word-Segmentation
- 分词程序,用于文本分词,可以进行分词,统计词频-Segmentation procedure, used for text segmentation
lex_new
- c语言词法分析,判断是关键字,还是其他的,并且统计词频-c language lexical analysis to determine the keyword or the other, and word frequency statistics
word
- 单词的词频统计 基本功能要求: (1) 从文件中读出一篇英文文章,将不同的单词存入堆中, 建立相应的索引表确定各单词在堆中存储的位置及出现的次数。 (2)按出现的频率递减的次序输出。-The basic functions of word frequency statistics requirements: (1) read from a file an article in English, into the heap of different words, to establ
Static_words
- 词频统计 词频统计 词频统计 词频统计-Statistical word frequency word frequency word frequency word frequency statistics statistics statistics statistics word frequency word frequency word frequency statistics Statistics
javaTermFreq
- java中对英文分词后,去词干后进行词频统计的代码-stastic the frequency of english words after steming and looking up stopword list
word-statics
- 词频统计是一个重要的工具,你可以统计文章中词出现的频率来真正实现每个词的含义-word statics,you can use the soft ware to check the number of the article.such as a,b.c.d.
obtaining-statistics-programs
- 建模\有用\词频统计程序.doc,高效有序,读取文件,可以自己添加文件-Modeling, useful, obtaining statistics programs. Doc, disciplined, read documents, can add your own files
tfidf
- TF-IDF算法,用于统计词频,并找出关键字,以及计算出权重值。-TF-IDF algorithm, used for statistical word frequency, and find out the key, and calculates a weight value.
editpad
- 文本文件编辑器,具有词频分析功能,此外还具有文件菜单和编辑菜单常用的操作功能。-Text file editor with word frequency analysis, in addition to the operation of the file and edit menus.
dictionarytree
- 又称单词查找树,Trie树,是一种树形结构,是一种哈希树的变种。典型应用是用于统计,排序和保存大量的字符串(但不仅限于字符串),所以经常被搜索引擎系统用于文本词频统计-Also known as word search tree, the trie is a tree structure, which is a variant of the hash tree. Typical applications are used for statistical, sort and save a lot
mian
- 又称单词查找树,Trie树,是一种树形结构,是一种哈希树的变种。典型应用是用于统计,排序和保存大量的字符串(但不仅限于字符串),所以经常被搜索引擎系统用于文本词频统计。它的优点是:利用字符串的公共前缀来节约存储空间,最大限度地减少无谓的字符串比较,查询效率比哈希表高。 -Text of obtaining statistics its advantage is: use the public to save string prefix storage space, the maximum r
UserTree-Map
- 针对特定的英文文本,计算其中每个单词的词频。-Count the frequency of words in certain english text.
CountOccurenceOfWords
- 对指定文本中的单词进行处理,输出文本中所有单词按照首字母顺序排列的结果,并对每个单词输出其出现的词频。-Output the words in a certain english text in alphabet sequence and count the frequency of each word.
Main2
- 对指定的文本文件进行处理,计算其中所有单词的词频,并按照词频高低依次输出。-For certain text file,count the frequency of each word and output the outcome in order of frequency in decrease order.
English-word-frequency-statistics
- 很实用的英文单词的词频统计程序简单实用的matlab的版本-Very useful English words, word frequency statistics program is simple and practical Matlab version
KeyWordGet
- 提取关键词程序,中文信息处理,能够提取一段正文的关键词及词频-Keyword Extraction
The-code-of-VSM-java
- 利用java代码实现向量空间模型,通过词频,文档频率计算相似度的值。-Using the language to get the value of VSM
Complete-Training-of-TC
- 用贝叶斯模型实现文本分类,;里面包含分词,词频统计,去除停用词等模块,目前完成的是分类的训练阶段。-realize text categorization by using the NaiveBayes Model