搜索资源列表
main
- 统计文档中出现某个人名的频率 适合于词频统-Statistics document the frequency of a person s name for the word frequency statistics
zz
- C程序,统计文本词频, 分动词名词,文本中包涵了词性 可以将名词动词分类输出 。-C program, statistics text word frequency, verb nouns., The text includes the part of speech noun verb classification output.
tongjicip
- 文本分类的程序,讲解详细,内容丰富,利于查找,统计词频-Text classification procedures, explain in detail, rich in content, which will help find the word frequency statistics
Java
- 能实现分词,去除停用词,统计词频的Java的源代码-To achieve segmentation, removal of stop words, word frequency statistics Java source code
WORDS_COUNT
- 统计单词频数,并按频数由多到少输出. 欢迎大家下载阅读-Statistical word frequency, according to the frequency of the output from more to less
smile
- 文件读入英语文章,并统计英语文章各个单词出现的词频。并用生成新的文件进行存储。-The file is read into English articles, and word frequency statistics of each word of the English article. And generate a new file is stored.
parallel
- 并行计算实验代码,分别有计算pi,并行求卷积的两种方法和mapreduce的统计词频-Parallel computing experiment code, respectively, calculated pi, parallel convolution of two methods of statistical, frequency and mapreduce
HOMEWORK_3_1252998_ZhangXueqin
- 统计一个文本文件的词频,并按词典序输出到一个文本里-count the word frequency
WordFrequency
- 文本处理过程中,对词频进行统计,以便于进一步将文本表示成向量-Text processing process, the word frequency statistics, in order to further text representation to a vector
SearsScraper
- 利用java的html分析包jsoup,编的网络爬虫,自动从sear网站上搜寻产品信息并归类,统计词频等。-Java using the html analysis package jsoup, compiled web crawler to automatically search for products on the website from the sear and classified information, statistical, frequency and so on.
wordCount
- python代码,利用hadoop分布式框架处理文本内容重的统计词频问题 -python code, use hadoop distributed framework for handling text heavy question word frequency statistics
lab_3
- 统计txt文件中单词的词频,要输入的txt文件inputdata,和要输出的文件outputdata在工作空间中,程序从inputdata文件中统计词频,并在outputdata中输出统计结果-Statistical txt file word frequency, to enter txt file inputdata, and to the output file outputdata in the workspace, the program files from inputdata wo
Perceptron
- 机器学习中Perceptron算法统计词频-The statistics of words of perceptron algorithm in Machine Learning
ansj_seg-master
- 一个功能非常全面的分词程序,内部有许多测试类可以使用,包含了词频的统计功能在其中,可以-A very comprehensive segmentation procedures, internal classes can use many tests, including word frequency statistics function in which you can see under the next
huffmancode
- 哈夫曼编码是一种常用的数据压缩技术,对数据文件进行哈夫曼编码可大大缩短文件的传输长度,提高信道利用率及传输效率。要求采用哈夫曼编码原理,统计文本文件中字符出现的词频,以词频作为权值,对文件进行哈夫曼编码以达到压缩文件的目的,再用哈夫曼编码进行译码解压缩。 统计待压缩的文本文件中各字符的词频,以词频为权值建立哈夫曼树,并将该哈夫曼树保存到文件中。 -Huffman coding is a commonly used data compression technology, Huffman
Counting
- 该程序用于实现统计词频功能,从文件读取内容,将统计结果输出到文件 -count the frequent of the words a file,and write down the result in a file.
sort
- 利用插入排序和首字母归类统计英文单词的词频,经过一些优化-Use insertion sort and classify the first letter of the English word word frequency statistics, after some optimization
word
- 统计一个txt文本中的词频。使用方法是将要统计的文本文件放在py文件的同一目录下,并根据py文件注释更改文本文件名-calculate the tf within a*.txt
Java
- 实现读取TXT文本文件,并统计文本中出现的单词的词频,并讲输出结果存入到另外一个文本文件中。- Achieve read TXT text files, and word frequency statistics appear in the text, and talk to the other output is stored in a text file.
SplitWords
- 基于lucene的文档分词程序,去停用词,统计词频,计算词的权重-Lucene-based document segmentation procedures, to stop words, word frequency statistics