搜索资源列表
Hadoop
- 使用hadoop开发,可以对输入文件中出现的关键词统计词频并进行不同文本词频统计高低的排序,本代码需要用户自行定义关键词和输入文件-Use hadoop development, can appear in the input file keyword statistics word frequency and low frequency statistics different sort of text, the code requires a user-defined keywords an
suanfayushujujiegou
- 算法与数据结构基础程序,最后一个课程设计是一篇文章的词频统计-data structure
0896170052
- 实现指定路径下的所有文件词频统计,基于命令行的。文件包括cpp和release版的exe-The achievement of designated trails of all documents under the frequency statistics, based on the command line. Documents include cpp and release version of the exe
THULAC_lite_java_v1
- 中文文本分词 词频统计,分词,去掉停词。 仅支持UTF-8编码-Chinese text segmentation To get the word frequency, word segmentation, remove stop words. Support only UTF-8 encoding
SumWords
- 数据结构大作业的成果,词频统计程序,JAVA语言写的一个类,可以用来统计一段文本中各个单词出现的次数。-Data structure operation results, statistical procedures, a kind of JAVA language, can be used to the number of each word of a text in statistics.
My_TDIF2
- Mapreduce实现的TF-IDF词频统计分析,可以直接运行于HADOOP环境下-Analysis of TF-IDF statistical Mapreduce to achieve, can be directly run in HADOOP environment
WordCount
- 关于storm的词频统计,是学习storm打好基础的第一步-Word Frequency Statistics of Storm Is to learn the storm to lay the foundation of the first step
dictionary1
- 采用python的字典功能,通过读取文件,获得文件中的词频统计。-count the words in a file by the dictionary function in python
wordcount-project
- 用JAVA语言编写的词频统计以及根据词的共现确定文章关键词的程序-JAVA language word frequency statistics and procedures in accordance with article keyword co-occurrence of the word is determined
WordCount
- 基于Hadoop的词频统计并行计算,输入输出控制在readMe.md中- U57FA u4E8EHadoop u7684 u8BCD u9891 u7EDF u8BA1 u5E76 u884C u8BA1 u7B97
svmcls
- 李荣陆老师做的文本分类器,特征选择方式包括全局和按类别选取,概率估算方法支持基于文档(布尔)统计和基于词频统计,支持三种特征加权方式,特征评估函数包括信息增益、互信息、期望交叉熵、X^2统计,文本证据权重,右半信息增益,分类方法包括支持向量机SVM和K近邻KNN,(text classifier that was written by Li Ronglu)
词频分析
- 通过关键字的查找,统计出某篇文档中相应关键字的个数,以及关键字的排序(word frequency analysis)
EMR
- 使用的是贝叶斯算法,进行文本的分类和词频统计(Using the Bayesian algorithm, the text classification and word frequency statistics)
单词词频
- 初学者课程设计,能够统计一个句子的词频。(A beginner's course design can count the word frequency of a sentence.)
课程设计作业
- 用分词包进行分词,并通过分词统计每个词频出现次数(use to seperate an article, and use the dictionary to find the frequency of each word)
jiebacut.py
- 通过结巴分词处理中文分词问题【对文本进行分词以及词频统计处理】。(The problem of Chinese participle is dealt with by the branch word segmentation.)
情感分析用词语集·知网hownet词典
- 该软件可以实现微博分析、聊天分析、全网分析、网站分析、浏览分析、分词、词频统计、英文词频统计、流量分析、聚类分析等一系列文本分析(The software can realize micro-blog analysis, chat analysis, whole network analysis, website analysis, browse analysis, word segmentation, word frequency statistics, English word freque
wordseg
- 运用R语言进行中文分词处理,得到词频统计,并绘制词云图直观表示(Chinese word segmentation and word cloud drawing)
project04
- 统计中英文字幕的英文单词数,也能够统计词频(Count the number of English words in Chinese and English subtitles, and also count the word frequency)
哈夫曼编码与字母频率统计
- 统计词频并构建哈夫曼树,左边为0,右边为1。(HuffmanTree ,calculate every the number of every zee and using '1' and '0' create a tree and printf the string.)