搜索资源列表
Hadoop
- 使用hadoop开发,可以对输入文件中出现的关键词统计词频并进行不同文本词频统计高低的排序,本代码需要用户自行定义关键词和输入文件-Use hadoop development, can appear in the input file keyword statistics word frequency and low frequency statistics different sort of text, the code requires a user-defined keywords an
keyword_find
- 实现了将pdf转换为txt,并且进行分模块的关键词抽取算法-Realized convert pdf to txt, and dividing module keyword extraction algorithm
tfidf
- TFIDF algorithm to achieve, through the input data sets and keywords to get each keyword corresponding weights to achieve text classification TFIDF算法的实现,通过输入数据集和关键词来获取每个关键词对应的权重,实现文本分类-TFIDF algorithm to achieve, through the input data sets and
