搜索资源列表
data_mining
- 这是一个R的程序 文本挖掘的作业,用来统计新闻类别再进行分类 -This is an R program text mining operations for Statistical News category then classified
NaiveBayes-master
- 对文本信息进行分类,训练和学习,利用朴素贝叶斯算法实现。-Text information on the classification, training and learning, with Naive Bayes algorithm.
Enhancedtextmining
- 强化版本文本挖掘流程,包含分词,分类聚类,分词结果评估等-Enhanced version of the text mining process, including word segmentation, classification clustering, segmentation results uation, etc.
Bias_algorithm_java
- 贝叶斯算法java实现,在贝叶斯算法思想基础上做改进,提供文本分类效率-Bias algorithm java implementation, based on the idea of Bias algorithm to improve the efficiency of text classification
InfoRetri
- 基于朴素贝叶斯的文本分类,包含去停用词,分词,特征提取,分类等-Text classification, based libsvm, included to stop words, segmentation, feature extraction and classification
pyspark_process
- 使用pyspark进行文本分类算法实现,其中使用了tf-idf表示-Use pyspark text classification algorithm, which uses the tf-idf representation
kNN
- 使用python编写kNN算法,包括生成数据集,简单分类器,文本转换等简单算法。-Using python write kNN algorithms, including generating a data set, a simple classification, text conversion simple algorithm.
tfidf
- TFIDF algorithm to achieve, through the input data sets and keywords to get each keyword corresponding weights to achieve text classification TFIDF算法的实现,通过输入数据集和关键词来获取每个关键词对应的权重,实现文本分类-TFIDF algorithm to achieve, through the input data sets and
fenciledebeiyesi
- 中文文本分词系统+基于贝叶斯算法的文本分类源码,用matlab实现。-Chinese word segmentation system+ based on Bayes text classification source code, using matlab implementation.
文本深度挖掘
- 用于分析文档,分析情感指数,正负面情绪,及新闻分类(Used to analyze documents, analyze sentiment, positive and negative emotions, and classify news)
专利文本分类
- 对大量的专利文本文件进行处理,然后利用机器学习的算法进行挖掘,实现对专利的分类(A large number of patent text files are processed, and then machine learning algorithm is used for mining.)
