搜索资源列表
reuters21578.tar
- 一个著名的文本分类数据集,用于测试分类器的性能。是写论文的同志不可或缺的东西。-A famous dataset for Text Classification, which is essencial for thesis writing.
bow-20020213
- 卡内基梅隆大学MaCallum开发的文本分类系统,可方便在其中嵌入自己的模块-text categorization system developed by maccallum of cmu
tztqjsydm
- 贝叶斯公式,文本分类,中文分词,VC++开发,方便实用和开发-beyes,text classify
reuters21578
- 这是一个英文的语料库,可以用于进行文本的分类与聚类。是文本分类领域共用的一个语料库。-This is a corpus of English, can be used for text classification and clustering. The field of text classification is a common corpus.
tc-corpus-train
- 语料库训练集 , 适用于文本分类中的训练-ts-corpus-training
SupportVectorMachinesTextClassification
- 支持向量机(support vector machine)用来解决复杂的文本分类问题-support vector machine& text classification
Test Class By SVM
- 支持向量机实现的文本分类程序,过程如下,首先使用分词工具分词,这里使用的是计算所的分词工具,从而保证分词是最优秀的,接下来使用国际效率最高的文本IFIDF向量生成工具生成文本相量,最后使用台湾林智恒的效率最高的SVM实现软件包libsvm实现训练和分类,可以这么说,该文本分类是同类中效率最高最准确的-text classfication source code use 3 technology.words sementation,vector gerneration,and libsvm too
TestICTCLAS
- 文本挖掘,文本分类源代码.包括贝叶斯分类,信息抽取以及抽取之后的关联规则挖掘等功能-source code of text mining and text classification
SupportVectorMachine
- 支撑状态向量机matlab代码,用于文本分类等-Support state vector machine matlab code for text classification, etc.
WebAutomaticTextClassificationTechnologyResearch.r
- Web自动文本分类技术研究综述 基于WEB的文本分类技术论文-Web Research on automatic text classification techniques for text classification based on technical papers WEB
libsvm-mat-2.9-1
- libsvm工具箱,用于分类的绝佳工具,也可用于非线性回归及预测,或拟合,其中文本分类是其长项,回归性能非常好-libsvm toolbox, an excellent tool for the classification can also be used for non-linear regression and forecasting, or fitting, of which text categorization is its long entry and return to a ve
wenben
- 文本分类文本的选择似乎是根据内容来的, 而非一般所采用的语体分类-Text classification is based on the text of the choice seems to be content, rather than the general language used in Classification
Bayesianclassifierfortextclassificationalgorithms.
- 用于文本分类的朴素贝叶斯分类算法,包括代码和测试数据-Naive Bayesian Text Classification for classification algorithms, including the code and test data
text_data_mining
- java编写的数据挖掘方面的代码,里面包含有文本分类,作者身份识别方面的java源码,本人亲自参与编写-java code about data mining;include:text cluster ,authorship identification,
naive_baysian_classify
- 朴素贝叶斯公式文本分类 把一片文章读入一个矩阵 分别计算每个词对应训练网络出现的概率 -Bayesian text classification to an article in the formula to read a matrix were calculated for each word corresponds to the training network the probability
PLStextclass
- 基于PLS的文本分类技术研究,和潜在语义索引联系密切,研究文本分类中特征抽取的重要参考。-PLS-based text classification technology, and closely linked to latent semantic indexing, feature extraction of text classification an important reference.
Wordfrequencystatistics
- 对英文文章的单词进行统计词频 并输出 主要应用文本分类中的对文章的处理-Word article on the English word frequency and the output of the main statistical application of text categorization of the articles deal with
bb
- 中文文本分类相关算法的研究与实现,介绍文本分类方法-Chinese text classification research and implementation of related algorithms, text classification introduced
NBClassify
- 人工智能。基于朴素贝叶斯的文本分类器,测试正确率较高。-Artificial intelligence. Naive Bayes text classification based on, test accuracy is higher.
11MyClassify
- 用中科院的分词系统实现文本分类,文本分类的方法为K-With the CAS system to achieve the sub-word text classification, text classification method for KNN