搜索资源列表
gmeans
- gmeans-- Clustering with first variation and splitting 文本聚类算法Gmeans ,使用了3种相似度函数,cosine,euclidean ,KL.文本数据使用的是稀疏矩阵形式. -gmeans clustering with first variation and splitting Gmeans,a text clustering algorithm, uses 3 functions,cosine,euclidean and
多种分词方法
- 含有多个分词算法。毕业设计的时候获得的,希望对大家在汉字处理中能用的到。 ICTCLAS算法,中科院,对名字识别能力很强。VC开发。 CSharp分词,向前匹配加向后最大匹配,C#开发,容易扩展。 小叮咚分词,由后向前最大匹配,C#开发。 xerdoc分词,基于ICTCLAS的Java版本分词。 文本分词词典,分词的词典,可以提供分词数据源。-containing more than Segmentation. Graduation was the time of hope
TextCategorization
- 基于朴素贝叶斯算法实现的中文文本分类程序。可以对中文文本进行分类识别,使用时先对分类器进行训练,然后进行识别。该Beta版本仅支持对3类文本进行分类,使用简单的中文分词方法,本程序尚不具备实用性,用于算法研究和改进。-based on Bayesian algorithms to achieve the Chinese text classification procedure. Can the Chinese text classification identification, the us
Incremental_Document_Clustering
- 一个增量文本聚类的算法。 参考文献: Wai-chiu Wong, Ada Wai-chee Fu, Incremental Document Clustering for Web Page Classification. -an incremental version of the clustering algorithm. References : Wai-chiu Wong, Ada Wai-CHEE Fu, Incremental Document Clustering for
BPinVB
- vb 编写的bp算法实现代码 包括:神经网络样板 文本格式 可执行神经网络程序-vb bp prepared by the algorithm code include : neural network model text format executable neural network programming
kmeans
- kmeans文本聚类算法的简单实现,是用JAVA实现的-kmeans a simple realization of the text clustering algorithm is implemented using JAVA
Text-Classification-CPP
- 利用贝叶斯算法进行文本分类,用VC++实现的源代码 -Bayesian text classification algorithm, implemented using VC++ source code
svm-EM
- SVM(支持向量机)和EM(最大熵)文本分类算法,压缩包中包括了测试文本(环境类和计算机类),词典,停用词表等。-SVM (support vector machine) and EM (maximum entropy) text classification algorithm, compressed package includes test text (environmental and computer), dictionary, thesaurus, such as disabled.
RepeatedForms
- 根据相似度去重,把文本很相似的删除掉,基于VSM的算法的实现。-According to the similarity to heavy, very similar to the text removed, the algorithm based on VSM realize.
text_classification
- 这是一个文本分类的简单算法,基于weka的,实现文本的分类。-This is a easy classification code.
pj
- 一种基于语义内积空间模型的文本聚类算法,这种算法有很好的效率,拿出来和大家分享下-A semantic-based model of inner product space the text clustering algorithm, this algorithm has good efficiency, and to share out under the
MyCluster
- 聚类算法中的k均值算法,里面已经包含一个文本聚类的实验。-Clustering algorithm k means algorithm, which already contains a text clustering experiments.
shotdetect
- 文本聚类算法的好东西文本聚类算法的好东西-Text clustering algorithm for text clustering algorithm good things good things
supermarket
- 大四上学期做的一个人工智能里有关数据挖掘的课程设计,里面包括数据挖掘算法和数据库文件。。。对于想学好人工智能的非常有用-Last semester to do a senior artificial intelligence in the design of data mining programs, which include data mining algorithms and database files. . . The artificial intelligence is very u
FCM
- 基于遗传FCM算法的文本聚类,FCM算法运用到文本中-Based on genetic FCM clustering algorithm, FCM algorithm is applied to the text
文本分类
- 文本分类算法的详解过程,可以帮助更好的理解文本分类(The detailed process of text categorization)
text_classification.tar
- 用python实现的问题分类算法,包括贝叶斯,svm,决策树,xgboost,对入门文本分类的同学有一定的帮助(text classification algrithom,include svm,dt,xgboost,bayes,that important to learner about text classification)
data_process_xgboost.tar
- 文本分类算法,重点实现了xgboost,支持python3,对文本分类入门同学有一定的帮助。以下是语料格式。前边为文本类型,后边为文本内容 0,千山万水… 5,QQ推广是在QQ演示吗 8,怎么满脸认证?(text classification,include xgboost,is is important to new learner.)
cnn-text-classfication
- 通过机器学习中的卷积神经网络实现对文本进行分类的算法。(The text is classified by the convolution neural network.)
lstm-text-classfication
- 通过机器学习中的长短时记忆单元实现对文本进行分类的算法。(The long short term memory unit realizes the algorithm of classifying text by machine learning.)