搜索资源列表
gmeans
- gmeans-- Clustering with first variation and splitting 文本聚类算法Gmeans ,使用了3种相似度函数,cosine,euclidean ,KL.文本数据使用的是稀疏矩阵形式. -gmeans clustering with first variation and splitting Gmeans,a text clustering algorithm, uses 3 functions,cosine,euclidean and
countTexture
- 统计图像子区域的相似度,但图像像素信息要先存到一个文本文件中。
072832
- 本文介绍了在向量空间模型下, 使用一和一两种不同的相似度评测方法, 来评测查询与文本之间的相关性。
simalar
- 基于Python的单词相似度分析,通过分析一些大文本来判断测试文件中给出的单词相似度判断的准确率-Python-based word similarity analysis, by analyzing a number of large text files to determine the test given to determine the accuracy of word similarity
knn
- 文本分类,用KNN方法,通过计算其相似度,找出与它靠近的K个文档。-text classify
wordsimilar
- 词汇分类 相似度计算 文本语料分析 归类 知网数据分类-Word text corpus classification Similarity analysis of data classified Text Classification
wenbenxiangsidujisuan
- 文本相似度计算工具代码,这是在做搜索引擎非常需要的一个算法,对于想从事开发这方面的应用,具有不错的参考价值。-Text similarity calculation tool code, which is doing a great need for an algorithm of the search engine, and want to engage in the development of this aspect of the application, has a good refer
similarity
- 文本相似度计算余弦相似度代码,计算文本相似度,用于搜索引擎-Cosine similarity of text similarity computation code, the text of the similarity calculation for the search engine
xsimilarity-master
- 一些文本相似度的方法总结,仅供大家参考 希望对大家有帮助-Some similarity Methods
TextSimilarity
- 文本相似度计算程序,有图形界面,基于向量-text similarity
Kl
- 计算文本相似度,输出文本间KL距离,JS距离-Calculate the text similarity, the KL distance and JS distancetext of text。
CosineSimilarAlgorithmzf
- 这里会用到TF/IDF权重,用余弦夹角计算文本相似度,用方差计算两个数据间欧式距离,用k-means进行数据聚类等数学和统计知识。-Here will use the TF/IDF weight, with cosine angle calculation of text similarity, with the variance of the two data between the data of the European distance, with K-means data cluste
WIP3
- Kaggle竞赛“Can your AI smarter than a 8th grade student?”的代码。 使用文本相似度计算的方法,对美国八年级学生的科学考试卷(4选1选择题)进行自动回答。-Kaggle contest Can your AI smarter than a 8th grade student? Code. The method of using text similarity calculation, the US eighth-grade stude
DocDistance
- java实现的文本相似度系统,使用向量空间模型以及余弦相似度距离公式,实测可以实现2篇文本的相似度计算且有一定的效果。-Java text similarity system, using the vector space model and the cosine similarity distance formula, the measured results can be achieved two similarity of text and have some effect.
xsd
- 易语言快速计算文本相似度源码例程程序演示了文本相似度的对比计算方法。 -Easy language to quickly calculate the similarity of the text source routine procedures to demonstrate the text similarity calculation method.
English
- 包括了原始英文文档、删除特殊符号、分词、词干化、计算相似度等文本预处理后产生的文档,总的数量是500个英文文档(Including the original English document, delete special symbols, such as text segmentation, a preprocessed documents produced, the total number of 500 English document)
Chinese
- 是做文本预处理时候利用爬虫收集的500个中文文档,包括分词部分、去掉特殊符号部分以及最后的相似度计算等(It is the 500 Chinese document collected by a crawler for text preprocessing, including the part of the participle, the removal of the special part of the symbol, and the final similarity calculatio
EnglishChuLi
- 利用python编写的文本预处理的程序,包含了每一步的实现代码,分为删除标点符号、删除停用词、相似度计算、PCA降维、聚类以及可视化等,运行环境为pytharm,python3开发环境(The text preprocessing program written by Python contains every step of implementation code, which is divided into delete punctuation marks, delete stop word
文本相似度计算方法研究综述.pdf
- Text similarity; semantic similarity; ontology; word bag model; neural network ; thesis review
Python中文文本预处理
- 包括删除标点、分词、删除停用词、计算相似度、文本聚类等功能