搜索资源列表
gmeans
- gmeans-- Clustering with first variation and splitting 文本聚类算法Gmeans ,使用了3种相似度函数,cosine,euclidean ,KL.文本数据使用的是稀疏矩阵形式. -gmeans clustering with first variation and splitting Gmeans,a text clustering algorithm, uses 3 functions,cosine,euclidean and
difference3
- In this paper, we propose a method of text retrieval from document images using a similarity measure based on an N-Gram algorithm.
simtexth
- text Similarity Source Code for Linux.
English-sentence-sim
- 英文文本的相似度计算,分别从词形、词序、词义等进行权重计算,得到相似度结果-English text similarity calculation were re-calculated from the word form, word order, meaning, etc. right, the similarity results
IS
- It s tf/idf track :) based on text similarity
wenbenleiju
- 基于文本相似度计算的文本聚类算法研究与实现,这是中文信息处理的重要分支。-The text clustering algorithm based on text similarity computing research and implementation, this is an important branch of Chinese information processing.
Program1
- 这是中文信息处理的分词算法实现,具有很高的正确率和使用价值!这是分词算法的核心内容!具有很高的参考价值。-The text clustering algorithm based on text similarity computing research and implementation, this is an important branch of Chinese information processing.
SimHash
- simhash算法的使用,可以进行网页去重,文本的相似度计算等-Use simhash algorithm can go heavy pages, such as text similarity calculation
TextSimilarity
- 余弦幅度法计算两个向量的相,求余弦相似度 -Consine Text Similarity
TextSimilarity
- 文本相似度计算程序,有图形界面,基于向量-text similarity
Kl
- 计算文本相似度,输出文本间KL距离,JS距离-Calculate the text similarity, the KL distance and JS distancetext of text。
CosineSimilarAlgorithmzf
- 这里会用到TF/IDF权重,用余弦夹角计算文本相似度,用方差计算两个数据间欧式距离,用k-means进行数据聚类等数学和统计知识。-Here will use the TF/IDF weight, with cosine angle calculation of text similarity, with the variance of the two data between the data of the European distance, with K-means data cluste
WIP3
- Kaggle竞赛“Can your AI smarter than a 8th grade student?”的代码。 使用文本相似度计算的方法,对美国八年级学生的科学考试卷(4选1选择题)进行自动回答。-Kaggle contest Can your AI smarter than a 8th grade student? Code. The method of using text similarity calculation, the US eighth-grade stude
DocDistance
- java实现的文本相似度系统,使用向量空间模型以及余弦相似度距离公式,实测可以实现2篇文本的相似度计算且有一定的效果。-Java text similarity system, using the vector space model and the cosine similarity distance formula, the measured results can be achieved two similarity of text and have some effect.
java-string-similarity-master
- code for finding similarity between programs the text that has been entered so can find similar items
xsd
- 易语言快速计算文本相似度源码例程程序演示了文本相似度的对比计算方法。 -Easy language to quickly calculate the similarity of the text source routine procedures to demonstrate the text similarity calculation method.
VSM
- 利用向量空间模型对两个文本的相似度进行计算的Java代码-The use of vector space model for the two text similarity calculation of the Java code
文本查重
- 类说明: 名称:Contrast 描述:用于两文本进行各种方法的相似度对比。 相似度对比方法: 1.EditDistance编辑距离 2.CosineSimilarAlgorithm余弦定理 3.JianDanMoHu模糊匹配 4.综合对比,三种方法皆对比一遍,取平局值 方法:String getDegree(文本1,文本2,使用方法id)返回值为:字符串型的,相似度百分比(Class descr iption: Name: Contrast Desc
相似度检测
- 可以计算文本相似度,任何语言!!!!!!!!!!!!!!!(Can calculate text similarity, any language!!!!!!!!!!!!!!!!!)
btm-master
- BTM模型,短文本相似度的处理模型,计算短文本相似度(BTM model, processing model of short text similarity)