搜索资源列表
gmeans
- gmeans-- Clustering with first variation and splitting 文本聚类算法Gmeans ,使用了3种相似度函数,cosine,euclidean ,KL.文本数据使用的是稀疏矩阵形式. -gmeans clustering with first variation and splitting Gmeans,a text clustering algorithm, uses 3 functions,cosine,euclidean and
difference3
- In this paper, we propose a method of text retrieval from document images using a similarity measure based on an N-Gram algorithm.
main.计算文本之间相似度的程序
- 计算文本之间相似度的程序,用于文本的聚类。是在已知各个文本的文本特征向量基础上进行计算的,利用余弦值计算,Calculation of similarity between the text of the procedures for text clustering. Are known at all the text of the text feature vector calculated based on the use of cosine values
TFIDF
- 用c#写的计算文本向量的TFIDF算法源码,同时包括用cosine距离计算文本相似度的算法源码-Calculation using c# to write the text of the TFIDF vector algorithm source code, while including the use of cosine similarity distance calculation algorithm for source text
MyIM
- 局域网即时通讯软件源码,模仿QQ实现,界面相似度90 ,文件包括服务器了客户端,服务器采用SELECT模型,已实现文字聊天功能,数据库采用ACCESS,有离线消息功能-LAN instant messaging software source code, imitate QQ achieved, the interface 90 similarity, including the file server the client, the server uses SELECT model, has
RepeatedForms
- 根据相似度去重,把文本很相似的删除掉,基于VSM的算法的实现。-According to the similarity to heavy, very similar to the text removed, the algorithm based on VSM realize.
63535316jiqishijue
- 视频文字识别在视频分析与检索中有着重要作用,该算法对字幕图像进行N层小波分解,获得该视频文字的低频分量,再借助相似性度量方法实现视频文字的识别。-Video character recognition in the video analysis and retrieval plays an important role in the algorithm of subtitles images N layer wavelet decomposition, which is low-frequenc
ssdeep-2.2
- 计算机文本进行hash计算,来判断文件间的相似性,用于计算机取证方向-Computer text hash calculation to determine the similarity between documents, for the direction of computer forensics
090211
- 细化参数的对数极坐标变换图像纹理特征提取算法提出一种细化参数的对数极坐标变换图像纹理特征提取算法,可以有效地消除旋转、缩放和平移等几何形 变的影响.在特征提取过程中,通过自相关图像消除平移的影响,引入细化参数的对数极坐标变换消除旋转和伸缩 的影响.在图像检索过程中,将不完全树型小波变换所得的特征矢量,通过欧式距离来度量图像之间的相似度.实 验表明,本算法对发生几何形变的纹理图像检索平均正确率达81.05 左右,较之传统算法能取得更好的检索 效果.-This paper p rop
duibiliangggewenbenxiangtongfou
- 自己创建2个文本文件,然后判断它们文字内容的相似性(雷同的程度)。如果二者的所有单词中,相同的单词数量占总两的80 ,则认为而和是雷同的。-Create your own two text files, and then judge their similarity of the text (the same degree). If all the words of the same number of words for 80 of the two, think and are alike
stex
- 用于进行字符串的匹配查找,查找整个文件夹中的文本文件。并给出相应的相似度。-Search for the string matching to find an entire folder of text files. And the corresponding similarity.
simalar
- 基于Python的单词相似度分析,通过分析一些大文本来判断测试文件中给出的单词相似度判断的准确率-Python-based word similarity analysis, by analyzing a number of large text files to determine the test given to determine the accuracy of word similarity
x8p2riyj
- 关联规则实现代码源代码来源于互联网如有雷同纯属巧合world文本改成doc后缀-Association rules implementation code source code from the Internet world and any similarity is purely coincidental text into doc suffix
wordsimilar
- 词汇分类 相似度计算 文本语料分析 归类 知网数据分类-Word text corpus classification Similarity analysis of data classified Text Classification
CompareText
- 比对两文本/字符串的相似度,利用LD矩阵算法-Compare two text/string similarity matrix algorithm using LD ..
Character-recognition
- 自己制作基于“欧氏距离的算法”来识别文字的相似性,从而来识别手写文字的程序,开发环境是matlab.需要讲手写的字加到字库才可以哦。-Produce their own based on the " Euclidean distance algorithm" to identify the similarity of the text, handwritten text in order to identify the procedures, the development e
WordSimilarity
- 基于HowNet对中文单词进行相似度计算,实现的是《基于<知网>的词汇语义相似度计算》论文中的算法。-Based on HowNet for Chinese words for similarity computation, to achieve the " based on < Text> vocabulary semantic similarity calculation," the paper' s algorithm.
xsd
- 易语言快速计算文本相似度源码例程程序演示了文本相似度的对比计算方法。 -Easy language to quickly calculate the similarity of the text source routine procedures to demonstrate the text similarity calculation method.
相似度检测
- 可以计算文本相似度,任何语言!!!!!!!!!!!!!!!(Can calculate text similarity, any language!!!!!!!!!!!!!!!!!)
btm-master
- BTM模型,短文本相似度的处理模型,计算短文本相似度(BTM model, processing model of short text similarity)