搜索资源列表
ParseWord
- 根据自然语言处理方法中的左端最大的方法对中文进行分词-Natural language processing methods in accordance with the methods of the extreme left of the largest Chinese word segmentation
Chinesewordsegmentationprocedureforthecarving
- 中文分词分割程序Chinese word segmentation procedure for the carving-Chinese word segmentation procedure for the carving
fenci
- PHP 织梦逆向最大匹配算法 中文分词 分词算法-PHP DedeCms reverse maximum matching algorithm for Chinese word segmentation algorithm
WordSegmentation
- 基于java的一个分词程序 速度比较快 精确度比较高-A java-based segmentation procedures faster relatively high accuracy
beiyes
- 贝叶斯网络概率中文分词算法,基于概率的分词算法-Bayesian network probability of Chinese word segmentation algorithm, based on the probability of word segmentation algorithm
KaiYuanIctclas
- ICTCLAC开源代码,可用在中文文本信息处理上,为信息抽取进行中文分词、词性标注处理,准确度达到85 以上,-failed to translate
WordSegment
- 很简单的中文分词程序,命令行程序,在VisualStudio2008中调试通过,内附测试文档。-Chinese language is very simple segmentation procedures, command-line procedures, the debugging of VisualStudio2008 passed, the document containing the test.
chinese_word_segment
- 一个比较简单的中文分词的C++代码,word文档,里面的C++代码在VisualStudio2008中调试通过,有详细说明。-A relatively simple Chinese word segmentation of the C++ code, word document, which the C++ code in debug VisualStudio2008 passed, there is a detailed descr iption.
Al_Chinese_Word_Segment
- 一种简约但不简单的中文分词的代码,作者比较有思想,里面给出了很详细的说明,值得学习。-A simple but not simple Chinese word segmentation of the code, the more ideas the author, which gives a very detailed explanation, it is worth learning.
Segmentation
- 用HMM实现的中文分词程序,用C#实现的。-HMM to achieve with the Chinese word segmentation
src
- 利用lucene编写的一个简单搜索引擎,能够中文分词。-a simple search engine built with lucene.
qygl
- lucene 中文分词公用组件,对搜索引擎开发中中文分词做了很好的封装。-Chinese word segmentation lucene common components, the development of the Chinese search engine has done a very good segmentation of the package.
word_split
- 这个一个基于逆向最大匹配的分词程序,语料规模比较小。-The maximum matching based on the reverse of the sub-term process, relatively small-scale corpus.
WordSegment
- 基于字符串匹配的中文分词程序,C++版。 结果以文件显示。-String matching based on the Chinese word segmentation procedures, C++ version. To document the results.
FreeICTCLAS
- 分词,国内比较著名的一款开源分词软件。ICTCLAS是中科院两位教授编写的,十分经典-Segmentation, the Mainland and a well-known open source software word. Chinese Academy of Sciences ICTCLAS is prepared by two professors, very classic
worddivision
- 自己写的一个关于智能分词算法的文件. 对于想研究搜索引擎的朋友来说必不可少-a algorithm about word division
tztqjsydm
- 贝叶斯公式,文本分类,中文分词,VC++开发,方便实用和开发-beyes,text classify
softwarecode
- 中文分词是中文信息处理中的重要环节,中文分词技术广泛应用于自动翻译、文本检索、语音识别、文本校对、人工智能以及搜索引擎技术等领域。中文分词算法的选择,中文词库的构建方式,词库中词条的完备性在很大程度上与中文分词系统性能紧密相关。-Chinese word segmentation in Chinese information processing is an important part of Chinese word segmentation technology is widely used
imdict-chinese-analyzer
- imdict-chinese-analyzer 是 imdict智能词典 的智能中文分词模块,算法基于隐马尔科夫模型(Hidden Markov Model, HMM),是中国科学院计算技术研究所的ictclas中文分词程序的重新实现(基于Java),可以直接为lucene搜索引擎提供简体中文分词支持。-imdict-chinese-analyzer is a smart imdict Chinese Dictionary smart module segmentation algorithm
SearchEngine
- C#+Lucene.Net开发完成的一个自定义WEB搜索引擎,本项目实现了分词、模糊索引,加以Lucene.Net内部核心功能共同实现了搜索机制引擎-C#+ Lucene.Net developed a custom WEB search engine, the project achieved a sub-word, fuzzy indexing, Lucene.Net be the core function of the internal search mechanism to achie