搜索资源列表
KeyWordTree
- 一个c#编写的中文分词系统,有一个附带的baidu词库-fen ci c#
fenci
- 正向最大匹配算法分词,采用容器储存字典可提高一定的效率。-Forward Maximum Matching algorithm for word segmentation, using containers for storage dictionaries can improve a certain efficiency.
UseHLSSplit(Fix)
- 中文分词处理,delphi调用海量智能分词库,修改了网上另一个版本的错误。-Chinese word processing, delphi call the massive intelligence points thesaurus, revised the online version of the error to another.
KWICcase
- 实现关键词检索的一个程序,中文分词,自动检索-To achieve keyword search of a program, Chinese word segmentation, automatic search
111
- 中文分词 词库 分次字典中文分词 词库 分次字典- IHTMLDocument3* pHTMLDoc3 HRESULT hr = m_pHTMLDocument2->QueryInterface(IID_IHTMLDocument3, (LPVOID*)&pHTMLDoc3) ASSERT(SUCCEEDED(hr))
www.myworld.net.cn
- 客采集系统是由工作在顶级门户网站的几名资深高级工程师利用爬虫技术(蜘蛛机器人,spider)、分词技术和网页萃取技术,利用URL重写技术、缓存技术,使用PHP语言开发的一套能根据设置的关键词自动抓取互联网上的相关信息、自动更新的WEB智能建站系统。利用 博客采集系统-Customer acquisition system is working in top-level portal site crawler technology, the use of several senior engine
WordSegTest
- 该软件的主要功能是可以帮助我们进行中文的分词和对一些词性的分析。-this software is used for Chinese sorted.
max_pro_seg
- 详细介绍了最大概率分词方法的理论和步骤, 帮大家轻松搞定简单的中文分词-the word to introduce the max propability int segmenting
fenci
- 一个简单的基于词典分词的程序,lucene的分词程序不少,但有时候并不需要复杂的功能,只是需要简单的根据指定的词典分词。代码简单,可以作为学习参考-A simple dictionary-based word process, lucene procedures for sub-word a lot, but sometimes does not require complex functions, but only require a simple dictionary word accord
WindowsApplication1
- 处理的对象是:完成分词和词性标注的语料,实现的结果是:统计出现词频完成降序排列。-Dealing with the object are: the completion of word segmentation and POS tagging of the corpus, the results achieved are: the completion of word frequency statistics appear in descending order.
zdfc
- 对中文语句进行分词的一个源码,利用正向匹配算法进行处理-Statements for the Chinese word for a source
IKAnalyzer3.1.1_userguide
- java分词程序,能够精确分词,包含词库等-java word program, word accurately, including the thesaurus, etc.
Dictory
- 用B-树作为查找存储结构,对中文单词进行哈希,用最长逆向匹配算法进行中文句段分词,实现中文词典以及分词。中文词典规模为十万八千多词汇。分词的正确率在90 以上。-Use the B-tree as storage structure , and hash the Chinese word while storing or searching. Use the longest reverse matching algorithm to split Chinese sentence to word
WordPartation2
- 中文分词程序 利用最大匹配算法 支持GB2312编码格式的文件-Chinese word segmentation procedure using the maximum matching algorithm to support GB2312 encoding format of the file
yinwenfenci
- 英文分词,根据字典的词对英文无空格的词序列进行分解-The English word, according to word dictionary of English words without spaces in the decomposition sequence
SegDll
- windows 编程 mfc 实现的分词程序-windows programming mfc program to achieve the sub-word
HLDLL
- C#调用天津海量分词进行分词,调用DLL进行实现-C# call Tianjin massive word for word, call the DLL to achieve the
YurowAnalyzer_001
- 中文分词软件,.netframework3.5-Chinese word segmentation software,. Netframework3.5
WordFrequencyStatistics
- 字频统计,根据词库 实现汉字分词,效果不错; 另外还能够统计出汉字词出现的频率;-Word frequency statistics, according to Chinese characters thesaurus, word, In addition to the Chinese characters of statistics.
IKAnalyzer3.1.1StableAllInOne
- Lucene 中文分词,很好的 可以随便下压,加油-Lucene Java