搜索资源列表
SplitWord_Java
- java制作的中文分词DLL文件,是根据中科院中文分词系统C++改写的-produced by the Chinese word DLL files, under the Chinese Academy of Sciences is the Chinese word rewrite the C system
SQLET_split
- 另外一个中文分词程序,采用的是可显示的字典,可供大家分析-another Chinese word segmentation procedures, the show is the dictionary for our analysis
segmentor_Perl
- 中文分词算法。Perl语言编写。wordlist.txt为词库。-Chinese Segmentation. Perl language. Wordlist.txt for the thesaurus.
xerdoc
- 这些都是关于中文分词的,一定会对很多人有帮助的!-these are on the Chinese word, and you will help a lot of people!
SplitWord
- 中文分词组件,?形姆执首榧?中文分词组件,-Chinese word components, the Chinese word components, the Chinese word components,
propsource
- 这是句法分析系统的原代码,可以用于人工智能的各各方面,比如输入法、段词分词程序的进一步补充、语音识别等各方面!-This is the syntactic analysis system source code, which can be used across a diversity of artificial intelligence, such as the input method, word of the Word of the procedures further added, vo
MySeg
- 最短路径法分词程序。将中文句子经过原子切分后生成一个有向无环图,然后使用Dijkstra算法求出由起点到终点的最短路径以实现中文分词。-shortest path method participle procedures. Chinese Sentence will be read after splitting atoms generated a directed acyclic graph. then use the Dijkstra algorithm derived from the s
findkey.c
- 此程序解决的问题:较好的, 并适应短字符串的中文分词算法.根据词库 发现以换行符分隔的众多标题中的 top N 关键字并以此更新词库.是一个分类分词算法 -this procedure to solve the problem : better, and adapt to the short string of Chinese Segmentation. According thesaurus found in the many separate newline heading the to
5271615762
- 中文分词技术 从别的网上摘的 感觉还不错 请大家-Chinese word technology from other online pick feeling quite well please try
ictclas10
- 基于中科院的ICTCLAS实现中文分词系统 开发工具是JAVA.经测试,效果很好.-ICTCLAS based on the realization of the Chinese Academy of Sciences Chinese word segmentation system is the Java development tools. Tested, good results.
Win32Cut
- 分词程序,Win32窗口界面程序,含设计文档,具有打开文档,显示分词结果,保存结果等功能,欢迎讨论。- The participle procedure, the Win32 window contact surface procedure, contains the design documents, has opens the documents, demonstrated the participle result, preserves function and so on resu
wordppl
- 本程序采用正向 逆向最大匹配才实现汉字分词-the procedures being used in reverse to get the maximum matching Chinese Word
framework
- 基于动态规划的中文分词程序,用vc写的,便于扩展。-based on dynamic programming of the Chinese word segmentation procedures using vc write, easy expansion.
seg_greedy
- 对大量文本进行分词。支持linux下的很多命令。如cat,输入输出重定向等。-text of a large number of sub-word. Linux support of many orders. If the cat, redirect input and output, etc..
CSW50
- 是一个很好的分词组件,里面有具体的说明文档。-is a good segmentation components, there are specific documentation.
chentian.nutch
- 实现了基于词库的nutch中文分词,主要修改了其中的.jj文件等-realized based on the thesaurus nutch Chinese word, the main change of them. Jj documents
chentian.fenci
- 实现了基于词库的nutch中文分词,这一部分是其中的dll文件-realized based on the thesaurus nutch Chinese word, this part is one of the dll file
firtex_beta102_src
- FirteX介绍 功能: 支持增量索引,差量索引,多字段索引,提供了3种前向索引方式; 支持纯文本,HTML,PDF等文件格式; 提供快速中文分词; 从底层到高层,提供了多种索引访问接口,灵活自由地使用索引文件; 提供丰富的检索语法,支持多字段检索,日期范围检索,检索结果自定义排序等。 性能: 在Pentium 4 2.8G 2GRAM的机器上超过200Mb每分钟的索引速度 在近7G的索引文件(100G网页,11G纯文本的索引)上检索,仅使用十几M内存在数毫
PWSWNRCODE
- 最大概率法分词。这种技术的分词效率极高。大家共享了。-greatest probability method segmentation. This segmentation of the very efficient. Share of.
BiHZFreqCode
- 汉字二字组频度统计。可以统计汉字文本中二字组的频度。很好用。中文文本分词很有用的工具。-Chinese word frequency statistics group. Chinese statistics can text the word frequency group. Good use. Chinese text segmentation useful tool.