搜索资源列表
0000000000
- 中文分词: 输入一个汉字的文本文件,可将里面的汉字分解成一个个符合人们心理的词-Chinese word : a Chinese character input text files can be inside of a Chinese character is divided into 000 people with mental word
SplitCNWord
- 一个中文分词的实现及演示程序,可用与中文和英文词组的分割.-a Chinese word achieve and demonstrate the procedure can be used with the Chinese and English phrase segmentation.
FreeICTCLAS
- 中科院的分词动态连接库,不用我说了哈,在国内是大名顶顶哈-CAS-term dynamic link library, I do not have said Kazakhstan, in Kazakhstan after another is Daming
Xerdoc XDMF
- 一个比较好用的中文分词模块,是车东等人开发的,比较有名,java语言实现-a better quality of the Chinese word module, East cars were developed, more famous, java language!
语料库
- 一份很重要的语料库,为你的分词程序是一个很好用的资料库文件-a very important corpus, as your segmentation procedure is a very good use of the database file
lzsearch
- 用javascrip编写的分词系统 可以解决现在许多网站中文搜索支持不好的问题 无解压密码 -javascrip prepared with the sub-term system can solve many Web sites now support Chinese search the problems without extracting passwords
wordseg
- 中文文本分类中对文本进行分词处理,以降温本表示称词的集合,然后用词向量来进行分类。-Chinese text classification of text-word processing, said to cool the pool said the words, then word vector to classify.
hlssplit
- 海天智能分词系统源码,可以根据需要设置分词的粒度,另外提供了词性标注,关键字提取等功能-sky Intelligent Word system source code, according to the need for segmentation of granularity, the provision of tagging, keyword extraction functions
JavaSplitWord
- 国内最好的开源分词系统-中科院分词系统的Java版,可以直接在JSP下调用-the best domestic revenue segmentation system-CAS-word version of Java, JSP can be directly under the Call
TextCategorization
- 基于朴素贝叶斯算法实现的中文文本分类程序。可以对中文文本进行分类识别,使用时先对分类器进行训练,然后进行识别。该Beta版本仅支持对3类文本进行分类,使用简单的中文分词方法,本程序尚不具备实用性,用于算法研究和改进。-based on Bayesian algorithms to achieve the Chinese text classification procedure. Can the Chinese text classification identification, the us
textsegment
- 中文分词程序,用java写的,有gui界面-Chinese word segmentation procedures using java write a gui interface
xdgf
- 字符处理这是一个基于Java的分词、N-gram统计、分段 、分句等功能的程序,支持多种语-characters to deal with this is a Java-based segmentation, N-gram to statistics, subparagraph Clauses function procedures, multiple language support
segment
- 基于文本文件的分词程序,可以将指定文件名的文本文档按照词典自动划分出单词.-text-based documents, the sub-term process, which would be designated the File Name text documents with automatic divided dictionary word.
chsegc
- chseg c语言写的汉字分词模块,调用方法和使用方法见程序内说明readme.txt。-chseg write Chinese characters Word module, called methods and procedures for use within see Note readme.txt.
ChineseSegmenter
- 中文分词java版 基本词典,分次效果很不错的-Chinese word java version of the basic dictionary, graded very good results
segment.tar
- java的分词操作的jar文件,很不错的-the sub-term operation of the jar, a very good
ictclas_Source_Code
- 计算所汉语词法分析系统ICTCLAS介绍 词是最小的能够独立活动的有意义的语言成分。 但汉语是以字为基本的书写单位,词语之间没有明显的区分标记,因此,中文词语分析是中文信息处理的基础与关键。为此,我们中国科学院计算技术研究所在多年研究基础上,耗时一年研制出了汉语词法分析系统ICTCLAS(Institute of Computing Technology, Chinese Lexical Analysis System),该系统的功能有:中文分词;词性标注;未登录词识别。分词正确率高达97
CDevideSentence
- 用c++写的分词算法,简单,实用,详情看里面的帮助文件!-using c + + to write the sub-term algorithm is simple, practical, inside look at the details of the help files!
darts-0.2.tar
- 双数组辞典生成程序。利用双数组实现trie算法,对于不定长度共同前缀查询情况,比哈希方法更为有效。经常用于分词辞典的制作。-array dictionary-generation procedures. Using two arrays to achieve Trie algorithm for the indefinite length of the inquiry common prefix than Hash methods more effective. Frequently used
PosTagger.exe
- 这是一个文本标注的软件,点击它后可以对输入文本进行分词、标注-This is a markup software, it hits right after importation of text-term Tagging