搜索资源列表
WordSegmentation
- 中文分词划分,包含标点符号,也适用于英文-chinese word segmentation
ictclas4j
- 中文分词器ictclas4j的源码,含有分词器的算法源码,以及案例-Chinese tokenizer ictclas4j source containing the word algorithm source code, as well as case
elasticsearch-analysis-ik-master
- ik是一个中文分词较为成熟的分词器,该文件是分词器的源码-ik is a Chinese word segmentation is more mature, the file is the code word
FenCi
- NLPIR2015中文分词 java 可以加入自定义词典 -chinese segmentation by NLPIR2015 . JAVA source code
nlpir_ictclas2013_release
- 中科院发布的分词系统,能很好的进行中文分词,词性标注。-Chinese Academy of Sciences released a segmentation system that can be very good for Chinese word segmentation, POS tagging.
909aae2c-4f2c-4771-83e4-6894516f14e1
- 一个中文分词算法,可以实现将分词文本切分成自定义字典中的单词-A Chinese word segmentation algorithm, you can achieve the word segmentation text into a dictionary of words
ExtractChinese
- Java编写的一个中文分词功能的代码,能实现中文分词功能-A Chinese word function written in Java code, to achieve the Chinese word function
IK-src
- ik 中文分词功能,使用中文分词,可以自己设置词库。区分于二元分出法-ik Chinese word function, using the Chinese word, you can set your own thesaurus. Separation method to distinguish two yuan
Twitter-LDA-master
- twitter-LDA算法的JAVA实现,LDA算法针对于微博短文本的改进算法,目前只是简单的英文分词功能,没有中文分词功能,-twitter-LDA algorithm JAVA implementation, LDA algorithm for improved algorithm for short text microblogging, now just a simple English word function, there is no Chinese word function,
ictclas
- 用java语言实现中文分词去停用词,中科院分词软件ICTCLAS-To achieve the Chinese word to stop word
New-folder
- 自然语言处理中的隐尔可夫马中文分词方法,利用java实现-NLP, using HMM to automatic word segmentation
fenci
- 中文分词算法双向最大匹配算法基于词典匹配的分词算法-Chinese word segmentation algorithm bidirectional maximum matching algorithm based on dictionary word matching algorithm
cws_theano-master
- 中文分词在theano的deep learning的运用,-chinese word segmentetion
FileDemo
- 对文件进行分词的例子.输出带词性的中文分词,已经去掉了停用词.-Examples of the file segmentation output of the Chinese word with POS, has been removed stop words.
Divide
- 使用Java语言,用前向匹配算法与后向匹配算法实现中文分词- The use of Java language, with the forward matching algorithm to achieve the Chinese word segmentation
lucene-unit
- 可以反射自定义索引类型,自定义索引路径-默认类路为上两级下的indexWrite目录,中文分词,自定义搜索Query,分页搜索并缓存一部分数据-Can reflect the custom index type, the index of the custom path- the default class on the road to indexWrite directory, under the two levels of Chinese word segmentation, custom
IKAnalyzer
- IKAnalyzer中文分词,是一种有效的中文分词API-IKAnalyzer Chinese divide
TFIDF
- 经典的中文分词算法 亲测可行,效果一般般,可供小白学习。(Classical Chinese word segmentation algorithm, pro test feasible)
IKAnalyzer2012_u6
- java 搜索引擎中文分词包,拆分中文词组(Java search engine Chinese word segmentation package)
ansj_seg-master
- 基于java语言的ansj中文分词程序,适合语义识别学习者研究用(Ansj Chinese word segmentation program based on Java language, which is suitable for semantic recognition learners to study)