搜索资源列表
ChineseSegmenter
- 中文分词java版 基本词典,分次效果很不错的-Chinese word java version of the basic dictionary, graded very good results
segmenter
- 一个实现简单分词java程序,附有源代码,大家可以参考学习交流一下。
HLjava
- 海量中文分词java接口,原海量分词是C/C++平台开发的,这是java版的接口,如果提示过期,修改系统时间即可试用
src_seg(java)
- 一个用java语言编写的中文文本分词算法
ictclas.rar
- Java中lucene分词需要的ICTCLAS.dll文件和data词库,Java Lucene participle in need ICTCLAS.dll documents and data thesaurus
je-analysis-1.5.3
- 在java环境下开发的分词源代码,本代码可以通过lucene,nutch调用,实现对中文的分词-Java development environment in the sub-etymology code, this code can be used with lucene, nutch call, the aim is to achieve the Chinese word
lingpipe-3.6.0
- 一个自然语言处理的Java开源工具包。LingPipe目前已有很丰富的功能,包括主题分类(Top Classification)、命名实体识别(Named Entity Recognition)、词性标注(Part-of Speech Tagging)、句题检测(Sentence Detection)、查询拼写检查(Query Spell Checking)、兴趣短语检测(Interseting Phrase Detection)、聚类(Clustering)、字符语言建模(Character
LTP
- 哈工大LTP自然语言处理工具的java调用实例,利用jni调用dll,实现中文的分词,词性标注,建立依存树等-HIT LTP natural language processing tool called an instance of java using jni call the dll, to achieve in Chinese word segmentation, POS tagging, the establishment of dependency trees, etc.
text_category
- 中文自动分类。使用spider抓取网络信息,利用lucene的分词及KNN方法。-Chinese automatic classification. The use of spider crawl network information, the use of Lucene sub-word and KNN methods.
ChineseWordsDemo
- LingPipe(开源自然语言处理的Java开源工具包) 中文分词java程序-LingPipe (open source natural language processing toolkit in Java open source) Chinese word segmentation procedure java
Lucene_demo
- java使用lucene的demo、包含分词函数、搜索函数-java使用lucene的demo
IKAnalyzer3.1.1_userguide
- java分词程序,能够精确分词,包含词库等-java word program, word accurately, including the thesaurus, etc.
IKAnalyzer3.1.1StableAllInOne
- Lucene 中文分词,很好的 可以随便下压,加油-Lucene Java
ShuzhenAnalyzer-1.1.8-jdk1.6.0
- 中文分词 ShuzhenAnalyzer 可用于将文档中词进行划分,比较好用-Word cut using java
ICTCLAS50_Linux_RHAS_64_JNI
- 中科院中文分词程序,国内相关领域的的权威.这是Java(JNI)64位版-Institute of Chinese word segmentation program, the domestic authority of the relevant fields, which is Java (JNI) 64-bit version
jieba分词
- jieba 的java分词包,一般都是python的包,这个可用于java的jieba分词(Jieba Java word segmentation package, generally Python package, this can be used for the Java Jieba participle)
FMM
- java源码分词器,导入eclipse即可使用,无需修改代码,分词效果还行(Java source code word segmentation, import eclipse can use, without modifying the code, the word segmentation effect is OK)
Models_v1_v2
- 对中文文本进行分词,词性标注。训练模型,根据模型训练学习分词。(participle Part of speech tagging)
CSATP
- 汉语文章的自动分词系统,带界面,java编写(Automatic word segmentation system for Chinese articles, with interface, Java writing)
JNA
- 中文的分词,包括词性标注、关键词提取,Java文件(word segmentation and part of speech tagging)