搜索资源列表
ChineseWordsDemo
- LingPipe(开源自然语言处理的Java开源工具包) 中文分词java程序-LingPipe (open source natural language processing toolkit in Java open source) Chinese word segmentation procedure java
Lucene_demo
- java使用lucene的demo、包含分词函数、搜索函数-java使用lucene的demo
fenci
- 中科院分词,自动分词,java实现,内附说明谢谢使用-zhongkeyuanfenci
SunMap
- 从底层实现的GIS小项目,具有地图的放大、缩小、平移等常用地图操作功能和查询功能等,可以读取多种常用几何类型的shapefile文件,项目中还含有一个MMSeg中文分词器,适合GIS专业的同学科研使用-GIS from the underlying implementation of small projects, with maps, zoom, pan and other commonly used map operation functions and query functions, y
MySearch
- lucene htmlparser paoding customSpider webservice 一个完整的基于lucene工具包和庖丁分词加自定义实现爬虫分析数据的搜索引擎,少量改动即可使用-lucene htmlparser paoding customSpider webservice a complete tool kits and Paoding lucene-based word plus a custom analysis of data to achieve a search
java
- java最大正向匹配算法 帮助大家了解分词原理-java maximum matching algorithm to help you understand the positive word principle
Index_Query
- 可以对pdf,doc,txt,html实现全文检索。用了中科院的ictloc分词器,分词效率较高-Can be pdf, doc, txt, html to achieve full-text search. With the Chinese Academy of Sciences of ictloc Word Breaker, efficient word
PaoDing
- 中文分词软件——庖丁最新版本,可用于文本检索领域的分词等功能的实现-Chinese word segmentation software- Paoding latest version can be used for the sub-text retrieval functions of the realization of the word
ICTCLAS_JAVA
- 使用汉语分词系统ICTCLAS_JAVA版本进行中文分词、词性标注-Use of Chinese word segmentation system ICTCLAS_JAVA version of Chinese word segmentation, POS tagging
MyWordSpliter1
- java实现的分词程序,Nutch中文分词-java implementation of segmentation procedures
fenci
- 复旦的中文分词java程序包,装了eclipse话,导入项目就能用-Fudan University Chinese word java program package installed eclipse, then import the project will be able to use
Java
- 能实现分词,去除停用词,统计词频的Java的源代码-To achieve segmentation, removal of stop words, word frequency statistics Java source code
Split
- Java实现逆向最大匹配中文分词算法,本程序可以实现较为简单的中文分词-Java implementation reverse maximum matching Chinese word segmentation algorithm, the program can be implemented relatively simple Chinese word segmentation
ictclas4j
- 中科院分词Java版,根据C语言版改写-ictclas4j-Participles ictclas4j Java version of Chinese academy of sciences, in the C language version
fenci
- 基于IKAnalyzer2012的中文分词java代码,可以去除停用词。-The Chinese word segmentation based IKAnalyzer2012 java code, you can remove stop words.
FenCi
- NLPIR2015中文分词 java 可以加入自定义词典 -chinese segmentation by NLPIR2015 . JAVA source code
WordSplit.java
- java实现的字典分词,有效去除停用词,标点符号,能识别姓名-java achieve dictionary word, the effective removal of stop words, punctuation, can identify the name
jieba-analysis-master
- 结巴分词(java版)只保留的原项目针对搜索引擎分词的功能(cut_for_index、cut_for_search),词性标注,关键词提取没有实现(今后如用到,可以考虑实现)。-Stammer participle (Java version) to retain only the original project for search engine participle (cut for index, cut for search), part of speech tagging, keyw
CatDemo
- JAVA文章检索 压缩包无加密,含有源代码 可运行,无错误,功能:1.分词;2.能添加新的词典 希望对下载的朋友们有帮助(JAVA article search compression package, no encryption, containing the source code can run, no error, function: 1. word segmentation; 2. can add new dictionaries, I hope to download friends
IKAnalyzer2012_u6
- java 搜索引擎中文分词包,拆分中文词组(Java search engine Chinese word segmentation package)