搜索资源列表
classifier-1.12
- 能对从Google中搜索出来的文本进行聚类,提供了Java包,及调用源代码.-can right from the Google Search for the text clustering, a Java package, source code and call.
xdgf
- 字符处理这是一个基于Java的分词、N-gram统计、分段 、分句等功能的程序,支持多种语-characters to deal with this is a Java-based segmentation, N-gram to statistics, subparagraph Clauses function procedures, multiple language support
ICTCLASCaller
- ICTCLAS的JNI调用接口文件: Title:ICTCLAS Caller * <p>Descr iption:do chinese word segmentation.don t change the pakage and CLASS name, orelse you can t use it. * 请不要改变包名、类名以及native的方法名,否则调用将失效。 * 由于ICTCLAS本身存在很多鲁棒性问题,调用segSentence时,strin
SplitWord_Java
- java制作的中文分词DLL文件,是根据中科院中文分词系统C++改写的-produced by the Chinese word DLL files, under the Chinese Academy of Sciences is the Chinese word rewrite the C system
FindChinese
- check chinese in source code file-check in source code file
xiaojishiben
- 这是用java开发的一个记事本,现把它的源程序公布给大家分享,希望多提意见-This is a java development of the notebook, it is released to the source code to share with you, to speak up
gerenqiuzhiguanlixitong
- 好东西望管理员给本人多加几分!每个人上传个文件不容易!-good things, I hope more managers to say! Everyone upload documents is not easy! Thank you
maxent
- 最大熵模型源代码,使用java编写,可以用来进行分类。-maximum entropy model source code, the use of java preparation, can be used for classification.
simmetrics_src_v1_5_d06_06_06
- SimMetrics is a Similarity Metric Library, e.g. from edit distance s (Levenshtein, Gotoh, Jaro etc) to other metrics, (e.g Soundex, Chapman). Work provided by UK Sheffield University funded by (AKT) an IRC sponsored by EPSRC, grant number GR/N15764/0
Classifier4J-0.6-dist
- 可用于文本分类的贝叶斯分类器,java开源项目-can be used for text classification Bayesian classifier, java open source projects
KNNSOURCECODE
- java 实现KNN文件分类器的源码 java 实现KNN文件分类器的源码-java achieve KNN classifier document java achieve the source document KNN classifier of the source code
JSP__Partition
- 利用java实现jsp的分页,可以作为实际的组块,应该是比较有用的,欢迎下载!-use java achieve jsp Pagination, as the actual block, it should be a more useful, welcome to download!
Paoding
- 中文分词得小系统,基本功能已实现,但还有很多地方有待改进,没有实现自动学习,人名识别等功能。-Chinese word in the smaller system, the basic functions have been achieved, but there is much room for improvement, no automatic learning, name identification, and other functions.
changname
- 批量改名程序,将文件进行批量改名.改名后可以还原!有文件的搜索,查询功能!-batch renamed procedures for batch file renaming. Renamed after Reduction! Document search, query!
guozhong
- 褒贬评价,可以对文章中的公司名及产品进行褒贬评价,即可以对他们进行打分。-evaluation, the article on the names of these companies and products different evaluations, which can for their scoring.
mmseg-v0.1
- 基于词典和最大匹配算法的的中文分词组件,达到很好的分词准确率-Dictionary and the largest based on the matching algorithm of the Chinese word segmentation components, to achieve good word accuracy rate
getSpell
- 实用的简繁体中文转换成拼音全拼Java类,可以转换GBK字符集中的所有汉字,使用非常简单,只需按照main()函数中的测试例子调用即可。目前,对于多音字的处理还有待完善。-practical Jane English phonetic spelling converted into Java classes, GBK characters can be converted concentrate all the characters, using very simple, only in acc
personNER
- 基于CRF(conditional random fields)统计模型的文本人名识别工具源代码,是Mallet开放源码项目的一部分-based on CRF (conditional random fields) statistical model of text my name recognition tools source code, open source Mallet is part of the project
abner
- 一个命名实体识别工具,是Mallet开放源码项目的一部分,可用于识别文本中的人名、地名等信息-a named entity recognition tools, Mallet OSS part of the project, Text can be used to identify the names, places and other information
ex1
- hhe example may be useflu