搜索资源列表
participle
- 这是关于句子相似度处理前的一个分词处理,希望可以帮到你!-This is about common sentence similarity processing of a points before word processing, not write the source code, is just a algorithm model!
worddiv
- 中文分词算法,用正向最大匹配算法实现的中文分词,包裹dao操作数据库。-Chinese word segmentation algorithm, the forward maximum matching algorithm of Chinese word segmentation, wrapped dao operations database.
ICTCLAS50_Windows_64_JNI
- 中科院权威分词系统源码,ICTCLAS50_Windows_64_JNI-ICTCLAS50_Windows_64_JNI,coming from The Chinese Academy of Sciences Authority origin code
ProbWordSeg
- 汉语分词 最大概率 分词 最大概率分词-failed to translate
A5bbs4.0
- 1.采用php小偷技术自动与A5论坛同步更新! 2.动态浏览与静态后台自由切换! 3.全站伪原创.全站可生成页面缓存,减轻服务器负担,读取速度加快! 4.四种采集方式,兼容98 空间服务器,独立模板风格. 5.搜索引擎蜘蛛访问记录! 6.keyword自动分词获取。内容过滤功能。包含自定义词语的内容将不采集 7.关键词内链, 文章内容包含指定关键词将自动添加链接!后台自定义关键词! 8.可随机打乱帖子顺序 账号: admin 密码: admin-Using
FastSeg
- 搜索引擎相关,中文分词算法,java编写-Search engines related, Chinese word segmentation algorithm, written in Java
IKAnalyzer2012
- IKAnalyzer2012,一个以lucene为基础的非常好用的中文分词器,有两种分词模式,智能分词模式和最细粒度分词模式。-IKAnalyzer2012 very easy to use a lucene-based Chinese Word Breaker, there are two sub-word mode, intelligent word patterns and most fine-grained segmentation model.
Frequency-Estimates-Word-Similarity-
- 统计分词的相似性措施的频率估计 ,词汇相似性的频率算法。-Ourbestcombinationofsimilaritymea-sureandfrequencyestimationmethodanswers 6-8 morequestionsthan the bestresultspre-viouslyreportedforthesamequestionsets.
WordSeg
- 此系统是用MFC编写的正向最大匹配的汉语分词系统,代码详尽,经本人调试能运行且正确。-The system is written in MFC forward maximum matching Chinese word segmentation system, a detailed source, I debug run and correct.
ZNFC
- 基于MFC智能分词程序及代码,不需要原始词库,通过训练原理实现忆词和分词-MFC smart segmentation procedures and code, without the original thesaurus, recalling words and word segmentation by training principles
ChineseWordSegmentation
- 中文分词处理,复旦大学FudanNLP中的中文分词处理程序-chinese word segmentation
Cs
- 中文分词 chinese word segmentation-chinese word segmentation
chinese-_segmentation
- 中文分词算法介绍,正向最大匹配。word-word for chinese segmentation algrithm
ICTCLAS50_Windows_32_C
- 中科院分析系统 ICTCLAS的主要功能有:中文分词;词性标注;命名实体识别;新闻识别;用户词典-ICTCLAS segementword
ChineseSeg_CSharp
- 该程序实现简单的中文分词,也可以直接使用。但不建议。做为开发中文分词的参考,相信还是有一定价值的。 项目基于.net(C#)平台下开发。-Chinese word segmentation is the Chinese word segmentation procedure based on matching the pattern of development, but also can be used directly. But is not recommended. Because t
Sfenciie
- 分词程序,HMM模型训练,维特特比解码,有说明文档。可直接使用。 -Segmentation process, HMM model training, Viterbi decoding, and documentation. Complete source code can be used directly.
Lucene.PaodingSrc.jar
- 最新的开源的中文分词paoding ,包含jar包和源码 可以给设计搜索的人一些帮助-The latest open-source Chinese the word paoding, contains the jar files and source code to the design search some help
Tokenizer
- opennlp是自然语言处理的开源工具,它是JAVA写的,可以再Eclipse中直接调用。上传的这写代码实现了英文分词代码的功能。-Opennlp is an open tool for natural language processing. It is written in JAVA. It can be used in Eclipse directly . The code uploaded is used to token English words.
Complete-Training-of-TC
- 用贝叶斯模型实现文本分类,;里面包含分词,词频统计,去除停用词等模块,目前完成的是分类的训练阶段。-realize text categorization by using the NaiveBayes Model
SplitWords
- 中文分词系统,给定一个文档,生成另一个内容已经被分割的文档-The Chinese word segmentation system, given a document, generating another content has been the division of the document