搜索资源列表
MainFrm
- 是设计并实现一个汉语自动分词系统。在分析了自动分词面临的主要困难和难点的基础上,旨在降低分词难度和提高分词精度,设计并实现了一个基于正向最大匹配的汉语自动分词系统。-The design and implementation of a Chinese word segmentation system. In the analysis of word segmentation and difficulty of the major difficulties facing based on the
bin
- 中科院分词系统,非常好用的分词工具,现下比较流行,对你会有帮助的-ICTCLAS,a useful tool, it will be useful for you
N-gram
- N-gram中文分词系统,通过前后项切分,计算概率,进而获得最佳的切分-N-gram Chinese segment,by FMM and RMM,we can Calculate the probabilities,then,we can get the best segment.
HZ_Freq
- java中文分词系统,可供大家学习,祝成功路上越走越远!-Java wordseg program
SharpICTCLAS
- SharpICTCLAS分词系统,是一种应用较为广泛的分词系统,适用于网络采集文字后的分词。-SharpICTCLAS word segmentation system is a sub-word is widely used systems for network segmentation after the text collection.
Linux_C_32
- ICTCLAS linux环境下借口和源文件,同时包含例子。是国内一个非常好的开源中文分词系统。-ICTCLAS linux
ChineseWordsDemo
- 中文分词系统的java源代码,中文分词中文分词中文分词中文分词-Chinese word java
11MyClassify
- 用中科院的分词系统实现文本分类,文本分类的方法为K-With the CAS system to achieve the sub-word text classification, text classification method for KNN
ICTCLAS_JAVA
- 使用汉语分词系统ICTCLAS_JAVA版本进行中文分词、词性标注-Use of Chinese word segmentation system ICTCLAS_JAVA version of Chinese word segmentation, POS tagging
InPutTextFile
- java中文分词系统,很好用的。欢迎下载与修改,并提出宝贵意见。-chinese words splitting system
forictclas
- 1.在vs2008下,解压缩即可运行 2.该代码为中科院的中文分词系统ictclas源码,本人修改部分bug后上传 3.运行后输入 中文字符串就可以-1. In vs2008, the extract to run 2. The code word for the Chinese Academy of Sciences of the sub-system ictclas source, I modified some bug and upload 3. Run and enter the
ictclas4j
- 本代码用java实现了分词功能,包括分词和词性标注,里面有具体的说明文档,包括数据结构的设计,分词步骤,分词系统研究等。-The code is implemented using java segmentation features, including word segmentation and POS tagging, which have specific documentation, including data structure design, word steps, such a
WordSeg
- 分词系统,主要是对中文进行分词处理,对初学者有一定的帮助。-Segmentation system is mainly deal with the Chinese word segmentation, there is some help for beginners.
partition
- 分词系统的实现和测试 基于字符串的分词,根据分词标记提取单个词组-Segmentation system implementation and testing of the sub-string based on word segmentation based on extracting a single phrase marker
Bayes
- 用bayes实现的聚类算法,分词采用的是SharpICTCLAS分词系统 1.0-Achieved using bayes clustering algorithm, word segmentation is used SharpICTCLAS System 1.0
ICTCLAS50_Linux_RHAS_32_C
- 中科院发布的中文分词系统,为国内水平最高的中文分词软件,这是最新版-Chinese Academy of Sciences released a Chinese word segmentation system, the highest level for the domestic Chinese word segmentation software, the latest version of the
shooter_seg
- 开源分词系统,可以自己更改词库词典,加载后即可正常使用-shooter seg
ICTCLAS5.0
- ICTCLAS50 是目前最新版本 该文档是一个分词系统的接口文档
imdict-report.ppt
- 中科院imdict中文分词系统的学习报告,剖析系统流程以及对隐性马尔可夫模型的学习。-A study of imdict-chinese analyzer.
WordSegment
- 用C++开发的分词系统 运用基于哈希的逆向最大匹配算法 基于词典-Word in C development system uses a hash-based reverse maximum matching algorithm is based on dictionary