搜索资源列表
多种分词方法
- 含有多个分词算法。毕业设计的时候获得的,希望对大家在汉字处理中能用的到。 ICTCLAS算法,中科院,对名字识别能力很强。VC开发。 CSharp分词,向前匹配加向后最大匹配,C#开发,容易扩展。 小叮咚分词,由后向前最大匹配,C#开发。 xerdoc分词,基于ICTCLAS的Java版本分词。 文本分词词典,分词的词典,可以提供分词数据源。-containing more than Segmentation. Graduation was the time of hope
ProbWordSeg
- 最大概率分词法,这种分词算法能够较好的解决汉语分词中的歧义问题,但分词效率比最大匹配分词算法要低-greatest probability points accidence, Segmentation algorithm can be used to solve the Chinese word segmentation of Ambiguity, but Word efficient than the largest matching segmentation algorithm lower
最大匹配算法
- 中文分词正反向最大匹配算法
worddiv
- 中文分词算法,用正向最大匹配算法实现的中文分词,包裹dao操作数据库。
pymmseg.用python写的分词程序
- 用python写的分词程序,实现的是最大匹配方法,简单易用,Using python to write the sub-term process of implementation is the largest matching method, easy-to-use
segmentation
- 基于hashmap的首字哈希查找法,正向最大匹配法分词系统。代码用c++编写,本系统很好的实现了分词功能。-Based on the first word hash hashmap Find law, being the largest sub-word matching system. Code using c++ development, the system achieved a very good word function.
hanyufenci
- 分别用最大匹配和最大熵的方法实现了汉语分词-Respectively maximum matching and maximum entropy methods to achieve the Chinese word
FMM
- 刚上文机器翻译课,自己写的一个简单的正向最大匹配分词-Just above machine translation classes, wrote it myself being a simple maximum matching segmentation
maxseg
- 最大匹配分词系统,分词中最见大的,效果也是很好的系统。-maximum matching segmentation
MaximumMatching
- 利用最大匹配法进行分词。需要一个词典。就可以进行分词。性能很好-Carried out using the maximum matching word segmentation. Need a dictionary. Segmentation can be carried out. Good performance
keyword-chouqu
- 基于逆向最大匹配算法的分词及基于HMM模型的词性标注系统,包括了未登录词的识别、数据库的添加等内容。(需要手动修改数据库的路径才可以运行)-Reverse Maximum Matching Algorithm Based on the sub-word HMM-based model and part of speech tagging system, including the unknown word identification, such as the contents of the d
word_split
- 这个一个基于逆向最大匹配的分词程序,语料规模比较小。-The maximum matching based on the reverse of the sub-term process, relatively small-scale corpus.
wordSegment
- 汉语分词 隐马尔科夫模型 最大匹配法 最大概率法-HMM MaxMatch
fenci
- 正向最大匹配算法分词,采用容器储存字典可提高一定的效率。-Forward Maximum Matching algorithm for word segmentation, using containers for storage dictionaries can improve a certain efficiency.
ZuiDaPiPeiSuanFa
- 对一个字符串进行最大匹配法分词的算法,在VC环境下运行即可。-A string of maximum matching word segmentation algorithm can be run in the VC environment.
CLucene
- clucene 源码,并且增加了自己写的正向最大匹配算法的分词程序。-clucene source code, and increase their own to write the forward maximum matching algorithm for the sub-word program.
RMM
- 逆向最大匹配算法实现分词,分词结果在另一个txt里呈现(Reverse maximum matching algorithm to achieve the word segmentation, word segmentation results in another TXT presentation)
Chinese Word Segementation
- 根据所给中文词典,对中文文本进行正向最大匹配分词,其中包括读取文本,并对其分词(According to the Chinese dictionary, the Chinese text is given a positive maximum matching word, which includes the reading of the text and the participle of the Chinese text.)
双向最大匹配.py
- 用双向最大匹配算法实现汉语分词。准确率颇高。(Two way maximum matching algorithm is used to realize Chinese word segmentation. The rate of accuracy is quite high.)
Python逆向最大匹配实现-代码-文档-截图等
- 对于文本进行自然语言处理,中文分词。北邮计算机院NLP课的作业。(This is a homework for BUPT student, implementing the word segment of natural language processing.)