搜索资源列表
mmseg-v0.1
- 基于词典和最大匹配算法的的中文分词组件,达到很好的分词准确率-Dictionary and the largest based on the matching algorithm of the Chinese word segmentation components, to achieve good word accuracy rate
cidian
- 基于最大匹配算法的的中文分词组件,达到很好的分词准确率
wordseg
- 基于正向最大匹配法的分词。采用hash表技术将一段连续的话用所给词库进行分词输出。
Superyy
- 一个分词的源码,采用正向最大匹配的策略,用过测试
dedesplit
- 采用的是一种用最小切分和逆向最大匹配法结合的一种复合分词算法,目前对岐义识别还处于初步研究阶段.
segmenter
- 一个简单有效的中文分词算法,包含正向最大匹配算法,java实现。
WordSeg
- 利用最大匹配法进行汉语句子的分词 最大匹配算法是最常用的分词算法,简单实用正确率可达到80%以上-the maximum matching method for the Chinese Sentence Word maximum matching algorithm is the most commonly used word segmentation algorithm, simple and practical accuracy rate can reach more than 80%
ChineseSplit
- 一个基于VB.NET开发的中文分词及关键词提取系统,采用双向最大匹配、词频统计、快速排序等算法实现。-VB.NET developed based on Chinese word segmentation and Key Extraction System, the largest two-way matching, word frequency statistics, such as quick sort algorithm.
proWordSegment
- 正向最大匹配中文分词c++源程序,在visual studio 2008中调试通过。-Chinese are the largest sub-word match c++ source code, visual studio 2008 in debug through.
KSeg4J.1.0
- 简体中文机械分词模块,实现正逆向最大匹配消除歧义,jar封装,可以导入后直接使用-a simplified Chinese segmentation Jar
Lucene
- Lucene的基于正向最大匹配分析器,用于中文分词,可将查准率提高-Lucene chinese analyzer based by FMM algorithm
segment
- 基于n元语法的分词。先训练后,再通过前后向最大匹配初步分词,在通过2元语法来消岐。-Based on the n-gram sub-word. The first training and then through to the maximum matching before and after the initial word, in through a 2-gram to eliminate qi.
ChineseWordSegmentation1
- 中文分词-前向最大匹配方法;VS2008环境下实现-Chinese word- the former to the maximum matching method VS2008 environment to achieve
MainFrm
- 是设计并实现一个汉语自动分词系统。在分析了自动分词面临的主要困难和难点的基础上,旨在降低分词难度和提高分词精度,设计并实现了一个基于正向最大匹配的汉语自动分词系统。-The design and implementation of a Chinese word segmentation system. In the analysis of word segmentation and difficulty of the major difficulties facing based on the
fencisuanfa
- 用正向最大匹配发实现句子的分词。是基于词典的分词算法。该算法的特点是速度快,准确率高。-Made to achieve a positive match with a maximum sentence segmentation. Dictionary-based segmentation algorithm. The algorithm is characterized by fast and accurately.
SF
- 中文分词在中文信息处理中是最最基础的,无论机器翻译亦或信息检索还是其他相关应用,如果涉及中文,都离不开中文分词,因此中文分词具有极高的地位。中文分词入门最简单应该是最大匹配法-Chinese word segmentation in Chinese information processing is the most basic, whether in machine translation Yihuo information retrieval, or other related applic
Web-Chinese
- 中文自动分词技术是中文 Web信息处理的基础。文中以最大匹配法(MM)为基础 ,充分 考虑上下文(MMC) ,在内存中采用二分法进行分词匹配 ,有效地提高了分词的准确率和时效。-Chinese automatic segmentation technology is Chinese Web information processing foundation. The maximal matching method (MM) as the foundation, fully Conside
Chinese_wordspliter
- 使用正向最大匹配法的中文分词器 分别读入待分词文本文件及字典文件后,将经处理后的结果输出至另一文件予以保存。所用待分词文件样例约有字符184万个,处理时间在11s左右。程序使用C++语言编写。-Using the forward maximum matching device of Chinese words segmentation, respectively, to be read into the text file and dictionary file, the result will
WordSegment
- 用C++开发的分词系统 运用基于哈希的逆向最大匹配算法 基于词典-Word in C development system uses a hash-based reverse maximum matching algorithm is based on dictionary
freqci2009
- 是一个自动分词软件,采用正向最大匹配法,文档解压后可以直接使用,内附有源代码。-An automatic segmentation software, using the forward maximum matching method, the document can be used directly after decompression included the source code.