搜索资源 - 分词算法 - 搜珍网

CDN加速镜像 | 设为首页 | 加入收藏夹

热门搜索： 源码 Android 整站插件识别 p2p OpenCV 网络编程游戏源码算法更多...

登陆 | 会员注册

当前位置：

搜索资源 - 分词算法

下载资源主分类

源码下载

Web源码

开发工具

文档下载

其它资源

资源分类

Windows编程

Internet/网络编程

系统编程

通讯/手机编程

游戏

多媒体

嵌入式/单片机编程

图形图象

数值算法/人工智能

行业应用软件

数据库系统

其它

搜索资源列表

cidian

0下载：
基于最大匹配算法的的中文分词组件，达到很好的分词准确率
所属分类：中文信息处理
- 发布日期：2008-10-13
- 文件大小：31957
- 提供者：大宝

bogofilter-0.93.4.tar

0下载：
开源项目BOGOFILTER的源码.该项目实现BAYES文本分类算法,不过该源码只支持英文的分词
所属分类：中文信息处理
- 发布日期：2008-10-13
- 文件大小：831874
- 提供者：zhou

Qiyi

0下载：
最大概率法分词的数据结构与算法，用这样的方法分词可以提高分词中歧义词的辨别率，非常经典啊-greatest probability - term data structure and algorithm, this approach can increase Word word-of ambiguous words in the identification rate, very classic ah
所属分类：中文信息处理
- 发布日期：2008-10-13
- 文件大小：1165
- 提供者：月月

EasyXSpider

2下载：
EasyXSpider不仅仅是一个简单的Linux下的爬虫程序。更包括了，索引制作，检索，分词（英文及中文二元法切词），以及Google PageRank算法和CGI查询界面的实现。可以看做是一个完整的小型搜索引擎。
所属分类：搜索引擎
- 发布日期：2009-06-12
- 文件大小：4149505
- 提供者：shootsoft

中文网页自动分类器

1下载：
利用knn算法实现了一个中文网页自动分类器，包括网页预处理，ictclas中文分词，基于tf-idf的文本特征表示，基于df的特征选取和基于knn的分类算法，最后通过struts2框架web发布
所属分类：源码下载
- 发布日期：2011-04-25
- 文件大小：177271
- 提供者：doris_227@163.com

SentenceSimilar.rar

0下载：
先对句子分词，然后根据词来比较句子的相似度，这个算法清晰易懂，欢迎下载！,The first word of the sentence, and then to compare the sentence the word similarity, this algorithm is clear and easy to understand, welcome to download!
所属分类：MultiLanguage
- 发布日期：2017-06-17
- 文件大小：27302095
- 提供者：xumancheer

Chinese-Word-Segment-And-POS-Tagger

1下载：
实现了中文分词和词性标注程序。分词方法采用“三词正向最长匹配”。词性标注使用HMM方法，用Viterbi算法实现。“三词正向最长匹配”保持了“正向最长匹配算法”快速的特点，同时提高了分词的准确性。-Chinese word segmentation and implemented procedures for POS tagging. Segmentation Methods, " the longest three-match positive words." POS tag
所属分类：Speech/Voice recognition/combine
- 发布日期：2017-05-14
- 文件大小：4034615
- 提供者：paul

ChineseSplit

0下载：
一个基于VB.NET开发的中文分词及关键词提取系统，采用双向最大匹配、词频统计、快速排序等算法实现。-VB.NET developed based on Chinese word segmentation and Key Extraction System, the largest two-way matching, word frequency statistics, such as quick sort algorithm.
所属分类：MultiLanguage
- 发布日期：2017-05-21
- 文件大小：6472935
- 提供者：张莉娟

Auto_WordSeg

0下载：
自动分词程序演示。包括最大、最小，正向、逆向等分词算法。-Automatic word segmentation procedure demonstrates. Including the largest, smallest, positive, reverse algorithm.
所属分类：Search Engine
- 发布日期：2017-05-12
- 文件大小：2822236
- 提供者：Li

77433632mmseg-v0.1

0下载：
使用最大正向匹配算法，实现中文自动分词，对于初学者有很高的参考价值。-The biggest being the use of matching algorithm, the realization of Chinese word segmentation, for beginners, has high reference value.
所属分类：MultiLanguage
- 发布日期：2017-04-08
- 文件大小：607478
- 提供者：jicheng

DTWspeech

1下载：
本文首先介绍了语音识别的研究和发展状况，然后循着语音识别系统的处理过程，介绍了语音识别的各个步骤，并对每个步骤可用的几种方法在实验基础上进行了分析对比。研究了语音信号的预处理和特征参数提取，包括语音信号的数字化、分帧加窗、预加重滤波、端点检测及时域特征向量和变换域特征向量.其中端点检测采用双门限法.通过实验比对特征参数的选取，采用12阶线性预测倒谱系数作为识别参数。详细分析了特定人孤立词识别算法，选定动态时间弯折为识别算法，并重点介绍其设计实现。在
所属分类：Speech/Voice recognition/combine
- 发布日期：2017-05-10
- 文件大小：2491034
- 提供者：周文超

segthewords

0下载：
它是文本分词程序代码的核心算法，可以为语言学者提供强大的分词功能。-Is a code word the text of the core algorithm
所属分类：MultiLanguage
- 发布日期：2017-04-01
- 文件大小：4494
- 提供者：baron

theshortest

1下载：
最短路径法分词程序.将中文句子经过原子切分后生成一个有向无环图,然后使用Dijkstra算法求出由起点到终点的-The shortest path segmentation process. After the Chinese sentence after splitting atoms to generate a directed acyclic graph, and then use the Dijkstra algorithm derived from the point of origi
所属分类：Other windows programs
- 发布日期：2017-04-02
- 文件大小：426500
- 提供者：wangzhen

phpsojiqidll

0下载：
自己写的一些PHP扩展EXT DLL，包括搜集齐分词扩展，imagick动态处理图片类，搜集齐核心排序算法DLL（www.sojiqi.com）-PHP to write some of their expansion EXT DLL, including the expansion of the collection of word Qi, imagick dynamic picture categories, the core sorting algorithm to collect Qi
所属分类：Internet-Socket-Network
- 发布日期：2017-05-07
- 文件大小：1315535
- 提供者：逍遥

PExtractor

0下载：
地名抽取算法，利用开源分词组件，并增加了地名前词识别，地名组合等逻辑。-Names extraction algorithm, using open-source word components, and increases the names before the word recognition, place names and combination logic.
所属分类：Other systems
- 发布日期：2017-05-14
- 文件大小：3205737
- 提供者：chensd

ycsfwordseg

0下载：
基于遗传算法的分词论文基于遗传算法的分词论文-Segmentation Based on Genetic Algorithms PapersSegmentation Based on Genetic Algorithms PapersSegmentation Based on Genetic Algorithms Papers
所属分类：AI-NN-PR
- 发布日期：2017-04-24
- 文件大小：195967
- 提供者：racheldo

code

0下载：
这其中涉及了黑名单、文本分类算法、短信内容分词、特征向量选取等关键技术-That involves a black list, text classification algorithm, SMS is divided into words, feature vector selected key technologies such as
所属分类：Data structs
- 发布日期：2017-04-04
- 文件大小：89622
- 提供者：汪浩

CLucene

0下载：
clucene 源码，并且增加了自己写的正向最大匹配算法的分词程序。-clucene source code, and increase their own to write the forward maximum matching algorithm for the sub-word program.
所属分类：AI-NN-PR
- 发布日期：2017-03-27
- 文件大小：440298
- 提供者：yimi

chiword_seg

0下载：
本代码含有逆向最大，正向最大，最短路径等自动识别算法，实现文本中分词的自动识别。-This code contains the largest reverse, forward the most, such as automatic identification shortest path algorithm to achieve automatic recognition of text carved words.
所属分类：Graph Recognize
- 发布日期：2017-04-07
- 文件大小：14070
- 提供者：zhang

Chinese-text-categorization-Study

1下载：
本文通过对Bayes、KNN、SVM 应用于中文文本分类进行比较实验研究。应用ICTCLAS 对中文文档进行分词，在大维数，多数据情况下应用TFIDF 进行特征选择，并同时利用它实现了对特征项进行加权处理，使文本库中的每个文本具有统一的、可处理的结构模型。然后通过三类分类算法实现了对权值数据进行训练和分类。-Based on the Bayes, KNN, SVM applied to compare the Chinese text ca
所属分类：Mathimatics-Numerical algorithms
- 发布日期：2017-03-29
- 文件大小：442391
- 提供者：wulili

« 1 2 3 4 5 6 7 89 10 11 »

搜珍网 www.dssz.com

本网站为编程资源及源代码搜集、介绍的搜索网站，版权归原作者所有！　　粤ICP备11031372号

1999-2046 搜珍网 All Rights Reserved.