搜索资源 - 分词 - 搜珍网

CDN加速镜像 | 设为首页 | 加入收藏夹

热门搜索： 源码 Android 整站插件识别 p2p OpenCV 网络编程游戏源码算法更多...

登陆 | 会员注册

当前位置：

数值算法/人工智能

搜索资源 - 分词

下载资源主分类

源码下载

Web源码

开发工具

文档下载

其它资源

资源分类

压缩解压

STL

数据结构常用算法

数学计算/工程计算

人工智能/神经网络/遗传算法

matlab例程

生物技术

密码/编码算法

mathematica

Maple

数据挖掘

大数据

comsol

物理计算

化学计算

仿真建模

搜索资源列表

WordSeg

0下载：
利用最大匹配法进行汉语句子的分词最大匹配算法是最常用的分词算法，简单实用正确率可达到80%以上-the maximum matching method for the Chinese Sentence Word maximum matching algorithm is the most commonly used word segmentation algorithm, simple and practical accuracy rate can reach more than 80%
所属分类：人工智能/神经网络/遗传算法
- 发布日期：2008-10-13
- 文件大小：74212
- 提供者：廖剑

ProbWordSeg

0下载：
最大概率分词法,这种分词算法能够较好的解决汉语分词中的歧义问题,但分词效率比最大匹配分词算法要低-greatest probability points accidence, Segmentation algorithm can be used to solve the Chinese word segmentation of Ambiguity, but Word efficient than the largest matching segmentation algorithm lower
所属分类：人工智能/神经网络/遗传算法
- 发布日期：2008-10-13
- 文件大小：87747
- 提供者：廖剑

CRFPP0[1].53

0下载：
条件随机域，主要用于标记序列，可以进行分词，词性标注，句法分析，以及文本抽取等。-condition random field
所属分类：AI-NN-PR
- 发布日期：2017-05-07
- 文件大小：1266084
- 提供者：王刚

FreeICTCLAS

0下载：
中科院自动化所的ICTCLAS，C++编写。用于中文文本分词-Automation of the Chinese Academy of Sciences ICTCLAS, C++ to prepare. For the Chinese text word segmentation
所属分类：Mathimatics-Numerical algorithms
- 发布日期：2017-05-24
- 文件大小：8037981
- 提供者：adrian

RostNat

2下载：
很不错的语料分析工具，有分词、分析等等。最主要的还有TF/IDF的分析结果。很是实用-Very good tool for corpus analysis, took part in word analysis, and so on. The main TF/IDF analysis of the results. Is practical
所属分类：AI-NN-PR
- 发布日期：2017-05-15
- 文件大小：3645159
- 提供者：lizhiyong

ycsfwordseg

0下载：
基于遗传算法的分词论文基于遗传算法的分词论文-Segmentation Based on Genetic Algorithms PapersSegmentation Based on Genetic Algorithms PapersSegmentation Based on Genetic Algorithms Papers
所属分类：AI-NN-PR
- 发布日期：2017-04-24
- 文件大小：195967
- 提供者：racheldo

Bayes_1

1下载：
首先，对CATEGORY中的txt文件分类；其次，对多个txt文件中的英文文本进行分词；最后，通过贝叶斯公式进行分类；-First, in the txt file CATEGORY classification Secondly, multiple txt files in English text word Finally, by Bayes formula to be classified
所属分类：Algorithm
- 发布日期：2017-04-03
- 文件大小：411491
- 提供者：guangyu

code

0下载：
这其中涉及了黑名单、文本分类算法、短信内容分词、特征向量选取等关键技术-That involves a black list, text classification algorithm, SMS is divided into words, feature vector selected key technologies such as
所属分类：Data structs
- 发布日期：2017-04-04
- 文件大小：89622
- 提供者：汪浩

vb

0下载：
连接数据库分词去除停用词计算权重值-Connect to the database to remove stop words word weighted value
所属分类：Algorithm
- 发布日期：2017-04-16
- 文件大小：32360
- 提供者：眭亚键

CLucene

0下载：
clucene 源码，并且增加了自己写的正向最大匹配算法的分词程序。-clucene source code, and increase their own to write the forward maximum matching algorithm for the sub-word program.
所属分类：AI-NN-PR
- 发布日期：2017-03-27
- 文件大小：440298
- 提供者：yimi

dict

0下载：
已处理过的中文分词词典Chinese Word Segment Dictionary,you may need to use it in your CWS program-Chinese Word Segment Dictionary,you may need to use it in your CWS program
所属分类：AI-NN-PR
- 发布日期：2017-03-31
- 文件大小：635225
- 提供者：zhaoyilin

YH_zhizhu_1.0

0下载：
军长搜索是一款基于 Microsoft .NET 2.0 开发的垂直搜索引擎。系统有着强大的文件和数据库引索能力，支持中英文分词，文件相似度分析排序，文件数据时实监控与更新，恐龙级的引索速度和毫秒级的搜索速度，搜索结果高亮显示，系统分两部分组成第一部分是Ｃ/s的搜索蜘蛛，第二部分是Ｂ/s的ＷＥＢ用户搜索显示界面，其整个系统的工作过程完全模仿了超级搜索引擎的工作原理。系统支持对站内和全网的引索。产品适用范围：行业垂直搜索引擎、大型新闻门户网站站内搜索、大型行业门户网站
所属分类：Compress-Decompress algrithms
- 发布日期：2017-03-29
- 文件大小：135074
- 提供者：彭晓

ICTCLASV1.2

0下载：
中科院计算所的分词工具，可以进行分词工作-ICT tools by the word, the work can be sub-word
所属分类：AI-NN-PR
- 发布日期：2017-05-11
- 文件大小：2289507
- 提供者：sh

sample

0下载：
中文分词，中文词法分析是中文信息处理的基础与关键-Chinese word
所属分类：AI-NN-PR
- 发布日期：2017-05-30
- 文件大小：12574058
- 提供者：jingwei

segChnWord

0下载：
中文分词评测系统，用于评测中文分词的质量，给出准确率等-Chinese word segmentation evaluation system for evaluating the quality of Chinese word segmentation, given the accuracy of such
所属分类：AI-NN-PR
- 发布日期：2017-04-08
- 文件大小：3452
- 提供者：miaoer

WebPages_WordSplitting

0下载：
自动提取网页内容（附带简单的 HTTPAnalyzer 类），并根据词典进行分词。-Automatically get the content from webpages, and split the words based on the internal Chinese dictionary.
所属分类：Data structs
- 发布日期：2017-05-13
- 文件大小：3475196
- 提供者：王啊

WebPages_InvertedFile

0下载：
根据中文分词结果生成倒排文档，并将结果输出到文本文件中。-Generate the inverted file based on the result of word-splitting, and output to a text file.
所属分类：Data structs
- 发布日期：2017-05-17
- 文件大小：4790673
- 提供者：王啊

fencisuanfa

0下载：
用正向最大匹配发实现句子的分词。是基于词典的分词算法。该算法的特点是速度快，准确率高。-Made to achieve a positive match with a maximum sentence segmentation. Dictionary-based segmentation algorithm. The algorithm is characterized by fast and accurately.
所属分类：Data structs
- 发布日期：2017-04-01
- 文件大小：901429
- 提供者：张喜

liaotianfenci

0下载：
一种基于国标2312（GB2312）汉字编码标准的分词算法，实现的分词效果是分成单个的汉字，可以识别英文、空格、中英文符号和数字等。也称原子分词算法。-Based on GB 2312 (GB2312) Chinese character coding standard segmentation algorithm to achieve the segmentation effect is divided into individual characters, can be identified
所属分类：Data structs
- 发布日期：2017-03-30
- 文件大小：137964
- 提供者：张喜

Chinese-text-categorization-Study

1下载：
本文通过对Bayes、KNN、SVM 应用于中文文本分类进行比较实验研究。应用ICTCLAS 对中文文档进行分词，在大维数，多数据情况下应用TFIDF 进行特征选择，并同时利用它实现了对特征项进行加权处理，使文本库中的每个文本具有统一的、可处理的结构模型。然后通过三类分类算法实现了对权值数据进行训练和分类。-Based on the Bayes, KNN, SVM applied to compare the Chinese text ca
所属分类：Mathimatics-Numerical algorithms
- 发布日期：2017-03-29
- 文件大小：442391
- 提供者：wulili

« 1 2 34 5 6 7 8 »

搜珍网 www.dssz.com

本网站为编程资源及源代码搜集、介绍的搜索网站，版权归原作者所有！　　粤ICP备11031372号

1999-2046 搜珍网 All Rights Reserved.