搜索资源 - 语料 - 搜珍网

CDN加速镜像 | 设为首页 | 加入收藏夹

热门搜索： 源码 Android 整站插件识别 p2p OpenCV 网络编程游戏源码算法更多...

登陆 | 会员注册

当前位置：

搜索资源 - 语料

下载资源主分类

源码下载

Web源码

开发工具

文档下载

其它资源

资源分类

Windows编程

Internet/网络编程

系统编程

通讯/手机编程

游戏

多媒体

嵌入式/单片机编程

图形图象

数值算法/人工智能

行业应用软件

数据库系统

其它

搜索资源列表

generate_wordlist

0下载：
一个生成词典的程序，从语料中抽取每一个不同的词按格式要求组成词典。-a program for generating wordlist,the detail is to get every word from corper and form a wordlist.
所属分类：Windows Develop
- 发布日期：2017-05-18
- 文件大小：4742436
- 提供者：糊涂虫

viterbi

0下载：
NLP中viterby算法的实现,对语料进行处理，建模，然后可以对新的语料进行句法标注-NLP algorithm implementation in viterby
所属分类：Communication-Mobile
- 发布日期：2017-05-02
- 文件大小：723453
- 提供者：skyxiang

072282

0下载：
提出了一种自动构造特定领域本体的方法，该方法应用术语抽取和多重聚类技术。在术语抽取阶段，通过术语在专业语料与背景语料中出现概率的对比，采用LLR公式对术语进行评分，取得了更好的抽取效果。在层级关系发现过程中，采用上下文共现信息结合HowNet中词语的语义相似度，进行术语间相似度度量，力求获得术语间最合理的相关状况。同时改进了k-medoids聚类算法，更准确地发现术语的层级关系，进而构造出特定领域的本体。-This paper presents an approach to mining dom
所属分类：Other systems
- 发布日期：2017-04-17
- 文件大小：100753
- 提供者：xiaobai

reuters

0下载：
路透社预处理工具，简单方便实用快捷，可把语料集按类别分类-Reuters Preprocessing tools, fast and simple and practical, can be classified according to the corpus set
所属分类：Special Effects
- 发布日期：2017-04-05
- 文件大小：762225
- 提供者：zxj

word_split

0下载：
这个一个基于逆向最大匹配的分词程序，语料规模比较小。-The maximum matching based on the reverse of the sub-term process, relatively small-scale corpus.
所属分类：MultiLanguage
- 发布日期：2017-04-09
- 文件大小：1517543
- 提供者：nancy

segword

0下载：
segword训练语料处理程序，针对人民日报199801训练语料进行训练的程序-segword
所属分类：MultiLanguage
- 发布日期：2017-05-12
- 文件大小：2726561
- 提供者：weiwei

PU123ACorpora.tar

0下载：
这是一个供做垃圾邮件方面东西的朋友的语料库，很好用的，望对大家有帮助-This is a place for things to do in junk e-mail a friend corpus, well used, hope helpful to everyone
所属分类：MultiLanguage
- 发布日期：2017-05-21
- 文件大小：6427967
- 提供者：王嘉琪

clcl

0下载：
关于语音识别中语料库的建立与整理，以及分析统计-Speech Recognition Corpus on the establishment and finishing, as well as the analysis of statistical
所属分类：Speech/Voice recognition/combine
- 发布日期：2017-04-24
- 文件大小：163370
- 提供者：comma

bigram1

1下载：
根据从语料库中统计出的词表建立二元文法法语言模型-According to statistics from the corpus vocabulary out of the establishment of the dual language model grammar France
所属分类：Other systems
- 发布日期：2017-04-16
- 文件大小：127978
- 提供者：liujianfei

SogouT.mini.tar

0下载：
百度搜索引擎具有响应速度快、查找结果准确全面、时效性强、无效链接少、符合中文语言特点和中国人使用习惯等优点。 1...这种方法只需对语料中的字组频度进行统计,不需要切分词典,因而又叫做无词典分词法或统计取词方法。但这种方法也有一定- IHTMLDocument3* pHTMLDoc3 HRESULT hr = m_pHTMLDocument2->QueryInterface(IID_IHTMLDocument3, (LPVOID*)&pHTMLDoc3)
所属分类：Search Engine
- 发布日期：2017-03-29
- 文件大小：62317
- 提供者：xuhaifan

WindowsApplication1

0下载：
处理的对象是：完成分词和词性标注的语料，实现的结果是：统计出现词频完成降序排列。-Dealing with the object are: the completion of word segmentation and POS tagging of the corpus, the results achieved are: the completion of word frequency statistics appear in descending order.
所属分类：MultiLanguage
- 发布日期：2017-03-29
- 文件大小：36724
- 提供者：陈烨彬

yuyinchulichengxv

1下载：
对给定语料估计其基音周期。要求用MATLAB或C语言实现有关基音检测算法，并给出检测结果。 -For a given corpus to estimate the pitch period. Required to use MATLAB or C language realization of the pitch detection algorithm, and gives test results.
所属分类：matlab
- 发布日期：2017-03-29
- 文件大小：7167
- 提供者：飞扬

HtmlAgilityPack20

0下载：
HtmlAgilityPack20 对从网站上爬去的新闻语料抽取出标题，时间，正文等-HtmlAgilityPack20 right from the Web Paqu news corpus extracted title, time, text, etc.
所属分类：Windows Develop
- 发布日期：2017-04-24
- 文件大小：186292
- 提供者：wony

ChinesePronominalCoreferenceResolution

0下载：
基于决策树的汉语代词共指消解提出一种统计与规则相结合的决策树算法进行汉语代词共指消解 ,利用规则过滤掉属性冲突的反例 ,一定程度上弥补了决策树算法忽略属性关联性的缺点. 采用 Chinese Treebank 作为语料进行测试 ,手工标注其中的共指关系和特征向量首先用规则过滤 ,然后采用 C415 决策树算法选择先行语. 实验结果显示 ,消解成功率为 82159 ,其中人称代词和指示代词的成功率分别为 87160 和 75121 .-A total based on de
所属分类：AI-NN-PR
- 发布日期：2017-04-02
- 文件大小：109887
- 提供者：pahran

LJParser

0下载：
聚类算法相关知识，有语料和训练文本集，可供大家学习。-AppWizard has created this application for you. This application not only demonstrates the basics of using the Microsoft Foundation classes but is also a starting point for writing your application.
所属分类：Other systems
- 发布日期：2017-06-11
- 文件大小：18475109
- 提供者：杨婷

fenci

0下载：
分词时，可以使用的词典及其语料。语料是北大1998年语料，已经分好词，并且标好词性。-Word, you can use the dictionary and corpus. Corpus is a corpus of Beijing University in 1998, has been divided into many words, and marked a good part of speech.
所属分类：Windows Develop
- 发布日期：2017-05-11
- 文件大小：2296972
- 提供者：王宏

IDFCal

1下载：
tf-idf程序，朋友写的，很好。对中文句子进行相似度计算，有计算句子权值、排序、两两句子之间的相似度计算。有语料，可以直接运行-tf-idf program, friends wrote, very good. Similarity calculation for Chinese sentences, the sentence weights are calculated, sort, twenty-two similarity between sub-calculation. A corpu
所属分类：Other windows programs
- 发布日期：2017-04-05
- 文件大小：16245
- 提供者：Shirley

1998renminribaodaiyoucixingbiaozhu

0下载：
语音合成训练用语料，分词并带有词性标注。文档性质不是源码。-TTS language training materials, word and with part of speech tagging. The nature of the document is not a source.
所属分类：Speech/Voice recognition/combine
- 发布日期：2017-05-10
- 文件大小：2220975
- 提供者：wulang

MM2

1下载：
利用隐马尔可夫模型实现词性标注。此为无监督模型。内含语料库和测试集。方便大家学习。--Transition Matrix and Emission Matrix of Hidden Markov Model
所属分类：Windows编程
- 发布日期：2014-01-17
- 文件大小：9519447
- 提供者：ken

VoxForge

0下载：
高级语音识别语料库，英语专用，HTK必备资料-Advanced speech recognition corpus, English dedicated, HTK essential information
所属分类：AI-NN-PR
- 发布日期：2017-05-06
- 文件大小：1309259
- 提供者：Fatso Ding

« 1 2 3 4 56 7 8 9 10 11 »

搜珍网 www.dssz.com

本网站为编程资源及源代码搜集、介绍的搜索网站，版权归原作者所有！　　粤ICP备11031372号

1999-2046 搜珍网 All Rights Reserved.