搜索资源列表
POSTagger
- (1)从已经标注好词性的语料中统计得到词性标记的二元转移矩阵,以及每个词以确定的词性标记出现的次数等数据(训练阶段) (2)利用动态规划算法快速选取词性标记路径,得到词性标记结果 (3)可以选择不同的词性标记集 -(1) has been marked from the Corpus POS good statistical POS be labeled binary transfer matrix, and every word to determine the POS m
POSTagger
- (1)从已经标注好词性的语料中统计得到词性标记的二元转移矩阵,以及每个词以确定的词性标记出现的次数等数据(训练阶段) (2)利用动态规划算法快速选取词性标记路径,得到词性标记结果 (3)可以选择不同的词性标记集 -(1) from the good part-of-speech tagging has been the Corpus statistics to be part of speech marking the transfer of binary matrix, a
natural-language-processing
- 统计自然语言处理PPT-刘挺 中科院自动化研究所、模式识别国家重点实验室的 介绍的内容有统计机器翻译、词法分析与词性标注、语料库与词汇知识库-Statistical Natural Language Processing PPT-Ting Liu Institute of Automation, Chinese Academy of Sciences, State Key Laboratory of Pattern Recognition content presentation of
nlpir_ictclas2013_release
- 中科院发布的分词系统,能很好的进行中文分词,词性标注。-Chinese Academy of Sciences released a segmentation system that can be very good for Chinese word segmentation, POS tagging.
HmmPos
- 基于HMM的中文词性标注代码,内有详细注释,并附有练习样本-Chinese POS tagging HMM-based code, with detailed notes, along with sample exercises
CodesPandPApplication
- 中科院中文词法分析器,对输入文本分词,词性标注,未登陆词识别功能,正确率高。-Chinese Academy of Sciences Chinese lexical analyzer, to enter text segmentation, POS tagging, not landing word recognition, correct rate.
HanLP-1.2.7
- HanLP是一个致力于向生产环境普及NLP技术的开源Java工具包,支持中文分词(N-最短路分词、CRF分词、索引分词、用户自定义词典、词性标注),命名实体识别(中国人名、音译人名、日本人名、地名、实体机构名识别),关键词提取,自动摘要,短语提取,拼音转换,简繁转换,文本推荐,依存句法分析(MaxEnt依存句法分析、神经网络依存句法分析)。-HanLP is a dedicated to popularize NLP technology to production environment of
acopost_note
- acopost是Ingo Schroder于02年在德国汉堡大学完成的一个词性标注工具包。主要实现了基于实例、最大熵、2元隐马、基于转换规则等4种词性标注算法,以及评价和算法融合等。采用的语言是perl和c,代码比较短小,非常适于学习。-acopost Ingo Schroder is a speech in 2002 at the University of Hamburg, Germany marked the completion of the toolkit. The main achi
jieba-analysis-master
- 结巴分词(java版)只保留的原项目针对搜索引擎分词的功能(cut_for_index、cut_for_search),词性标注,关键词提取没有实现(今后如用到,可以考虑实现)。-Stammer participle (Java version) to retain only the original project for search engine participle (cut for index, cut for search), part of speech tagging, keyw
ltp_code
- 哈工大语言云LTP的C++集成代码,能够实现自然语言的处理。能够进行分词、词性标注、 命名实体识别、依存句法分析、语义角色标注 语义依存分析等功能。注:读者需要自己到哈工大官网注册KEYS使用。-Harbin Institute of technology language cloud LTP C integrated code, can realize natural language processing. Segmentation, part of speech tagging,
HanLP-1.2.10.tar
- 汉语自然语言处理,包括分词,词性标注,命名实体,及句法依存-chinese netrual solve
NLP-speech-tagging
- 基于隐马尔可夫模型的中文分词、词性标注、命名实体识别-Based on Chinese word hidden Markov model, speech tagging, named entity recognition
ytgfc.tar
- 用python实现对文档的分词,并进行词性标注-Use python to achieve the word on the document, and voice tagging
ltp-3.3.2
- 哈工大信息检索实验室进行文本的依存分析、命名实体识别、词性标注、分词、语义依存分析、语义角色标注(dependency parse of text)
ltp-3.4.0
- 自然语言处理开源项目源代码,中文分词,词性标注等功能介绍(Natural language processing open source project source code, Chinese word segmentation, speech tagging and other functions)
hmm机器学习
- HMM(隐马尔科夫模型)是自然语言处理中的一个基本模型,用途比较广泛,如汉语分词、词性标注及语音识别等,在NLP中占有很重要的地位(HMM (hidden Markov model) is a basic model in Natural Language Processing, which is widely used, such as Chinese segmentation, part of speech tagging and speech recognition, and plays
汉语分词20140928
- cltclas中文分词工具包,可以进行分词,词性标注等等(Cltclas Chinese word segmentation kit, can be participle, part of speech tagging, and so on)
ansj_seg-master
- 一个很好的中文分词工具,其中使用了CRF做词性标注以及新词发现(A good Chinese word segmentation tool, in which CRF is used for part of speech tagging and new word discovery.)
JNA
- 中文的分词,包括词性标注、关键词提取,Java文件(word segmentation and part of speech tagging)
download_tweets
- 能够进行词性标注、词典匹配、否定词匹配,能够进行CRF之前的模型准备工作(Can do part of speech tagging, dictionary matching, negative word matching)