文件名称:segment
介绍说明--下载内容来自于网络,使用问题请自行百度
segment,一个简单的中文分词程序,命令行如下:
java -jar segmenter.jar [-b|-g|-8|-s|-t] inputfile.txt
-b Big5, -g GB2312, -8 UTF-8, -s simp. chars, -t trad. chars
Segmented text will be saved to inputfile.txt.seg
java -jar segmenter.jar [-b|-g|-8|-s|-t] inputfile.txt
-b Big5, -g GB2312, -8 UTF-8, -s simp. chars, -t trad. chars
Segmented text will be saved to inputfile.txt.seg
(系统自动生成,下载前可以参看下载内容)
下载文件列表
META-INF/MANIFEST.MF
bothlexu8.txt
segmenter.class
segmenter.java
simplexu8.txt
tradlexu8.txt
data/sforeign_u8.txt
data/snotname_u8.txt
data/snumbers_u8.txt
data/ssurname_u8.txt
data/tforeign_u8.txt
data/tnotname_u8.txt
data/tnumbers_u8.txt
data/tsurname_u8.txt
META-INF
data
www.dssz.com.txt
bothlexu8.txt
segmenter.class
segmenter.java
simplexu8.txt
tradlexu8.txt
data/sforeign_u8.txt
data/snotname_u8.txt
data/snumbers_u8.txt
data/ssurname_u8.txt
data/tforeign_u8.txt
data/tnotname_u8.txt
data/tnumbers_u8.txt
data/tsurname_u8.txt
META-INF
data
www.dssz.com.txt
1999-2046 搜珍网 All Rights Reserved.
本站作为网络服务提供者,仅为网络服务对象提供信息存储空间,仅对用户上载内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。
