搜索资源列表
MSWord
- 基于perl语言实现的MS-Word文档信息提取程序,方便以后进行全文检索-perl language based on the MS-Word document information extraction procedures to facilitate subsequent full-text retrieval
PDFTest_bemjh
- 用c++写的,从一个pdf文件读入,并分析其中的内容,并提取文本内容,并输出文本文件.-with c write from a pdf file into and analyze its content, and extraction of text, and the output text file.
GetUnicode
- 看看是否对你有用,字库生成,字模点阵提取,UNICODE查询和转换,二进制文件转文本文件,文本文件转化为二进制文件(一个数组还原为一个BIN文件)-see whether useful to you, font generation, Dot Matrix Printer extraction, UNICODE inquiries and conversion, binary files to text files, text documents into binary files (a red
WordPadTest
- 不使用OLE Automation而实现的DOC文档文本抽取类,其中也示范了使用远程挂钩实现进程间通信的机制-Not to use OLE Automation and the implementation of the DOC document text extraction categories, one of the model is also linked to the use of long-range implementation-process communication mech
Select-Chinese-from-the-web
- 网页文本提取,已经经过测试,主要用于垃圾网页过滤等功能-Web text extraction, has been tested, mainly for web filtering spam
dataFile
- 基于KMP算法的文件文本提取程序,可以从文件中提取想要的文本,进行重组输出致另外一个文件。-The desired text file text extraction program based on the KMP algorithm can be extracted from the file, carry out, even a Low- End restructuring of output caused by another one files.
