文件名称:WebCollector-master
-
所属分类:
- 标签属性:
- 上传时间:2016-06-22
-
文件大小:10.46mb
-
已下载:0次
-
提 供 者:
-
相关连接:无下载说明:别用迅雷下载,失败请重下,重下不扣分!
介绍说明--下载内容来自于网络,使用问题请自行百度
爬虫
支持表单爬取,增加分布式支持。hadoop-
Crawler
Support form to climb, increase distributed support. Hadoop
支持表单爬取,增加分布式支持。hadoop-
Crawler
Support form to climb, increase distributed support. Hadoop
(系统自动生成,下载前可以参看下载内容)
下载文件列表
WebCollector-master/
WebCollector-master/.gitignore
WebCollector-master/Lazy/
WebCollector-master/Lazy/.idea/
WebCollector-master/Lazy/.idea/.name
WebCollector-master/Lazy/.idea/compiler.xml
WebCollector-master/Lazy/.idea/copyright/
WebCollector-master/Lazy/.idea/copyright/profiles_settings.xml
WebCollector-master/Lazy/.idea/encodings.xml
WebCollector-master/Lazy/.idea/libraries/
WebCollector-master/Lazy/.idea/libraries/Maven__com_googlecode_juniversalchardet_juniversalchardet_1_0_3.xml
WebCollector-master/Lazy/.idea/libraries/Maven__com_sleepycat_je_5_0_73.xml
WebCollector-master/Lazy/.idea/libraries/Maven__junit_junit_4_11.xml
WebCollector-master/Lazy/.idea/libraries/Maven__log4j_log4j_1_2_17.xml
WebCollector-master/Lazy/.idea/libraries/Maven__org_hamcrest_hamcrest_core_1_3.xml
WebCollector-master/Lazy/.idea/libraries/Maven__org_json_json_20140107.xml
WebCollector-master/Lazy/.idea/libraries/Maven__org_jsoup_jsoup_1_8_3.xml
WebCollector-master/Lazy/.idea/libraries/Maven__org_mongodb_mongo_java_driver_3_2_0_rc0.xml
WebCollector-master/Lazy/.idea/libraries/Maven__org_slf4j_slf4j_api_1_7_9.xml
WebCollector-master/Lazy/.idea/libraries/Maven__org_slf4j_slf4j_log4j12_1_7_9.xml
WebCollector-master/Lazy/.idea/misc.xml
WebCollector-master/Lazy/.idea/modules.xml
WebCollector-master/Lazy/.idea/uiDesigner.xml
WebCollector-master/Lazy/.idea/workspace.xml
WebCollector-master/Lazy/Lazy.iml
WebCollector-master/Lazy/README.md
WebCollector-master/Lazy/demo_task.json
WebCollector-master/Lazy/demo_task1.json
WebCollector-master/Lazy/pom.xml
WebCollector-master/Lazy/src/
WebCollector-master/Lazy/src/main/
WebCollector-master/Lazy/src/main/java/
WebCollector-master/Lazy/src/main/java/cn/
WebCollector-master/Lazy/src/main/java/cn/edu/
WebCollector-master/Lazy/src/main/java/cn/edu/hfut/
WebCollector-master/Lazy/src/main/java/cn/edu/hfut/dmic/
WebCollector-master/Lazy/src/main/java/cn/edu/hfut/dmic/dm/
WebCollector-master/Lazy/src/main/java/cn/edu/hfut/dmic/dm/KMeans.java
WebCollector-master/Lazy/src/main/java/cn/edu/hfut/dmic/dm/example/
WebCollector-master/Lazy/src/main/java/cn/edu/hfut/dmic/dm/example/StopWords.java
WebCollector-master/Lazy/src/main/java/cn/edu/hfut/dmic/dm/example/WebpageKmeans.java
WebCollector-master/Lazy/src/main/java/cn/edu/hfut/dmic/dm/example/WordsBag.java
WebCollector-master/Lazy/src/main/java/cn/edu/hfut/dmic/webcollector/
WebCollector-master/Lazy/src/main/java/cn/edu/hfut/dmic/webcollector/lazy/
WebCollector-master/Lazy/src/main/java/cn/edu/hfut/dmic/webcollector/lazy/LazyConfig.java
WebCollector-master/Lazy/src/main/java/cn/edu/hfut/dmic/webcollector/lazy/LazyCrawler.java
WebCollector-master/Lazy/src/main/java/cn/edu/hfut/dmic/webcollector/lazy/Main.java
WebCollector-master/Lazy/src/main/java/cn/edu/hfut/dmic/webcollector/lazy/util/
WebCollector-master/Lazy/src/main/java/cn/edu/hfut/dmic/webcollector/lazy/util/MongoHelper.java
WebCollector-master/Lazy/src/main/resources/
WebCollector-master/Lazy/src/main/resources/stopwords.txt
WebCollector-master/NewsCrawler.java
WebCollector-master/README.md
WebCollector-master/README.zh-cn.md
WebCollector-master/WebCollector-Hadoop/
WebCollector-master/WebCollector-Hadoop/README.md
WebCollector-master/WebCollector-Hadoop/build.sh
WebCollector-master/WebCollector-Hadoop/conf/
WebCollector-master/WebCollector-Hadoop/conf/crawler-default.xml
WebCollector-master/WebCollector-Hadoop/conf/hadoop/
WebCollector-master/WebCollector-Hadoop/conf/hadoop/core-site.xml
WebCollector-master/WebCollector-Hadoop/conf/hadoop/hdfs-site.xml
WebCollector-master/WebCollector-Hadoop/conf/hadoop/mapred-site.xml
WebCollector-master/WebCollector-Hadoop/conf/regex
WebCollector-master/WebCollector-Hadoop/pom.xml
WebCollector-master/WebCollector-Hadoop/src/
WebCollector-master/WebCollector-Hadoop/src/main/
WebCollector-master/WebCollector-Hadoop/src/main/java/
WebCollector-master/WebCollector-Hadoop/src/main/java/cn/
WebCollector-master/WebCollector-Hadoop/src/main/java/cn/edu/
WebCollector-master/WebCollector-Hadoop/src/main/java/cn/edu/hfut/
WebCollector-master/WebCollector-Hadoop/src/main/java/cn/edu/hfut/dmic/
WebCollector-master/WebCollector-Hadoop/src/main/java/cn/edu/hfut/dmic/webcollector/
WebCollector-master/WebCollector-Hadoop/src/main/java/cn/edu/hfut/dmic/webcollector/crawldb/
WebCollector-master/WebCollector-Hadoop/src/main/java/cn/edu/hfut/dmic/webcollector/crawldb/DBReader.java
WebCollector-master/WebCollector-Hadoop/src/main/java/cn/edu/hfut/dmic/webcollector/crawldb/DBUpdater.java
WebCollector-master/WebCollector-Hadoop/src/main/java/cn/edu/hfut/dmic/webcollector/crawldb/Generator.java
WebCollector-master/WebCollector-Hadoop/src/main/java/cn/edu/hfut/dmic/webcollector/crawldb/Injector.java
WebCollector-master/WebCollector-Hadoop/src/main/java/cn/edu/hfut/dmic/webcollector/crawldb/Merge.java
WebCollector-master/WebCollector-Hadoop/src/main/java/cn/edu/hfut/dmic/webcollector/crawldb/SegmentUtil.java
WebCollector-master/WebCollector-Hadoop/src/main/java/cn/edu/hfut/dmic/webcollector/crawler/
WebCollector-master/WebCollector-Hadoop/src/main/java/cn/edu/hfut/dmic/webcollector/crawler/Crawler.java
W
WebCollector-master/.gitignore
WebCollector-master/Lazy/
WebCollector-master/Lazy/.idea/
WebCollector-master/Lazy/.idea/.name
WebCollector-master/Lazy/.idea/compiler.xml
WebCollector-master/Lazy/.idea/copyright/
WebCollector-master/Lazy/.idea/copyright/profiles_settings.xml
WebCollector-master/Lazy/.idea/encodings.xml
WebCollector-master/Lazy/.idea/libraries/
WebCollector-master/Lazy/.idea/libraries/Maven__com_googlecode_juniversalchardet_juniversalchardet_1_0_3.xml
WebCollector-master/Lazy/.idea/libraries/Maven__com_sleepycat_je_5_0_73.xml
WebCollector-master/Lazy/.idea/libraries/Maven__junit_junit_4_11.xml
WebCollector-master/Lazy/.idea/libraries/Maven__log4j_log4j_1_2_17.xml
WebCollector-master/Lazy/.idea/libraries/Maven__org_hamcrest_hamcrest_core_1_3.xml
WebCollector-master/Lazy/.idea/libraries/Maven__org_json_json_20140107.xml
WebCollector-master/Lazy/.idea/libraries/Maven__org_jsoup_jsoup_1_8_3.xml
WebCollector-master/Lazy/.idea/libraries/Maven__org_mongodb_mongo_java_driver_3_2_0_rc0.xml
WebCollector-master/Lazy/.idea/libraries/Maven__org_slf4j_slf4j_api_1_7_9.xml
WebCollector-master/Lazy/.idea/libraries/Maven__org_slf4j_slf4j_log4j12_1_7_9.xml
WebCollector-master/Lazy/.idea/misc.xml
WebCollector-master/Lazy/.idea/modules.xml
WebCollector-master/Lazy/.idea/uiDesigner.xml
WebCollector-master/Lazy/.idea/workspace.xml
WebCollector-master/Lazy/Lazy.iml
WebCollector-master/Lazy/README.md
WebCollector-master/Lazy/demo_task.json
WebCollector-master/Lazy/demo_task1.json
WebCollector-master/Lazy/pom.xml
WebCollector-master/Lazy/src/
WebCollector-master/Lazy/src/main/
WebCollector-master/Lazy/src/main/java/
WebCollector-master/Lazy/src/main/java/cn/
WebCollector-master/Lazy/src/main/java/cn/edu/
WebCollector-master/Lazy/src/main/java/cn/edu/hfut/
WebCollector-master/Lazy/src/main/java/cn/edu/hfut/dmic/
WebCollector-master/Lazy/src/main/java/cn/edu/hfut/dmic/dm/
WebCollector-master/Lazy/src/main/java/cn/edu/hfut/dmic/dm/KMeans.java
WebCollector-master/Lazy/src/main/java/cn/edu/hfut/dmic/dm/example/
WebCollector-master/Lazy/src/main/java/cn/edu/hfut/dmic/dm/example/StopWords.java
WebCollector-master/Lazy/src/main/java/cn/edu/hfut/dmic/dm/example/WebpageKmeans.java
WebCollector-master/Lazy/src/main/java/cn/edu/hfut/dmic/dm/example/WordsBag.java
WebCollector-master/Lazy/src/main/java/cn/edu/hfut/dmic/webcollector/
WebCollector-master/Lazy/src/main/java/cn/edu/hfut/dmic/webcollector/lazy/
WebCollector-master/Lazy/src/main/java/cn/edu/hfut/dmic/webcollector/lazy/LazyConfig.java
WebCollector-master/Lazy/src/main/java/cn/edu/hfut/dmic/webcollector/lazy/LazyCrawler.java
WebCollector-master/Lazy/src/main/java/cn/edu/hfut/dmic/webcollector/lazy/Main.java
WebCollector-master/Lazy/src/main/java/cn/edu/hfut/dmic/webcollector/lazy/util/
WebCollector-master/Lazy/src/main/java/cn/edu/hfut/dmic/webcollector/lazy/util/MongoHelper.java
WebCollector-master/Lazy/src/main/resources/
WebCollector-master/Lazy/src/main/resources/stopwords.txt
WebCollector-master/NewsCrawler.java
WebCollector-master/README.md
WebCollector-master/README.zh-cn.md
WebCollector-master/WebCollector-Hadoop/
WebCollector-master/WebCollector-Hadoop/README.md
WebCollector-master/WebCollector-Hadoop/build.sh
WebCollector-master/WebCollector-Hadoop/conf/
WebCollector-master/WebCollector-Hadoop/conf/crawler-default.xml
WebCollector-master/WebCollector-Hadoop/conf/hadoop/
WebCollector-master/WebCollector-Hadoop/conf/hadoop/core-site.xml
WebCollector-master/WebCollector-Hadoop/conf/hadoop/hdfs-site.xml
WebCollector-master/WebCollector-Hadoop/conf/hadoop/mapred-site.xml
WebCollector-master/WebCollector-Hadoop/conf/regex
WebCollector-master/WebCollector-Hadoop/pom.xml
WebCollector-master/WebCollector-Hadoop/src/
WebCollector-master/WebCollector-Hadoop/src/main/
WebCollector-master/WebCollector-Hadoop/src/main/java/
WebCollector-master/WebCollector-Hadoop/src/main/java/cn/
WebCollector-master/WebCollector-Hadoop/src/main/java/cn/edu/
WebCollector-master/WebCollector-Hadoop/src/main/java/cn/edu/hfut/
WebCollector-master/WebCollector-Hadoop/src/main/java/cn/edu/hfut/dmic/
WebCollector-master/WebCollector-Hadoop/src/main/java/cn/edu/hfut/dmic/webcollector/
WebCollector-master/WebCollector-Hadoop/src/main/java/cn/edu/hfut/dmic/webcollector/crawldb/
WebCollector-master/WebCollector-Hadoop/src/main/java/cn/edu/hfut/dmic/webcollector/crawldb/DBReader.java
WebCollector-master/WebCollector-Hadoop/src/main/java/cn/edu/hfut/dmic/webcollector/crawldb/DBUpdater.java
WebCollector-master/WebCollector-Hadoop/src/main/java/cn/edu/hfut/dmic/webcollector/crawldb/Generator.java
WebCollector-master/WebCollector-Hadoop/src/main/java/cn/edu/hfut/dmic/webcollector/crawldb/Injector.java
WebCollector-master/WebCollector-Hadoop/src/main/java/cn/edu/hfut/dmic/webcollector/crawldb/Merge.java
WebCollector-master/WebCollector-Hadoop/src/main/java/cn/edu/hfut/dmic/webcollector/crawldb/SegmentUtil.java
WebCollector-master/WebCollector-Hadoop/src/main/java/cn/edu/hfut/dmic/webcollector/crawler/
WebCollector-master/WebCollector-Hadoop/src/main/java/cn/edu/hfut/dmic/webcollector/crawler/Crawler.java
W
本网站为编程资源及源代码搜集、介绍的搜索网站,版权归原作者所有! 粤ICP备11031372号
1999-2046 搜珍网 All Rights Reserved.