CDN加速镜像 | 设为首页 | 加入收藏夹
当前位置: 首页 资源下载 源码下载 Windows编程 C#编程

文件名称:PACHONG

  • 所属分类:
  • 标签属性:
  • 上传时间:
    2012-11-16
  • 文件大小:
    780.31kb
  • 已下载:
    0次
  • 提 供 者:
  • 相关连接:
  • 下载说明:
    别用迅雷下载,失败请重下,重下不扣分!

介绍说明--下载内容来自于网络,使用问题请自行百度

网络爬虫程序源码



这是一款用 C# 编写的网络爬虫

主要特性有:

可配置:线程数、线程等待时间,连接超时时间,可爬取文件类型和优先级、下载目录等。

状态栏显示统计信息:排入队列URL数,已下载文件数,已下载总字节数,CPU使用率和可用内存等。

有偏好的爬虫:可针对爬取的资源类型设置不同的优先级。

健壮性:十几项URL正规化策略以排除冗余下载、爬虫陷阱避免策略的使用等、多种策略以解析相对路径等。

较好的性能:基于正则表达式的页面解析、适度加锁、维持HTTP连接等。



今后有空可能加入的特性:

新特性 介绍

爬取文件用Berkeley DB存储 提高性能: 常用操作系统不善于处理大量小文件

基于URL Ranking的优先级队列 主题爬虫: 机器学习算法对链接与主题相关度进行评估,并按照得出的优先级顺序进行爬取

爬虫礼仪 遵循爬虫禁止协议、以及避免对服务器资源的过度使用等

性能优化 用UDP取代封装好的HttpWebRequest/Response

DNS缓存

异步的DNS地址解析

硬盘缓存或内存数据库以避免频繁的磁盘寻道

分布式爬虫以扩展单机能力(CPU、内存和硬盘访问) -GreySky source personal accounting system, management of daily accounting classification of report management user management built several sets of beautiful skin for beginners learning to use.
(系统自动生成,下载前可以参看下载内容)

下载文件列表

NWebCrawler/config.ini
NWebCrawler/MainForm.cs
NWebCrawler/MainForm.Designer.cs
NWebCrawler/MainForm.resx
NWebCrawler/NWebCrawler.csproj
NWebCrawler/obj/Debug/NWebCrawler.csproj.FileListAbsolute.txt
NWebCrawler/obj/Debug/NWebCrawler.csproj.GenerateResource.Cache
NWebCrawler/obj/Debug/NWebCrawler.exe
NWebCrawler/obj/Debug/NWebCrawler.MainForm.resources
NWebCrawler/obj/Debug/NWebCrawler.pdb
NWebCrawler/obj/Debug/NWebCrawler.Properties.Resources.resources
NWebCrawler/obj/Debug/NWebCrawler.SettingsForm.resources
NWebCrawler/obj/Debug/ResolveAssemblyReference.cache
NWebCrawler/Program.cs
NWebCrawler/Properties/AssemblyInfo.cs
NWebCrawler/Properties/Resources.Designer.cs
NWebCrawler/Properties/Resources.resx
NWebCrawler/Properties/Settings.Designer.cs
NWebCrawler/Properties/Settings.settings
NWebCrawler/SettingsForm.cs
NWebCrawler/SettingsForm.Designer.cs
NWebCrawler/SettingsForm.resx
NWebCrawlerLib/Common/Logger.cs
NWebCrawlerLib/Common/PriorityQueue.cs
NWebCrawlerLib/CrawleHistroyEntry.cs
NWebCrawlerLib/CrawlerThread.cs
NWebCrawlerLib/Downloader.cs
NWebCrawlerLib/NWebCrawlerLib.csproj
NWebCrawlerLib/obj/Debug/NWebCrawlerLib.csproj.FileListAbsolute.txt
NWebCrawlerLib/obj/Debug/NWebCrawlerLib.exe
NWebCrawlerLib/obj/Debug/NWebCrawlerLib.pdb
NWebCrawlerLib/Parser.cs
NWebCrawlerLib/Program.cs
NWebCrawlerLib/Properties/AssemblyInfo.cs
NWebCrawlerLib/Settings.cs
NWebCrawlerLib/UrlFrontierQueueManager.cs
NWebCrawlerLib/Utility.cs
NWebCrawler.sln
NWebCrawler.suo
51aspx源码必读.txt
from.gif
最新Asp.Net源码下载.url
bin/config.ini
bin/download/0003be8238c8302e17c799d9f5d65876.gif
bin/download/0718ad68487fa12de0cc75b20f7be03c.html; charset=utf-8
bin/download/082e9d970f371da4f6e74dbe2c97f6e2.html; charset=utf-8
bin/download/132949602460dfebc35da092329cba0c.gif
bin/download/1695505243ceaa9c68e5a00061d1763f.javascript
bin/download/1df7133090a0d07c5cec8fccbf6fd8dd.html; charset=utf-8
bin/download/203557adfb69f0b4da4e237df2c0899a.html; charset=gb2312
bin/download/23e5f50b0b42662c6694e574e74835cd.html; charset=utf-8
bin/download/24eebf7019dc355f064372d6a889c60a.html; charset=gb2312
bin/download/27439efce81b9ca84182d54aa411418e.html; charset=gb2312
bin/download/2a2f02ca86459cde185fc8e8e9045bed.html; charset=utf-8
bin/download/349427e49e96cbca35651e55ef94353d.gif
bin/download/3891570720e771c847e5ac23e28aa6cc.html
bin/download/3ff2932f670fc24203b1290df195dabf.gif
bin/download/417d9e708c95da24b75705338598087f.html
bin/download/44b19dec343bee7540d2e563399518f6.html; charset=gb2312
bin/download/46e1c646c9965ce2581be0e2baa182cf.html; charset=utf-8
bin/download/48bfe5c4818bc6d7d0a86b7c5d5a963a.javascript
bin/download/4cef95f512517e118d0427cdf40d8d91.javascript
bin/download/54cd270476c08dc49137cc587d5420e7.html; charset=utf-8
bin/download/5ae7c8b442091b3c740b5f89f2202977.gif
bin/download/5f194c03340af2c82af0806b4cd95f44.html; charset=gb2312
bin/download/6a78a05748d064e4491b674a391174c7.javascript
bin/download/6ba086f85f3602a364dae60f740138c5.html; charset=gb2312
bin/download/73e9259e079ac68519bd2cf67af06c13.html; charset=utf-8
bin/download/753a67d9417f20f83e1dce17d6146f85.gif
bin/download/767223508f1bd57304d84720065f9ee8.x-javascript
bin/download/7780c2d0134fad8b7a05a95d0f7b3378.html; charset=gb2312
bin/download/7a6721fd05029de13a9df0e2a0948f25.html; charset=UTF-8
bin/download/7eedab1d5fa988b034a32f14e08a97c0.gif
bin/download/84675a6817fc8715e33bc1c631154b5d.html
bin/download/857c3c382495ba1593a316498236e4f8.html; charset=gb2312
bin/download/8769fd41800599144d3fffb49173cf71.x-icon
bin/download/89253cefeda362f9b403341ccec22420.gif
bin/download/8d52d7ccdc272a6bcaf36ae22d856dfc.html; charset=utf-8
bin/download/9339d79eed585c1e0b126588c50477a8.javascript
bin/download/93c0e58661019bd4a98aa3790a400cdf.x-javascript
bin/download/94f1e7adbd48cf364b19771319db6b3f.gif
bin/download/956119ce46fe84d5c1e240ef7d417bdb.html; charset=gb2312
bin/download/9d71e4ab781e1b9bf3eccf2a47568d6e.html; charset=utf-8
bin/download/a2418875c3955a694b18cf795764164a.html; charset=gb2312
bin/download/a490c2a29b5986e5cd4e114a0b50d394.html; charset=gb2312
bin/download/a6275663cfbb6142241df064c6f249f9.html; charset=gb2312
bin/download/a776c9fb2eafab1f75def2a07a40c6ff.html; charset=gb2312
bin/download/b49950b51a7090372fa275d86a0bbae6.html; charset=gb2312
bin/download/baaba63486a5eaa09b34f56b5ffbfe99.html; charset=gb2312
bin/download/bbcff706ddc752ee730069aa036a390b.html; charset=gb2312
bin/download/c6b7e4c627243167faa5495e3aa583ec.html
bin/download/cb8c4ddd3d55475825bf08ed71e11da7.gif
bin/download/d37c07e10a22a9698fe474154fecaef1.html; charset=utf-8
bin/download/d646baff77dd8a709ef5b83ab084dfa1.html; charset=gb2312
bin/download/d655e24ee8c02fb2ef11764588cec317.x-javascript
bin/download/d700dde53028a9f4cb83aaed8df7ab23.x-javascript
bin/download/da78a112d1275115651d236d9c42ee97.html
bin/download/dce581b36d215edeae8f9fdc9c07529e.html; charset=gb2312
bin/download/dfcb93920e639c9f7963e66ad84c9a46.gif
bin/download/e030fe253f6880680bdd7dec04fbf67d.html; charset=gb2312
bin/download/e0b5fdbe393b18e9d9f30feb89c3e695.html
bin/download/e1b0f26b9a2eb96cbcfbe8c6d88d0344.html; charset=utf-8
bin/download/e2ab7c468bc700b

相关说明

  • 本站资源为会员上传分享交流与学习,如有侵犯您的权益,请联系我们删除.
  • 搜珍网是交换下载平台,只提供交流渠道,下载内容来自于网络,除下载问题外,其它问题请自行百度。更多...
  • 本站已设置防盗链,请勿用迅雷、QQ旋风等下载软件下载资源,下载后用WinRAR最新版进行解压.
  • 如果您发现内容无法下载,请稍后再次尝试;或换浏览器;或者到消费记录里找到下载记录反馈给我们.
  • 下载后发现下载的内容跟说明不相乎,请到消费记录里找到下载记录反馈给我们,经确认后退回积分.
  • 如下载前有疑问,可以通过点击"提供者"的名字,查看对方的联系方式,联系对方咨询.

相关评论

暂无评论内容.

发表评论

*快速评论: 推荐 一般 有密码 和说明不符 不是源码或资料 文件不全 不能解压 纯粹是垃圾
*内  容:
*验 证 码:
搜珍网 www.dssz.com