diff options
author | Andreas Baumann <abaumann@yahoo.com> | 2012-07-08 20:53:48 +0200 |
---|---|---|
committer | Andreas Baumann <abaumann@yahoo.com> | 2012-07-08 20:53:48 +0200 |
commit | ceae68b94f60005bd3a8a6704320abf2c8e18728 (patch) | |
tree | 42228f3d652b8871bb8e740745f6a506ccdd26a2 | |
parent | e43ac4c3e6e7695208e54318e01d16cf1b6bf374 (diff) | |
download | crawler-ceae68b94f60005bd3a8a6704320abf2c8e18728.tar.gz crawler-ceae68b94f60005bd3a8a6704320abf2c8e18728.tar.bz2 |
some doc links
-rw-r--r-- | docs/LINKS | 11 |
1 files changed, 11 insertions, 0 deletions
@@ -5,8 +5,19 @@ http://mercator.comm.nsdlib.org/ authors working for Microsoft now :-) heritrix +crawler4j mainly dead or unusable: jspider websphinx + +Javascript support + +phantomjs http://code.google.com/p/phantomjs/ +https://github.com/mikeal/spider +https://github.com/joshfire/node-crawler + +Php + +http://www.makeuseof.com/tag/build-basic-web-crawler-pull-information-website/ |