summaryrefslogtreecommitdiff
path: root/docs
diff options
context:
space:
mode:
authorAndreas Baumann <abaumann@yahoo.com>2012-07-08 20:53:48 +0200
committerAndreas Baumann <abaumann@yahoo.com>2012-07-08 20:53:48 +0200
commitceae68b94f60005bd3a8a6704320abf2c8e18728 (patch)
tree42228f3d652b8871bb8e740745f6a506ccdd26a2 /docs
parente43ac4c3e6e7695208e54318e01d16cf1b6bf374 (diff)
downloadcrawler-ceae68b94f60005bd3a8a6704320abf2c8e18728.tar.gz
crawler-ceae68b94f60005bd3a8a6704320abf2c8e18728.tar.bz2
some doc links
Diffstat (limited to 'docs')
-rw-r--r--docs/LINKS11
1 files changed, 11 insertions, 0 deletions
diff --git a/docs/LINKS b/docs/LINKS
index 568183f..afa1082 100644
--- a/docs/LINKS
+++ b/docs/LINKS
@@ -5,8 +5,19 @@ http://mercator.comm.nsdlib.org/
authors working for Microsoft now :-)
heritrix
+crawler4j
mainly dead or unusable:
jspider
websphinx
+
+Javascript support
+
+phantomjs http://code.google.com/p/phantomjs/
+https://github.com/mikeal/spider
+https://github.com/joshfire/node-crawler
+
+Php
+
+http://www.makeuseof.com/tag/build-basic-web-crawler-pull-information-website/