summaryrefslogtreecommitdiff
path: root/docs/LINKS
diff options
context:
space:
mode:
Diffstat (limited to 'docs/LINKS')
-rw-r--r--docs/LINKS11
1 files changed, 11 insertions, 0 deletions
diff --git a/docs/LINKS b/docs/LINKS
index 568183f..afa1082 100644
--- a/docs/LINKS
+++ b/docs/LINKS
@@ -5,8 +5,19 @@ http://mercator.comm.nsdlib.org/
authors working for Microsoft now :-)
heritrix
+crawler4j
mainly dead or unusable:
jspider
websphinx
+
+Javascript support
+
+phantomjs http://code.google.com/p/phantomjs/
+https://github.com/mikeal/spider
+https://github.com/joshfire/node-crawler
+
+Php
+
+http://www.makeuseof.com/tag/build-basic-web-crawler-pull-information-website/