summaryrefslogtreecommitdiff
path: root/TODOS
diff options
context:
space:
mode:
authorAndreas Baumann <abaumann@yahoo.com>2014-04-30 16:46:00 +0200
committerAndreas Baumann <abaumann@yahoo.com>2014-04-30 16:46:00 +0200
commit12c50867c04b2c2a11f5026466bbea02d5406b70 (patch)
tree4008a8d5e3660d823197f97b3c0b244fa37d3ea1 /TODOS
parenteb3771cafb98451116a4f0ec0e7a371800770de1 (diff)
downloadcrawler-12c50867c04b2c2a11f5026466bbea02d5406b70.tar.gz
crawler-12c50867c04b2c2a11f5026466bbea02d5406b70.tar.bz2
started a robots.txt parser
Diffstat (limited to 'TODOS')
-rwxr-xr-xTODOS4
1 files changed, 4 insertions, 0 deletions
diff --git a/TODOS b/TODOS
index 949dc8a..6d28b36 100755
--- a/TODOS
+++ b/TODOS
@@ -12,3 +12,7 @@
- content based type detection on Windows
- port of libmagic?
- something from Microsoft (around the index service)?
+- robots.txt
+ - handle Sitemap
+- Parse URLs from sitemaps
+