summaryrefslogtreecommitdiff
path: root/src
AgeCommit message (Expand)Author
2012-08-04brutal testing and normalization of Google URL, must refactor most things the...Andreas Baumann
2012-08-04rearanged google test1 and added a GoogleUrlNormalizerAndreas Baumann
2012-08-03tamed some debug outputAndreas Baumann
2012-08-03basic normalizationAndreas Baumann
2012-08-03fighting with reverse iterators for url normalizationAndreas Baumann
2012-07-29-Andreas Baumann
2012-07-29somewhat working againAndreas Baumann
2012-07-29temporarily removed domain, domain filter is a host filter nowAndreas Baumann
2012-07-29started to add simple parseUrl implementationAndreas Baumann
2012-07-28heavy redesign of URL class, must not contain any parsing logic asAndreas Baumann
2012-07-28started to add URL normalizers and testing environment for URLsAndreas Baumann
2012-07-19some interface fixesAndreas Baumann
2012-07-18fixed memory frontierAndreas Baumann
2012-07-18added URLSeen componentAndreas Baumann
2012-07-15some investemnet in URL parsingAndreas Baumann
2012-07-15started to add URL filtersAndreas Baumann
2012-07-14some pseudo URL normalizationAndreas Baumann
2012-07-14first working crawlerAndreas Baumann
2012-07-14added streamhtmlparserAndreas Baumann
2012-07-14first fetch worksAndreas Baumann
2012-07-13-Andreas Baumann
2012-07-13added a test for a libfetch_streambufAndreas Baumann
2012-07-12-Andreas Baumann
2012-07-12added basic structureAndreas Baumann
2012-07-12a restartAndreas Baumann
2010-01-17checkinAndreas Baumann
2009-12-29more reading and abstract interfacingAndreas Baumann
2009-12-25continueAndreas Baumann
2009-12-25some starting hereAndreas Baumann