summaryrefslogtreecommitdiff
path: root/src
AgeCommit message (Expand)Author
2012-08-11google url normalization works on Windows, test1 must be improved:Andreas Baumann
2012-08-11added a file spooling buffer in libfetch rewind input stream, a littleAndreas Baumann
2012-08-11fixed memory buffering in libfetch fetcherAndreas Baumann
2012-08-10started adapting googleurl on windows, icu intergration pendingAndreas Baumann
2012-08-10fixed simle url normalizer and it's testsAndreas Baumann
2012-08-10some more windows modulesAndreas Baumann
2012-08-10module loader works on Windows, simple URL normalizer test worksAndreas Baumann
2012-08-10fixed wrong initlializtion order in RewindInputStreamAndreas Baumann
2012-08-10first porting attempts to Windows:Andreas Baumann
2012-08-09-Andreas Baumann
2012-08-09added spooling to LibFetchRewindInputStream in order to support rewindAndreas Baumann
2012-08-09better libmagic buffer detection with increasing buffer on streamAndreas Baumann
2012-08-08added a file rewind input streamAndreas Baumann
2012-08-08handle sigintAndreas Baumann
2012-08-08-Andreas Baumann
2012-08-08modularized all other modulesAndreas Baumann
2012-08-08chain filter and modules with one ctor param work nowAndreas Baumann
2012-08-08more testing and docu aroung Type*Andreas Baumann
2012-08-07started modularization of URL filtersAndreas Baumann
2012-08-07allow modules to be linked as static libraries, mainly to be able toAndreas Baumann
2012-08-07combined the two url normalizer testsAndreas Baumann
2012-08-07more reduction of module code and fixed dependency problem when buildingAndreas Baumann
2012-08-07cleaned up url normalizer tests and made them use module loaderAndreas Baumann
2012-08-07reduced some code duplication when registering modulesAndreas Baumann
2012-08-06removed some debug printsAndreas Baumann
2012-08-06using typeinfo to find correct destruction function for loadable module objectsAndreas Baumann
2012-08-06first steps to make URL loader loadableAndreas Baumann
2012-08-04cleaned up interface of GoogleURLNormalizer APIAndreas Baumann
2012-08-04brutal testing and normalization of Google URL, must refactor most things the...Andreas Baumann
2012-08-04rearanged google test1 and added a GoogleUrlNormalizerAndreas Baumann
2012-08-03tamed some debug outputAndreas Baumann
2012-08-03basic normalizationAndreas Baumann
2012-08-03fighting with reverse iterators for url normalizationAndreas Baumann
2012-07-29-Andreas Baumann
2012-07-29somewhat working againAndreas Baumann
2012-07-29temporarily removed domain, domain filter is a host filter nowAndreas Baumann
2012-07-29started to add simple parseUrl implementationAndreas Baumann
2012-07-28heavy redesign of URL class, must not contain any parsing logic asAndreas Baumann
2012-07-28started to add URL normalizers and testing environment for URLsAndreas Baumann
2012-07-19some interface fixesAndreas Baumann
2012-07-18fixed memory frontierAndreas Baumann
2012-07-18added URLSeen componentAndreas Baumann
2012-07-15some investemnet in URL parsingAndreas Baumann
2012-07-15started to add URL filtersAndreas Baumann
2012-07-14some pseudo URL normalizationAndreas Baumann
2012-07-14first working crawlerAndreas Baumann
2012-07-14added streamhtmlparserAndreas Baumann
2012-07-14first fetch worksAndreas Baumann
2012-07-13-Andreas Baumann
2012-07-13added a test for a libfetch_streambufAndreas Baumann