Historique des commits

Auteur SHA1 Message Date
  Nick Sweeting a680724367 Merge branch 'dev' into search_index_extract_html_text il y a 2 ans
  Ross Williams 310b4d1242 Add htmltotext extractor il y a 2 ans
  Ross Williams b44f7e68b1 Add URL-specific method allow/deny lists il y a 2 ans
  Nick Sweeting bd6d9c165b enforce utf8 on literally all file operations because windows sucks il y a 4 ans
  Cristian 62ed11a5ca fix: Improve headers handling il y a 5 ans
  Angel Rey ee6caca3ca Added more asserts il y a 5 ans
  Angel Rey 1cce786d6d Added test headers extractor il y a 5 ans
  ttimasdf e3329be291 tests: add test for mercury-parser il y a 5 ans
  Cristian cc0fa747ce feat: Add options to ease management of node related extractors il y a 5 ans
  Cristian 2a68af1b94 tests: Add readability tests il y a 5 ans
  Cristian 5429096c30 tests: Add mechanism to avoid using extractors that we are not testing il y a 5 ans
  Nick Sweeting 5b6eb5e4ad make filenames consistent with program name il y a 5 ans
  Cristian 37df00a08b tests: Add basic singlefile test il y a 5 ans
  Cristian e6c571beb2 fix: Remove title from extractors for oneshot il y a 5 ans
  Cristian 23e6803f02 fix: Add change to calculate wget folder when there is a port present il y a 5 ans