Commit History

Author SHA1 Message Date
  Ben Muthalaly d8cf09c21e Remove unnecessary variable length args for dedupe 1 year ago
  Ben Muthalaly d74ddd42ae Flip dedupe precedence order 1 year ago
  Ben Muthalaly 4e69d2c9e1 Add `EXTRA_*_ARGS` for wget, curl, and singlefile 1 year ago
  Nick Sweeting 6a4e568d1b new archivebox update speed improvements 1 year ago
  Nick Sweeting db2984e47b prefer dom dump to singlefile for generating readability output 1 year ago
  Ben Muthalaly 77917e9b55 Fix HTML title parsing bugs. 2 years ago
  papersnake de8e22efb7 improve title extractor 3 years ago
  Nick Sweeting 04c951cdd5 fix alerts 4 years ago
  Nick Sweeting 385daf9af8 save the url as title for staticfiles or non html files 4 years ago
  Dan Arnfield 5420903102 Refactor `should_save_extractor` methods to accept `overwrite` parameter 4 years ago
  Cristian 81d766aba1 refactor: Remove setup_django from title.py 5 years ago
  Cristian e7e33ea7a5 tests: Add tests for several different ways to extract the title 5 years ago
  Nick Sweeting f727ece7b3 add regex fallback back to title parser 5 years ago
  Nick Sweeting 79bef1384e Merge pull request #493 from ttimasdf/feat-ogtitle 5 years ago
  Cristian c12fe0e3d7 feat: Use CURL_ARGS on title extractor 5 years ago
  ttimasdf eda3836dee feat: add og:title metadata as alternative title 5 years ago
  Cristian b18bbf8874 test: Fix tests post-rebase 5 years ago
  Nick Sweeting 032c2458de add missing setup_django import 5 years ago
  Nick Sweeting 55a237a435 also set snapshot title inside of fetch_title directly 5 years ago
  Nick Sweeting 273059f054 accept gzipped responses when using curl 5 years ago
  Cristian a5550b2105 fix: Rename logging folder to avoid naming conflicts (and circular import issues) 5 years ago
  Cristian f4d1b5121e refactor: Move logging.py to main module to avoid circular import issues 5 years ago
  Nick Sweeting 5c2bbe7efe bufixes 5 years ago
  Nick Sweeting 95007d9137 split up utils into separate files 6 years ago
  Nick Sweeting 1b8abc0961 move everything out of legacy folder 6 years ago