提交历史

作者 SHA1 备注 提交日期
  Nick Sweeting 457c42bf84 load EXTRACTORS dynamically using importlib.import_module 1 年之前
  Nick Sweeting c1fd2cfa42 tag URLs immediately once added instead of waiting until archival completes 1 年之前
  Nick Sweeting 78d942ac22 show more detail in readabiliity error messages 1 年之前
  Nick Sweeting 5b07a1126c add comment about why DOM is preferred over singlefile for readability parsing 1 年之前
  Nick Sweeting 2c54e55697 prefer dom dump to singlefile for generating readability output 1 年之前
  Nick Sweeting 82d8662c74 add more readability error output 2 年之前
  prnake 011bd104cb remove unused import 3 年之前
  papersnake de8e22efb7 improve title extractor 3 年之前
  Nick Sweeting eb4d3bca9d Update readability.py 4 年之前
  Nick Sweeting a9986f1f05 add timezone support, tons of CSS and layout improvements, more detailed snapshot admin form info, ability to sort by recently updated, better grid view styling, better table layouts, better dark mode support 4 年之前
  Nick Sweeting bd6d9c165b enforce utf8 on literally all file operations because windows sucks 4 年之前
  Nick Sweeting acb932ba12 improve readability and mercury error handling and fix output path to be relative 4 年之前
  Nick Sweeting d0f8a5e710 change mercury atomic_write output order 4 年之前
  Dan Arnfield 5420903102 Refactor `should_save_extractor` methods to accept `overwrite` parameter 4 年之前
  JDC b1f70b2197 Initial implementation 5 年之前
  Nick Sweeting a645f36b87 add comment about fake cmd 5 年之前
  Cristian 66037535fd feat: Add curl command on readability as default command to debug 5 年之前
  Cristian bf3ea42141 fix: Add a default cmd value to handle case where the html cannot be retrieved 5 年之前
  Nick Sweeting a2c158e43e catch OSErrors due to missing path 5 年之前
  Nick Sweeting 7144e0bdce search for node dependencies in output dir first 5 年之前
  Nick Sweeting 92de20af15 better detect missing dependencies on startup 5 年之前
  Cristian 05c71fc302 fix: Organize readability extractor so a timeout does not break the whole process 5 年之前
  Nick Sweeting 03b73bfe77 Update archivebox/extractors/readability.py 5 年之前
  Cristian 5dc7e63792 feat: Update dockerfile to support readability 5 年之前
  Cristian 2a68af1b94 tests: Add readability tests 5 年之前
  Cristian 8aa7b34de7 tests: Add readability to ignored methods in tests 5 年之前
  Cristian dc87d8b68c tests: Update failing tests 5 年之前
  Cristian 0ec747f64e feat: Look in wget, singlefile or dom outputs before attempting to download the information again 5 年之前
  Cristian a14762640e feat: Avoid running readability when the target is a file 5 年之前
  Cristian 61e08a7c43 docs: Update docs link 5 年之前