Nick Sweeting c1fd2cfa42 tag URLs immediately once added instead of waiting until archival completes 1 год назад
..
__init__.py f0033f75d0 config.py lint fixes 2 лет назад
archive_org.py bd6d9c165b enforce utf8 on literally all file operations because windows sucks 4 лет назад
dom.py 603ce7ec10 After a timeout, chrome will leave behind a SingletonLock, which prevents future instances of chrome from starting. When an extractor fails due to a timeout, remove this file. 2 лет назад
favicon.py 1e50ca243e Add FAVICON_PROVIDER option for custom favicon service 2 лет назад
git.py 5420903102 Refactor `should_save_extractor` methods to accept `overwrite` parameter 4 лет назад
headers.py 5420903102 Refactor `should_save_extractor` methods to accept `overwrite` parameter 4 лет назад
htmltotext.py 310b4d1242 Add htmltotext extractor 2 лет назад
media.py b864c38d9e Don't be strict on unicode errors 3 лет назад
mercury.py acb932ba12 improve readability and mercury error handling and fix output path to be relative 4 лет назад
pdf.py 603ce7ec10 After a timeout, chrome will leave behind a SingletonLock, which prevents future instances of chrome from starting. When an extractor fails due to a timeout, remove this file. 2 лет назад
readability.py c1fd2cfa42 tag URLs immediately once added instead of waiting until archival completes 1 год назад
screenshot.py 603ce7ec10 After a timeout, chrome will leave behind a SingletonLock, which prevents future instances of chrome from starting. When an extractor fails due to a timeout, remove this file. 2 лет назад
singlefile.py d77c770c47 add CHROME_TIMEOUT args 2 лет назад
title.py db2984e47b prefer dom dump to singlefile for generating readability output 1 год назад
wget.py a9986f1f05 add timezone support, tons of CSS and layout improvements, more detailed snapshot admin form info, ability to sort by recently updated, better grid view styling, better table layouts, better dark mode support 4 лет назад