| .. |
|
__init__.py
|
f0033f75d0
config.py lint fixes
|
2 лет назад |
|
archive_org.py
|
bd6d9c165b
enforce utf8 on literally all file operations because windows sucks
|
4 лет назад |
|
dom.py
|
603ce7ec10
After a timeout, chrome will leave behind a SingletonLock, which prevents future instances of chrome from starting. When an extractor fails due to a timeout, remove this file.
|
2 лет назад |
|
favicon.py
|
1e50ca243e
Add FAVICON_PROVIDER option for custom favicon service
|
2 лет назад |
|
git.py
|
5420903102
Refactor `should_save_extractor` methods to accept `overwrite` parameter
|
4 лет назад |
|
headers.py
|
5420903102
Refactor `should_save_extractor` methods to accept `overwrite` parameter
|
4 лет назад |
|
htmltotext.py
|
310b4d1242
Add htmltotext extractor
|
2 лет назад |
|
media.py
|
b864c38d9e
Don't be strict on unicode errors
|
3 лет назад |
|
mercury.py
|
acb932ba12
improve readability and mercury error handling and fix output path to be relative
|
4 лет назад |
|
pdf.py
|
603ce7ec10
After a timeout, chrome will leave behind a SingletonLock, which prevents future instances of chrome from starting. When an extractor fails due to a timeout, remove this file.
|
2 лет назад |
|
readability.py
|
c1fd2cfa42
tag URLs immediately once added instead of waiting until archival completes
|
1 год назад |
|
screenshot.py
|
603ce7ec10
After a timeout, chrome will leave behind a SingletonLock, which prevents future instances of chrome from starting. When an extractor fails due to a timeout, remove this file.
|
2 лет назад |
|
singlefile.py
|
d77c770c47
add CHROME_TIMEOUT args
|
2 лет назад |
|
title.py
|
db2984e47b
prefer dom dump to singlefile for generating readability output
|
1 год назад |
|
wget.py
|
a9986f1f05
add timezone support, tons of CSS and layout improvements, more detailed snapshot admin form info, ability to sort by recently updated, better grid view styling, better table layouts, better dark mode support
|
4 лет назад |