Nick Sweeting
|
457c42bf84
load EXTRACTORS dynamically using importlib.import_module
|
1 year ago |
Ben Muthalaly
|
d8cf09c21e
Remove unnecessary variable length args for dedupe
|
1 year ago |
Ben Muthalaly
|
d74ddd42ae
Flip dedupe precedence order
|
1 year ago |
Ben Muthalaly
|
4e69d2c9e1
Add `EXTRA_*_ARGS` for wget, curl, and singlefile
|
1 year ago |
Nick Sweeting
|
6a4e568d1b
new archivebox update speed improvements
|
1 year ago |
Nick Sweeting
|
db2984e47b
prefer dom dump to singlefile for generating readability output
|
1 year ago |
Ben Muthalaly
|
77917e9b55
Fix HTML title parsing bugs.
|
2 years ago |
papersnake
|
de8e22efb7
improve title extractor
|
3 years ago |
Nick Sweeting
|
04c951cdd5
fix alerts
|
4 years ago |
Nick Sweeting
|
385daf9af8
save the url as title for staticfiles or non html files
|
4 years ago |
Dan Arnfield
|
5420903102
Refactor `should_save_extractor` methods to accept `overwrite` parameter
|
4 years ago |
Cristian
|
81d766aba1
refactor: Remove setup_django from title.py
|
5 years ago |
Cristian
|
e7e33ea7a5
tests: Add tests for several different ways to extract the title
|
5 years ago |
Nick Sweeting
|
f727ece7b3
add regex fallback back to title parser
|
5 years ago |
Nick Sweeting
|
79bef1384e
Merge pull request #493 from ttimasdf/feat-ogtitle
|
5 years ago |
Cristian
|
c12fe0e3d7
feat: Use CURL_ARGS on title extractor
|
5 years ago |
ttimasdf
|
eda3836dee
feat: add og:title metadata as alternative title
|
5 years ago |
Cristian
|
b18bbf8874
test: Fix tests post-rebase
|
5 years ago |
Nick Sweeting
|
032c2458de
add missing setup_django import
|
5 years ago |
Nick Sweeting
|
55a237a435
also set snapshot title inside of fetch_title directly
|
5 years ago |
Nick Sweeting
|
273059f054
accept gzipped responses when using curl
|
5 years ago |
Cristian
|
a5550b2105
fix: Rename logging folder to avoid naming conflicts (and circular import issues)
|
5 years ago |
Cristian
|
f4d1b5121e
refactor: Move logging.py to main module to avoid circular import issues
|
5 years ago |
Nick Sweeting
|
5c2bbe7efe
bufixes
|
5 years ago |
Nick Sweeting
|
95007d9137
split up utils into separate files
|
6 years ago |
Nick Sweeting
|
1b8abc0961
move everything out of legacy folder
|
6 years ago |