Nick Sweeting
|
c6f8a33a63
Update util.py
|
1 year ago |
longzai
|
e4dc2701ef
fix URL_REGEX 2
|
1 year ago |
longzai
|
4ae765ec27
fix the URL_REGEX used in generic_html parsers
|
1 year ago |
spresse1
|
603ce7ec10
After a timeout, chrome will leave behind a SingletonLock, which prevents future instances of chrome from starting. When an extractor fails due to a timeout, remove this file.
|
2 years ago |
Ross Williams
|
c039ef05b3
Fix hyphen placement in util.URL_REGEX
|
2 years ago |
Ross Williams
|
d0e65eba7f
More reliably detect Google Chrome version number
|
2 years ago |
ふぁ
|
44a5a5ed7e
add explicitly specify --headless=new
|
2 years ago |
ふぁ
|
d77c770c47
add CHROME_TIMEOUT args
|
2 years ago |
Nick Sweeting
|
606fa397a4
disable passing timeout arg to chrome because v111 is crashing when passed
|
2 years ago |
Nick Sweeting
|
1f1c70a8b1
remove --single-process from chrome args and add some rendering optimization args
|
2 years ago |
Nick Sweeting
|
49faec8f6d
add no-zygote and single-process args to try and prevent orphan chrome processes after exit
|
4 years ago |
Nick Sweeting
|
9f05cf8283
virtual-time-budget doesnt work with some chrome stuff
|
4 years ago |
Nick Sweeting
|
0c321a06d0
hide scrollbars in screenshots
|
4 years ago |
Nick Sweeting
|
a9986f1f05
add timezone support, tons of CSS and layout improvements, more detailed snapshot admin form info, ability to sort by recently updated, better grid view styling, better table layouts, better dark mode support
|
4 years ago |
Nick Sweeting
|
5a9f27204a
dont use chrome when its not available on windows systems
|
4 years ago |
Nick Sweeting
|
3e26ae4a66
support finding multiple urls as substrings in text
|
4 years ago |
Nick Sweeting
|
c089501073
add response status code to headers.json
|
4 years ago |
Nick Sweeting
|
a0a79cead8
move utils and vendored libs into subfolders
|
5 years ago |
Nick Sweeting
|
104553489f
remove redundant utils file
|
5 years ago |
Nick Sweeting
|
83693a5c03
add packaging setup with stdeb for debian and apt
|
5 years ago |
Nick Sweeting
|
c47398851b
nicer timeout hints
|
5 years ago |
Cristian
|
62ed11a5ca
fix: Improve headers handling
|
5 years ago |
Angel Rey
|
f0915a56aa
Replaced get method
|
5 years ago |
Angel Rey
|
a8a8fd14ac
Fixed indent headers.json
|
5 years ago |
Angel Rey
|
852e3c9cff
Added headers extractor
|
5 years ago |
Cristian
|
b18bbf8874
test: Fix tests post-rebase
|
5 years ago |
apkallum
|
008769d296
add support for Paths in json encoder
|
5 years ago |
Nick Sweeting
|
3658153cf8
fix url parsing through quotes
|
5 years ago |
Cristian
|
d0d2991c69
fix: Change import that was not working
|
5 years ago |
Cristian
|
6006b4f93b
refactor: Organize code to remove flake8 issues
|
5 years ago |