Commit History

Author SHA1 Message Date
  Nick Sweeting c6f8a33a63 Update util.py 1 year ago
  longzai e4dc2701ef fix URL_REGEX 2 1 year ago
  longzai 4ae765ec27 fix the URL_REGEX used in generic_html parsers 1 year ago
  spresse1 603ce7ec10 After a timeout, chrome will leave behind a SingletonLock, which prevents future instances of chrome from starting. When an extractor fails due to a timeout, remove this file. 2 years ago
  Ross Williams c039ef05b3 Fix hyphen placement in util.URL_REGEX 2 years ago
  Ross Williams d0e65eba7f More reliably detect Google Chrome version number 2 years ago
  ふぁ 44a5a5ed7e add explicitly specify --headless=new 2 years ago
  ふぁ d77c770c47 add CHROME_TIMEOUT args 2 years ago
  Nick Sweeting 606fa397a4 disable passing timeout arg to chrome because v111 is crashing when passed 2 years ago
  Nick Sweeting 1f1c70a8b1 remove --single-process from chrome args and add some rendering optimization args 2 years ago
  Nick Sweeting 49faec8f6d add no-zygote and single-process args to try and prevent orphan chrome processes after exit 4 years ago
  Nick Sweeting 9f05cf8283 virtual-time-budget doesnt work with some chrome stuff 4 years ago
  Nick Sweeting 0c321a06d0 hide scrollbars in screenshots 4 years ago
  Nick Sweeting a9986f1f05 add timezone support, tons of CSS and layout improvements, more detailed snapshot admin form info, ability to sort by recently updated, better grid view styling, better table layouts, better dark mode support 4 years ago
  Nick Sweeting 5a9f27204a dont use chrome when its not available on windows systems 4 years ago
  Nick Sweeting 3e26ae4a66 support finding multiple urls as substrings in text 4 years ago
  Nick Sweeting c089501073 add response status code to headers.json 4 years ago
  Nick Sweeting a0a79cead8 move utils and vendored libs into subfolders 5 years ago
  Nick Sweeting 104553489f remove redundant utils file 5 years ago
  Nick Sweeting 83693a5c03 add packaging setup with stdeb for debian and apt 5 years ago
  Nick Sweeting c47398851b nicer timeout hints 5 years ago
  Cristian 62ed11a5ca fix: Improve headers handling 5 years ago
  Angel Rey f0915a56aa Replaced get method 5 years ago
  Angel Rey a8a8fd14ac Fixed indent headers.json 5 years ago
  Angel Rey 852e3c9cff Added headers extractor 5 years ago
  Cristian b18bbf8874 test: Fix tests post-rebase 5 years ago
  apkallum 008769d296 add support for Paths in json encoder 5 years ago
  Nick Sweeting 3658153cf8 fix url parsing through quotes 5 years ago
  Cristian d0d2991c69 fix: Change import that was not working 5 years ago
  Cristian 6006b4f93b refactor: Organize code to remove flake8 issues 5 years ago