Commit History

Author SHA1 Message Date
  Nick Sweeting 33e82736f9 Merge branch 'dev' into plugins-browsertrix 1 year ago
  Nick Sweeting 98c5e69203 bump lockfiles 1 year ago
  Nick Sweeting 17f40f3ada Merge branch 'dev' into fix-URL_REGEX 1 year ago
  Nick Sweeting c6f8a33a63 Update util.py 1 year ago
  longzai e4dc2701ef fix URL_REGEX 2 1 year ago
  longzai 4ae765ec27 fix the URL_REGEX used in generic_html parsers 1 year ago
  Nick Sweeting c22df0b63a Merge branch 'dev' into plugins-browsertrix 1 year ago
  Nick Sweeting c5bb99dce1 explicitly use Default profile inside user data dir 1 year ago
  Nick Sweeting 11b067a1ae Merge branch 'dev' into plugins-browsertrix 1 year ago
  Nick Sweeting ca2c484a8e Add `_EXTRA_ARGS` for various extractors (#1360) 1 year ago
  Ben Muthalaly d8cf09c21e Remove unnecessary variable length args for dedupe 1 year ago
  Ben Muthalaly 4686da91e6 Fix cookies being set incorrectly 1 year ago
  Ben Muthalaly d74ddd42ae Flip dedupe precedence order 1 year ago
  Ben Muthalaly 68326a60ee Add cookies file to http request in `download_url` 1 year ago
  Ben Muthalaly 4d9c5a7b4b Add `CHROME_EXTRA_ARGS` 1 year ago
  Ben Muthalaly 4e69d2c9e1 Add `EXTRA_*_ARGS` for wget, curl, and singlefile 1 year ago
  Nick Sweeting 1ea7ac168a Merge branch 'dev' into plugins-browsertrix 1 year ago
  Nick Sweeting 6a4e568d1b new archivebox update speed improvements 1 year ago
  Nick Sweeting c6faa9ab76 add extra information to headers extractor output 1 year ago
  Nick Sweeting 8c07b7e127 disable automatic chrome selfupdating 1 year ago
  Nick Sweeting 6184f659dc improve window size chrome cli handling 1 year ago
  spresse1 603ce7ec10 After a timeout, chrome will leave behind a SingletonLock, which prevents future instances of chrome from starting. When an extractor fails due to a timeout, remove this file. 2 years ago
  Ross Williams c039ef05b3 Fix hyphen placement in util.URL_REGEX 2 years ago
  Ross Williams d0e65eba7f More reliably detect Google Chrome version number 2 years ago
  ふぁ 44a5a5ed7e add explicitly specify --headless=new 2 years ago
  ふぁ d77c770c47 add CHROME_TIMEOUT args 2 years ago
  Nick Sweeting 606fa397a4 disable passing timeout arg to chrome because v111 is crashing when passed 2 years ago
  Nick Sweeting 1f1c70a8b1 remove --single-process from chrome args and add some rendering optimization args 2 years ago
  Nick Sweeting 49faec8f6d add no-zygote and single-process args to try and prevent orphan chrome processes after exit 4 years ago
  Nick Sweeting 9f05cf8283 virtual-time-budget doesnt work with some chrome stuff 4 years ago