Nick Sweeting
|
dd2302ad92
new jsonl cli interface
|
1 month ago |
Claude
|
69965a2782
fix: correct CLI pipeline data flow for crawl -> snapshot -> extract
|
1 month ago |
Claude
|
ae648c9bc1
refactor: move remaining JSONL methods to models, clean up jsonl.py
|
1 month ago |
Claude
|
bc273c5a7f
feat: add schema_version to JSONL outputs and remove dead code
|
1 month ago |
Claude
|
a5206e7648
refactor: move to_jsonl() methods to models
|
1 month ago |
Claude
|
d36079829b
feat: replace index.json with index.jsonl flat JSONL format
|
1 month ago |
Nick Sweeting
|
f0aa19fa7d
wip
|
1 month ago |
Nick Sweeting
|
bd265c0083
rename extractor to plugin everywhere
|
2 months ago |
Nick Sweeting
|
50e527ec65
way better plugin hooks system wip
|
2 months ago |
Claude
|
b632894bc9
Update views, API, and exports for new ArchiveResult output fields
|
2 months ago |
Claude
|
c3acadd528
Remove extractor field from Crawl model and fix tests
|
2 months ago |
Nick Sweeting
|
bb53228ebf
remove Seed model in favor of Crawl as template
|
2 months ago |
Nick Sweeting
|
1915333b81
wip major changes
|
2 months ago |