|
@@ -14,7 +14,7 @@ Project-URL: Roadmap, https://github.com/ArchiveBox/ArchiveBox/wiki/Roadmap
|
|
|
Project-URL: Community, https://github.com/ArchiveBox/ArchiveBox/wiki/Web-Archiving-Community
|
|
Project-URL: Community, https://github.com/ArchiveBox/ArchiveBox/wiki/Web-Archiving-Community
|
|
|
Project-URL: Donate, https://github.com/ArchiveBox/ArchiveBox/wiki/Donations
|
|
Project-URL: Donate, https://github.com/ArchiveBox/ArchiveBox/wiki/Donations
|
|
|
Description: <div align="center">
|
|
Description: <div align="center">
|
|
|
- <img src="https://i.imgur.com/4nkFjdv.png" height="80px">
|
|
|
|
|
|
|
+ <em><img src="https://i.imgur.com/5B48E3N.png" height="90px"></em>
|
|
|
<h1>ArchiveBox<br/><sub>The open-source self-hosted web archive.</sub></h1>
|
|
<h1>ArchiveBox<br/><sub>The open-source self-hosted web archive.</sub></h1>
|
|
|
|
|
|
|
|
▶️ <a href="https://github.com/ArchiveBox/ArchiveBox/wiki/Quickstart">Quickstart</a> |
|
|
▶️ <a href="https://github.com/ArchiveBox/ArchiveBox/wiki/Quickstart">Quickstart</a> |
|
|
@@ -41,7 +41,7 @@ Description: <div align="center">
|
|
|
<hr/>
|
|
<hr/>
|
|
|
</div>
|
|
</div>
|
|
|
|
|
|
|
|
- ArchiveBox is a powerful self-hosted internet archiving solution written in Python 3. You feed it URLs of pages you want to archive, and it saves them to disk in a varitety of formats depending on the configuration and the content it detects. ArchiveBox can be installed via [Docker](https://docs.docker.com/get-docker/) or [`pip3`](https://wiki.python.org/moin/BeginnersGuide/Download).
|
|
|
|
|
|
|
+ ArchiveBox is a powerful self-hosted internet archiving solution written in Python 3. You feed it URLs of pages you want to archive, and it saves them to disk in a varitety of formats depending on the configuration and the content it detects. ArchiveBox can be installed via [Docker](https://docs.docker.com/get-docker/) (recommended) or [`pip`](https://www.python.org/downloads/). It works on macOS, Windows, and Linux/BSD (both armv7 and amd64).
|
|
|
|
|
|
|
|
Once installed, URLs can be added via the command line `archivebox add` or the built-in Web UI `archivebox server`. It can ingest bookmarks from a service like Pocket/Pinboard, your entire browsing history, RSS feeds, or URLs one at a time.
|
|
Once installed, URLs can be added via the command line `archivebox add` or the built-in Web UI `archivebox server`. It can ingest bookmarks from a service like Pocket/Pinboard, your entire browsing history, RSS feeds, or URLs one at a time.
|
|
|
|
|
|
|
@@ -51,19 +51,27 @@ Description: <div align="center">
|
|
|
#### Quickstart
|
|
#### Quickstart
|
|
|
|
|
|
|
|
```bash
|
|
```bash
|
|
|
- docker run -d -it -v ~/archivebox:/data -p 8000:8000 archivebox/archivebox server --init 0.0.0.0:8000
|
|
|
|
|
- docker run -v ~/archivebox:/data -it archivebox/archivebox manage createsuperuser
|
|
|
|
|
- docker run -v ~/archivebox:/data -it archivebox/archivebox add 'https://example.com'
|
|
|
|
|
|
|
+ # 1. Create a folder somewhere to hold your ArchiveBox data
|
|
|
|
|
+ mkdir ~/archivebox && cd ~/archivebox
|
|
|
|
|
+ docker run -v $PWD:/data -it archivebox/archivebox init
|
|
|
|
|
+
|
|
|
|
|
+ # 2. Archive some URLs to get started
|
|
|
|
|
+ docker run -v $PWD:/data -t archivebox/archivebox add https://github.com/ArchiveBox/ArchiveBox
|
|
|
|
|
+ docker run -v $PWD:/data -t archivebox/archivebox add --depth=1 https://example.com
|
|
|
|
|
|
|
|
- open http://127.0.0.1:8000/admin/login/ # then click "Add" in the navbar
|
|
|
|
|
|
|
+ # 3. Then view the snapshots of the URLs you added via the self-hosted web UI
|
|
|
|
|
+ docker run -v $PWD:/data -it archivebox/archivebox manage createsuperuser # create an admin acct
|
|
|
|
|
+ docker run -v $PWD:/data -p 8000:8000 archivebox/archivebox # start the web server
|
|
|
|
|
+ open http://127.0.0.1:8000/ # open the interactive admin panel
|
|
|
|
|
+ ls archive/*/index.html # or just browse snapshots on disk
|
|
|
```
|
|
```
|
|
|
|
|
|
|
|
<div align="center">
|
|
<div align="center">
|
|
|
<img src="https://i.imgur.com/lUuicew.png" width="400px">
|
|
<img src="https://i.imgur.com/lUuicew.png" width="400px">
|
|
|
<br/>
|
|
<br/>
|
|
|
|
|
|
|
|
- [DEMO: archivebox.zervice.io/](https://archivebox.zervice.io)
|
|
|
|
|
- For more information, see the [full Quickstart guide](https://github.com/ArchiveBox/ArchiveBox/wiki/Quickstart), [Usage](https://github.com/ArchiveBox/ArchiveBox/wiki/Usage), and [Configuration](https://github.com/ArchiveBox/ArchiveBox/wiki/Configuration) docs.
|
|
|
|
|
|
|
+ <a href="https://archivebox.zervice.io">DEMO: archivebox.zervice.io/</a>
|
|
|
|
|
+ For more information, see the <a href="https://github.com/ArchiveBox/ArchiveBox/wiki/Quickstart">full Quickstart guide</a>, <a href="https://github.com/ArchiveBox/ArchiveBox/wiki/Usage">Usage</a>, and <a href="https://github.com/ArchiveBox/ArchiveBox/wiki/Configuration">Configuration</a> docs.
|
|
|
</div>
|
|
</div>
|
|
|
|
|
|
|
|
---
|
|
---
|
|
@@ -82,7 +90,7 @@ Description: <div align="center">
|
|
|
open http://127.0.0.1:8000
|
|
open http://127.0.0.1:8000
|
|
|
```
|
|
```
|
|
|
|
|
|
|
|
- The CLI is considered "stable", and the ArchiveBox Python API and REST APIs are in "beta".
|
|
|
|
|
|
|
+ The CLI is considered "stable", the ArchiveBox Python API and REST APIs are in "beta", and the [desktop app](https://github.com/ArchiveBox/desktop) is in "alpha" stage.
|
|
|
|
|
|
|
|
At the end of the day, the goal is to sleep soundly knowing that the part of the internet you care about will be automatically preserved in multiple, durable long-term formats that will be accessible for decades (or longer). You can also self-host your archivebox server on a public domain to provide archive.org-style public access to your site snapshots.
|
|
At the end of the day, the goal is to sleep soundly knowing that the part of the internet you care about will be automatically preserved in multiple, durable long-term formats that will be accessible for decades (or longer). You can also self-host your archivebox server on a public domain to provide archive.org-style public access to your site snapshots.
|
|
|
|
|
|
|
@@ -331,7 +339,7 @@ Description: <div align="center">
|
|
|
|
|
|
|
|
▶ **Check out our [community page](https://github.com/ArchiveBox/ArchiveBox/wiki/Web-Archiving-Community) for an index of web archiving initiatives and projects.**
|
|
▶ **Check out our [community page](https://github.com/ArchiveBox/ArchiveBox/wiki/Web-Archiving-Community) for an index of web archiving initiatives and projects.**
|
|
|
|
|
|
|
|
- <img src="https://i.imgur.com/4nkFjdv.png" width="10%" align="left"/> The aim of ArchiveBox is to go beyond what the Wayback Machine and other public archiving services can do, by adding a headless browser to replay sessions accurately, and by automatically extracting all the content in multiple redundant formats that will survive being passed down to historians and archivists through many generations.
|
|
|
|
|
|
|
+ <img src="https://i.imgur.com/4nkFjdv.png" width="10%" align="left" alt="comparison"/> The aim of ArchiveBox is to go beyond what the Wayback Machine and other public archiving services can do, by adding a headless browser to replay sessions accurately, and by automatically extracting all the content in multiple redundant formats that will survive being passed down to historians and archivists through many generations.
|
|
|
|
|
|
|
|
#### User Interface & Intended Purpose
|
|
#### User Interface & Intended Purpose
|
|
|
|
|
|
|
@@ -362,7 +370,7 @@ Description: <div align="center">
|
|
|
_A collection of the most active internet archiving communities and initiatives._
|
|
_A collection of the most active internet archiving communities and initiatives._
|
|
|
- Check out the ArchiveBox [Roadmap](https://github.com/ArchiveBox/ArchiveBox/wiki/Roadmap) and [Changelog](https://github.com/ArchiveBox/ArchiveBox/wiki/Changelog)
|
|
- Check out the ArchiveBox [Roadmap](https://github.com/ArchiveBox/ArchiveBox/wiki/Roadmap) and [Changelog](https://github.com/ArchiveBox/ArchiveBox/wiki/Changelog)
|
|
|
- Learn why archiving the internet is important by reading the "[On the Importance of Web Archiving](https://parameters.ssrc.org/2018/09/on-the-importance-of-web-archiving/)" blog post.
|
|
- Learn why archiving the internet is important by reading the "[On the Importance of Web Archiving](https://parameters.ssrc.org/2018/09/on-the-importance-of-web-archiving/)" blog post.
|
|
|
- - Or reach out to me for questions and comments via [@theSquashSH](https://twitter.com/thesquashSH) on Twitter.
|
|
|
|
|
|
|
+ - Or reach out to me for questions and comments via [@ArchiveBoxApp](https://twitter.com/ArchiveBoxApp) or [@theSquashSH](https://twitter.com/thesquashSH) on Twitter.
|
|
|
|
|
|
|
|
---
|
|
---
|
|
|
|
|
|
|
@@ -494,7 +502,7 @@ Description: <div align="center">
|
|
|
<a href="https://www.patreon.com/theSquashSH"><img src="https://img.shields.io/badge/Donate_to_support_development-via_Patreon-%23DD5D76.svg?style=flat"/></a>
|
|
<a href="https://www.patreon.com/theSquashSH"><img src="https://img.shields.io/badge/Donate_to_support_development-via_Patreon-%23DD5D76.svg?style=flat"/></a>
|
|
|
<br/>
|
|
<br/>
|
|
|
|
|
|
|
|
- <a href="https://twitter.com/thesquashSH"><img src="https://img.shields.io/badge/Tweet-%40theSquashSH-blue.svg?style=flat"/></a>
|
|
|
|
|
|
|
+ <a href="https://twitter.com/ArchiveBoxApp"><img src="https://img.shields.io/badge/Tweet-%40ArchiveBoxApp-blue.svg?style=flat"/></a>
|
|
|
<a href="https://github.com/ArchiveBox/ArchiveBox"><img src="https://img.shields.io/github/stars/ArchiveBox/ArchiveBox.svg?style=flat&label=Star+on+Github"/></a>
|
|
<a href="https://github.com/ArchiveBox/ArchiveBox"><img src="https://img.shields.io/github/stars/ArchiveBox/ArchiveBox.svg?style=flat&label=Star+on+Github"/></a>
|
|
|
|
|
|
|
|
<br/><br/>
|
|
<br/><br/>
|