And here I'm still looking for a way, with one click, to create an offline backu...

simonw · 2025-11-12T04:54:21 1762923261

Have you tried ArchiveBox https://github.com/ArchiveBox/ArchiveBox ? It's a pretty solid implementation of that pattern.

zimpenfish · 2025-11-12T09:04:30 1762938270

I love ArchiveBox but the headless Chromium they use has some annoying "will break randomly and GFL trying to figure out why/how to fix it" problems (like it'll just randomly stop working because the profile is locked except the lock file isn't there and even if you tweak things to make 100% sure the profile lock is removed before and after every archive request, it'll still randomly fail on a locked profile and WHAT THE HELL IS GOING ON?!)

Although, to be fair, running it in Docker seems less fraught and breaks less often (and it's a lot easier to restart when it does break.)

(I've got a pipeline from Instapaper -> {IFTTT -> {Pinboard -> Linkhut, Dropbox, Webhook -> ArchiveBox}} which works well most of the time for archiving random pages. Used to be Pocket until Mozilla decided to be evil.)

toomuchtodo · 2025-11-12T05:00:40 1762923640

https://github.com/karakeep-app/karakeep

https://github.com/gildas-lormeau/SingleFile

ninalanyon · 2025-11-12T18:39:37 1762972777

I used SingleFile for a while but now I've switched to WebScrapBook because a lot of the pages that I save have the same images. Then I run rdfind to hard link all the identical files and save space.

profsummergig · 2025-11-12T05:17:25 1762924645

Thanks. I've tried SingleFile. I made some backups using the Chrome Extension. I was unable to open them a couple of years later. So I abandoned it.

Will try karakeep.

gildas · 2025-11-12T10:14:23 1762942463

Author of SingleFile here. Sorry, this is obviously not normal. Please feel free to report any bugs here https://github.com/gildas-lormeau/SingleFile/issues.

jjice · 2025-11-12T15:03:30 1762959810

Anecdotally (not to diminish any bug the parent had), SingleFile is one of my favorite extensions. Been using it for years and it's saved my ass multiple times. Thank you!

Edit: What's the best way to support the project? I'm seeing there's an option through the Mozilla store and through GitHub. Is there's a preference?

gildas · 2025-11-12T22:05:09 1762985109

Thank you also for the kind words! Regardoing support, you can choose whichever method you prefer; it makes no difference to me actually.

toomuchtodo · 2025-11-12T05:23:06 1762924986

I have SingleFile configured to post full archives to Karakeep with an HTTP POST; this enables archiving pages from my browser that Karakeep cannot scrape and bookmark due to paywalls or bot protection.

https://docs.karakeep.app/guides/singlefile/

BoredPositron · 2025-11-12T07:47:27 1762933647

Thanks for mentioning it was about to hack something together myself.

nadir_ishiguro · 2025-11-13T08:56:33 1763024193

Also works with Linkding

rpdillon · 2025-11-12T13:47:13 1762955233

I've been using single file for five years and I've never had this issue for what it's worth. I keep a directory called Archives on my Synology that I expose with Copy Party, and I routinely back up web pages and then drop the result into my Copy Party instance for safekeeping.

I would look into what happened with the single file copies you made that didn't work because that is highly unusual.

Intralexical · 2025-11-12T21:33:53 1762983233

WebRecorder [0] is the best implemention of this that I've tested. It runs as an extension in your browser, intercepting HTTP streams, so as long as you open a page in your browser the data is captured to reproduce it exactly. It outputs WARC files that are (in theory) compatible with the rest of the web archiving ecosystem, and has a WARC explorer interface to browse captured archives.

For pages with dynamic content that can't be trivially reproduced by their HTTP streams— E.G., opening the archive triggers GETs with a mismatched timestamp, even if the file it's looking for is in the WARC under a different URI— There's always SingleFile [1], and Chromium's built-in MHTML Ctrl+S export, which "bake" the content into a static page.

0: https://chromewebstore.google.com/detail/webrecorder-archive...

1: https://github.com/gildas-lormeau/SingleFile

rambambram · 2025-11-12T10:00:42 1762941642

On Firefox, but I still feel the need to reply. You might find it handy, or other readers here might like it. Maybe it's also available for Chrome, I don't know.

I've been using an extension called WebScrapBook to locally save copies of interesting webpages. I use the basic functionality, but it comes with tons of options and settings.

jamwil · 2025-11-12T10:52:19 1762944739

I happened upon a bit of an unconventional approach to this with Zotero. It’s obviously more focused on academic research but it takes snapshots and works as a more general purpose archive tool really well.

neomindryan · 2025-11-12T18:56:35 1762973795

FWIW I've had success with self-hosted [LinkDing](https://github.com/sissbruecker/linkding) and the firefox SingleFile plugin (so it archives what I'm seeing / gets around logins etc). LinkDing also links directly to Internet Archive for any URL.

jmort · 2025-11-12T04:50:40 1762923040

No options?