The database is being reverse engineered and published anyways, as per the artic...

therealdrag0 · 2025-08-13T07:24:38 1755069878

I think Archive is just rehydrating shortened links in webpages that have been archived. I doubt They’re discovering previously unknown urls.

wolfgang42 · 2025-08-13T18:18:40 1755109120

No they really are trying to enumerate all 230 billion possible shortlinks; that’s why they need so many people to help crawl everything.

therealdrag0 · 2025-08-13T19:55:36 1755114936

Got a source? I don’t see details one way or another

wolfgang42 · 2025-08-20T17:13:04 1755709984

From the article:

> there are about 230 billion* links that need visiting

> * Thanks to arkiver on the Archive Team IRC for correcting this number.

Also when running the Warrior project you could see it iterating through the range. I don't have any logs handy since the project is finished but they looked a bit like

  https://goo.gl/gEdpoS: 404 Not Found
  https://goo.gl/gEdpoT: 404 Not Found
  https://goo.gl/gEdpoU: 302 Found -> https://...
  https://goo.gl/gEdpoV: 404 Not Found