Not to be a jerk, but 'hundreds of devs and dozens of MR per day' is not 'huge r...

ffsm8 · 2025-10-30T20:59:59 1761857999

> terabytes of source code

You sure that exists?

Git repositories that contain terabytes of source code?

I could imagine a repo that is terabytes but has binaries committed or similar... But source code?

CBLT · 2025-10-30T21:12:49 1761858769

Google's monorepo is in fact terabytes with no binaries. It does stretch the definition of source code though - a lot of that is configuration files (at worst, text protos) which are automatically generated.

packetslave · 2025-10-30T21:15:49 1761858949

Google had 86TB of sourcecode data in Piper way back in 2016.

ffsm8 · 2025-10-31T04:15:10 1761884110

Dang, that's mind boggling - especially if I keep in mind that a book series like lord of the rings is mere kilobytes if saved as plain text.

Having 86 TB of plain text/source code - I can't fathom the scale, honestly

Are you absolutely sure there aren't binaries in there (honestly asking, the scale is just insane from my perspective - even the largest book compilation like Anna's isn't approaching that number - if you strip out images ... And that's pretty much all books in circulation - with multiple versions per title)

phyrex · 2025-10-31T10:07:42 1761905262

Each snapshot of the repo isn't that big, but all the snapshots together, plus all the commit metadata and such, are

jeffbee · 2025-10-30T21:14:25 1761858865

git could never, but piper at google is way over that figure. Way, way over.

phyrex · 2025-10-31T00:16:57 1761869817

Microsoft has actually done a lot of work to scale got to large repos

p_l · 2025-10-31T11:29:14 1761910154

It's why there's special Microsoft Git VFS (a lot like VFS at google that is also referenced in the talk).

It was made to make working on Windows source code possible with Git.

phyrex · 2025-10-30T21:53:22 1761861202

Very sure, i work in one