No - it's the reproduction, not the memorization.

wiseowise · 2025-04-12T14:06:15 1744466775

So training AI on copyrighted data isn’t a problem unless it spits out the data verbatim. Correct?

lelanthran · 2025-04-12T14:15:04 1744467304

Close.

It isn't a problem until it spits out a similar enough copy.

Copyright violation doesn't have to be verbatim; taking a 4k movie and reencoding it to 320x200 before distribution isn't legal.

greyface- · 2025-04-12T14:02:25 1744466545

It's reproduced in the memorizer's neural connectome.