Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

No - it's the reproduction, not the memorization.


So training AI on copyrighted data isn’t a problem unless it spits out the data verbatim. Correct?


Close.

It isn't a problem until it spits out a similar enough copy.

Copyright violation doesn't have to be verbatim; taking a 4k movie and reencoding it to 320x200 before distribution isn't legal.


It's reproduced in the memorizer's neural connectome.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: