Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

It's "unauthorized use" rather than "stealing", since the original work is not moved anywhere. It's more like using your creative work to train a software system that generates similar-looking, competing works, for pennies, at industrial scale and speed.




Obtaining without payment or consent and then using to create derivative works at scale?

And the pedantry matters only because the entities criming are too big and rich and financed by the right people.

It is basically a display of the societal threshold beyond which laws are not enforced.


> Obtaining without payment or consent

Usually "obtaining" is just making a bunch of HTTP requests - which is kind of how the web is designed to work. The "consent" (and perhaps "desired payment" when there is no paywall) issue is the important bit and ultimately boils down to the use case. Is it a human viewing the page, a search engine updating its index, or OpenAI collecting data for training? It is annoying when things like robots.txt are simply ignored, even if they are not legally or technically binding.

The legal situation is unsurprisingly murky at the moment. Copyright law was designed for a different use case, and might not be the right tool or regulatory framework to address GenAI.

But as I think you are suggesting, it may be an example of regulatory entrepreneurship, where (AI) companies try to move forward quickly before laws and regulations catch up with them, while simultaneously trying to influence new laws and regulations in their favor.

[Copyright law itself also has many peculiarities, for example not applying to recipes, game rules, or fashion designs (hence fast fashion, knockoffs, etc.) Does it, or should it, apply to AI training and GenAI services? Time will tell.]


The pedantry matters for the same reason it mattered when the music industry did this to Napster: because the truth is important.

Accurate terminology can help to clarify discussions, including discussions of appropriate activity related to creative works.

My suspicion is also that what many artists are objecting to is less the training itself (though AI web crawlers are aggressive and annoying) and more the use of the trained model for large-scale generation of similar (and possibly competing) works ("derivative" in an artistic if not legal sense), in the artist's style, especially by large companies for commercial gain.


Ok Mr. (Or Ms.) Pedant you know what the intended meaning was.

everybody knows this, you are being uselessly pedantic



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: