Certainly you can. You can audit Bill Header’s streaming history to see exactly what he watched and when.
But I think that’s immaterial as I think the answer is that it doesn’t matter if you train on copyrighted work. Or maybe better that you don’t need a special license.
If I steal books and train on them, then I think that’s copyright infringement not because of the training but because I made an infringing copy. However, if I have a license already to read those books (ie, I bought a copy at a book store) then it’s not infringement to train an AI, or loan it to a million people or whatever I like with it as I bought a copy.
Yes but you’re not allowed to copy and distribute so you’re never going to actually do that. Also you’ll pass away with the knowledge you gained unlike the AI you’re training it on which will contain remnants of the work in all copies of itself, however abstracted away you consider it.
But I think that’s immaterial as I think the answer is that it doesn’t matter if you train on copyrighted work. Or maybe better that you don’t need a special license.
If I steal books and train on them, then I think that’s copyright infringement not because of the training but because I made an infringing copy. However, if I have a license already to read those books (ie, I bought a copy at a book store) then it’s not infringement to train an AI, or loan it to a million people or whatever I like with it as I bought a copy.