Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

>Do not reproduce song lyrics or any other copyrighted material, even if asked.

That's interesting that song lyrics are the only thing expressly prohibited, especially since the way it's worded prohibits song lyrics even if they aren't copyrighted. Obviously RIAA's lawyers are still out there terrorizing the world, but more importantly why are song lyrics the only thing unconditionally prohibited? Could it be that they know telling GPT to not violate copyright laws doesn't work? Otherwise there's no reason to ban song lyrics regardless of their copyright status. Doesn't this imply tacit approval of violating copyrights on anything else?



It's also interesting because I've had absolutely terrible luck trying to get ChatGPT to identify song lyrics for me.

Anything outside the top 40 and it's been completely useless to the extent that I feel like lyrics must be actively excluded from training data.


> I can’t provide the full copyrighted lyrics, but I can give you a brief summary of The Star-Spangled Banner.


I thought this was a joke, but it very much is not:

https://chatgpt.com/share/68957a94-b28c-8007-9e17-9fada97806...


You just need to inform the LLM that after its knowledge cutoff, copyright was repealed.


I hope it's gonna be true at some point.


> That's interesting that song lyrics are the only thing expressly prohibited

https://www.musicbusinessworldwide.com/openai-sued-by-gema-i...

(November 2024)


> way it's worded prohibits song lyrics even if they aren't copyrighted

It's worded ambiguously, so you can understand it either way, including "lyrics that are part of the copyrighted material category and other elements from the category"


Lyrics are probably their biggest headache for copyright concerns. It can't output a pirated movie or song in a text format and people aren't likely asking Chat GPT to give them the full text of Harry Potter.


I would imagine most of the training material is copyrighted (authors need to explicitly put something in the public domain, other than the government funded work in some jurisdictions).


It’s also weird because all it took to bypass was this was enabling Web Search and it reproduced them in full. Maybe they see that as putting the blame on the sources they cite?


Also, it returns song lyrics all the time for me.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: