What do AI models depend on? Data. But what happens when the data you use isn’t data you own or have the rights to use in that way?
Commercial LLMs seem like an amazing capability, offering users ways to ask questions and get answers without ever looking at the original source material. In a sense, they are a modern search engine, but instead of directing a user to the document that best matches their prompt, a document is synthesized with no link to the original material.
This fundamental issue disconnects the owner of the material from the ability to earn from it, especially as most AI firms have been willy nilly with their ethics.
But copyright holders, creators, and publishers are taking notice and putting the commercial LLM providers on notice. In some cases, they are already winning court cases based on this.
The key to AI today is to focus on using it effectively and ethically. The legal risks of not doing so become great as Open AI found out.
