Controversy over OpenAI's use of YouTube videos to train GPT-4. Google stated that it had seen unconfirmed reports about OpenAI using YouTube content without permission.

This discovery underscores the significant challenge that AI companies face in obtaining high-quality training data for their models. This behavior raises questions about the legality and ethics of using copyrighted material without prior and explicit permission from the platform that owns the content. It reflects a broader trend in this field, where the appetite of developers of artificial intelligence systems is close to exceeding the limits of the available resources of this data.