Nothing new, really. It is open secret that every commercial LLM out there was trained to some extent on copyrighted data without permission and it would otherwise be deemed as copyright infringement.