AI copyright could lead to new Marketing opportunities
Kevin Indig
JANUARY 31, 2023
Aggregators like Google would lose power since content platforms could train models on their data and provide superior AI experiences. Of the 45 terabytes of text GPT-3 was trained on, 60% came from Common Crawl*, 22% from WebText 2 (which is trained on outgoing links from Reddit), 8% on books and 3% on Wikipedia.
Let's personalize your content