<img height="1" width="1" style="display:none" src="https://www.facebook.com/tr?id=2006193252832260&amp;ev=PageView&amp;noscript=1">

1 Min Read

Reddit IPO Reveals That Your User Data Is Training AI

Featured Image

Wondering how to get started with AI? Take our on-demand Piloting AI for Marketers Series.

Learn More

Reddit is going public. And, as part of its IPO filing, the company revealed it has data licensing agreements with AI companies.

The agreements, worth more than $200 million, presumably give these companies' AI models the ability to train legally on Reddit data.

Reddit hasn't revealed all the AI companies involved. But Reuters found out that one of the deals is with Google to the tune of $60 million per year.

What's going on here?

I got the answers on Episode 85 of The Artificial Intelligence Show from Marketing AI Institute founder/CEO Paul Roetzer.

The future of AI models is licensed and synthetic data

"This is going to be worth a lot of money," says Roetzer. "The future of these models is going to be licensed and synthetic data."

The most powerful AI models learn by ingesting enormous quantities of information. How they get at that information is the source of controversy. (Some companies like OpenAI are being sued for their use of copyright material.)

But by training on licensed content, or training on AI-generated content, AI companies avoid legal issues and get unique datasets that give them an advantage over other models. That makes proprietary datasets like Reddit's worth a lot of money to AI companies.

Companies with proprietary data are sitting on a goldmine

Access to unique data is how AI models stay competitive, says Roetzer. So it's no wonder Google and others are inking these types of deals. 

It's also why Elon Musk turned off access to the API of X, formerly Twitter. Now, only his AI model, Grok, can train on that data.

This likely impacts media companies and online sites with proprietary data, says Roetzer. Because this data represents a goldmine to AI companies.

"If you have a bunch of proprietary data, licensing of data is gonna be huge moving forward."

Related Posts

New CB Insights Report Reveals Top AI Trends To Watch in 2018

Ashley Sams | February 22, 2018

Using a database of 1,000+ global AI companies, CB Insights released the top 13 AI trends their analysts will be watching in 2018. Here are the takeaways.

Turn Google Analytics Data into Actionable Recommendations with AI

Paul Roetzer | June 20, 2021

Company Spotlight: PaveAI. Use machine learning and data science to turn Google Analytics data into actionable recommendations and reports.

How to Analyze Emotional Responses to Content and Ads Using AI

Paul Roetzer | October 4, 2022

Emotiva uses computer vision and machine learning to assess a person's emotional response to content and ads. We spoke with the company to learn how.