Building an AI product without understanding data quality is like flying blind.
#Day80 of #HelpingAspiringPMs100Days
Yes, Data is crucial for AI products because without it, how can you pre-train the LLM?
3 Ways Product Managers Source Data for Pre-training Their LLMs.
3. Companies do web scraping of the website, like Wikipedia, Blogs, new websites, etc and from there they train the LLM.
2. They use APIs from big companies that have a lot of data.
1. If they are building the tools for the internal Company, then they have internal data from which they train their users.
Data quality is like the unsung hero of AI...ignore it, and the whole thing crumbles. I’ve seen internal data be a goldmine, but only when teams actually invest in cleaning it up. Curiouswhat’s your go-to approach for ensuring clean data? 🤔 #AI #ProductManagement