Data Mesh Radio Patreon – get access to interviews well before they are released
Episode list and links to all available episode transcripts (most interviews from #32 on) here
Provided as a free resource by DataStax AstraDB; George Trujillo’s contact info: email (email@example.com) and LinkedIn
For more great content from Zhamak, check out her book on data mesh, a book she collaborated on, her LinkedIn, and her Twitter.
This episode is part of the greater AI/ML conversation I had with Zhamak. To start, Zhamak recognizes we aren’t where we want to be in terms of capabilities – ways of working or tooling – to make this a reality just yet. But, if we can make it so data scientists can trust and easily consume from data products – that we create data products that don’t care what use case type – regular analytics or AI/ML – can we remove a lot of the complexity they face? Do they need feature stores for data they aren’t transforming? If they can get continued access and know the quality, why create a separate process that has fragility instead of trust the data product owners upstream?
I wasn’t smart enough in the moment to talk about do we need to have a copy of the training data itself for reproducibility but folks smarter on ML than I am can answer that one, probably in the affirmative. But overall, there is a lot of complexity in the way we do AI/ML because data scientists can’t trust the sources of their data and they feel the need to take control because if they don’t, their models break. So we need to earn their trust and show them a better way. But again, we aren’t there yet, so let’s work to make this a reality in the future.
Data Mesh Radio is hosted by Scott Hirleman. If you want to connect with Scott, reach out to him at community at datameshlearning.com or on LinkedIn: https://www.linkedin.com/in/scotthirleman/
If you want to learn more and/or join the Data Mesh Learning Community, see here: https://datameshlearning.com/community/
If you want to be a guest or give feedback (suggestions for topics, comments, etc.), please see here
All music used this episode was found on PixaBay and was created by (including slight edits by Scott Hirleman): Lesfm, MondayHopes, SergeQuadrado, ItsWatR, Lexin_Music, and/or nevesf
Data Mesh Radio is brought to you as a community resource by DataStax. Check out their high-scale, multi-region database offering (w/ lots of great APIs) and use code DAAP500 for a free $500 credit (apply under “add payment”): AstraDB