Download - Synthetic Data for AI | Podbean

Discover

Podcast Features
Your all-in-one podcasting solution.

Blog to Podcast
Turn your blog into an engaging podcast.
Livestream
High-performing audio live, without limits.

Podcast Studio
Easy-to-use audio recorder app.
Podbean AI
AI-Enhanced Audio Quality and Content Generation.

Podcast App
The best podcast player & podcast app.

Ads Marketplace
Join Ads Marketplace to earn money
through sponsorship on your podcast.

PodAds
Manage your ads with dynamic ad insertion capability.
Apple Podcasts Subscriptions Integration
Effortlessly publish and manage exclusive episodes for your
Apple Podcasts subscribers directly from Podbean.
Live Streaming
Receive livestream rewards from your audience and earn
recurring income from your Fan Club membership.

All Arts Business Comedy Education
Fiction Government Health & Fitness History Kids & Family
Leisure Music News Religion & Spirituality Science
Society & Culture Sports Technology True Crime TV & Film
Live

How to Start a Podcast
How to Start a Live Podcast
How to Monetize a podcast
How to Promote Your Podcast
How to Use Group Recording

Log in
Start your podcast for free

Podcasting
Monetization
Advertisers
Enterprise
Pricing
Discover

Log in

Sign up free

Technology

Synthetic Data for AI

2024-04-17

Download Right click and do "save link as"

Kalyan Veeramachaneni (@kveeramac, CEO/Founder @DataCebo) discusses the generation and value proposition of synthetic data for GenAI.

SHOW: 813

CLOUD NEWS OF THE WEEK - http://bit.ly/cloudcast-cnotw

NEW TO CLOUD? CHECK OUT OUR OTHER PODCAST - "CLOUDCAST BASICS"

SHOW NOTES:

DataCebo (homepage)
Synthetic Data Vault - SDV
TechCrunch Article
MIT News Article

Topic 1 - Our topic for today is synthetic data. While the concept and need for synthetic data has been around for a long time, it isn’t a topic that typically comes to the forefront and something we haven’t talked about until today. Today is a bit of crossing the streams between developers and testing data and using GenAI to achieve this goal. For this, we’re joined by Kalyan, CEO and Co-Founder of DataCebo. Welcome to the show

Topic 2 - First, for those not familiar, what is synthetic data? What is the use case and need? What problem is it solving today?

Topic 2a - Hopefully, listeners out there are making the connection to the advantages of GenAI for synthetic data, but take us through your original concept at MIT and the history of Synthetic Data Vault (SDV).

Topic 3 - We recently did a show on the security and privacy of training LLMs where we covered the need to mask PII for the training of models for compliance. I can also see bias issues coming into play or maybe training data that doesn’t exist in the real world (weather models example). What are some of the use cases that you’ve seen require synthetic data sets. Are there certain industries (healthcare, financials, etc.) that benefit?

Topic 4 - You were designing this based on GenAI before GenAI was “cool”. How has the rise of LLMs impacted this space?

Topic 5 - If I understand this correctly, organizations would put generative AI on a problem to describe a need for a data set, the model would then evaluate the available data and create a quality synthetic or “fake” dataset. How would the organization verify the quality of the dataset? How would they validate that a synthetic data set is as good as the original data?

Topic 6 - Let’s talk about resources for a bit. When I think of GenAI and training, I think of large amounts of hardware and in particular GPU’s that might have limited availability. Is that true here? Also, is this on-prem or in the cloud, or both?

FEEDBACK?

Email: show at the cloudcast dot net
Twitter: @cloudcastpod
Instagram: @cloudcastpod
TikTok: @cloudcastpod

view more

More Episodes

Will Enterprise AI adoption patterns follow Enterprise Cloud adoption?

2024-04-07

Cloud News of the Month - March 2024

2024-04-03

The $69B bet against replacement

2024-03-31

LLM Security and Privacy

2024-03-27

Cloud Fundamentals needed for AI

2024-03-24

Building an AI Product Company

2024-03-20

What if the CNCF was private equity?

2024-03-17

Integration and Observability of 3rd Party APIs

2024-03-13

The End of the Free Tiers

2024-03-10

Cloud News of the Month - February 2024

2024-03-06

Anatomy of a Side Hustle

2024-03-03

Observability and Visualizing Data with Grafana

2024-02-28

Building a Keynote Presentation

2024-02-25

Validation and Guardrails for LLMs

2024-02-21

Experiencing The Sphere in Las Vegas

2024-02-18

APIs and AI Gateways

2024-02-14

The Curious Case of AI, Funding and Cloud Credits

2024-02-11

Cloud News of the Month

2024-02-07

Bundling, Unbundling and Ensh*tification

2024-02-04

VCs and Open Source and Funding

2024-01-31

←
1
2
3
4
5
6
7
8
9
10
→

012345678910111213141516171819

Get this podcast on your
phone, FREE

Download Podbean app on App Store

Download Podbean app on Google Play

Create your
podcast in
minutes

Full-featured podcast site
Unlimited storage and bandwidth
Comprehensive podcast stats
Distribute to Apple Podcasts, Spotify, and more
Make money with your podcast

Get started

It is Free

Podcast Services
MONETIZATION & MORE
KNOWLEDGE BASE
Support
Podbean

Privacy Policy
Cookie Policy
Terms of Use
Consent Preferences
Copyright © 2015-2024 Podbean.com