Download - Bringing Whisper and LLaMA to the masses (Interview) | Podbean

Discover

Podcast Features
Your all-in-one podcasting solution.

Podcast Studio
Easy-to-use audio recorder app.
Livestream
High-performing audio live, without limits.

Podcast App
The best podcast player & podcast app.
Podbean AI
AI-Enhanced Audio Quality and Content Generation.

Ads Marketplace
Join Ads Marketplace to earn money
through sponsorship on your podcast.

PodAds
Manage your ads with dynamic ad insertion capability.
Patron & Paid Content
The seamless way for fans to support you directly
from your podcast.
Apple Podcasts Subscriptions Integration
Effortlessly publish and manage exclusive episodes for your
Apple Podcasts subscribers directly from Podbean.

All Arts Business Comedy Education
Fiction Government Health & Fitness History Kids & Family
Leisure Music News Religion & Spirituality Science
Society & Culture Sports Technology True Crime TV & Film
Live

How to Start a Podcast
How to Start a Live Podcast
How to Monetize a podcast
How to Promote Your Podcast
How to Use Group Recording

Log in
Start your podcast for free

Podcasting
Monetization
Enterprise
Pricing
Discover

The Changelog: Software Development, Open Source

Technology

Bringing Whisper and LLaMA to the masses (Interview)

2023-03-22

Download Right click and do "save link as"

This week we’re talking with Georgi Gerganov about his work on Whisper.cpp and llama.cpp. Georgi first crossed our radar with whisper.cpp, his port of OpenAI’s Whisper model in C and C++. Whisper is a speech recognition model enabling audio transcription and translation. Something we’re paying close attention to here at Changelog, for obvious reasons. Between the invite and the show’s recording, he had a new hit project on his hands: llama.cpp. This is a port of Facebook’s LLaMA model in C and C++. Whisper.cpp made a splash, but llama.cpp is growing in GitHub stars faster than Stable Diffusion did, which was a rocket ship itself.

Discuss on Changelog News

Changelog++ members get a bonus 12 minutes at the end of this episode and zero ads. Join today!

Sponsors:

Postman – Build APIs together — More than 20 million developers use Postman for building and using APIs. Postman simplifies each step of the API lifecycle and streamlines collaboration so you can create better APIs—faster.
Sentry – Session Replay! Rewind and replay every step of the user’s journey before and after they encountered an issue. Eliminate the guesswork and get to the root cause of an issue, faster. Use the code CHANGELOG and get the team plan free for three months.
Fastly – Our bandwidth partner. Fastly powers fast, secure, and scalable digital experiences. Move beyond your content delivery network to their powerful edge cloud platform. Learn more at fastly.com
Typesense – Lightning fast, globally distributed Search-as-a-Service that runs in memory. You iterlly can’t get any faster!

Featuring:

Georgi Gerganov – Mastodon, Twitter, GitHub, Website
Adam Stacoviak – Mastodon, Twitter, GitHub, LinkedIn, Website
Jerod Santo – Mastodon, Twitter, GitHub, LinkedIn

Show Notes:

ggerganov/whisper.cpp
examples/main
Arm Neon technology
Apple’s secret M1 coprocessor
ggerganov/llama.cpp
Introducing LLaMA: A foundational, 65-billion-parameter large language model
facebookresearch/llama
Ludacris Llama Llama Red Pajama Freestyle
The Changelog #506: Stable Diffusion breaks the internet with Simon Willison
Large language models are having their Stable Diffusion moment

Something missing or broken? PRs welcome!

Timestamps:

(00:00) - This week on The Changelog
(01:20) - Sponsor: Postman
(04:09) - Start the show!
(12:03) - Why is Whisper interesting to us?
(17:04) - What's involved in making a port?
(22:55) - Sponsor: Sentry
(24:51) - One layer deeper
(27:57) - Examples of Whisper.cpp
(31:49) - Whisper.cpp and speaker detection
(39:25) - What did you learn about Apple Silicon?
(42:26) - Apple's secret M1 coprocessor
(44:56) - GPU support on the roadmap
(47:06) - Cultivating contributions
(48:49) - Ludacris Llama Llama Red Pajama
(52:57) - What is Llama.cpp so interesting?
(57:01) - What are you going from here?
(58:22) - How can this be extended?
(1:01:22) - How did you learn this stuff?
(1:08:48) - Wrapping up
(1:10:09) - Outro

More Episodes

Windows 3.1 keeps Southwest flying high (News)

2024-07-22

There’s a TUI for that (Friends)

2024-07-19

What even is the modern data stack (Interview)

2024-07-17

The six dumbest ideas in computer security (News)

2024-07-15

Last DevRel standing (Friends)

2024-07-12

It all starts with Postgres (Interview)

2024-07-11

Programming advice for my younger self (News)

2024-07-08

A different kind of rug pull (Friends)

2024-07-05

Code review anxiety (Interview)

2024-07-03

The scariest chart in all of software (News)

2024-07-01

Kaizen! NOT a pipe dream (Friends)

2024-06-28

MAJOR.SEMVER.PATCH (Interview)

2024-06-26

Please let this be Peak LLM (News)

2024-06-24

Where DOESN’T curl run (Friends)

2024-06-21

Securing GitHub (Interview)

2024-06-19

The onset of "Senior Engineer Fatigue" (News)

2024-06-17

Putting the Apple in AI (Friends)

2024-06-14

1999: A Film Odyssey (Changelog++ 🔐) (Friends)

2024-06-13

Retired, not tired. (Interview)

2024-06-12

Apple finally gets Siri-ous (News)

2024-06-10

←
1
2
3
4
5
6
7
8
9
10
→

012345678910111213141516171819

Get this podcast on your
phone, FREE

Download Podbean app on App Store

Download Podbean app on Google Play

Create your
podcast in
minutes

Full-featured podcast site
Unlimited storage and bandwidth
Comprehensive podcast stats
Distribute to Apple Podcasts, Spotify, and more
Make money with your podcast

It is Free

Podcast Services
MONETIZATION & MORE
KNOWLEDGE BASE
Support
Podbean

Privacy Policy
Cookie Policy
Terms of Use
Consent Preferences
Copyright © 2015-2024 Podbean.com