Podcast Archives - Software Engineering Daily
News:Tech News
The size of ML models is growing into the many billions of parameters. This poses a challenge for running inference on non-dedicated hardware like phones and laptops.
Argmax is a startup focused on developing methods to run large models on commodity hardware. A key observation behind their strategy is that the largest models are getting larger, but the smallest models that are commercially relevant are getting smaller. The company was started in 2023 and has raised money from General Catalyst and other industry leaders.
Atila Orhon is the founder of Argmax and he previously worked at Apple and NVIDIA. He joins the show to talk about working in computer vision, building ML tooling at Apple, optimizing ML models, and more.
Sean’s been an academic, startup founder, and Googler. He has published works covering a wide range of topics from information visualization to quantum computing. Currently, Sean is Head of Marketing and Developer Relations at Skyflow and host of the podcast Partially Redacted, a podcast about privacy and security engineering. You can connect with Sean on Twitter @seanfalconer .
Please click here to see the transcript of this episode.
Sponsorship inquiries: sponsor@softwareengineeringdaily.com
The post Scaling Large ML Models to Small Devices with Atila Orhon appeared first on Software Engineering Daily.
The Vesuvius Challenge with Juli Schilliger and Youssef Nader
Shift-Left Security and Code Scanning with Amjad Afanah and Sudipta Mukherjee
Uber’s LedgerStore and its Trillions of Indexes with Kaushik Devarajaiah
Frontend Observability with Purvi Kanal
AI Tools for Game Development with Igor Poletaev and Nathan Yu
C++ Static Analysis with Abbas Sabra
Climate Tech Investing with Tom Biegala
Luma AI with Barkley Dai and Karan Ganesan
AI at Redis with Brian Sam-Bodden
Dusk and the Art of Making Short Games with David Szymanski
Fast Frontend Development with David Hsu
One Year of ChatGPT with Christian Hubicki
Hyperscaling SQL with Sam Lambert
Google Ventures with Erik Nordlander
The Challenge of API Design with Lauren Long
Shopify’s Hydrogen Framework with Ben Sehl
Celeste and Platform Game Engineering with Noel Berry
DataStax with Ed Anuff
It’s APIs All the Way Down with Marco Palladino
Bitwarden with Matt Bishop
Create your
podcast in
minutes
It is Free
Cyber Security Headlines
The WAN Show
The 404 Media Podcast
Cybersecurity Today
Techmeme Ride Home