Download - #600: Amazon SageMaker Multi Model Endpoints | Podbean

Discover

Podcast Features
Your all-in-one podcasting solution.

Blog to Podcast
Turn your blog into an engaging podcast.
Livestream
High-performing audio live, without limits.

Podcast Studio
Easy-to-use audio recorder app.
Podbean AI
AI-Enhanced Audio Quality and Content Generation.

Podcast App
The best podcast player & podcast app.

Ads Marketplace
Join Ads Marketplace to earn money
through sponsorship on your podcast.

PodAds
Manage your ads with dynamic ad insertion capability.
Apple Podcasts Subscriptions Integration
Effortlessly publish and manage exclusive episodes for your
Apple Podcasts subscribers directly from Podbean.
Live Streaming
Receive livestream rewards from your audience and earn
recurring income from your Fan Club membership.

All Arts Business Comedy Education
Fiction Government Health & Fitness History Kids & Family
Leisure Music News Religion & Spirituality Science
Society & Culture Sports Technology True Crime TV & Film
Live

How to Start a Podcast
How to Start a Live Podcast
How to Monetize a podcast
How to Promote Your Podcast
How to Use Group Recording

Log in
Start your podcast for free

Podcasting
Monetization
Advertisers
Enterprise
Pricing
Discover

Log in

Sign up free

News:Tech News

#600: Amazon SageMaker Multi Model Endpoints

2023-07-03

Download Right click and do "save link as"

Amazon SageMaker Multi-Model Endpoint (MME) is fully managed capability of SageMaker Inference that allows customers to deploy thousands of models on a single endpoint and save costs by sharing instances on which the endpoints run across all the models. Until recently, MME was only supported for machine learning (ML) models which run on CPU instances. Now, customers can use MME to deploy thousands of ML models on GPU based instances as well, and potentially save costs by 90%. MME dynamically loads and unloads models from GPU memory based on incoming traffic to the endpoint. Customers save cost with MME as the GPU instances are shared by thousands of models. Customers can run ML models from multiple ML frameworks including PyTorch, TensorFlow, XGBoost, and ONNX. Customers can get started by using the NVIDIA Triton™ Inference Server and deploy models on SageMaker’s GPU instances in “multi-model“ mode. Once the MME is created, customers specify the ML model from which they want to obtain inference while invoking the endpoint. Multi Model Endpoints for GPU is available in all AWS regions where Amazon SageMaker is available. To learn more checkout: Our launch blog: https://go.aws/3NwtJyh Amazon SageMaker website: https://go.aws/44uCdNr

view more

More Episodes

#693: AWS News Updates, November 4, 2024

2024-11-04

#692: A Discussion About Serverless and How to Make the Most of It

2024-10-28

#691: [MIGRATION SPECIAL SERIES] AJE Group's Cloud Transformation Journey to AWS with CloudHesive

2024-10-24

#690: AWS News Updates, October 21, 2024

2024-10-21

#689: Diving Deep into AWS Transit Gateway

2024-10-14

#688: AWS News Updates, October 7, 2024

2024-10-07

#687: Graph Analytics Breakdown

2024-09-30

#686: AWS News Updates, September 23, 2024

2024-09-23

#685: [Beyond the API] Colm MacCárthaigh

2024-09-16

#684: AWS News Updates, September 9, 2024

2024-09-09

#683: Scaling at Speed: Totogi’s AWS-Powered Emergency Response

2024-09-02

#682: AWS News Updates, August 26, 2024

2024-08-26

#681: Amazon DynamoDB Deep Dive

2024-08-19

#680: AWS News Updates, August 12, 2024

2024-08-12

#679: [Beyond the API] Becky Weiss

2024-08-05

#678: AWS News Updates, July 29, 2024

2024-07-29

#677: High performance computing in FSI

2024-07-22

#676: AWS News Updates, July 15, 2024

2024-07-15

#675: Unravel Internet Ingress and Egress - A Deep Dive into Application Access

2024-07-08

#674: AWS News Updates, July 1, 2024

2024-07-01

←
1
2
3
4
5
6
7
8
9
10
→

012345678910111213141516171819

Get this podcast on your
phone, FREE

Download Podbean app on App Store

Download Podbean app on Google Play

Create your
podcast in
minutes

Full-featured podcast site
Unlimited storage and bandwidth
Comprehensive podcast stats
Distribute to Apple Podcasts, Spotify, and more
Make money with your podcast

Get started

It is Free

Podcast Services
MONETIZATION & MORE
KNOWLEDGE BASE
Support
Podbean

Privacy Policy
Cookie Policy
Terms of Use
Consent Preferences
Copyright © 2015-2024 Podbean.com