Download - Token Merging: Your ViT But Faster | Podbean

Discover

Podcast Features
Monetization
Podbean App
- Podcast Studio
  Easy-to-use audio recorder app.
- Podcast App
  The best podcast player & podcast app.

Help and Support
Popular Topics

All Arts Business Comedy Education
Fiction Government Health & Fitness History Kids & Family
Leisure Music News Religion & Spirituality Science
Society & Culture Sports Technology True Crime TV & Film
Live

How to Start a Podcast
How to Start a Live Podcast
How to Monetize a podcast
How to Promote Your Podcast
How to Use Group Recording

Log in
Start your podcast for free

Podcasting
Advertisers
Enterprise
Pricing
Resources
- Help and Support
- Popular Topics
Discover

Log in

Sign up free

Papers Read on AI

News:Tech News

Token Merging: Your ViT But Faster

2023-02-21

Download

We introduce Token Merging (ToMe), a simple method to increase the throughput of existing ViT models without needing to train. ToMe gradually combines similar tokens in a transformer using a general and light-weight matching algorithm that is as fast as pruning while being more accurate. Qualitatively, we ﬁnd that ToMe merges object parts into one token, even over multiple frames of video. Overall, ToMe’s accuracy and speed are competitive with state-of-the-art on images, video, and audio. 2022: Daniel Bolya, Cheng-Yang Fu, Xiaoliang Dai, Peizhao Zhang, Christoph Feichtenhofer, Judy Hoffman https://arxiv.org/pdf/2210.09461v1.pdf

view more

More Episodes

ToolAlpaca: Generalized Tool Learning for Language Models with 3000 Simulated Cases

2024-11-01

449

Mini-Omni2: Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities

2024-10-31

156

Hallo2: Long-Duration and High-Resolution Audio-Driven Portrait Image Animation

2024-10-30

119

F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching

2024-10-18

224

LightRAG: Simple and Fast Retrieval-Augmented Generation

2024-10-17

195

Aria: An Open Multimodal Native Mixture-of-Experts Model

2024-10-16

107

AgentKit: Structured LLM Reasoning with Dynamic Graphs

2024-10-15

136

PDF-WuKong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling

2024-10-14

112

Diffusion Models are Evolutionary Algorithms

2024-10-10

165

Is Safer Better? The Impact of Guardrails on the Argumentative Strength of LLMs in Hate Speech Countering

2024-10-09

108

LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations

2024-10-08

144

Internal Consistency and Self-Feedback in Large Language Models: A Survey

2024-10-07

115

On the Diagram of Thought

2024-10-02

136

3DTopia-XL: Scaling High-quality 3D Asset Generation via Primitive Diffusion

2024-10-01

106

StoryMaker: Towards Holistic Consistent Characters in Text-to-image Generation

2024-09-30

106

On the limits of agency in agent-based models

2024-09-24

168

Symbolic Prompt Program Search: A Structure-Aware Approach to Efficient Compile-Time Prompt Optimization

2024-09-23

108

PuLID: Pure and Lightning ID Customization via Contrastive Alignment

2024-09-22

98

MemoRAG: Moving towards Next-Gen RAG Via Memory-Inspired Knowledge Discovery

2024-09-21

134

PuLID: Pure and Lightning ID Customization via Contrastive Alignment

2024-09-20

94

←
1
2
3
4
5
6
7
8
9
10
→

012345678910111213141516171819

Get this podcast on your
phone, FREE

Download Podbean app on App Store

Download Podbean app on Google Play

Create your
podcast in
minutes

Full-featured podcast site
Unlimited storage and bandwidth
Comprehensive podcast stats
Distribute to Apple Podcasts, Spotify, and more
Make money with your podcast

Get started

It is Free

Podcast Services
MONETIZATION & MORE
KNOWLEDGE BASE
Support
Podbean

Privacy Policy
Cookie Policy
Terms of Use
Consent Preferences
Copyright © 2015-2025 Podbean.com