DAMO-YOLO : A Report on Real-Time Object Detection Design
TorchScale: Transformers at Scale
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
OneFormer: One Transformer to Rule Universal Image Segmentation
Large Language Models Are Human-Level Prompt Engineers
Efficient Spatially Sparse Inference for Conditional GANs and Diffusion Models
On the Versatile Uses of Partial Distance Correlation in Deep Learning
SeaPearl: A Constraint Programming Solver guided by Reinforcement Learning
What Makes Convolutional Models Great on Long Sequence Modeling?
Amos: An Adam-style Optimizer with Adaptive Weight Decay towards Model-Oriented Scale
TabPFN: A Transformer That Solves Small Tabular Classification Problems in a Second
Long Range Graph Benchmark
Taming Transformers for High-Resolution Image Synthesis
Time Will Tell: New Outlooks and A Baseline for Temporal Multi-View 3D Object Detection
GLM-130B: An Open Bilingual Pre-trained Model
Elucidating the Design Space of Diffusion-Based Generative Models
GAN You Hear Me? Reclaiming Unconditional Speech Synthesis from Diffusion Models
DigiFace-1M: 1 Million Digital Face Images for Face Recognition
Human Motion Diffusion Model
TranAD: Deep Transformer Networks for Anomaly Detection in Multivariate Time Series Data
Join Podbean Ads Marketplace and connect with engaged listeners.
Advertise Today
Create your
podcast in
minutes
It is Free
AI Deep Dive
The WAN Show
Cyber Security Headlines
Big Technology Podcast
Babbage from The Economist