GAN You Hear Me? Reclaiming Unconditional Speech Synthesis from Diffusion Models
DigiFace-1M: 1 Million Digital Face Images for Face Recognition
Human Motion Diffusion Model
TranAD: Deep Transformer Networks for Anomaly Detection in Multivariate Time Series Data
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
YOLOX-PAI: An Improved YOLOX, Stronger and Faster than YOLOv6
Transformers are Sample Efficient World Models
An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion
StyleGAN-XL: Scaling StyleGAN to Large Diverse Datasets
Explaining Explanations: Axiomatic Feature Interactions for Deep Networks
Musika! Fast Infinite Waveform Music Generation
TranAD: Deep Transformer Networks for Anomaly Detection in Multivariate Time Series Data
Representation Learning for the Automatic Indexing of Sound Effects Libraries
Fast Sampling of Diffusion Models with Exponential Integrator
A Walk in the Park: Learning to Walk in 20 Minutes With Model-Free Reinforcement Learning
Architecture-Agnostic Masked Image Modeling - From ViT back to CNN
Learning the Beauty in Songs: Neural Singing Voice Beautifier
LiT: Zero-Shot Transfer with Locked-image Text Tuning
Large-Scale Intelligent Microservices
Collaborative Neural Rendering using Anime Character Sheets
Join Podbean Ads Marketplace and connect with engaged listeners.
Advertise Today
Create your
podcast in
minutes
It is Free
Cyber Security Headlines
Cybersecurity Today
Techmeme Ride Home
The WAN Show
The 404 Media Podcast