The Forward-Forward Algorithm: Some Preliminary Investigations
Cramming: Training a Language Model on a Single GPU in One Day
TorchGeo: deep learning with geospatial data
Revisiting Classifier: Transferring Vision-Language Models for Video Recognition
Editing Models with Task Arithmetic
What do Vision Transformers Learn? A Visual Exploration
Yuan 1.0: Large-Scale Pre-trained Language Model in Zero-Shot and Few-Shot Learning
Programming Is Hard - Or at Least It Used to Be: Educational Opportunities And Challenges of AI Code Generation
MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition
DAMO-YOLO : A Report on Real-Time Object Detection Design
TorchScale: Transformers at Scale
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
OneFormer: One Transformer to Rule Universal Image Segmentation
Large Language Models Are Human-Level Prompt Engineers
Efficient Spatially Sparse Inference for Conditional GANs and Diffusion Models
On the Versatile Uses of Partial Distance Correlation in Deep Learning
SeaPearl: A Constraint Programming Solver guided by Reinforcement Learning
What Makes Convolutional Models Great on Long Sequence Modeling?
Amos: An Adam-style Optimizer with Adaptive Weight Decay towards Model-Oriented Scale
TabPFN: A Transformer That Solves Small Tabular Classification Problems in a Second
Join Podbean Ads Marketplace and connect with engaged listeners.
Advertise Today
Create your
podcast in
minutes
It is Free
Babbage from The Economist
Cyber Security Headlines
The WAN Show
The 404 Media Podcast
Techmeme Ride Home