Robust Speech Recognition via Large-Scale Weak Supervision
High-Resolution Image Synthesis with Latent Diffusion Models
Segment Everything Everywhere All at Once
SadTalker: Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Understanding INT4 Quantization for Transformer Models: Latency Speedup, Composability, and Failure Cases
Consistency Models
OpenAGI: When LLM Meets Domain Experts
Instruction Tuning with GPT-4
Baize: An Open-Source Chat Model with Parameter-Efficient Tuning on Self-Chat Data
Segment Anything
A Survey of Large Language Models
HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in HuggingFace
ChatDoctor: A Medical Chat Model Fine-tuned on LLaMA Model using Medical Domain Knowledge
LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention
Sparks of Artificial General Intelligence: Early experiments with GPT-4
X-Risk Analysis for AI Research
Zero-1-to-3: Zero-shot One Image to 3D Object
SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
Zero-Shot Information Extraction via Chatting with ChatGPT
Parameter is Not All You Need: Starting from Non-Parametric Networks for 3D Point Cloud Analysis
Join Podbean Ads Marketplace and connect with engaged listeners.
Advertise Today
Create your
podcast in
minutes
It is Free
AI Deep Dive
The WAN Show
Cyber Security Headlines
Cybersecurity Today
Babbage from The Economist