Improving Diffusion Models for Virtual Try-on
StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation
RAG and RAU: A Survey on Retrieval-Augmented Language Model in Natural Language Processing
KAN: Kolmogorov-Arnold Networks
Make Your LLM Fully Utilize the Context
How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites
Dynamic Generation of Personalities with Large Language Models
Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations
Assisting in Writing Wikipedia-like Articles From Scratch with Large Language Models
Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
From Words to Numbers: Your Large Language Model Is Secretly A Capable Regressor When Given In-Context Examples
AutoCodeRover: Autonomous Program Improvement
TrustLLM: Trustworthiness in Large Language Models
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
Fast Timing-Conditioned Latent Audio Diffusion
Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians
ReFT: Representation Finetuning for Language Models
Join Podbean Ads Marketplace and connect with engaged listeners.
Advertise Today
Create your
podcast in
minutes
It is Free
AI Deep Dive
The WAN Show
Cyber Security Headlines
Big Technology Podcast
Babbage from The Economist