Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
Sapiens: Foundation for Human Vision Models
OctFusion: Octree-based Diffusion Models for 3D Shape Generation
Writing in the Margins: Better Inference Pattern for Long Context Retrieval
Fact Finder -- Enhancing Domain Expertise of Large Language Models by Incorporating Knowledge Graphs
RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation
RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation
DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search
LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
ControlNeXt: Powerful and Efficient Control for Image and Video Generation
OpenResearcher: Unleashing AI for Accelerated Scientific Research
Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation
AnyTool: Self-Reflective, Hierarchical Agents for Large-Scale API Calls
Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads
LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders
CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers
MindSearch: Mimicking Human Minds Elicits Deep AI Searcher
Cinemo: Consistent and Controllable Image Animation with Motion Diffusion Models
FinanceBench: A New Benchmark for Financial Question Answering
Stable-Hair: Real-World Hair Transfer via Diffusion Model
Join Podbean Ads Marketplace and connect with engaged listeners.
Advertise Today
Create your
podcast in
minutes
It is Free
AI Deep Dive
Cyber Security Headlines
Cybersecurity Today
The WAN Show
Techmeme Ride Home