Learning to Generate Instruction Tuning Datasets for Zero-Shot Task Adaptation
Intent-based Prompt Calibration: Enhancing prompt optimization with synthetic boundary cases
Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models
BitDelta: Your Fine-Tune May Only Be Worth One Bit
Ring Attention with Blockwise Transformers for Near-Infinite Context
Premise Order Matters in Reasoning with Large Language Models
Generative Representational Instruction Tuning
DoRA: Weight-Decomposed Low-Rank Adaptation
Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time
World Model on Million-Length Video And Language With RingAttention
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models
Fractal Patterns May Unravel the Intelligence in Next-Token Prediction
Precise Zero-Shot Dense Retrieval without Relevance Labels
ColBERTv2: Effective and Efficient Retrieval via Lightweight Late Interaction
Relevance-guided Supervision for OpenQA with ColBERT
PLAID: An Efficient Engine for Late Interaction Retrieval
RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
Corrective Retrieval Augmented Generation
DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence
A Comprehensive Survey on 3D Content Generation
Join Podbean Ads Marketplace and connect with engaged listeners.
Advertise Today
Create your
podcast in
minutes
It is Free
AI Deep Dive
The WAN Show
Cyber Security Headlines
Big Technology Podcast
Babbage from The Economist