Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch
Diffuse, Attend, and Segment: Unsupervised Zero-Shot Segmentation using Stable Diffusion
Exponentially Faster Language Modelling
Orca 2: Teaching Small Language Models How to Reason
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
A Survey on Language Models for Code
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Learning to Filter Context for Retrieval-Augmented Generation
GraphCast: Learning skillful medium-range global weather forecasting
LCM-LoRA: A Universal Stable-Diffusion Acceleration Module
CogVLM: Visual Expert for Pretrained Language Models
How Can Recommender Systems Benefit from Large Language Models: A Survey
Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs
Generative Pretraining in Multimodality
Evaluating Large Language Models: A Comprehensive Survey
LongLLMLingua: Accelerating and Enhancing LLMs in Long Context Scenarios via Prompt Compression
iTransformer: Inverted Transformers Are Effective for Time Series Forecasting
Zephyr: Direct Distillation of LM Alignment
Large Language Models for Software Engineering: Survey and Open Problems
Eureka: Human-Level Reward Design via Coding Large Language Models
Join Podbean Ads Marketplace and connect with engaged listeners.
Advertise Today
Create your
podcast in
minutes
It is Free
Babbage from The Economist
Cyber Security Headlines
Cybersecurity Today
The WAN Show
Software Engineering Daily