The Platonic Representation Hypothesis
RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment
LLMs as Hackers: Autonomous Linux Privilege Escalation Attacks
CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval
A decoder-only foundation model for time-series forecasting
Autonomous LLM-driven research from data to human-verifiable research papers
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
Granite Code Models: A Family of Open Foundation Models for Code Intelligence
Improving Diffusion Models for Virtual Try-on
StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation
RAG and RAU: A Survey on Retrieval-Augmented Language Model in Natural Language Processing
KAN: Kolmogorov-Arnold Networks
Make Your LLM Fully Utilize the Context
How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites
Dynamic Generation of Personalities with Large Language Models
Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations
Assisting in Writing Wikipedia-like Articles From Scratch with Large Language Models
Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
Join Podbean Ads Marketplace and connect with engaged listeners.
Advertise Today
Create your
podcast in
minutes
It is Free
Cyber Security Headlines
The WAN Show
The 404 Media Podcast
Cybersecurity Today
Techmeme Ride Home