Submitted by akhaliq 53 Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models · 17 authors 5
Submitted by akhaliq 50 Beyond Language Models: Byte Models are Digital World Simulators · 6 authors 4
Submitted by akhaliq 33 Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers · 11 authors 3
Submitted by akhaliq 24 MOSAIC: A Modular System for Assistive and Interactive Cooking · 17 authors 1
Submitted by akhaliq 21 DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models · 10 authors 1
Submitted by akhaliq 19 Simple linear attention language models balance the recall-throughput tradeoff · 9 authors 12
Submitted by akhaliq 14 ViewFusion: Towards Multi-View Consistency via Interpolated Denoising · 6 authors 1