Submitted by ShawLiu 48 AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents · 10 authors 3
Submitted by ekurtic 47 "Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization · 5 authors 3
Submitted by ShawLiu 35 WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning · 13 authors 1
Submitted by bykang 33 How Far is Video Generation from World Model: A Physical Law Perspective · 8 authors 2
Submitted by Franck-Dernoncourt 27 DynaSaur: Large Language Agents Beyond Predefined Actions · 12 authors 2
Submitted by xxzcc 24 Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent · 106 authors 1
Submitted by wchengad 23 MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D · 11 authors 1
Submitted by jinjh0123 23 Survey of Cultural Awareness in Language Models: Text and Beyond · 10 authors 2
Submitted by kumarak 23 Adaptive Caching for Faster Video Generation with Diffusion Transformers · 7 authors 1
Submitted by haoyuhsu 17 AutoVFX: Physically Realistic Video Editing from Natural Language Instructions · 5 authors 3
Submitted by farewellthree 11 PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance · 7 authors 1
Submitted by Raincleared 11 Sparsing Law: Towards Large Language Models with Greater Activation Sparsity · 7 authors 1
Submitted by seyedhamidreza 8 SALSA: Soup-based Alignment Learning for Stronger Adaptation in RLHF · 8 authors 2
Submitted by akhaliq 8 IGOR: Image-GOal Representations are the Atomic Control Units for Foundation Models in Embodied AI · 8 authors 2
Submitted by aashiqmuhamed 7 Decoding Dark Matter: Specialized Sparse Autoencoders for Interpreting Rare Concepts in Foundation Models · 3 authors 2
Submitted by dxlong2000 6 Multi-expert Prompting Improves Reliability, Safety, and Usefulness of Large Language Models · 6 authors 2
Submitted by Franck-Dernoncourt 4 LoRA-Contextualizing Adaptation of Large Multimodal Models for Long Document Understanding · 9 authors 2
Submitted by gagan3012 3 Swan and ArabicMTEB: Dialect-Aware, Arabic-Centric, Cross-Lingual, and Cross-Cultural Embedding Models and Benchmarks · 5 authors 2