Submitted by myownskyW7 57 VideoRoPE: What Makes for Good Video Rotary Position Embedding? · 12 authors 2
Submitted by akhaliq 40 Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach · 9 authors 9
Submitted by d-alistarh 35 QuEST: Stable Training of LLMs with 1-Bit Weights and Activations · 6 authors 3
Submitted by yulunliu 24 AuraFusion360: Augmented Unseen Region Alignment for Reference-based 360° Unbounded Scene Inpainting · 11 authors 3
Submitted by ydeng9 18 DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails · 5 authors 2
Submitted by akhaliq 17 FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation · 10 authors 3
Submitted by akhaliq 15 Generating Symbolic World Models via Test-time Scaling of Large Language Models · 8 authors 2
Submitted by akhaliq 14 Step Back to Leap Forward: Self-Backtracking for Boosting Reasoning of Language Models · 8 authors 2
Submitted by Eleven-P 10 CMoE: Fast Carving of Mixture-of-Experts for Efficient LLM Inference · 8 authors 2
Submitted by danielm1405 9 No Task Left Behind: Isotropic Model Merging with Common and Task-Specific Subspaces · 6 authors 2
Submitted by akhaliq 9 On-device Sora: Enabling Diffusion-Based Text-to-Video Generation for Mobile Devices · 6 authors 3
Submitted by akhaliq 9 Linear Correlation in LM's Compositional Generalization and Hallucination · 5 authors 3
Submitted by nielsr 8 Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And More · 7 authors 2
Submitted by zhaoyue-zephyrus 8 QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation · 9 authors 2
Submitted by akhaliq 8 CodeSteer: Symbolic-Augmented Language Models via Code/Text Guidance · 5 authors 3
Submitted by rohitsaxena 6 Lost in Time: Clock and Calendar Understanding Challenges in Multimodal LLMs · 3 authors 4
Submitted by yuweiyin 5 ARR: Question Answering with Large Language Models via Analyzing, Retrieving, and Reasoning · 2 authors 2
Submitted by amanchadha 5 YINYANG-ALIGN: Benchmarking Contradictory Objectives and Proposing Multi-Objective Optimization based DPO for Text-to-Image Alignment · 8 authors 2
Submitted by XiaotingQin 3 MEETING DELEGATE: Benchmarking LLMs on Attending Meetings on Our Behalf · 8 authors 3
Submitted by sinatayebati 2 SPARC: Subspace-Aware Prompt Adaptation for Robust Continual Learning in LLMs · 5 authors 2
Submitted by sinatayebati - Intelligent Sensing-to-Action for Robust Autonomy at the Edge: Opportunities and Challenges · 12 authors 2