CustomVideoX: 3D Reference Attention Driven Dynamic Adaptation for Zero-Shot Customized Video Diffusion Transformers Paper • 2502.06527 • Published 1 day ago • 5
Dual Caption Preference Optimization for Diffusion Models Paper • 2502.06023 • Published 2 days ago • 6
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning Paper • 2502.06781 • Published 1 day ago • 34
SynthDetoxM: Modern LLMs are Few-Shot Parallel Detoxification Data Annotators Paper • 2502.06394 • Published 1 day ago • 69
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling Paper • 2502.06703 • Published 1 day ago • 54
Show-o Turbo: Towards Accelerated Unified Multimodal Understanding and Generation Paper • 2502.05415 • Published 4 days ago • 11
The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Information Steering Paper • 2502.03628 • Published 6 days ago • 9
EVEv2: Improved Baselines for Encoder-Free Vision-Language Models Paper • 2502.06788 • Published 1 day ago • 9
APE: Faster and Longer Context-Augmented Generation via Adaptive Parallel Encoding Paper • 2502.05431 • Published 4 days ago • 5
Efficient-vDiT: Efficient Video Diffusion Transformers With Attention Tile Paper • 2502.06155 • Published 1 day ago • 5
Lumina-Video: Efficient and Flexible Video Generation with Multi-scale Next-DiT Paper • 2502.06782 • Published 1 day ago • 7
DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization Paper • 2502.04370 • Published 6 days ago • 4
No Task Left Behind: Isotropic Model Merging with Common and Task-Specific Subspaces Paper • 2502.04959 • Published 4 days ago • 9
Linear Correlation in LM's Compositional Generalization and Hallucination Paper • 2502.04520 • Published 5 days ago • 9
MEETING DELEGATE: Benchmarking LLMs on Attending Meetings on Our Behalf Paper • 2502.04376 • Published 6 days ago • 3