Submitted by AJZhou 45 BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices · 22 authors 5
Submitted by akhaliq 22 AnimateAnything: Consistent and Controllable Animation for Video Generation · 6 authors 2
Submitted by Tigerph 20 Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering · 11 authors 2
Submitted by mrdrozdov 17 Drowning in Documents: Consequences of Scaling Reranker Inference · 6 authors 4
Submitted by akhaliq 13 FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on · 10 authors 2
Submitted by Franck-Dernoncourt 12 SlimLM: An Efficient Small Language Model for On-Device Document Assistance · 6 authors 2
Submitted by AlonzoLeeeooo 11 StableV2V: Stablizing Shape Consistency in Video-to-Video Editing · 5 authors 5
Submitted by akhaliq 10 Awaker2.5-VL: Stably Scaling MLLMs with Parameter-Efficient Mixture of Experts · 7 authors 2
Submitted by stefan-it 8 LLäMmlein: Compact and Competitive German-Only Language Models from Scratch · 3 authors 3
Submitted by akhaliq 8 SmoothCache: A Universal Inference Acceleration Technique for Diffusion Transformers · 5 authors 2
Submitted by Franck-Dernoncourt 7 Comprehensive and Practical Evaluation of Retrieval-Augmented Generation Systems for Medical Question Answering · 4 authors 2
Submitted by ambean 5 Evaluating the role of `Constitutions' for learning from AI feedback · 3 authors 2