Submitted by Ryan1122 273 MiniMax-01: Scaling Foundation Models with Lightning Attention · 90 authors 6
Submitted by Johanan0528 56 MangaNinja: Line Art Colorization with Precise Reference Following · 10 authors 3
Submitted by sanaka87 34 3DIS-FLUX: simple and efficient multi-instance generation with DiT rendering · 4 authors 2
Submitted by akhaliq 32 Diffusion Adversarial Post-Training for One-Step Video Generation · 6 authors 4
Submitted by cmhungsteve 31 Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks · 8 authors 2
Submitted by tokeron 31 Padding Tone: A Mechanistic Analysis of Padding Tokens in T2I Models · 7 authors 2
Submitted by Ningyu 24 A Multi-Modal AI Copilot for Single-Cell Analysis with Instruction Following · 8 authors 2
Submitted by Yabo 18 FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors · 6 authors 2
Submitted by s-emanuilov 17 HALoGEN: Fantastic LLM Hallucinations and Where to Find Them · 4 authors 2
Submitted by turkeyju 16 Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens · 7 authors 3
Submitted by akhaliq 15 Tarsier2: Advancing Large Vision-Language Models from Detailed Video Description to Comprehensive Video Understanding · 5 authors 2
Submitted by akshat57 15 PokerBench: Training Large Language Models to become Professional Poker Players · 6 authors 2
Submitted by gsarti 10 Enhancing Automated Interpretability with Output-Centric Feature Descriptions · 5 authors 2
Submitted by yuyijiong 8 OpenCSG Chinese Corpus: A Series of High-quality Chinese Datasets for LLM Training · 6 authors 2
Submitted by stefan-it 6 AfriHate: A Multilingual Collection of Hate Speech and Abusive Language Datasets for African Languages · 27 authors 2
Submitted by amanchadha 6 Potential and Perils of Large Language Models as Judges of Unstructured Textual Data · 10 authors 2
Submitted by nielsr 5 MatchAnything: Universal Cross-Modality Image Matching with Large-Scale Pre-Training · 7 authors 3
Submitted by mjbuehler 5 In-situ graph reasoning and knowledge expansion using Graph-PReFLexOR · 1 authors 2