Submitted by philschmid 91 The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale · 8 authors 5
Submitted by oindrila13saha 41 YouDream: Generating Anatomically Controllable Consistent Text-to-3D Animals · 3 authors 1
Submitted by alexgambashidze 27 Aligning Diffusion Models with Noise-Conditioned Perception · 4 authors 1
Submitted by Zuxin 24 APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets · 17 authors 1
Submitted by zhangysk 23 LongIns: A Challenging Long-context Instruction-based Exam for LLMs · 10 authors 1
Submitted by iiiiwis 17 Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA · 14 authors 1
Submitted by markus583 16 Segment Any Text: A Universal Approach for Robust, Efficient and Adaptable Sentence Segmentation · 5 authors 3
Submitted by jcyk 12 On the Transformations across Reward Model, Parameter Update, and In-Context Prompt · 14 authors 1
Submitted by jiho283 11 DialSim: A Real-Time Simulator for Evaluating Long-Term Dialogue Understanding of Conversational Agents · 8 authors 1
Submitted by MoonQiu 11 FreeTraj: Tuning-Free Trajectory Control in Video Diffusion Models · 6 authors 4
Submitted by Liangbin 9 Image Conductor: Precision Control for Interactive Video Synthesis · 8 authors 3
Submitted by aashiqmuhamed 5 Grass: Compute Efficient Low-Memory LLM Training with Structured Sparse Gradients · 5 authors 3
Submitted by theryanliu 4 Large Language Models Assume People are More Rational than We Really are · 5 authors 4
Submitted by gsarti 4 Multi-property Steering of Large Language Models with Dynamic Activation Composition · 3 authors 1