Steel-LLM:From Scratch to Open Source -- A Personal Journey in Building a Chinese-Centric LLM Paper β’ 2502.06635 β’ Published 1 day ago β’ 4
Generating Symbolic World Models via Test-time Scaling of Large Language Models Paper β’ 2502.04728 β’ Published 4 days ago β’ 15
Generating Symbolic World Models via Test-time Scaling of Large Language Models Paper β’ 2502.04728 β’ Published 4 days ago β’ 15
Demystifying Long Chain-of-Thought Reasoning in LLMs Paper β’ 2502.03373 β’ Published 6 days ago β’ 48
MetaOcc: Surround-View 4D Radar and Camera Fusion Framework for 3D Occupancy Prediction with Dual Training Strategies Paper β’ 2501.15384 β’ Published 17 days ago
ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning Paper β’ 2502.01100 β’ Published 9 days ago β’ 14
Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate Paper β’ 2501.17703 β’ Published 13 days ago β’ 51
Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos Paper β’ 2501.13826 β’ Published 19 days ago β’ 23
UI-TARS: Pioneering Automated GUI Interaction with Native Agents Paper β’ 2501.12326 β’ Published 21 days ago β’ 49
ToolBeHonest: A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented Large Language Models Paper β’ 2406.20015 β’ Published Jun 28, 2024 β’ 1
HoLLMwood: Unleashing the Creativity of Large Language Models in Screenwriting via Role Playing Paper β’ 2406.11683 β’ Published Jun 17, 2024
HoLLMwood: Unleashing the Creativity of Large Language Models in Screenwriting via Role Playing Paper β’ 2406.11683 β’ Published Jun 17, 2024
Data-Efficient Massive Tool Retrieval: A Reinforcement Learning Approach for Query-Tool Alignment with Language Models Paper β’ 2410.03212 β’ Published Oct 4, 2024
Data-Efficient Massive Tool Retrieval: A Reinforcement Learning Approach for Query-Tool Alignment with Language Models Paper β’ 2410.03212 β’ Published Oct 4, 2024
Chain-of-Reasoning: Towards Unified Mathematical Reasoning in Large Language Models via a Multi-Paradigm Perspective Paper β’ 2501.11110 β’ Published 23 days ago β’ 2
Chain-of-Reasoning: Towards Unified Mathematical Reasoning in Large Language Models via a Multi-Paradigm Perspective Paper β’ 2501.11110 β’ Published 23 days ago β’ 2