view post Post 1797 GRPO reasoning embedded in a custom Prem-1B model ucalyptus/prem-663ff8769efa4d3700ba14e5 ucalyptus/prem-1B-grpo See translation 🔥 4 4 + Reply
RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques Paper • 2501.14492 • Published 18 days ago • 29
Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models Paper • 2501.11873 • Published 22 days ago • 63
The Lessons of Developing Process Reward Models in Mathematical Reasoning Paper • 2501.07301 • Published 29 days ago • 90
Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey Paper • 2412.18619 • Published Dec 16, 2024 • 54
FRNet: Frustum-Range Networks for Scalable LiDAR Segmentation Paper • 2312.04484 • Published Dec 7, 2023
LiMoE: Mixture of LiDAR Representation Learners from Automotive Scenes Paper • 2501.04004 • Published Jan 7 • 1
Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives Paper • 2501.04003 • Published Jan 7 • 25
LargeAD: Large-Scale Cross-Sensor Data Pretraining for Autonomous Driving Paper • 2501.04005 • Published Jan 7
OVGaussian: Generalizable 3D Gaussian Segmentation with Open Vocabularies Paper • 2501.00326 • Published Dec 31, 2024 • 1
Rethinking Addressing in Language Models via Contexualized Equivariant Positional Encoding Paper • 2501.00712 • Published Jan 1 • 6
CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings Paper • 2501.01257 • Published Jan 2 • 48
DateLogicQA: Benchmarking Temporal Biases in Large Language Models Paper • 2412.13377 • Published Dec 17, 2024 • 2
view post Post 9166 Google drops Gemini 2.0 Flash Thinkinga new experimental model that unlocks stronger reasoning capabilities and shows its thoughts. The model plans (with thoughts visible), can solve complex problems with Flash speeds, and morenow available in anychat, try it out: akhaliq/anychat See translation 2 replies · 🚀 8 8 🔥 5 5 👀 2 2 👍 2 2 + Reply
Flash Diffusion: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation Paper • 2406.02347 • Published Jun 4, 2024 • 3