SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer Paper • 2501.18427 • Published 12 days ago • 16
Running on CPU Upgrade 12.4k 12.4k Open LLM Leaderboard 🏆 Track, rank and evaluate open LLMs and chatbots
Cautious Optimizers: Improving Training with One Line of Code Paper • 2411.16085 • Published Nov 25, 2024 • 17