AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents Paper โข 2410.24024 โข Published Oct 31, 2024 โข 49
WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning Paper โข 2411.02337 โข Published Nov 4, 2024 โข 35
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions Paper โข 2409.15278 โข Published Sep 23, 2024 โข 24
Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion Paper โข 2409.11406 โข Published Sep 17, 2024 โข 26
Seed-Music: A Unified Framework for High Quality and Controlled Music Generation Paper โข 2409.09214 โข Published Sep 13, 2024 โข 52
Shakker-Labs/FLUX.1-dev-ControlNet-Union-Pro Text-to-Image โข Updated Aug 29, 2024 โข 39.8k โข 439
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery Paper โข 2408.06292 โข Published Aug 12, 2024 โข 118
Towards Achieving Human Parity on End-to-end Simultaneous Speech Translation via LLM Agent Paper โข 2407.21646 โข Published Jul 31, 2024 โข 18
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output Paper โข 2407.03320 โข Published Jul 3, 2024 โข 93