Submitted by akhaliq 26 Text-to-Sticker: Style Tailoring Latent Diffusion Models for Human Expression · 17 authors 1
Submitted by akhaliq 25 Adapters: A Unified Library for Parameter-Efficient and Modular Transfer Learning · 10 authors 3
Submitted by akhaliq 18 LucidDreamer: Towards High-Fidelity Text-to-3D Generation via Interval Score Matching · 6 authors 1
Submitted by akhaliq 17 Memory Augmented Language Models through Mixture of Word Experts · 5 authors 1
Submitted by akhaliq 15 AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort · 6 authors 3
Submitted by akhaliq 9 ProAgent: From Robotic Process Automation to Agentic Process Automation · 12 authors 1
Submitted by akhaliq 7 TPTU-v2: Boosting Task Planning and Tool Usage of Large Language Model-based Agents in Real-world Systems · 12 authors 2
Submitted by akhaliq 5 M$^{2}$UGen: Multi-modal Music Understanding and Generation with the Power of Large Language Models · 4 authors 1
Submitted by akhaliq 5 GPT-4V(ision) for Robotics: Multimodal Task Planning from Human Demonstration · 5 authors 1