Submitted by akhaliq 51 WaveCoder: Widespread And Versatile Enhanced Instruction Tuning with Refined Data Generation · 8 authors 5
Submitted by akhaliq 19 InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks · 15 authors 1
Submitted by akhaliq 16 VCoder: Versatile Vision Encoders for Multimodal Large Language Models · 3 authors 1
Submitted by akhaliq 14 Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning · 16 authors 4
Submitted by akhaliq 11 DreamDistribution: Prompt Distribution Learning for Text-to-Image Diffusion Models · 9 authors 1
Submitted by akhaliq 11 PlatoNeRF: 3D Reconstruction in Plato's Cave via Single-View Two-Bounce Lidar · 7 authors 1
Submitted by akhaliq 7 Parameter Efficient Tuning Allows Scalable Personalization of LLMs for Text Entry: A Case Study on Abbreviation Expansion · 3 authors 1
Submitted by akhaliq 6 Generative AI Beyond LLMs: System Implications of Multi-Modal Generation · 11 authors 1