Submitted by akhaliq 126 MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases · 12 authors 13
Submitted by akhaliq 19 ChunkAttention: Efficient Self-Attention with Prefix-Aware KV Cache and Two-Phase Partition · 4 authors 6
Submitted by akhaliq 18 Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models · 3 authors 6
Submitted by akhaliq 14 Seamless Human Motion Composition with Blended Positional Encodings · 3 authors 1
Submitted by akhaliq 14 AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning · 16 authors 3
Submitted by akhaliq 13 API-BLEND: A Comprehensive Corpora for Training and Benchmarking API LLMs · 10 authors 3