Submitted by akhaliq 33 I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models · 9 authors 3
Submitted by akhaliq 17 Relax: Composable Abstractions for End-to-End Dynamic Machine Learning · 19 authors 1
Submitted by akhaliq 11 Tell Your Model Where to Attend: Post-hoc Attention Steering for LLMs · 7 authors 2
Submitted by akhaliq 5 CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding · 7 authors
Submitted by akhaliq 4 Consistent4D: Consistent 360° Dynamic Object Generation from Monocular Video · 5 authors 1
Submitted by akhaliq 4 Co-training and Co-distillation for Quality Improvement and Compression of Language Models · 7 authors 1
Submitted by akhaliq 4 Attention or Convolution: Transformer Encoders in Audio Language Models for Inference Efficiency · 7 authors 1