new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

by AK and the research community

Jan 16

Submitted by

stefan-it

Towards Best Practices for Open Datasets for LLM Training

·
39 authors

Submitted by

s-emanuilov

MMDocIR: Benchmarking Multi-Modal Retrieval for Long Documents

·
6 authors

Submitted by

hzxie

CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities

·
4 authors

Submitted by

akhaliq

RepVideo: Rethinking Cross-Layer Representation for Video Generation

·
6 authors

Submitted by

s-emanuilov

Ouroboros-Diffusion: Exploring Consistent Content Generation in Tuning-free Long Video Diffusion

·
7 authors

Submitted by

s-emanuilov

Multimodal LLMs Can Reason about Aesthetics in Zero-Shot

·
2 authors

Submitted by

akhaliq

XMusic: Towards a Generalized and Controllable Symbolic Music Generation Framework

·
5 authors

Submitted by

wzk1015

Parameter-Inverted Image Pyramid Networks for Visual Perception and Multimodal Understanding

·
11 authors

Submitted by

iliashum

Trusted Machine Learning Models Unlock Private Inference for Problems Currently Infeasible with Cryptography

·
7 authors

Submitted by

nielsr

MINIMA: Modality Invariant Image Matching

·
6 authors

Submitted by

nielsr

Beyond Sight: Finetuning Generalist Robot Policies with Heterogeneous Sensors via Language Grounding

·
6 authors