new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

by AK and the research community

Aug 26

Submitted by

HugoLaurencon

Building and better understanding vision-language models: insights and future directions

·
4 authors

Submitted by

yifanzhang114

MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?

·
13 authors

Submitted by

akhaliq

LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation

·
8 authors

Submitted by

JamesSand

Multi-Layer Transformers Gradient Can be Approximated in Almost Linear Time

·
5 authors

Submitted by

kz919

Memory-Efficient LLM Training with Online Subspace Descent

·
4 authors

Submitted by

kpzhang996

T3M: Text Guided 3D Human Motion Synthesis from Speech

·
3 authors

Submitted by

akhaliq

CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities

·
8 authors

Submitted by

hasanar1f

HiRED: Attention-Guided Token Dropping for Efficient Inference of High-Resolution Vision-Language Models in Resource-Constrained Environments

·
6 authors

Submitted by

IAMJB

A Web-Based Solution for Federated Learning with LLM-Based Automation

·
3 authors

Submitted by

akhaliq

FLoD: Integrating Flexible Level of Detail into 3D Gaussian Splatting for Customizable Rendering

·
4 authors

Submitted by

amanchadha

RoundTable: Leveraging Dynamic Schema and Contextual Autocomplete for Enhanced Query Precision in Tabular Question Answering

·
4 authors

Submitted by

tommymarto

CODE: Confident Ordinary Differential Editing

·
3 authors