new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

by AK and the research community

Dec 21

Submitted by

akhaliq

StreamDiffusion: A Pipeline-level Solution for Real-time Interactive Generation

·
10 authors

Submitted by

akhaliq

VideoPoet: A Large Language Model for Zero-Shot Video Generation

·
31 authors

Submitted by

akhaliq

PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPU

·
4 authors

Submitted by

akhaliq

Generative Multimodal Models are In-Context Learners

·
11 authors

Submitted by

akhaliq

DREAM-Talk: Diffusion-based Realistic Emotional Audio-driven Method for Single Image Talking Face Generation

·
10 authors

Submitted by

akhaliq

DreamTuner: Single Image is Enough for Subject-Driven Generation

·
6 authors

Submitted by

akhaliq

Zero-Shot Metric Depth with a Field-of-View Conditioned Diffusion Model

·
5 authors

Submitted by

akhaliq

Fairy: Fast Parallelized Instruction-Guided Video-to-Video Synthesis

·
9 authors

Submitted by

akhaliq

InstructVideo: Instructing Video Diffusion Models with Human Feedback

·
10 authors

Submitted by

akhaliq

Splatter Image: Ultra-Fast Single-View 3D Reconstruction

·
3 authors

Submitted by

akhaliq

TinySAM: Pushing the Envelope for Efficient Segment Anything Model

·
8 authors

Submitted by

akhaliq

Cached Transformers: Improving Transformers with Differentiable Memory Cache

·
6 authors

Submitted by

akhaliq

Neural feels with neural fields: Visuo-tactile perception for in-hand manipulation

·
12 authors

Submitted by

akhaliq

Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models

·
5 authors

Submitted by

akhaliq

MaskINT: Video Editing via Interpolative Non-autoregressive Masked Transformers

·
8 authors

Submitted by

akhaliq

Adaptive Guidance: Training-free Acceleration of Conditional Diffusion Models

·
8 authors

Submitted by

akhaliq

Mini-GPTs: Efficient Large Language Models through Contextual Pruning

·
3 authors

Submitted by

akhaliq

Model-Based Control with Sparse Neural Dynamics

·
7 authors

Submitted by

akhaliq

UniSDF: Unifying Neural Representations for High-Fidelity 3D Reconstruction of Complex Scenes with Reflections

·
6 authors

Submitted by

akhaliq

SpecNeRF: Gaussian Directional Encoding for Specular Reflections

·
8 authors

Submitted by

akhaliq

Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting

·
9 authors

Submitted by

akhaliq

RadEdit: stress-testing biomedical vision models via diffusion image editing

·
14 authors