Michael Rawle's picture

7

Michael Rawle

therubberrabbit

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

MakeAnything: Harnessing Diffusion Transformers for Multi-Domain Procedural Sequence Generation

upvoted a paper 6 days ago

AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding

upvoted a paper 6 days ago

COCONut-PanCap: Joint Panoptic Segmentation and Grounded Captions for Fine-Grained Understanding and Generation

View all activity

Organizations

None yet

therubberrabbit's activity

upvoted 4 papers 6 days ago

MakeAnything: Harnessing Diffusion Transformers for Multi-Domain Procedural Sequence Generation

Paper • 2502.01572 • Published 8 days ago • 20

AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding

Paper • 2502.01341 • Published 8 days ago • 33

COCONut-PanCap: Joint Panoptic Segmentation and Grounded Captions for Fine-Grained Understanding and Generation

Paper • 2502.02589 • Published 7 days ago • 8

VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models

Paper • 2502.02492 • Published 7 days ago • 49

upvoted 3 papers 7 days ago

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published 8 days ago • 53

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

Paper • 2502.01061 • Published 9 days ago • 168

Scalable-Softmax Is Superior for Attention

Paper • 2501.19399 • Published 11 days ago • 20