EgoLife

community

https://egolife-v1.github.io/

https://github.com/egolife-v1

Activity Feed Request to join this org

AI & ML interests

Egocentric Vision Assistant

Recent Activity

Nicous updated a Space 1 day ago

EgoLife-v1/EgoGPT

Jingkang updated a Space 1 day ago

EgoLife-v1/EgoGPT

Nicous updated a model 2 days ago

EgoLife-v1/EgoGPT

View all activity

EgoLife-v1's activity

Nicous

updated a Space 1 day ago

EgoGPT

💬

Generate descriptions from video and audio input

Jingkang

updated a Space 1 day ago

EgoGPT

💬

Generate descriptions from video and audio input

Nicous

updated a model 2 days ago

EgoLife-v1/EgoGPT

Updated 2 days ago • 100

Nicous

published a Space 2 days ago

EgoGPT

💬

Generate descriptions from video and audio input

THUdyh

authored a paper 5 days ago

Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment

Paper • 2502.04328 • Published 5 days ago • 20

THUdyh

authored a paper about 1 month ago

Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives

Paper • 2501.04003 • Published Jan 7 • 25

THUdyh

posted an update 3 months ago

Post

1034

🚀🚀🚀Introducing Insight-V! An early attempt towards o1-like multi-modal reasoning.
We offer a structured long-chain visual reasoning data generation pipeline and a multi-agent system to unleash the reasoning potential of MLLMs.
📜 Paper: https://arxiv.org/abs/2411.14432
🛠️ Github: https://github.com/dongyh20/Insight-V
💼 Model Weight: THUdyh/insight-v-673f5e1dd8ab5f2d8d332035

Jingkang

authored a paper 3 months ago

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

Paper • 2411.14432 • Published Nov 21, 2024 • 23

THUdyh

authored a paper 3 months ago

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

Paper • 2411.14432 • Published Nov 21, 2024 • 23

THUdyh

posted an update 4 months ago

Post

3316

🔥🔥🔥Introducing Oryx-1.5!
A series of unified MLLMs with much stronger performance on all the image, video, and 3D benchmarks 😍
🛠️Github: https://github.com/Oryx-mllm/Oryx
🚀Model: THUdyh/oryx-15-6718c60763845525c2bba71d
🎨Demo: THUdyh/Oryx
👋Try the top-tier MLLM yourself!

👀Stay tuned for more explorations on MLLMs!