Daniel Bourke's picture

Daniel Bourke PRO

mrdbourke

·

https://www.mrdbourke.com

AI & ML interests

Computer vision. Small on-device models. VLMs. High-quality tutorials.

Recent Activity

upvoted an article about 14 hours ago

Open-source DeepResearch – Freeing our search agents

liked a dataset about 14 hours ago

allenai/pixmo-points

upvoted a collection about 14 hours ago

View all activity

Organizations

None yet

mrdbourke's activity

upvoted an article about 14 hours ago

Article

Open-source DeepResearch – Freeing our search agents

8 days ago

• 911

upvoted a collection about 14 hours ago

PixMo

A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 9 items • Updated about 21 hours ago • 59

upvoted a collection 3 days ago

Core ML Segment Anything 2

8 items • Updated Oct 4, 2024 • 28

upvoted 2 collections 12 days ago

Qwen2.5-1M

The long-context version of Qwen2.5, supporting 1M-token context lengths • 2 items • Updated 16 days ago • 99

Mistral Small

5 items • Updated 12 days ago • 4

upvoted 2 articles 12 days ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

28 days ago

• 142

Article

Timm ❤️ Transformers: Use any timm model with transformers

27 days ago

• 39

upvoted an article 13 days ago

Article

Introducing smolagents: simple agents that write actions in code.

Dec 31, 2024

• 594

upvoted 2 collections 13 days ago

SmolVLM

State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct • 5 items • Updated Dec 22, 2024 • 33

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 3 items • Updated 16 days ago • 337

upvoted an article 14 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

15 days ago

• 706

upvoted a collection 2 months ago

InternVL2.5

Better than InternVL 2.0 • 18 items • Updated Jan 10 • 83

upvoted a paper 2 months ago

MARS: Unleashing the Power of Variance Reduction for Training Large Models

Paper • 2411.10438 • Published Nov 15, 2024 • 13

upvoted a paper 3 months ago

Multimodal Autoregressive Pre-training of Large Vision Encoders

Paper • 2411.14402 • Published Nov 21, 2024 • 43

upvoted an article 3 months ago

Article

Visually Multilingual: Introducing mcdse-2b

By

•

Oct 27, 2024

• 38

upvoted 2 collections 3 months ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated 5 days ago • 227

Granite 3.0 Language Models

A series of language models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 8 items • Updated Dec 18, 2024 • 96

upvoted a paper 3 months ago

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22, 2024 • 126

upvoted a collection 3 months ago

Stable Diffusion 3.5

6 items • Updated Jan 9 • 131

upvoted an article 5 months ago

Article

The 5 Most Under-Rated Tools on Hugging Face

Aug 22, 2024

• 86