Daniel van Strien's picture

Daniel van Strien PRO

davanstrien

·

https://danielvanstrien.xyz/

AI & ML interests

Machine Learning Librarian

Recent Activity

updated a dataset 5 minutes ago

davanstrien/curator-poems

liked a dataset 25 minutes ago

Anthropic/EconomicIndex

View all activity

Organizations

davanstrien's activity

upvoted a collection 4 days ago

SYNTHETIC-1

A collection of tasks & verifiers for reasoning datasets • 5 items • Updated 5 days ago • 22

upvoted 3 papers 5 days ago

TrustLLM: Trustworthiness in Large Language Models

Paper • 2401.05561 • Published Jan 10, 2024 • 69

CondAmbigQA: A Benchmark and Dataset for Conditional Ambiguous Question Answering

Paper • 2502.01523 • Published 8 days ago • 1

ScholaWrite: A Dataset of End-to-End Scholarly Writing Process

Paper • 2502.02904 • Published 6 days ago • 1

upvoted a collection 5 days ago

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 11 items • Updated about 7 hours ago • 48

upvoted 2 papers 6 days ago

Template-Driven LLM-Paraphrased Framework for Tabular Math Word Problem Generation

Paper • 2412.15594 • Published Dec 20, 2024 • 1

s1: Simple test-time scaling

Paper • 2501.19393 • Published 11 days ago • 98

upvoted 3 papers 7 days ago

MM-IQ: Benchmarking Human-Like Abstraction and Reasoning in Multimodal Models

Paper • 2502.00698 • Published 9 days ago • 22

FENICE: Factuality Evaluation of summarization based on Natural language Inference and Claim Extraction

Paper • 2403.02270 • Published Mar 4, 2024 • 3

ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning

Paper • 2502.01100 • Published 8 days ago • 14

upvoted an article 9 days ago

Article

Open-R1: Update #1

By

and 7 others •

10 days ago

• 267

upvoted 2 articles 11 days ago

Article

Replicating DeepSeek R1 for Information Extraction

By

•

11 days ago

• 30

Article

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

By

•

11 days ago

• 33

upvoted a paper 11 days ago

WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training

Paper • 2501.18511 • Published 12 days ago • 17

upvoted a collection 11 days ago

WildChat-50m

All model responses associated with the WildChat-50m paper. • 55 items • Updated 13 days ago • 7

upvoted an article 13 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

15 days ago

• 705

upvoted an article 14 days ago

Article

Welcome to Inference Providers on the Hub 🔥

15 days ago

• 319

upvoted a collection 15 days ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 3 items • Updated 15 days ago • 336

upvoted a collection 16 days ago

Qwen2.5-1M

The long-context version of Qwen2.5, supporting 1M-token context lengths • 2 items • Updated 16 days ago • 99

upvoted an article 19 days ago

Article

Explore, Curate and Vector Search Any Hugging Face Dataset with Nomic Atlas

By

and 4 others •

19 days ago

• 30