Kashif Rasul's picture

Kashif Rasul

kashif

·

AI & ML interests

Time Series Forecasting, Denoising Diffusion, Generative Modeling, Reinforcement Learning

Recent Activity

upvoted an article about 24 hours ago

Open R1: Update #2

upvoted an article 11 days ago

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

published a model 21 days ago

kashif/Qwen2-0.5B-SFT

View all activity

Organizations

kashif's activity

upvoted an article about 24 hours ago

Article

Open R1: Update #2

By

and 6 others •

about 24 hours ago

• 111

upvoted an article 11 days ago

Article

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

By

•

11 days ago

• 33

published a model 21 days ago

kashif/Qwen2-0.5B-SFT

Updated 21 days ago

updated a model 22 days ago

kashif/Gemma2-2B-SFT

Text Generation • Updated 22 days ago • 18

published a model 22 days ago

kashif/Gemma2-2B-SFT

Text Generation • Updated 22 days ago • 18

upvoted a paper about 1 month ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 255

upvoted an article about 1 month ago

Article

Process Reinforcement through Implicit Rewards

By

and 1 other •

Jan 3

• 23

liked 2 Spaces about 2 months ago

Scaling test-time compute

Enhance math problem solving by scaling test-time compute

Fev Leaderboard

Display benchmark results for time series models

liked a model 2 months ago

nicolas-dufour/PLONK_YFCC

Updated Dec 12, 2024 • 171 • 13

updated a model 2 months ago

huggingface/timesfm-tourism-monthly

Updated Dec 9, 2024 • 35 • 1

upvoted a paper 2 months ago

Mooncake: A KVCache-centric Disaggregated Architecture for LLM Serving

Paper • 2407.00079 • Published Jun 24, 2024 • 5

liked a model 2 months ago

flair/bueble-lm-2b

Text Generation • Updated Dec 6, 2024 • 3.05k • 20

upvoted a paper 2 months ago

RRM: Robust Reward Model Training Mitigates Reward Hacking

Paper • 2409.13156 • Published Sep 20, 2024 • 5

liked a model 2 months ago

TianqiLiuAI/RM-gemma2-2b

Text Generation • Updated Nov 18, 2024 • 94 • 1

updated a dataset 3 months ago

trl-lib/alpaca-cleaned

Viewer • Updated Nov 28, 2024 • 51.8k • 50

liked a dataset 3 months ago

ylecun/mnist

Viewer • Updated Aug 8, 2024 • 70k • 33.8k • 155

updated a model 3 months ago

HuggingFaceTB/SmolVLM-Instruct-DPO

Image-Text-to-Text • Updated Nov 26, 2024 • 215 • 18

liked 2 models 3 months ago

apple/coreml-mobileclip

Updated Nov 19, 2024 • 312 • 40

apple/aimv2-large-patch14-448

Image Feature Extraction • Updated Nov 28, 2024 • 2.2k • 1