Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
2
Rohan Khopkar
ubiqtuitin
Follow
ubitquitin
AI & ML interests
None yet
Recent Activity
upvoted
an
article
about 15 hours ago
Fine-tune Deepseek-R1 with a Synthetic Reasoning Dataset
updated
a model
about 16 hours ago
ubiqtuitin/deepseek_r1_medical-ft_q4_k_m
published
a model
about 16 hours ago
ubiqtuitin/deepseek_r1_medical-ft_q4_k_m
View all activity
Organizations
None yet
models
7
Sort: Recently updated
ubiqtuitin/deepseek_r1_medical-ft_q4_k_m
Updated
about 16 hours ago
•
1
ubiqtuitin/deepseek_r1_medical_rk
Text Generation
•
Updated
about 16 hours ago
ubiqtuitin/q-Taxi-v3
Reinforcement Learning
•
Updated
Jun 20, 2022
ubiqtuitin/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Jun 20, 2022
ubiqtuitin/PPO_CarRacing-v0
Reinforcement Learning
•
Updated
Jun 6, 2022
•
2
ubiqtuitin/PPO_CartPole-v1
Reinforcement Learning
•
Updated
Jun 6, 2022
•
6
ubiqtuitin/deeprltutorial1
Reinforcement Learning
•
Updated
Jun 6, 2022
•
1
datasets
None public yet