RLHFlow

university
Activity Feed

AI & ML interests

Workflow of Reinforcement Learning from Human Feedback (RLHF). Blog: https://rlhflow.github.io/

Recent Activity

weqweasdas  updated a dataset about 12 hours ago
RLHFlow/numia_prompt_dpo_test
weqweasdas  published a dataset about 12 hours ago
RLHFlow/numia_prompt_dpo_test
Chenlu123  updated a dataset about 13 hours ago
RLHFlow/numia_prompt_dpo9
View all activity