Models trained/used in the paper "DogeRM: Equipping Reward Models with Domain Knowledge through Model Merging ( https://arxiv.org/abs/2407.01470)
![](https://cdn-avatars.huggingface.co/v1/production/uploads/5df9c78eda6d0311fd3d541f/aEdmfx1oeSE3ILQOB5OR4.png)
NTU Miulab
university
AI & ML interests
None defined yet.
Recent Activity
View all activity
Collections
1
models
6
![](https://cdn-avatars.huggingface.co/v1/production/uploads/5df9c78eda6d0311fd3d541f/aEdmfx1oeSE3ILQOB5OR4.png)
miulab/SalesBot2_CoT_lora_w_neg_wo_dup_chitchat_e10
Updated
•
1
![](https://cdn-avatars.huggingface.co/v1/production/uploads/5df9c78eda6d0311fd3d541f/aEdmfx1oeSE3ILQOB5OR4.png)
miulab/SalesBot2_CoT_lora_w_neg
Updated
•
3
![](https://cdn-avatars.huggingface.co/v1/production/uploads/5df9c78eda6d0311fd3d541f/aEdmfx1oeSE3ILQOB5OR4.png)
miulab/llama2-7b-alpaca-sft-10k
Text Generation
•
Updated
•
5
![](https://cdn-avatars.huggingface.co/v1/production/uploads/5df9c78eda6d0311fd3d541f/aEdmfx1oeSE3ILQOB5OR4.png)
miulab/llama2-7b-ultrafeedback-rm
Text Classification
•
Updated
•
44
![](https://cdn-avatars.huggingface.co/v1/production/uploads/5df9c78eda6d0311fd3d541f/aEdmfx1oeSE3ILQOB5OR4.png)
miulab/llama2-7b-oss-instruct
Text Generation
•
Updated
•
7
![](https://cdn-avatars.huggingface.co/v1/production/uploads/5df9c78eda6d0311fd3d541f/aEdmfx1oeSE3ILQOB5OR4.png)
miulab/llama2-7b-magicoder-evol-instruct
Text Generation
•
Updated
•
33