![](https://cdn-avatars.huggingface.co/v1/production/uploads/610b70452719facd4ea85e28/S7nMy7D0Rxq0VIVblhYDG.jpeg)
chujiezheng/zephyr_0.05
Text Generation
•
Updated
•
6
Note zephyr-7b-sft-full trained by DPO with 5% UltraFeedback data
Note zephyr-7b-sft-full trained by DPO with 10% UltraFeedback data
Note alpha = 8.0
Note zephyr-7b-sft-full trained by DPO with 20% UltraFeedback data
Note alpha = 2.5
Note zephyr-7b-sft-full trained by DPO with 40% UltraFeedback data
Note zephyr-7b-sft-full trained by DPO with 20% UltraFeedback data and x2 learning rate
Note zephyr-7b-sft-full trained by DPO with 20% UltraFeedback data and x3 learning rate
Note zephyr-7b-sft-full trained by DPO with 20% UltraFeedback data and x2 epochs
Note zephyr-7b-sft-full trained by DPO with 20% UltraFeedback data and x3 epochs