ValueFX9507
/

Tifa-Deepsex-14b-CoT-GGUF-Q4

About RL

by Rhythmblue - opened 4 days ago

4 days ago

•

您好，请问这里的RL是怎么做的啊，有参考的资料吗。对于这种开放式的写作类型的，怎么加reward信号呢？

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment