Several trained models to compare the differences between each method. Each model has a complete description of hyperparams with wandb reports.
G
G-reen
AI & ML interests
SFT, DPO, ORPO, LLMs, text-generation
Recent Activity
updated
a model
2 days ago
G-reen/Qwen2.5-Coder-32b-Instruct-Fp8
updated
a model
2 days ago
G-reen/Mistral-Small-2501-Instruct-Fp8
published
a model
2 days ago
G-reen/Mistral-Small-2501-Instruct-Fp8
Organizations
None yet
Collections
1
models
23
![](https://cdn-avatars.huggingface.co/v1/production/uploads/65a5c0e82823ba72ed2cee7d/-47llMc1c9Kou9j6KqpO8.png)
G-reen/Qwen2.5-Coder-32b-Instruct-Fp8
Updated
•
4
![](https://cdn-avatars.huggingface.co/v1/production/uploads/65a5c0e82823ba72ed2cee7d/-47llMc1c9Kou9j6KqpO8.png)
G-reen/Mistral-Small-2501-Instruct-Fp8
Updated
•
5
![](https://cdn-avatars.huggingface.co/v1/production/uploads/65a5c0e82823ba72ed2cee7d/-47llMc1c9Kou9j6KqpO8.png)
G-reen/gpt5o-reflexion-q-agi-llama-3.1-8b
Text Generation
•
Updated
•
44
•
64
![](https://cdn-avatars.huggingface.co/v1/production/uploads/65a5c0e82823ba72ed2cee7d/-47llMc1c9Kou9j6KqpO8.png)
G-reen/Duet_Minitron8b_v0.51
Updated
•
9
•
1
![](https://cdn-avatars.huggingface.co/v1/production/uploads/65a5c0e82823ba72ed2cee7d/-47llMc1c9Kou9j6KqpO8.png)
G-reen/Duet_Minitron8b_v0.5
Text Generation
•
Updated
•
15
•
6
![](https://cdn-avatars.huggingface.co/v1/production/uploads/65a5c0e82823ba72ed2cee7d/-47llMc1c9Kou9j6KqpO8.png)
G-reen/EXPERIMENT-ORPO-m7b2-2-merged
Text Generation
•
Updated
•
81
![](https://cdn-avatars.huggingface.co/v1/production/uploads/65a5c0e82823ba72ed2cee7d/-47llMc1c9Kou9j6KqpO8.png)
G-reen/EXPERIMENT-ORPO-m7b2-2-lora
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/65a5c0e82823ba72ed2cee7d/-47llMc1c9Kou9j6KqpO8.png)
G-reen/EXPERIMENT-ORPO-m7b2-1-merged
Text Generation
•
Updated
•
86
![](https://cdn-avatars.huggingface.co/v1/production/uploads/65a5c0e82823ba72ed2cee7d/-47llMc1c9Kou9j6KqpO8.png)
G-reen/EXPERIMENT-ORPO-m7b2-1-lora
Text Generation
•
Updated
•
77
![](https://cdn-avatars.huggingface.co/v1/production/uploads/65a5c0e82823ba72ed2cee7d/-47llMc1c9Kou9j6KqpO8.png)
G-reen/EXPERIMENT-DPO-m7b2-3-merged
Text Generation
•
Updated
•
158
datasets
14
G-reen/Duet-v0.6
Viewer
•
Updated
•
5k
•
41
G-reen/reflexion-agi
Viewer
•
Updated
•
5k
•
46
•
40
G-reen/TheatreLM-v2.1-Characters
Viewer
•
Updated
•
5.01k
•
91
•
57
G-reen/Duet-v0.5
Viewer
•
Updated
•
5k
•
76
•
22
G-reen/deepmindcodecontestssharegpt
Viewer
•
Updated
•
13.1k
•
48
G-reen/TheatreLM-v2.0-Settings
Viewer
•
Updated
•
200
•
15
G-reen/TheatreLM-v2.0-Characters
Viewer
•
Updated
•
1k
•
17
G-reen/TheatreLM-v2.1-chats-preview
Viewer
•
Updated
•
3.94k
•
62
G-reen/TheatreLM-v2.0-chats-preview
Viewer
•
Updated
•
264
•
19
G-reen/TheatreLM-v1.0-DPO
Viewer
•
Updated
•
1
•
17