DopeyTinyLlama-1.1B-v1

An experimental DPO finetune of SmarTinyLlama with Alpaca-QLoRA

Datasets

Trained on bagel style DPO datasets

Prompt Template

Uses chatml style prompt template

Downloads last month: 1,563

Safetensors

Model size

1.1B params

Tensor type

FP16

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for vihangd/DopeyTinyLlama-1.1B-v1

Merges

49 models

Quantizations

2 models