ToastyPigeon
/

MS-Meadowlark-Alt-22B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

ToastyPigeon commited on Nov 8, 2024

Commit

902c100

·

verified ·

1 Parent(s): ab95e15

Update README.md

Files changed (1) hide show

README.md +5 -36

README.md CHANGED Viewed

@@ -11,43 +11,12 @@ tags:
 - merge
 ---
-# ms-idk-v13
-This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
-## Merge Details
-### Merge Method
-This model was merged using the [task arithmetic](https://arxiv.org/abs/2212.04089) merge method using [unsloth/Mistral-Small-Instruct-2409](https://huggingface.co/unsloth/Mistral-Small-Instruct-2409) as a base.
-### Models Merged
-The following models were included in the merge:
-* [unsloth/Mistral-Small-Instruct-2409](https://huggingface.co/unsloth/Mistral-Small-Instruct-2409) + [Alfitaria/mistral-small-fujin-qlora](https://huggingface.co/Alfitaria/mistral-small-fujin-qlora)
-* [unsloth/Mistral-Small-Instruct-2409](https://huggingface.co/unsloth/Mistral-Small-Instruct-2409) + [ToastyPigeon/mistral-small-springdragon-qlora](https://huggingface.co/ToastyPigeon/mistral-small-springdragon-qlora)
-* output/tempered-rp
-### Configuration
-The following YAML configuration was used to produce this model:
-```yaml
-base_model: unsloth/Mistral-Small-Instruct-2409
-merge_method: task_arithmetic
-slices:
-- sources:
-  - layer_range: [0, 56]
-    model: output/tempered-rp
-    parameters:
-      weight: 0.4
-  - layer_range: [0, 56]
-    model: unsloth/Mistral-Small-Instruct-2409+Alfitaria/mistral-small-fujin-qlora
-    parameters:
-      weight: 0.5
-  - layer_range: [0, 56]
-    model: unsloth/Mistral-Small-Instruct-2409+ToastyPigeon/mistral-small-springdragon-qlora
-    parameters:
-      weight: 0.1
-  - layer_range: [0, 56]
-    model: unsloth/Mistral-Small-Instruct-2409
-```

 - merge
 ---
+# Meadowlark - Alternate Version
+This is an alternate version of [Meadowlark](https://huggingface.co/allura-org/MS-Meadowlark-22B). It's distinct/fun enough that I wanted to post it, but not really distinct enough from Meadowlark to be its own thing.
+The recipe is very similar to that of Meadowlark, but I subbed out the train on Creative_Writing_Multiturn and replaced it with [Sunfall v0.7.0](https://huggingface.co/crestf411/MS-sunfall-v0.7.0) for a less-synthetic-feeling RP segment.
+Sunfall does include some synthetic data, including some generated from Claude, so it's not exactly what I was looking for but it feels like it helps over what was there before.
+For usage information see the model card for [Meadowlark](https://huggingface.co/allura-org/MS-Meadowlark-22B), same stuff here.