mav23 commited on
Commit
1d7c9fa
·
verified ·
1 Parent(s): 8ff883c

Upload folder using huggingface_hub

Browse files
Files changed (3) hide show
  1. .gitattributes +1 -0
  2. README.md +38 -0
  3. arco-plus.Q4_0.gguf +3 -0
.gitattributes CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ arco-plus.Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,38 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model:
3
+ - appvoid/arco
4
+ - h2oai/h2o-danube3-500m-base
5
+ library_name: transformers
6
+ tags:
7
+ - mergekit
8
+ - merge
9
+
10
+ ---
11
+ # arco+
12
+
13
+ This is an untrained passthrough model based on arco and danube as a first effort to train a small enough reasoning language model that generalizes across all kind of reasoning tasks.
14
+
15
+ #### Benchmarks
16
+
17
+ | Parameters | Model | MMLU | ARC | HellaSwag | PIQA | Winogrande | Average |
18
+ | -----------|--------------------------------|-------|-------|-----------|--------|------------|---------|
19
+ | 488m | arco-lite | **23.22** | 33.45 | 56.55| 69.70 | **59.19**| 48.46 |
20
+ | 773m | arco-plus | 23.06 | **36.43** | **60.09**|**72.36**| **60.46**| **50.48** |
21
+
22
+ #### Configuration
23
+
24
+ The following YAML configuration was used to produce this model:
25
+
26
+ ```yaml
27
+ slices:
28
+ - sources:
29
+ - model: appvoid/arco
30
+ layer_range: [0, 14]
31
+ - sources:
32
+ - model: h2oai/h2o-danube3-500m-base
33
+ layer_range: [4, 16]
34
+
35
+ merge_method: passthrough
36
+ dtype: float16
37
+
38
+ ```
arco-plus.Q4_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e733740ceb8609600ca320be7c6ad18f286604c66104508f43f45076a7a2274c
3
+ size 448577984