Fishball02/llama-topical-chat

Browse files

Files changed (8) hide show

README.md +23 -7
adapter_model.safetensors +1 -1
runs/Nov10_01-01-47_69332eb5d77e/events.out.tfevents.1699578107.69332eb5d77e.14876.0 +3 -0
runs/Nov10_01-03-01_69332eb5d77e/events.out.tfevents.1699578182.69332eb5d77e.14876.1 +3 -0
runs/Nov10_01-06-45_69332eb5d77e/events.out.tfevents.1699578406.69332eb5d77e.14876.2 +3 -0
runs/Nov10_01-08-23_69332eb5d77e/events.out.tfevents.1699578504.69332eb5d77e.14876.3 +3 -0
runs/Nov10_01-10-41_69332eb5d77e/events.out.tfevents.1699578642.69332eb5d77e.14876.4 +3 -0
training_args.bin +2 -2

README.md CHANGED Viewed

@@ -14,7 +14,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [meta-llama/Llama-2-7b-chat-hf](https://huggingface.co/meta-llama/Llama-2-7b-chat-hf) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.5527
 ## Model description
@@ -34,20 +34,36 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 8
-- eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 3.0
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| No log        | 1.0   | 147  | 2.8403          |
-| No log        | 2.0   | 294  | 2.6048          |
-| No log        | 3.0   | 441  | 2.5527          |
 ### Framework versions

 This model is a fine-tuned version of [meta-llama/Llama-2-7b-chat-hf](https://huggingface.co/meta-llama/Llama-2-7b-chat-hf) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.2213
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 4
+- eval_batch_size: 4
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 4
+- mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 2.4511        | 0.21  | 256  | 2.3800          |
+| 2.3704        | 0.43  | 512  | 2.3322          |
+| 2.3548        | 0.64  | 768  | 2.2984          |
+| 2.3161        | 0.86  | 1024 | 2.2793          |
+| 2.2941        | 1.07  | 1280 | 2.2647          |
+| 2.276         | 1.29  | 1536 | 2.2542          |
+| 2.2891        | 1.5   | 1792 | 2.2462          |
+| 2.2585        | 1.72  | 2048 | 2.2398          |
+| 2.2395        | 1.93  | 2304 | 2.2357          |
+| 2.2342        | 2.15  | 2560 | 2.2326          |
+| 2.2343        | 2.36  | 2816 | 2.2311          |
+| 2.217         | 2.58  | 3072 | 2.2277          |
+| 2.2263        | 2.79  | 3328 | 2.2252          |
+| 2.244         | 3.0   | 3584 | 2.2237          |
+| 2.2198        | 3.22  | 3840 | 2.2229          |
+| 2.2157        | 3.43  | 4096 | 2.2224          |
+| 2.1992        | 3.65  | 4352 | 2.2219          |
+| 2.2254        | 3.86  | 4608 | 2.2213          |
 ### Framework versions

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:031d752ae33862d1a09639c82c3cbec7b612398c7ac9379c0df7b7c005fe31e1
 size 134235048

 version https://git-lfs.github.com/spec/v1
+oid sha256:e30fea857b97e033354a16c4aabae19f3d46b42975e1f8e287bc7b2b219ab0a5
 size 134235048

runs/Nov10_01-01-47_69332eb5d77e/events.out.tfevents.1699578107.69332eb5d77e.14876.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:f56664edaf0388ae3d586fc9b3a14803f43a4bcdc7303ad0fafdf7093852be88
+size 4619

runs/Nov10_01-03-01_69332eb5d77e/events.out.tfevents.1699578182.69332eb5d77e.14876.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:9d4235777c75fd83f3272a1dcbe9d8096afe180a46b0a8b29ebeda40745f826b
+size 5545

runs/Nov10_01-06-45_69332eb5d77e/events.out.tfevents.1699578406.69332eb5d77e.14876.2 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:eca5cab4ddf413fb5f40d376d8aec5ef390864c8f5ba2d5e39540fc58a7430c7
+size 4770

runs/Nov10_01-08-23_69332eb5d77e/events.out.tfevents.1699578504.69332eb5d77e.14876.3 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:399caf6b885606cfdaba1f4054a1713744d43a80bd66a273487c9aaeb402de5e
+size 4772

runs/Nov10_01-10-41_69332eb5d77e/events.out.tfevents.1699578642.69332eb5d77e.14876.4 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:45497559df6ee8efb35701f0802db5afc8594b22f70dd2263dcc3526a2aad802
+size 12676

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:91cc86ea45b2fca7fae9fe3634cdd61f0cb3896ce3dff60ffc605df4c9d791b7
-size 4600

 version https://git-lfs.github.com/spec/v1
+oid sha256:283660df3ac10267291c7441e6210e2f31ce3de8207326fb26d26a366e9c4586
+size 4536