Spaces:

Korron
/

music-recommendation

Runtime error

App Files Files Community

Korron commited on May 29, 2024

Commit

5f08496

1 Parent(s): 8fc6d4d

baseline

Browse files

This view is limited to 50 files because it contains too many changes. See raw diff

Files changed (50) hide show

MusicBot-ft/checkpoint-10/README.md +202 -0
MusicBot-ft/checkpoint-10/adapter_config.json +28 -0
MusicBot-ft/checkpoint-10/adapter_model.safetensors +3 -0
MusicBot-ft/checkpoint-10/optimizer.pt +3 -0
MusicBot-ft/checkpoint-10/rng_state.pth +3 -0
MusicBot-ft/checkpoint-10/scheduler.pt +3 -0
MusicBot-ft/checkpoint-10/trainer_state.json +96 -0
MusicBot-ft/checkpoint-10/training_args.bin +3 -0
MusicBot-ft/checkpoint-12/README.md +202 -0
MusicBot-ft/checkpoint-12/adapter_config.json +28 -0
MusicBot-ft/checkpoint-12/adapter_model.safetensors +3 -0
MusicBot-ft/checkpoint-12/optimizer.pt +3 -0
MusicBot-ft/checkpoint-12/rng_state.pth +3 -0
MusicBot-ft/checkpoint-12/scheduler.pt +3 -0
MusicBot-ft/checkpoint-12/trainer_state.json +111 -0
MusicBot-ft/checkpoint-12/training_args.bin +3 -0
MusicBot-ft/checkpoint-14/README.md +202 -0
MusicBot-ft/checkpoint-14/adapter_config.json +28 -0
MusicBot-ft/checkpoint-14/adapter_model.safetensors +3 -0
MusicBot-ft/checkpoint-14/optimizer.pt +3 -0
MusicBot-ft/checkpoint-14/rng_state.pth +3 -0
MusicBot-ft/checkpoint-14/scheduler.pt +3 -0
MusicBot-ft/checkpoint-14/trainer_state.json +126 -0
MusicBot-ft/checkpoint-14/training_args.bin +3 -0
MusicBot-ft/checkpoint-16/README.md +202 -0
MusicBot-ft/checkpoint-16/adapter_config.json +28 -0
MusicBot-ft/checkpoint-16/adapter_model.safetensors +3 -0
MusicBot-ft/checkpoint-16/optimizer.pt +3 -0
MusicBot-ft/checkpoint-16/rng_state.pth +3 -0
MusicBot-ft/checkpoint-16/scheduler.pt +3 -0
MusicBot-ft/checkpoint-16/trainer_state.json +141 -0
MusicBot-ft/checkpoint-16/training_args.bin +3 -0
MusicBot-ft/checkpoint-18/README.md +202 -0
MusicBot-ft/checkpoint-18/adapter_config.json +28 -0
MusicBot-ft/checkpoint-18/adapter_model.safetensors +3 -0
MusicBot-ft/checkpoint-18/optimizer.pt +3 -0
MusicBot-ft/checkpoint-18/rng_state.pth +3 -0
MusicBot-ft/checkpoint-18/scheduler.pt +3 -0
MusicBot-ft/checkpoint-18/trainer_state.json +156 -0
MusicBot-ft/checkpoint-18/training_args.bin +3 -0
MusicBot-ft/checkpoint-2/README.md +202 -0
MusicBot-ft/checkpoint-2/adapter_config.json +28 -0
MusicBot-ft/checkpoint-2/adapter_model.safetensors +3 -0
MusicBot-ft/checkpoint-2/optimizer.pt +3 -0
MusicBot-ft/checkpoint-2/rng_state.pth +3 -0
MusicBot-ft/checkpoint-2/scheduler.pt +3 -0
MusicBot-ft/checkpoint-2/trainer_state.json +36 -0
MusicBot-ft/checkpoint-2/training_args.bin +3 -0
MusicBot-ft/checkpoint-20/README.md +202 -0
MusicBot-ft/checkpoint-20/adapter_config.json +28 -0

MusicBot-ft/checkpoint-10/README.md ADDED Viewed

	@@ -0,0 +1,202 @@

+---
+library_name: peft
+base_model: TheBloke/Mistral-7B-Instruct-v0.2-GPTQ
+---
+# Model Card for Model ID
+<!-- Provide a quick summary of what the model is/does. -->
+## Model Details
+### Model Description
+<!-- Provide a longer summary of what this model is. -->
+- **Developed by:** [More Information Needed]
+- **Funded by [optional]:** [More Information Needed]
+- **Shared by [optional]:** [More Information Needed]
+- **Model type:** [More Information Needed]
+- **Language(s) (NLP):** [More Information Needed]
+- **License:** [More Information Needed]
+- **Finetuned from model [optional]:** [More Information Needed]
+### Model Sources [optional]
+<!-- Provide the basic links for the model. -->
+- **Repository:** [More Information Needed]
+- **Paper [optional]:** [More Information Needed]
+- **Demo [optional]:** [More Information Needed]
+## Uses
+<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
+### Direct Use
+<!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
+[More Information Needed]
+### Downstream Use [optional]
+<!-- This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app -->
+[More Information Needed]
+### Out-of-Scope Use
+<!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
+[More Information Needed]
+## Bias, Risks, and Limitations
+<!-- This section is meant to convey both technical and sociotechnical limitations. -->
+[More Information Needed]
+### Recommendations
+<!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
+Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.
+## How to Get Started with the Model
+Use the code below to get started with the model.
+[More Information Needed]
+## Training Details
+### Training Data
+<!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
+[More Information Needed]
+### Training Procedure
+<!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
+#### Preprocessing [optional]
+[More Information Needed]
+#### Training Hyperparameters
+- **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
+#### Speeds, Sizes, Times [optional]
+<!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
+[More Information Needed]
+## Evaluation
+<!-- This section describes the evaluation protocols and provides the results. -->
+### Testing Data, Factors & Metrics
+#### Testing Data
+<!-- This should link to a Dataset Card if possible. -->
+[More Information Needed]
+#### Factors
+<!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
+[More Information Needed]
+#### Metrics
+<!-- These are the evaluation metrics being used, ideally with a description of why. -->
+[More Information Needed]
+### Results
+[More Information Needed]
+#### Summary
+## Model Examination [optional]
+<!-- Relevant interpretability work for the model goes here -->
+[More Information Needed]
+## Environmental Impact
+<!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
+Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
+- **Hardware Type:** [More Information Needed]
+- **Hours used:** [More Information Needed]
+- **Cloud Provider:** [More Information Needed]
+- **Compute Region:** [More Information Needed]
+- **Carbon Emitted:** [More Information Needed]
+## Technical Specifications [optional]
+### Model Architecture and Objective
+[More Information Needed]
+### Compute Infrastructure
+[More Information Needed]
+#### Hardware
+[More Information Needed]
+#### Software
+[More Information Needed]
+## Citation [optional]
+<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
+**BibTeX:**
+[More Information Needed]
+**APA:**
+[More Information Needed]
+## Glossary [optional]
+<!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->
+[More Information Needed]
+## More Information [optional]
+[More Information Needed]
+## Model Card Authors [optional]
+[More Information Needed]
+## Model Card Contact
+[More Information Needed]
+### Framework versions
+- PEFT 0.11.1

MusicBot-ft/checkpoint-10/adapter_config.json ADDED Viewed

	@@ -0,0 +1,28 @@

+{
+  "alpha_pattern": {},
+  "auto_mapping": null,
+  "base_model_name_or_path": "TheBloke/Mistral-7B-Instruct-v0.2-GPTQ",
+  "bias": "none",
+  "fan_in_fan_out": false,
+  "inference_mode": true,
+  "init_lora_weights": true,
+  "layer_replication": null,
+  "layers_pattern": null,
+  "layers_to_transform": null,
+  "loftq_config": {},
+  "lora_alpha": 32,
+  "lora_dropout": 0.05,
+  "megatron_config": null,
+  "megatron_core": "megatron.core",
+  "modules_to_save": null,
+  "peft_type": "LORA",
+  "r": 8,
+  "rank_pattern": {},
+  "revision": null,
+  "target_modules": [
+    "q_proj"
+  ],
+  "task_type": "CAUSAL_LM",
+  "use_dora": false,
+  "use_rslora": false
+}

MusicBot-ft/checkpoint-10/adapter_model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:b3f848d44395cbebea0a898556c5ca03163efadeee9f966352d6ef710cde35ff
+size 8397056

MusicBot-ft/checkpoint-10/optimizer.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:10476d98404b953836673f9a14d1b1d852783b13f38dbfeb74630d04bcda8152
+size 4279546

MusicBot-ft/checkpoint-10/rng_state.pth ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:578dbb8e06c1f8419c8a4c6b88841fba86a58d3db56b75a421170205eea7dbb4
+size 14244

MusicBot-ft/checkpoint-10/scheduler.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:e55b45c789d4b050974f1819cfbf5c274f2aacee67bdf8bfe1ed6c696623c88c
+size 1064

MusicBot-ft/checkpoint-10/trainer_state.json ADDED Viewed

	@@ -0,0 +1,96 @@

+{
+  "best_metric": 2.165768623352051,
+  "best_model_checkpoint": "MusicBot-ft/checkpoint-10",
+  "epoch": 5.0,
+  "eval_steps": 500,
+  "global_step": 10,
+  "is_hyper_param_search": false,
+  "is_local_process_zero": true,
+  "is_world_process_zero": true,
+  "log_history": [
+    {
+      "epoch": 1.0,
+      "grad_norm": 2.1322076320648193,
+      "learning_rate": 0.0003,
+      "loss": 3.3408,
+      "step": 2
+    },
+    {
+      "epoch": 1.0,
+      "eval_loss": 3.314174175262451,
+      "eval_runtime": 4.3893,
+      "eval_samples_per_second": 1.823,
+      "eval_steps_per_second": 0.456,
+      "step": 2
+    },
+    {
+      "epoch": 2.0,
+      "grad_norm": 2.0614089965820312,
+      "learning_rate": 0.0002727272727272727,
+      "loss": 3.075,
+      "step": 4
+    },
+    {
+      "epoch": 2.0,
+      "eval_loss": 2.82991886138916,
+      "eval_runtime": 4.4015,
+      "eval_samples_per_second": 1.818,
+      "eval_steps_per_second": 0.454,
+      "step": 4
+    },
+    {
+      "epoch": 3.0,
+      "grad_norm": 1.8810017108917236,
+      "learning_rate": 0.00024545454545454545,
+      "loss": 2.618,
+      "step": 6
+    },
+    {
+      "epoch": 3.0,
+      "eval_loss": 2.520831823348999,
+      "eval_runtime": 4.3985,
+      "eval_samples_per_second": 1.819,
+      "eval_steps_per_second": 0.455,
+      "step": 6
+    },
+    {
+      "epoch": 4.0,
+      "grad_norm": 2.1504602432250977,
+      "learning_rate": 0.00021818181818181816,
+      "loss": 2.3422,
+      "step": 8
+    },
+    {
+      "epoch": 4.0,
+      "eval_loss": 2.325821876525879,
+      "eval_runtime": 4.398,
+      "eval_samples_per_second": 1.819,
+      "eval_steps_per_second": 0.455,
+      "step": 8
+    },
+    {
+      "epoch": 5.0,
+      "grad_norm": 2.3659000396728516,
+      "learning_rate": 0.0001909090909090909,
+      "loss": 2.1455,
+      "step": 10
+    },
+    {
+      "epoch": 5.0,
+      "eval_loss": 2.165768623352051,
+      "eval_runtime": 4.3935,
+      "eval_samples_per_second": 1.821,
+      "eval_steps_per_second": 0.455,
+      "step": 10
+    }
+  ],
+  "logging_steps": 500,
+  "max_steps": 24,
+  "num_input_tokens_seen": 0,
+  "num_train_epochs": 12,
+  "save_steps": 500,
+  "total_flos": 43703827070976.0,
+  "train_batch_size": 4,
+  "trial_name": null,
+  "trial_params": null
+}

MusicBot-ft/checkpoint-10/training_args.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:6f7ebc35e6bb8f7d9e32cc230c02462c36709bdd1c89554a8250bf4f37175bfd
+size 4984

MusicBot-ft/checkpoint-12/README.md ADDED Viewed

	@@ -0,0 +1,202 @@

+---
+library_name: peft
+base_model: TheBloke/Mistral-7B-Instruct-v0.2-GPTQ
+---
+# Model Card for Model ID
+<!-- Provide a quick summary of what the model is/does. -->
+## Model Details
+### Model Description
+<!-- Provide a longer summary of what this model is. -->
+- **Developed by:** [More Information Needed]
+- **Funded by [optional]:** [More Information Needed]
+- **Shared by [optional]:** [More Information Needed]
+- **Model type:** [More Information Needed]
+- **Language(s) (NLP):** [More Information Needed]
+- **License:** [More Information Needed]
+- **Finetuned from model [optional]:** [More Information Needed]
+### Model Sources [optional]
+<!-- Provide the basic links for the model. -->
+- **Repository:** [More Information Needed]
+- **Paper [optional]:** [More Information Needed]
+- **Demo [optional]:** [More Information Needed]
+## Uses
+<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
+### Direct Use
+<!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
+[More Information Needed]
+### Downstream Use [optional]
+<!-- This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app -->
+[More Information Needed]
+### Out-of-Scope Use
+<!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
+[More Information Needed]
+## Bias, Risks, and Limitations
+<!-- This section is meant to convey both technical and sociotechnical limitations. -->
+[More Information Needed]
+### Recommendations
+<!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
+Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.
+## How to Get Started with the Model
+Use the code below to get started with the model.
+[More Information Needed]
+## Training Details
+### Training Data
+<!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
+[More Information Needed]
+### Training Procedure
+<!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
+#### Preprocessing [optional]
+[More Information Needed]
+#### Training Hyperparameters
+- **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
+#### Speeds, Sizes, Times [optional]
+<!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
+[More Information Needed]
+## Evaluation
+<!-- This section describes the evaluation protocols and provides the results. -->
+### Testing Data, Factors & Metrics
+#### Testing Data
+<!-- This should link to a Dataset Card if possible. -->
+[More Information Needed]
+#### Factors
+<!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
+[More Information Needed]
+#### Metrics
+<!-- These are the evaluation metrics being used, ideally with a description of why. -->
+[More Information Needed]
+### Results
+[More Information Needed]
+#### Summary
+## Model Examination [optional]
+<!-- Relevant interpretability work for the model goes here -->
+[More Information Needed]
+## Environmental Impact
+<!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
+Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
+- **Hardware Type:** [More Information Needed]
+- **Hours used:** [More Information Needed]
+- **Cloud Provider:** [More Information Needed]
+- **Compute Region:** [More Information Needed]
+- **Carbon Emitted:** [More Information Needed]
+## Technical Specifications [optional]
+### Model Architecture and Objective
+[More Information Needed]
+### Compute Infrastructure
+[More Information Needed]
+#### Hardware
+[More Information Needed]
+#### Software
+[More Information Needed]
+## Citation [optional]
+<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
+**BibTeX:**
+[More Information Needed]
+**APA:**
+[More Information Needed]
+## Glossary [optional]
+<!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->
+[More Information Needed]
+## More Information [optional]
+[More Information Needed]
+## Model Card Authors [optional]
+[More Information Needed]
+## Model Card Contact
+[More Information Needed]
+### Framework versions
+- PEFT 0.11.1

MusicBot-ft/checkpoint-12/adapter_config.json ADDED Viewed

	@@ -0,0 +1,28 @@

+{
+  "alpha_pattern": {},
+  "auto_mapping": null,
+  "base_model_name_or_path": "TheBloke/Mistral-7B-Instruct-v0.2-GPTQ",
+  "bias": "none",
+  "fan_in_fan_out": false,
+  "inference_mode": true,
+  "init_lora_weights": true,
+  "layer_replication": null,
+  "layers_pattern": null,
+  "layers_to_transform": null,
+  "loftq_config": {},
+  "lora_alpha": 32,
+  "lora_dropout": 0.05,
+  "megatron_config": null,
+  "megatron_core": "megatron.core",
+  "modules_to_save": null,
+  "peft_type": "LORA",
+  "r": 8,
+  "rank_pattern": {},
+  "revision": null,
+  "target_modules": [
+    "q_proj"
+  ],
+  "task_type": "CAUSAL_LM",
+  "use_dora": false,
+  "use_rslora": false
+}

MusicBot-ft/checkpoint-12/adapter_model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:b7877e4ad2685b8e045a82298cb34e464c276df1409a1e86bc806f248f5530ff
+size 8397056

MusicBot-ft/checkpoint-12/optimizer.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:d92f7c76ab84c690042a6a632eca68b675ba90c91791e455fc7a6f6616569847
+size 4279546

MusicBot-ft/checkpoint-12/rng_state.pth ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:0c08c4723c8699bee87e1f0e00043fccd14116ae1819375e5d461f22525fe03b
+size 14244

MusicBot-ft/checkpoint-12/scheduler.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:26fd7fea26595da9e3df5d54844f861b21f2a292c3f864a7dce63619b82176df
+size 1064

MusicBot-ft/checkpoint-12/trainer_state.json ADDED Viewed

	@@ -0,0 +1,111 @@

+{
+  "best_metric": 2.0470268726348877,
+  "best_model_checkpoint": "MusicBot-ft/checkpoint-12",
+  "epoch": 6.0,
+  "eval_steps": 500,
+  "global_step": 12,
+  "is_hyper_param_search": false,
+  "is_local_process_zero": true,
+  "is_world_process_zero": true,
+  "log_history": [
+    {
+      "epoch": 1.0,
+      "grad_norm": 2.1322076320648193,
+      "learning_rate": 0.0003,
+      "loss": 3.3408,
+      "step": 2
+    },
+    {
+      "epoch": 1.0,
+      "eval_loss": 3.314174175262451,
+      "eval_runtime": 4.3893,
+      "eval_samples_per_second": 1.823,
+      "eval_steps_per_second": 0.456,
+      "step": 2
+    },
+    {
+      "epoch": 2.0,
+      "grad_norm": 2.0614089965820312,
+      "learning_rate": 0.0002727272727272727,
+      "loss": 3.075,
+      "step": 4
+    },
+    {
+      "epoch": 2.0,
+      "eval_loss": 2.82991886138916,
+      "eval_runtime": 4.4015,
+      "eval_samples_per_second": 1.818,
+      "eval_steps_per_second": 0.454,
+      "step": 4
+    },
+    {
+      "epoch": 3.0,
+      "grad_norm": 1.8810017108917236,
+      "learning_rate": 0.00024545454545454545,
+      "loss": 2.618,
+      "step": 6
+    },
+    {
+      "epoch": 3.0,
+      "eval_loss": 2.520831823348999,
+      "eval_runtime": 4.3985,
+      "eval_samples_per_second": 1.819,
+      "eval_steps_per_second": 0.455,
+      "step": 6
+    },
+    {
+      "epoch": 4.0,
+      "grad_norm": 2.1504602432250977,
+      "learning_rate": 0.00021818181818181816,
+      "loss": 2.3422,
+      "step": 8
+    },
+    {
+      "epoch": 4.0,
+      "eval_loss": 2.325821876525879,
+      "eval_runtime": 4.398,
+      "eval_samples_per_second": 1.819,
+      "eval_steps_per_second": 0.455,
+      "step": 8
+    },
+    {
+      "epoch": 5.0,
+      "grad_norm": 2.3659000396728516,
+      "learning_rate": 0.0001909090909090909,
+      "loss": 2.1455,
+      "step": 10
+    },
+    {
+      "epoch": 5.0,
+      "eval_loss": 2.165768623352051,
+      "eval_runtime": 4.3935,
+      "eval_samples_per_second": 1.821,
+      "eval_steps_per_second": 0.455,
+      "step": 10
+    },
+    {
+      "epoch": 6.0,
+      "grad_norm": 4.212636470794678,
+      "learning_rate": 0.0001636363636363636,
+      "loss": 1.9805,
+      "step": 12
+    },
+    {
+      "epoch": 6.0,
+      "eval_loss": 2.0470268726348877,
+      "eval_runtime": 4.4026,
+      "eval_samples_per_second": 1.817,
+      "eval_steps_per_second": 0.454,
+      "step": 12
+    }
+  ],
+  "logging_steps": 500,
+  "max_steps": 24,
+  "num_input_tokens_seen": 0,
+  "num_train_epochs": 12,
+  "save_steps": 500,
+  "total_flos": 52097446969344.0,
+  "train_batch_size": 4,
+  "trial_name": null,
+  "trial_params": null
+}

MusicBot-ft/checkpoint-12/training_args.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:6f7ebc35e6bb8f7d9e32cc230c02462c36709bdd1c89554a8250bf4f37175bfd
+size 4984

MusicBot-ft/checkpoint-14/README.md ADDED Viewed

	@@ -0,0 +1,202 @@

+---
+library_name: peft
+base_model: TheBloke/Mistral-7B-Instruct-v0.2-GPTQ
+---
+# Model Card for Model ID
+<!-- Provide a quick summary of what the model is/does. -->
+## Model Details
+### Model Description
+<!-- Provide a longer summary of what this model is. -->
+- **Developed by:** [More Information Needed]
+- **Funded by [optional]:** [More Information Needed]
+- **Shared by [optional]:** [More Information Needed]
+- **Model type:** [More Information Needed]
+- **Language(s) (NLP):** [More Information Needed]
+- **License:** [More Information Needed]
+- **Finetuned from model [optional]:** [More Information Needed]
+### Model Sources [optional]
+<!-- Provide the basic links for the model. -->
+- **Repository:** [More Information Needed]
+- **Paper [optional]:** [More Information Needed]
+- **Demo [optional]:** [More Information Needed]
+## Uses
+<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
+### Direct Use
+<!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
+[More Information Needed]
+### Downstream Use [optional]
+<!-- This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app -->
+[More Information Needed]
+### Out-of-Scope Use
+<!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
+[More Information Needed]
+## Bias, Risks, and Limitations
+<!-- This section is meant to convey both technical and sociotechnical limitations. -->
+[More Information Needed]
+### Recommendations
+<!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
+Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.
+## How to Get Started with the Model
+Use the code below to get started with the model.
+[More Information Needed]
+## Training Details
+### Training Data
+<!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
+[More Information Needed]
+### Training Procedure
+<!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
+#### Preprocessing [optional]
+[More Information Needed]
+#### Training Hyperparameters
+- **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
+#### Speeds, Sizes, Times [optional]
+<!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
+[More Information Needed]
+## Evaluation
+<!-- This section describes the evaluation protocols and provides the results. -->
+### Testing Data, Factors & Metrics
+#### Testing Data
+<!-- This should link to a Dataset Card if possible. -->
+[More Information Needed]
+#### Factors
+<!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
+[More Information Needed]
+#### Metrics
+<!-- These are the evaluation metrics being used, ideally with a description of why. -->
+[More Information Needed]
+### Results
+[More Information Needed]
+#### Summary
+## Model Examination [optional]
+<!-- Relevant interpretability work for the model goes here -->
+[More Information Needed]
+## Environmental Impact
+<!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
+Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
+- **Hardware Type:** [More Information Needed]
+- **Hours used:** [More Information Needed]
+- **Cloud Provider:** [More Information Needed]
+- **Compute Region:** [More Information Needed]
+- **Carbon Emitted:** [More Information Needed]
+## Technical Specifications [optional]
+### Model Architecture and Objective
+[More Information Needed]
+### Compute Infrastructure
+[More Information Needed]
+#### Hardware
+[More Information Needed]
+#### Software
+[More Information Needed]
+## Citation [optional]
+<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
+**BibTeX:**
+[More Information Needed]
+**APA:**
+[More Information Needed]
+## Glossary [optional]
+<!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->
+[More Information Needed]
+## More Information [optional]
+[More Information Needed]
+## Model Card Authors [optional]
+[More Information Needed]
+## Model Card Contact
+[More Information Needed]
+### Framework versions
+- PEFT 0.11.1

MusicBot-ft/checkpoint-14/adapter_config.json ADDED Viewed

	@@ -0,0 +1,28 @@

+{
+  "alpha_pattern": {},
+  "auto_mapping": null,
+  "base_model_name_or_path": "TheBloke/Mistral-7B-Instruct-v0.2-GPTQ",
+  "bias": "none",
+  "fan_in_fan_out": false,
+  "inference_mode": true,
+  "init_lora_weights": true,
+  "layer_replication": null,
+  "layers_pattern": null,
+  "layers_to_transform": null,
+  "loftq_config": {},
+  "lora_alpha": 32,
+  "lora_dropout": 0.05,
+  "megatron_config": null,
+  "megatron_core": "megatron.core",
+  "modules_to_save": null,
+  "peft_type": "LORA",
+  "r": 8,
+  "rank_pattern": {},
+  "revision": null,
+  "target_modules": [
+    "q_proj"
+  ],
+  "task_type": "CAUSAL_LM",
+  "use_dora": false,
+  "use_rslora": false
+}

MusicBot-ft/checkpoint-14/adapter_model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:a3221af7d2caad1b71f38d7877328c0147670c8baef5aae692d35a6ee3279d64
+size 8397056

MusicBot-ft/checkpoint-14/optimizer.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:ec131a96a4796bb6f7d0221de0d12dd79e2018e45247c68e0b37f02fc9978d70
+size 4279546

MusicBot-ft/checkpoint-14/rng_state.pth ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:693b96676e503de86029c44092e321874295d0d8c7dbf832a6a2bd2e07f6256e
+size 14244

MusicBot-ft/checkpoint-14/scheduler.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:56355bc3114a317b20553e75349b70c7d602db55255dbefaa1bdc107e13ba98d
+size 1064

MusicBot-ft/checkpoint-14/trainer_state.json ADDED Viewed

	@@ -0,0 +1,126 @@

+{
+  "best_metric": 1.9581711292266846,
+  "best_model_checkpoint": "MusicBot-ft/checkpoint-14",
+  "epoch": 7.0,
+  "eval_steps": 500,
+  "global_step": 14,
+  "is_hyper_param_search": false,
+  "is_local_process_zero": true,
+  "is_world_process_zero": true,
+  "log_history": [
+    {
+      "epoch": 1.0,
+      "grad_norm": 2.1322076320648193,
+      "learning_rate": 0.0003,
+      "loss": 3.3408,
+      "step": 2
+    },
+    {
+      "epoch": 1.0,
+      "eval_loss": 3.314174175262451,
+      "eval_runtime": 4.3893,
+      "eval_samples_per_second": 1.823,
+      "eval_steps_per_second": 0.456,
+      "step": 2
+    },
+    {
+      "epoch": 2.0,
+      "grad_norm": 2.0614089965820312,
+      "learning_rate": 0.0002727272727272727,
+      "loss": 3.075,
+      "step": 4
+    },
+    {
+      "epoch": 2.0,
+      "eval_loss": 2.82991886138916,
+      "eval_runtime": 4.4015,
+      "eval_samples_per_second": 1.818,
+      "eval_steps_per_second": 0.454,
+      "step": 4
+    },
+    {
+      "epoch": 3.0,
+      "grad_norm": 1.8810017108917236,
+      "learning_rate": 0.00024545454545454545,
+      "loss": 2.618,
+      "step": 6
+    },
+    {
+      "epoch": 3.0,
+      "eval_loss": 2.520831823348999,
+      "eval_runtime": 4.3985,
+      "eval_samples_per_second": 1.819,
+      "eval_steps_per_second": 0.455,
+      "step": 6
+    },
+    {
+      "epoch": 4.0,
+      "grad_norm": 2.1504602432250977,
+      "learning_rate": 0.00021818181818181816,
+      "loss": 2.3422,
+      "step": 8
+    },
+    {
+      "epoch": 4.0,
+      "eval_loss": 2.325821876525879,
+      "eval_runtime": 4.398,
+      "eval_samples_per_second": 1.819,
+      "eval_steps_per_second": 0.455,
+      "step": 8
+    },
+    {
+      "epoch": 5.0,
+      "grad_norm": 2.3659000396728516,
+      "learning_rate": 0.0001909090909090909,
+      "loss": 2.1455,
+      "step": 10
+    },
+    {
+      "epoch": 5.0,
+      "eval_loss": 2.165768623352051,
+      "eval_runtime": 4.3935,
+      "eval_samples_per_second": 1.821,
+      "eval_steps_per_second": 0.455,
+      "step": 10
+    },
+    {
+      "epoch": 6.0,
+      "grad_norm": 4.212636470794678,
+      "learning_rate": 0.0001636363636363636,
+      "loss": 1.9805,
+      "step": 12
+    },
+    {
+      "epoch": 6.0,
+      "eval_loss": 2.0470268726348877,
+      "eval_runtime": 4.4026,
+      "eval_samples_per_second": 1.817,
+      "eval_steps_per_second": 0.454,
+      "step": 12
+    },
+    {
+      "epoch": 7.0,
+      "grad_norm": 4.396366596221924,
+      "learning_rate": 0.00013636363636363634,
+      "loss": 1.8691,
+      "step": 14
+    },
+    {
+      "epoch": 7.0,
+      "eval_loss": 1.9581711292266846,
+      "eval_runtime": 4.3935,
+      "eval_samples_per_second": 1.821,
+      "eval_steps_per_second": 0.455,
+      "step": 14
+    }
+  ],
+  "logging_steps": 500,
+  "max_steps": 24,
+  "num_input_tokens_seen": 0,
+  "num_train_epochs": 12,
+  "save_steps": 500,
+  "total_flos": 60990648975360.0,
+  "train_batch_size": 4,
+  "trial_name": null,
+  "trial_params": null
+}

MusicBot-ft/checkpoint-14/training_args.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:6f7ebc35e6bb8f7d9e32cc230c02462c36709bdd1c89554a8250bf4f37175bfd
+size 4984

MusicBot-ft/checkpoint-16/README.md ADDED Viewed

	@@ -0,0 +1,202 @@

+---
+library_name: peft
+base_model: TheBloke/Mistral-7B-Instruct-v0.2-GPTQ
+---
+# Model Card for Model ID
+<!-- Provide a quick summary of what the model is/does. -->
+## Model Details
+### Model Description
+<!-- Provide a longer summary of what this model is. -->
+- **Developed by:** [More Information Needed]
+- **Funded by [optional]:** [More Information Needed]
+- **Shared by [optional]:** [More Information Needed]
+- **Model type:** [More Information Needed]
+- **Language(s) (NLP):** [More Information Needed]
+- **License:** [More Information Needed]
+- **Finetuned from model [optional]:** [More Information Needed]
+### Model Sources [optional]
+<!-- Provide the basic links for the model. -->
+- **Repository:** [More Information Needed]
+- **Paper [optional]:** [More Information Needed]
+- **Demo [optional]:** [More Information Needed]
+## Uses
+<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
+### Direct Use
+<!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
+[More Information Needed]
+### Downstream Use [optional]
+<!-- This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app -->
+[More Information Needed]
+### Out-of-Scope Use
+<!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
+[More Information Needed]
+## Bias, Risks, and Limitations
+<!-- This section is meant to convey both technical and sociotechnical limitations. -->
+[More Information Needed]
+### Recommendations
+<!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
+Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.
+## How to Get Started with the Model
+Use the code below to get started with the model.
+[More Information Needed]
+## Training Details
+### Training Data
+<!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
+[More Information Needed]
+### Training Procedure
+<!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
+#### Preprocessing [optional]
+[More Information Needed]
+#### Training Hyperparameters
+- **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
+#### Speeds, Sizes, Times [optional]
+<!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
+[More Information Needed]
+## Evaluation
+<!-- This section describes the evaluation protocols and provides the results. -->
+### Testing Data, Factors & Metrics
+#### Testing Data
+<!-- This should link to a Dataset Card if possible. -->
+[More Information Needed]
+#### Factors
+<!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
+[More Information Needed]
+#### Metrics
+<!-- These are the evaluation metrics being used, ideally with a description of why. -->
+[More Information Needed]
+### Results
+[More Information Needed]
+#### Summary
+## Model Examination [optional]
+<!-- Relevant interpretability work for the model goes here -->
+[More Information Needed]
+## Environmental Impact
+<!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
+Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
+- **Hardware Type:** [More Information Needed]
+- **Hours used:** [More Information Needed]
+- **Cloud Provider:** [More Information Needed]
+- **Compute Region:** [More Information Needed]
+- **Carbon Emitted:** [More Information Needed]
+## Technical Specifications [optional]
+### Model Architecture and Objective
+[More Information Needed]
+### Compute Infrastructure
+[More Information Needed]
+#### Hardware
+[More Information Needed]
+#### Software
+[More Information Needed]
+## Citation [optional]
+<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
+**BibTeX:**
+[More Information Needed]
+**APA:**
+[More Information Needed]
+## Glossary [optional]
+<!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->
+[More Information Needed]
+## More Information [optional]
+[More Information Needed]
+## Model Card Authors [optional]
+[More Information Needed]
+## Model Card Contact
+[More Information Needed]
+### Framework versions
+- PEFT 0.11.1

MusicBot-ft/checkpoint-16/adapter_config.json ADDED Viewed

	@@ -0,0 +1,28 @@

+{
+  "alpha_pattern": {},
+  "auto_mapping": null,
+  "base_model_name_or_path": "TheBloke/Mistral-7B-Instruct-v0.2-GPTQ",
+  "bias": "none",
+  "fan_in_fan_out": false,
+  "inference_mode": true,
+  "init_lora_weights": true,
+  "layer_replication": null,
+  "layers_pattern": null,
+  "layers_to_transform": null,
+  "loftq_config": {},
+  "lora_alpha": 32,
+  "lora_dropout": 0.05,
+  "megatron_config": null,
+  "megatron_core": "megatron.core",
+  "modules_to_save": null,
+  "peft_type": "LORA",
+  "r": 8,
+  "rank_pattern": {},
+  "revision": null,
+  "target_modules": [
+    "q_proj"
+  ],
+  "task_type": "CAUSAL_LM",
+  "use_dora": false,
+  "use_rslora": false
+}

MusicBot-ft/checkpoint-16/adapter_model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5e057e44845720a9c29035efa4e9f48a2745a8fb4abaf6fc4d4fdcc343ddf492
+size 8397056

MusicBot-ft/checkpoint-16/optimizer.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:fd5f151be3bbab353eaef672e427285146cb351d14e5acdd906ce8bb8dca02a0
+size 4279546

MusicBot-ft/checkpoint-16/rng_state.pth ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:238374c4874bb27a0e6c707c18e07d383ab67519b59df12e56d9cf440264e056
+size 14244

MusicBot-ft/checkpoint-16/scheduler.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:f6414c9e2c8bbada63e5376a8ae3f1ad7bc4eed74cf5ab4a9b627ccf90b4685a
+size 1064

MusicBot-ft/checkpoint-16/trainer_state.json ADDED Viewed

	@@ -0,0 +1,141 @@

+{
+  "best_metric": 1.8912162780761719,
+  "best_model_checkpoint": "MusicBot-ft/checkpoint-16",
+  "epoch": 8.0,
+  "eval_steps": 500,
+  "global_step": 16,
+  "is_hyper_param_search": false,
+  "is_local_process_zero": true,
+  "is_world_process_zero": true,
+  "log_history": [
+    {
+      "epoch": 1.0,
+      "grad_norm": 2.1322076320648193,
+      "learning_rate": 0.0003,
+      "loss": 3.3408,
+      "step": 2
+    },
+    {
+      "epoch": 1.0,
+      "eval_loss": 3.314174175262451,
+      "eval_runtime": 4.3893,
+      "eval_samples_per_second": 1.823,
+      "eval_steps_per_second": 0.456,
+      "step": 2
+    },
+    {
+      "epoch": 2.0,
+      "grad_norm": 2.0614089965820312,
+      "learning_rate": 0.0002727272727272727,
+      "loss": 3.075,
+      "step": 4
+    },
+    {
+      "epoch": 2.0,
+      "eval_loss": 2.82991886138916,
+      "eval_runtime": 4.4015,
+      "eval_samples_per_second": 1.818,
+      "eval_steps_per_second": 0.454,
+      "step": 4
+    },
+    {
+      "epoch": 3.0,
+      "grad_norm": 1.8810017108917236,
+      "learning_rate": 0.00024545454545454545,
+      "loss": 2.618,
+      "step": 6
+    },
+    {
+      "epoch": 3.0,
+      "eval_loss": 2.520831823348999,
+      "eval_runtime": 4.3985,
+      "eval_samples_per_second": 1.819,
+      "eval_steps_per_second": 0.455,
+      "step": 6
+    },
+    {
+      "epoch": 4.0,
+      "grad_norm": 2.1504602432250977,
+      "learning_rate": 0.00021818181818181816,
+      "loss": 2.3422,
+      "step": 8
+    },
+    {
+      "epoch": 4.0,
+      "eval_loss": 2.325821876525879,
+      "eval_runtime": 4.398,
+      "eval_samples_per_second": 1.819,
+      "eval_steps_per_second": 0.455,
+      "step": 8
+    },
+    {
+      "epoch": 5.0,
+      "grad_norm": 2.3659000396728516,
+      "learning_rate": 0.0001909090909090909,
+      "loss": 2.1455,
+      "step": 10
+    },
+    {
+      "epoch": 5.0,
+      "eval_loss": 2.165768623352051,
+      "eval_runtime": 4.3935,
+      "eval_samples_per_second": 1.821,
+      "eval_steps_per_second": 0.455,
+      "step": 10
+    },
+    {
+      "epoch": 6.0,
+      "grad_norm": 4.212636470794678,
+      "learning_rate": 0.0001636363636363636,
+      "loss": 1.9805,
+      "step": 12
+    },
+    {
+      "epoch": 6.0,
+      "eval_loss": 2.0470268726348877,
+      "eval_runtime": 4.4026,
+      "eval_samples_per_second": 1.817,
+      "eval_steps_per_second": 0.454,
+      "step": 12
+    },
+    {
+      "epoch": 7.0,
+      "grad_norm": 4.396366596221924,
+      "learning_rate": 0.00013636363636363634,
+      "loss": 1.8691,
+      "step": 14
+    },
+    {
+      "epoch": 7.0,
+      "eval_loss": 1.9581711292266846,
+      "eval_runtime": 4.3935,
+      "eval_samples_per_second": 1.821,
+      "eval_steps_per_second": 0.455,
+      "step": 14
+    },
+    {
+      "epoch": 8.0,
+      "grad_norm": 6.510736465454102,
+      "learning_rate": 0.00010909090909090908,
+      "loss": 1.7738,
+      "step": 16
+    },
+    {
+      "epoch": 8.0,
+      "eval_loss": 1.8912162780761719,
+      "eval_runtime": 4.4091,
+      "eval_samples_per_second": 1.814,
+      "eval_steps_per_second": 0.454,
+      "step": 16
+    }
+  ],
+  "logging_steps": 500,
+  "max_steps": 24,
+  "num_input_tokens_seen": 0,
+  "num_train_epochs": 12,
+  "save_steps": 500,
+  "total_flos": 69618047680512.0,
+  "train_batch_size": 4,
+  "trial_name": null,
+  "trial_params": null
+}

MusicBot-ft/checkpoint-16/training_args.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:6f7ebc35e6bb8f7d9e32cc230c02462c36709bdd1c89554a8250bf4f37175bfd
+size 4984

MusicBot-ft/checkpoint-18/README.md ADDED Viewed

	@@ -0,0 +1,202 @@

+---
+library_name: peft
+base_model: TheBloke/Mistral-7B-Instruct-v0.2-GPTQ
+---
+# Model Card for Model ID
+<!-- Provide a quick summary of what the model is/does. -->
+## Model Details
+### Model Description
+<!-- Provide a longer summary of what this model is. -->
+- **Developed by:** [More Information Needed]
+- **Funded by [optional]:** [More Information Needed]
+- **Shared by [optional]:** [More Information Needed]
+- **Model type:** [More Information Needed]
+- **Language(s) (NLP):** [More Information Needed]
+- **License:** [More Information Needed]
+- **Finetuned from model [optional]:** [More Information Needed]
+### Model Sources [optional]
+<!-- Provide the basic links for the model. -->
+- **Repository:** [More Information Needed]
+- **Paper [optional]:** [More Information Needed]
+- **Demo [optional]:** [More Information Needed]
+## Uses
+<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
+### Direct Use
+<!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
+[More Information Needed]
+### Downstream Use [optional]
+<!-- This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app -->
+[More Information Needed]
+### Out-of-Scope Use
+<!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
+[More Information Needed]
+## Bias, Risks, and Limitations
+<!-- This section is meant to convey both technical and sociotechnical limitations. -->
+[More Information Needed]
+### Recommendations
+<!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
+Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.
+## How to Get Started with the Model
+Use the code below to get started with the model.
+[More Information Needed]
+## Training Details
+### Training Data
+<!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
+[More Information Needed]
+### Training Procedure
+<!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
+#### Preprocessing [optional]
+[More Information Needed]
+#### Training Hyperparameters
+- **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
+#### Speeds, Sizes, Times [optional]
+<!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
+[More Information Needed]
+## Evaluation
+<!-- This section describes the evaluation protocols and provides the results. -->
+### Testing Data, Factors & Metrics
+#### Testing Data
+<!-- This should link to a Dataset Card if possible. -->
+[More Information Needed]
+#### Factors
+<!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
+[More Information Needed]
+#### Metrics
+<!-- These are the evaluation metrics being used, ideally with a description of why. -->
+[More Information Needed]
+### Results
+[More Information Needed]
+#### Summary
+## Model Examination [optional]
+<!-- Relevant interpretability work for the model goes here -->
+[More Information Needed]
+## Environmental Impact
+<!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
+Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
+- **Hardware Type:** [More Information Needed]
+- **Hours used:** [More Information Needed]
+- **Cloud Provider:** [More Information Needed]
+- **Compute Region:** [More Information Needed]
+- **Carbon Emitted:** [More Information Needed]
+## Technical Specifications [optional]
+### Model Architecture and Objective
+[More Information Needed]
+### Compute Infrastructure
+[More Information Needed]
+#### Hardware
+[More Information Needed]
+#### Software
+[More Information Needed]
+## Citation [optional]
+<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
+**BibTeX:**
+[More Information Needed]
+**APA:**
+[More Information Needed]
+## Glossary [optional]
+<!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->
+[More Information Needed]
+## More Information [optional]
+[More Information Needed]
+## Model Card Authors [optional]
+[More Information Needed]
+## Model Card Contact
+[More Information Needed]
+### Framework versions
+- PEFT 0.11.1

MusicBot-ft/checkpoint-18/adapter_config.json ADDED Viewed

	@@ -0,0 +1,28 @@

+{
+  "alpha_pattern": {},
+  "auto_mapping": null,
+  "base_model_name_or_path": "TheBloke/Mistral-7B-Instruct-v0.2-GPTQ",
+  "bias": "none",
+  "fan_in_fan_out": false,
+  "inference_mode": true,
+  "init_lora_weights": true,
+  "layer_replication": null,
+  "layers_pattern": null,
+  "layers_to_transform": null,
+  "loftq_config": {},
+  "lora_alpha": 32,
+  "lora_dropout": 0.05,
+  "megatron_config": null,
+  "megatron_core": "megatron.core",
+  "modules_to_save": null,
+  "peft_type": "LORA",
+  "r": 8,
+  "rank_pattern": {},
+  "revision": null,
+  "target_modules": [
+    "q_proj"
+  ],
+  "task_type": "CAUSAL_LM",
+  "use_dora": false,
+  "use_rslora": false
+}

MusicBot-ft/checkpoint-18/adapter_model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:be31eb38929ef36597538b9ddcbea10dbdd1ead07ae34d4ce425d979d755477f
+size 8397056

MusicBot-ft/checkpoint-18/optimizer.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:1eb47f8507de803c62d9507d1260bb3c409c80d2349020d9a547075a2f2c3720
+size 4279546

MusicBot-ft/checkpoint-18/rng_state.pth ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:aba13aa2d646c3223d812a90205872708f2602b2cd07c030620189f98a57af39
+size 14244

MusicBot-ft/checkpoint-18/scheduler.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:9dfb515925badad287b1fb4432c6ed6bf530efc8a89a11d532ed891c0ff17850
+size 1064

MusicBot-ft/checkpoint-18/trainer_state.json ADDED Viewed

	@@ -0,0 +1,156 @@

+{
+  "best_metric": 1.8643579483032227,
+  "best_model_checkpoint": "MusicBot-ft/checkpoint-18",
+  "epoch": 9.0,
+  "eval_steps": 500,
+  "global_step": 18,
+  "is_hyper_param_search": false,
+  "is_local_process_zero": true,
+  "is_world_process_zero": true,
+  "log_history": [
+    {
+      "epoch": 1.0,
+      "grad_norm": 2.1322076320648193,
+      "learning_rate": 0.0003,
+      "loss": 3.3408,
+      "step": 2
+    },
+    {
+      "epoch": 1.0,
+      "eval_loss": 3.314174175262451,
+      "eval_runtime": 4.3893,
+      "eval_samples_per_second": 1.823,
+      "eval_steps_per_second": 0.456,
+      "step": 2
+    },
+    {
+      "epoch": 2.0,
+      "grad_norm": 2.0614089965820312,
+      "learning_rate": 0.0002727272727272727,
+      "loss": 3.075,
+      "step": 4
+    },
+    {
+      "epoch": 2.0,
+      "eval_loss": 2.82991886138916,
+      "eval_runtime": 4.4015,
+      "eval_samples_per_second": 1.818,
+      "eval_steps_per_second": 0.454,
+      "step": 4
+    },
+    {
+      "epoch": 3.0,
+      "grad_norm": 1.8810017108917236,
+      "learning_rate": 0.00024545454545454545,
+      "loss": 2.618,
+      "step": 6
+    },
+    {
+      "epoch": 3.0,
+      "eval_loss": 2.520831823348999,
+      "eval_runtime": 4.3985,
+      "eval_samples_per_second": 1.819,
+      "eval_steps_per_second": 0.455,
+      "step": 6
+    },
+    {
+      "epoch": 4.0,
+      "grad_norm": 2.1504602432250977,
+      "learning_rate": 0.00021818181818181816,
+      "loss": 2.3422,
+      "step": 8
+    },
+    {
+      "epoch": 4.0,
+      "eval_loss": 2.325821876525879,
+      "eval_runtime": 4.398,
+      "eval_samples_per_second": 1.819,
+      "eval_steps_per_second": 0.455,
+      "step": 8
+    },
+    {
+      "epoch": 5.0,
+      "grad_norm": 2.3659000396728516,
+      "learning_rate": 0.0001909090909090909,
+      "loss": 2.1455,
+      "step": 10
+    },
+    {
+      "epoch": 5.0,
+      "eval_loss": 2.165768623352051,
+      "eval_runtime": 4.3935,
+      "eval_samples_per_second": 1.821,
+      "eval_steps_per_second": 0.455,
+      "step": 10
+    },
+    {
+      "epoch": 6.0,
+      "grad_norm": 4.212636470794678,
+      "learning_rate": 0.0001636363636363636,
+      "loss": 1.9805,
+      "step": 12
+    },
+    {
+      "epoch": 6.0,
+      "eval_loss": 2.0470268726348877,
+      "eval_runtime": 4.4026,
+      "eval_samples_per_second": 1.817,
+      "eval_steps_per_second": 0.454,
+      "step": 12
+    },
+    {
+      "epoch": 7.0,
+      "grad_norm": 4.396366596221924,
+      "learning_rate": 0.00013636363636363634,
+      "loss": 1.8691,
+      "step": 14
+    },
+    {
+      "epoch": 7.0,
+      "eval_loss": 1.9581711292266846,
+      "eval_runtime": 4.3935,
+      "eval_samples_per_second": 1.821,
+      "eval_steps_per_second": 0.455,
+      "step": 14
+    },
+    {
+      "epoch": 8.0,
+      "grad_norm": 6.510736465454102,
+      "learning_rate": 0.00010909090909090908,
+      "loss": 1.7738,
+      "step": 16
+    },
+    {
+      "epoch": 8.0,
+      "eval_loss": 1.8912162780761719,
+      "eval_runtime": 4.4091,
+      "eval_samples_per_second": 1.814,
+      "eval_steps_per_second": 0.454,
+      "step": 16
+    },
+    {
+      "epoch": 9.0,
+      "grad_norm": Infinity,
+      "learning_rate": 9.545454545454545e-05,
+      "loss": 1.7041,
+      "step": 18
+    },
+    {
+      "epoch": 9.0,
+      "eval_loss": 1.8643579483032227,
+      "eval_runtime": 4.408,
+      "eval_samples_per_second": 1.815,
+      "eval_steps_per_second": 0.454,
+      "step": 18
+    }
+  ],
+  "logging_steps": 500,
+  "max_steps": 24,
+  "num_input_tokens_seen": 0,
+  "num_train_epochs": 12,
+  "save_steps": 500,
+  "total_flos": 78924365660160.0,
+  "train_batch_size": 4,
+  "trial_name": null,
+  "trial_params": null
+}

MusicBot-ft/checkpoint-18/training_args.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:6f7ebc35e6bb8f7d9e32cc230c02462c36709bdd1c89554a8250bf4f37175bfd
+size 4984

MusicBot-ft/checkpoint-2/README.md ADDED Viewed

	@@ -0,0 +1,202 @@

+---
+library_name: peft
+base_model: TheBloke/Mistral-7B-Instruct-v0.2-GPTQ
+---
+# Model Card for Model ID
+<!-- Provide a quick summary of what the model is/does. -->
+## Model Details
+### Model Description
+<!-- Provide a longer summary of what this model is. -->
+- **Developed by:** [More Information Needed]
+- **Funded by [optional]:** [More Information Needed]
+- **Shared by [optional]:** [More Information Needed]
+- **Model type:** [More Information Needed]
+- **Language(s) (NLP):** [More Information Needed]
+- **License:** [More Information Needed]
+- **Finetuned from model [optional]:** [More Information Needed]
+### Model Sources [optional]
+<!-- Provide the basic links for the model. -->
+- **Repository:** [More Information Needed]
+- **Paper [optional]:** [More Information Needed]
+- **Demo [optional]:** [More Information Needed]
+## Uses
+<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
+### Direct Use
+<!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
+[More Information Needed]
+### Downstream Use [optional]
+<!-- This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app -->
+[More Information Needed]
+### Out-of-Scope Use
+<!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
+[More Information Needed]
+## Bias, Risks, and Limitations
+<!-- This section is meant to convey both technical and sociotechnical limitations. -->
+[More Information Needed]
+### Recommendations
+<!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
+Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.
+## How to Get Started with the Model
+Use the code below to get started with the model.
+[More Information Needed]
+## Training Details
+### Training Data
+<!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
+[More Information Needed]
+### Training Procedure
+<!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
+#### Preprocessing [optional]
+[More Information Needed]
+#### Training Hyperparameters
+- **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
+#### Speeds, Sizes, Times [optional]
+<!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
+[More Information Needed]
+## Evaluation
+<!-- This section describes the evaluation protocols and provides the results. -->
+### Testing Data, Factors & Metrics
+#### Testing Data
+<!-- This should link to a Dataset Card if possible. -->
+[More Information Needed]
+#### Factors
+<!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
+[More Information Needed]
+#### Metrics
+<!-- These are the evaluation metrics being used, ideally with a description of why. -->
+[More Information Needed]
+### Results
+[More Information Needed]
+#### Summary
+## Model Examination [optional]
+<!-- Relevant interpretability work for the model goes here -->
+[More Information Needed]
+## Environmental Impact
+<!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
+Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
+- **Hardware Type:** [More Information Needed]
+- **Hours used:** [More Information Needed]
+- **Cloud Provider:** [More Information Needed]
+- **Compute Region:** [More Information Needed]
+- **Carbon Emitted:** [More Information Needed]
+## Technical Specifications [optional]
+### Model Architecture and Objective
+[More Information Needed]
+### Compute Infrastructure
+[More Information Needed]
+#### Hardware
+[More Information Needed]
+#### Software
+[More Information Needed]
+## Citation [optional]
+<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
+**BibTeX:**
+[More Information Needed]
+**APA:**
+[More Information Needed]
+## Glossary [optional]
+<!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->
+[More Information Needed]
+## More Information [optional]
+[More Information Needed]
+## Model Card Authors [optional]
+[More Information Needed]
+## Model Card Contact
+[More Information Needed]
+### Framework versions
+- PEFT 0.11.1

MusicBot-ft/checkpoint-2/adapter_config.json ADDED Viewed

	@@ -0,0 +1,28 @@

+{
+  "alpha_pattern": {},
+  "auto_mapping": null,
+  "base_model_name_or_path": "TheBloke/Mistral-7B-Instruct-v0.2-GPTQ",
+  "bias": "none",
+  "fan_in_fan_out": false,
+  "inference_mode": true,
+  "init_lora_weights": true,
+  "layer_replication": null,
+  "layers_pattern": null,
+  "layers_to_transform": null,
+  "loftq_config": {},
+  "lora_alpha": 32,
+  "lora_dropout": 0.05,
+  "megatron_config": null,
+  "megatron_core": "megatron.core",
+  "modules_to_save": null,
+  "peft_type": "LORA",
+  "r": 8,
+  "rank_pattern": {},
+  "revision": null,
+  "target_modules": [
+    "q_proj"
+  ],
+  "task_type": "CAUSAL_LM",
+  "use_dora": false,
+  "use_rslora": false
+}

MusicBot-ft/checkpoint-2/adapter_model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:ea3372491475ba92fd1211ef0274d0797286892b2987143b8f393c24a7f9ae76
+size 8397056

MusicBot-ft/checkpoint-2/optimizer.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:0a9d39f7e32a772e7e53fdebc915ee6006af212e155f3c014c42c1fb4867ebcb
+size 4279546

MusicBot-ft/checkpoint-2/rng_state.pth ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:d78a5494664cf913ce325a101c6ab36fec200803191880843cac974227751885
+size 14244

MusicBot-ft/checkpoint-2/scheduler.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:81816625d794a79e004b837cc47a1be5d185a18f93cb9708671510ac834114f9
+size 1064

MusicBot-ft/checkpoint-2/trainer_state.json ADDED Viewed

	@@ -0,0 +1,36 @@

+{
+  "best_metric": 3.314174175262451,
+  "best_model_checkpoint": "MusicBot-ft/checkpoint-2",
+  "epoch": 1.0,
+  "eval_steps": 500,
+  "global_step": 2,
+  "is_hyper_param_search": false,
+  "is_local_process_zero": true,
+  "is_world_process_zero": true,
+  "log_history": [
+    {
+      "epoch": 1.0,
+      "grad_norm": 2.1322076320648193,
+      "learning_rate": 0.0003,
+      "loss": 3.3408,
+      "step": 2
+    },
+    {
+      "epoch": 1.0,
+      "eval_loss": 3.314174175262451,
+      "eval_runtime": 4.3893,
+      "eval_samples_per_second": 1.823,
+      "eval_steps_per_second": 0.456,
+      "step": 2
+    }
+  ],
+  "logging_steps": 500,
+  "max_steps": 24,
+  "num_input_tokens_seen": 0,
+  "num_train_epochs": 12,
+  "save_steps": 500,
+  "total_flos": 8566552166400.0,
+  "train_batch_size": 4,
+  "trial_name": null,
+  "trial_params": null
+}

MusicBot-ft/checkpoint-2/training_args.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:6f7ebc35e6bb8f7d9e32cc230c02462c36709bdd1c89554a8250bf4f37175bfd
+size 4984

MusicBot-ft/checkpoint-20/README.md ADDED Viewed

	@@ -0,0 +1,202 @@

+---
+library_name: peft
+base_model: TheBloke/Mistral-7B-Instruct-v0.2-GPTQ
+---
+# Model Card for Model ID
+<!-- Provide a quick summary of what the model is/does. -->
+## Model Details
+### Model Description
+<!-- Provide a longer summary of what this model is. -->
+- **Developed by:** [More Information Needed]
+- **Funded by [optional]:** [More Information Needed]
+- **Shared by [optional]:** [More Information Needed]
+- **Model type:** [More Information Needed]
+- **Language(s) (NLP):** [More Information Needed]
+- **License:** [More Information Needed]
+- **Finetuned from model [optional]:** [More Information Needed]
+### Model Sources [optional]
+<!-- Provide the basic links for the model. -->
+- **Repository:** [More Information Needed]
+- **Paper [optional]:** [More Information Needed]
+- **Demo [optional]:** [More Information Needed]
+## Uses
+<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
+### Direct Use
+<!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
+[More Information Needed]
+### Downstream Use [optional]
+<!-- This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app -->
+[More Information Needed]
+### Out-of-Scope Use
+<!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
+[More Information Needed]
+## Bias, Risks, and Limitations
+<!-- This section is meant to convey both technical and sociotechnical limitations. -->
+[More Information Needed]
+### Recommendations
+<!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
+Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.
+## How to Get Started with the Model
+Use the code below to get started with the model.
+[More Information Needed]
+## Training Details
+### Training Data
+<!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
+[More Information Needed]
+### Training Procedure
+<!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
+#### Preprocessing [optional]
+[More Information Needed]
+#### Training Hyperparameters
+- **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
+#### Speeds, Sizes, Times [optional]
+<!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
+[More Information Needed]
+## Evaluation
+<!-- This section describes the evaluation protocols and provides the results. -->
+### Testing Data, Factors & Metrics
+#### Testing Data
+<!-- This should link to a Dataset Card if possible. -->
+[More Information Needed]
+#### Factors
+<!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
+[More Information Needed]
+#### Metrics
+<!-- These are the evaluation metrics being used, ideally with a description of why. -->
+[More Information Needed]
+### Results
+[More Information Needed]
+#### Summary
+## Model Examination [optional]
+<!-- Relevant interpretability work for the model goes here -->
+[More Information Needed]
+## Environmental Impact
+<!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
+Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
+- **Hardware Type:** [More Information Needed]
+- **Hours used:** [More Information Needed]
+- **Cloud Provider:** [More Information Needed]
+- **Compute Region:** [More Information Needed]
+- **Carbon Emitted:** [More Information Needed]
+## Technical Specifications [optional]
+### Model Architecture and Objective
+[More Information Needed]
+### Compute Infrastructure
+[More Information Needed]
+#### Hardware
+[More Information Needed]
+#### Software
+[More Information Needed]
+## Citation [optional]
+<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
+**BibTeX:**
+[More Information Needed]
+**APA:**
+[More Information Needed]
+## Glossary [optional]
+<!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->
+[More Information Needed]
+## More Information [optional]
+[More Information Needed]
+## Model Card Authors [optional]
+[More Information Needed]
+## Model Card Contact
+[More Information Needed]
+### Framework versions
+- PEFT 0.11.1

MusicBot-ft/checkpoint-20/adapter_config.json ADDED Viewed

	@@ -0,0 +1,28 @@

+{
+  "alpha_pattern": {},
+  "auto_mapping": null,
+  "base_model_name_or_path": "TheBloke/Mistral-7B-Instruct-v0.2-GPTQ",
+  "bias": "none",
+  "fan_in_fan_out": false,
+  "inference_mode": true,
+  "init_lora_weights": true,
+  "layer_replication": null,
+  "layers_pattern": null,
+  "layers_to_transform": null,
+  "loftq_config": {},
+  "lora_alpha": 32,
+  "lora_dropout": 0.05,
+  "megatron_config": null,
+  "megatron_core": "megatron.core",
+  "modules_to_save": null,
+  "peft_type": "LORA",
+  "r": 8,
+  "rank_pattern": {},
+  "revision": null,
+  "target_modules": [
+    "q_proj"
+  ],
+  "task_type": "CAUSAL_LM",
+  "use_dora": false,
+  "use_rslora": false
+}