BEE-spoke-data
/

mega-small-embed-synthSTS-16384-v1

Sentence Similarity

sentence-transformers

feature-extraction

efficient attention

Inference Endpoints

Model card Files Files and versions Community

pszemraj commited on Mar 15, 2024

Commit

aa57866

·

verified ·

1 Parent(s): d5cd64c

Update README.md

Files changed (1) hide show

README.md +3 -0

README.md CHANGED Viewed

@@ -1,4 +1,5 @@
 ---
 library_name: sentence-transformers
 pipeline_tag: sentence-similarity
 tags:
@@ -21,6 +22,8 @@ language:
 This is a [sentence-transformers](https://www.SBERT.net) model: It maps sentences & paragraphs to a 768 dimensional dense vector space and can be used for tasks like clustering or semantic search.
 - pretrained & finetuned at context length 16384
 - This model is a "v1" and we may make improved versions in the future. Or, we may not.

 ---
+base_model: BEE-spoke-data/mega-encoder-small-16k-v1
 library_name: sentence-transformers
 pipeline_tag: sentence-similarity
 tags:
 This is a [sentence-transformers](https://www.SBERT.net) model: It maps sentences & paragraphs to a 768 dimensional dense vector space and can be used for tasks like clustering or semantic search.
+- this model's primary use case is meant to be long-document similarity, i.e. computing embeddings of long documents and comparing those.
+  - check out the training dataset `pszemraj/synthetic-text-similarity` for details
 - pretrained & finetuned at context length 16384
 - This model is a "v1" and we may make improved versions in the future. Or, we may not.