Update README.md
Browse files
README.md
CHANGED
@@ -10,17 +10,19 @@ This is a Turkish Tiny uncased BERT model, developed to fill the gap for small-s
|
|
10 |
|
11 |
#### ⚠ Uncased use requires manual lowercase conversion
|
12 |
|
13 |
-
|
|
|
14 |
```python
|
15 |
text.replace("I", "ı").lower()
|
16 |
```
|
|
|
17 |
|
18 |
Be aware that this model may exhibit biased predictions as it was trained primarily on crawled data, which inherently can contain various biases.
|
19 |
|
20 |
Other relevant information can be found in the [paper](https://arxiv.org/abs/2307.14134).
|
21 |
|
22 |
|
23 |
-
|
24 |
```python
|
25 |
from transformers import AutoTokenizer, BertForMaskedLM
|
26 |
from transformers import pipeline
|
|
|
10 |
|
11 |
#### ⚠ Uncased use requires manual lowercase conversion
|
12 |
|
13 |
+
|
14 |
+
**Don't** use the `do_lower_case = True` flag with the tokenizer. Instead, convert your text to lower case as follows:
|
15 |
```python
|
16 |
text.replace("I", "ı").lower()
|
17 |
```
|
18 |
+
This is due to a [known issue](https://github.com/huggingface/transformers/issues/6680) with the tokenizer.
|
19 |
|
20 |
Be aware that this model may exhibit biased predictions as it was trained primarily on crawled data, which inherently can contain various biases.
|
21 |
|
22 |
Other relevant information can be found in the [paper](https://arxiv.org/abs/2307.14134).
|
23 |
|
24 |
|
25 |
+
## Example Usage
|
26 |
```python
|
27 |
from transformers import AutoTokenizer, BertForMaskedLM
|
28 |
from transformers import pipeline
|