answerdotai/ModernBERT-base

#64 opened 5 days ago by

Joseph2805

Question about MLDR Evaluation Metrics in ModernBERT Paper

#62 opened 11 days ago by

WoutDeRijck

I have trained a multilingual version of ModernBert

#60 opened 11 days ago by

neavo

nan or 0.0 loss when training with flash attention

16

#59 opened 11 days ago by

roadtoagi

Modernbert with Golang

#58 opened 15 days ago by

Thibault-Requesty

ModernBERT fails to work without FlashAttention !

#56 opened 18 days ago by

benhachem

Import fails on AWS lamba instance.

#55 opened 20 days ago by

obeijbom

Performance vs the original architecture on approximate original data sizes (BooksCorpus/Wikipedia)

#54 opened 26 days ago by

tollefj

Problem with highly padded sequences

#49 opened about 1 month ago by

fmrs

Speed Benchmarks with MPS Backend

#47 opened about 1 month ago by

mlburnham

Continual pre-training for multilingual support (extend embedding matrix and tokenizer)

#46 opened about 1 month ago by

ibotana

Encountering Error: cannot import name 'shard_checkpoint' from 'transformers.modeling_utils'

#44 opened about 1 month ago by

rkabir

ModernBertModel works on the CPU but fails on the GPU

#43 opened about 1 month ago by

rudigung

ModernBERT-base-chinese

#42 opened about 1 month ago by

ZBW

Error: RuntimeError: Failed to import transformers.models.modernbert.modeling_modernbert because of the following error (look up to see its traceback): Windows not yet supported for torch.compile

#40 opened about 1 month ago by

JoAmps42i

ModernBART wen?

6

#38 opened about 1 month ago by

Fizzarolli

Pretraining Using HF Tokenizers and Transformers

#36 opened about 1 month ago by

akhooli

Update README.md

#35 opened about 1 month ago by

solankibhargav

Unpadding and Sequence Packing inference example?

#34 opened about 1 month ago by

denti

Interview Request: Thoughts on Model Documentation

#33 opened about 1 month ago by

evatang

Training Data?

#32 opened about 2 months ago by

binarymax

What is the position of this model in MTEB leaderboard?

#31 opened about 2 months ago by

deepak-banka

tokenizer

#24 opened about 2 months ago by

ulasarikaya

RuntimeError: Failed to import transformers.models.modernbert.modeling_modernbert

#21 opened about 2 months ago by

SantoshHF

Pretraining data cutoff?

#17 opened about 2 months ago by

ytsaig

How to use ModernBERT with the AutoModelForQuestionAnswering class?

#15 opened about 2 months ago by

sraj

Is ModernBERT already fine-tuned for IR tasks?

#13 opened about 2 months ago by

belerico

Question about output embedding vector of ModernBERT

#12 opened about 2 months ago by

Youm9602

ModernBert for multi-vector embeddings

#11 opened about 2 months ago by

admarcosai

How to use ModernBERT as a sentence transformer?

30

#9 opened about 2 months ago by

hungrybiker

multilingual

#8 opened about 2 months ago by

ale-volpe

Is this model meant for full bfloat16, AMP bfloat16 or no bfloat16?

#7 opened about 2 months ago by

umarbutler

# Fine-tuning ModernBERT on a Large Dataset with Masked Language Modelling

#6 opened about 2 months ago by

ssmits

Precisions about the config properties wrt the paper