Could you please do the same for Mistral 7B Instruct?
#27 opened 8 months ago
by
ZeroWw
How to load this model and what is the required hardware?
#26 opened 8 months ago
by
Vision-CAIR
Multi-needle In A Haystack
1
#25 opened 9 months ago
by
ElliottDyson
Rope Theta Value Difference?
#24 opened 9 months ago
by
fahadh4ilyas
Memory requirements to take advantage of full context window
1
#23 opened 9 months ago
by
andrewrreed
![](https://cdn-avatars.huggingface.co/v1/production/uploads/61d375fd733d3a83ecd1bba9/oIXwvvs1-HaCnJXMCZgkc.jpeg)
Fine-tuning
#22 opened 9 months ago
by
EkmekE
Adding Evaluation Results
#21 opened 9 months ago
by
leaderboard-pr-bot
![](https://cdn-avatars.huggingface.co/v1/production/uploads/655506df9dc61e22c5f9c732/IZGvup0FdVlioPPIPnzZv.jpeg)
Adding Evaluation Results
#20 opened 9 months ago
by
leaderboard-pr-bot
![](https://cdn-avatars.huggingface.co/v1/production/uploads/655506df9dc61e22c5f9c732/IZGvup0FdVlioPPIPnzZv.jpeg)
Adding Evaluation Results
#19 opened 9 months ago
by
leaderboard-pr-bot
![](https://cdn-avatars.huggingface.co/v1/production/uploads/655506df9dc61e22c5f9c732/IZGvup0FdVlioPPIPnzZv.jpeg)
Performance Degredation After Weight Update
7
#18 opened 9 months ago
by
evilperson068
error, can not load
#17 opened 9 months ago
by
yeyeyeyeye2
You should rename your weights every time you update them
#16 opened 9 months ago
by
AiCreatornator
ITS NOT REAL
8
#11 opened 10 months ago
by
rombodawg
![](https://cdn-avatars.huggingface.co/v1/production/uploads/642cc1c253e76b4c2286c58e/fGtQ_QeTjUgBhIT89dpUt.jpeg)
GPU requirement for hosting this model?
3
#9 opened 10 months ago
by
csgxy2022
From your experience what would be a good methodology for using a 1048k model for filtering pre-training data
#8 opened 10 months ago
by
TimeLordRaps
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1623805017432-noauth.jpeg)
Can you please build an extended version of mistral instruct v0.2 too please ?
1
#6 opened 10 months ago
by
AiModelsMarket
Better context utilization
1
#5 opened 10 months ago
by
DataPhreak