SWE-bench is a benchmark for evaluating Language Models and AI Systems on their ability resolve real world GitHub Issues.
Princeton NLP group
princeton-nlp
AI & ML interests
None yet
Recent Activity
updated
a dataset
about 22 hours ago
WebOrganizer/FormatAnnotations-Llama-3.1-405B-FP8
published
a dataset
about 22 hours ago
WebOrganizer/FormatAnnotations-Llama-3.1-405B-FP8
updated
a dataset
about 22 hours ago
WebOrganizer/FormatAnnotations-Llama-3.1-8B
Organizations
Papers
1
models
259
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1618969698200-noauth.png)
princeton-nlp/Llama-3-8B-ProLong-512k-Instruct
Updated
•
6.38k
•
18
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1618969698200-noauth.png)
princeton-nlp/Llama-3-8B-ProLong-512k-Base
Updated
•
2.15k
•
8
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1618969698200-noauth.png)
princeton-nlp/Llama-3-8B-ProLong-64k-Instruct
Text Generation
•
Updated
•
5.08k
•
13
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1618969698200-noauth.png)
princeton-nlp/Llama-3-8B-ProLong-64k-Base
Text Generation
•
Updated
•
5.08k
•
5
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1618969698200-noauth.png)
princeton-nlp/Mistral-7B-Base-SFT-CPO
Text Generation
•
Updated
•
5.07k
•
1
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1618969698200-noauth.png)
princeton-nlp/Mistral-7B-Base-SFT-RRHF
Text Generation
•
Updated
•
5.07k
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1618969698200-noauth.png)
princeton-nlp/gemma-2-9b-it-SimPO
Text Generation
•
Updated
•
29.7k
•
148
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1618969698200-noauth.png)
princeton-nlp/gemma-2-9b-it-DPO
Text Generation
•
Updated
•
4.95k
•
8
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1618969698200-noauth.png)
princeton-nlp/Llama-3-Instruct-8B-SimPO-v0.2
Text Generation
•
Updated
•
5.42k
•
5
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1618969698200-noauth.png)
princeton-nlp/Llama-3-Instruct-8B-RDPO-v0.2
Text Generation
•
Updated
•
4.99k
•
1
datasets
46
princeton-nlp/TextbooksBySubject
Viewer
•
Updated
•
129
•
31
princeton-nlp/TextbookChapters
Viewer
•
Updated
•
77.9k
•
41
•
6
princeton-nlp/SWE-bench_Multimodal
Viewer
•
Updated
•
612
•
413
•
15
princeton-nlp/fineweb_edu-swahili-translated
Viewer
•
Updated
•
137k
•
64
princeton-nlp/SWE-bench_Verified
Viewer
•
Updated
•
500
•
254k
•
134
princeton-nlp/SWE-bench
Viewer
•
Updated
•
21.5k
•
23.8k
•
94
princeton-nlp/prolong-ultrachat-64K
Preview
•
Updated
•
73
princeton-nlp/HELMET
Viewer
•
Updated
•
516
•
340
•
5
princeton-nlp/prolong-data-64K
Updated
•
5.44k
•
11
princeton-nlp/prolong-data-512K
Updated
•
3.77k
•
5