The *RLMT* collection. Coming soon!
Princeton NLP group
princeton-nlp
AI & ML interests
None yet
Organizations
SimPO
This collections contains a list of SimPO and baseline models.
-
princeton-nlp/gemma-2-9b-it-SimPO
Text Generation • 9B • Updated • 1k • • 172 -
princeton-nlp/gemma-2-9b-it-DPO
Text Generation • 9B • Updated • 11 • • 9 -
princeton-nlp/Llama-3-Base-8B-SFT-IPO
Text Generation • 8B • Updated • 16 • • 1 -
princeton-nlp/Llama-3-Base-8B-SFT-DPO
Text Generation • 8B • Updated • 223 •
RLMT Experiments
The *RLMT* collection. Coming soon!
SimPO
This collections contains a list of SimPO and baseline models.
-
princeton-nlp/gemma-2-9b-it-SimPO
Text Generation • 9B • Updated • 1k • • 172 -
princeton-nlp/gemma-2-9b-it-DPO
Text Generation • 9B • Updated • 11 • • 9 -
princeton-nlp/Llama-3-Base-8B-SFT-IPO
Text Generation • 8B • Updated • 16 • • 1 -
princeton-nlp/Llama-3-Base-8B-SFT-DPO
Text Generation • 8B • Updated • 223 •
models 306
princeton-nlp/warm-start__grpo__nothink__Qwen2.5-7B-Instruct
8B • Updated • 3
princeton-nlp/warm-start__grpo__nothink__Llama-3.1-8B-Instruct
8B • Updated • 3
princeton-nlp/warm-start__grpo__nothink__Qwen2.5-7B
8B • Updated • 2
princeton-nlp/warm-start__grpo__nothink__Llama-3.1-8B
8B • Updated • 3
princeton-nlp/warm-start__grpo__think__Qwen2.5-7B-Instruct
8B • Updated • 3
princeton-nlp/warm-start__grpo__think__Llama-3.1-8B-Instruct
8B • Updated • 2
princeton-nlp/warm-start__grpo__think__Qwen2.5-7B
8B • Updated • 2
princeton-nlp/warm-start__grpo__think__Llama-3.1-8B
8B • Updated • 2
princeton-nlp/zero__grpo__nothink__Qwen2.5-7B
8B • Updated • 2
princeton-nlp/zero__grpo__nothink__Llama-3.1-8B
8B • Updated • 3
datasets 47
princeton-nlp/rl_tulu3_wildchat-if_prompts
Viewer • Updated • 7.79k • 11 • 5
princeton-nlp/gemini_2.5_flash_0417_sft-data
Viewer • Updated • 6k • 21 • 1
princeton-nlp/prolong-data-512K
Updated • 10.8k • 12
princeton-nlp/SWE-bench_Lite
Viewer • Updated • 323 • 136k • 64
princeton-nlp/SWE-bench
Viewer • Updated • 21.5k • 29.4k • 143
princeton-nlp/SWE-bench_Verified
Viewer • Updated • 500 • 819k • 355
princeton-nlp/TextbooksBySubject
Viewer • Updated • 129 • 13 • 1
princeton-nlp/TextbookChapters
Viewer • Updated • 77.9k • 43 • 12
princeton-nlp/SWE-bench_Multimodal
Viewer • Updated • 612 • 4.6k • 21
princeton-nlp/fineweb_edu-swahili-translated
Viewer • Updated • 137k • 44 • 2