imdatta0's picture
Upload best holdout adapter checkpoint
394ca2d verified
raw
history blame contribute delete
270 Bytes
{
"step": 0,
"stage": "post_sft",
"pass@1_first_sample": 0.2,
"mean_reward": 0.3497,
"patch_applied_rate": 0.6857,
"dir": "/mnt/disks/unslothai/datta0/cache/qwen3-grpo-patch/20260605_045145_swegym_q4b-kl02-sft20k-hardmulti_10e3a3b/checkpoints/best_holdout"
}