PaTaRM is a Generative Reward Model (GRM) for RLHF alignment.
JianAi
AIJian
AI & ML interests
None yet
Recent Activity
upvoted a paper 26 days ago
OProver: A Unified Framework for Agentic Formal Theorem Proving updated a model 2 months ago
AIJian/PaTaRM-8B