Announcement_0026
My collaborative paper “VersaPRM: Multi-Domain Process Reward Model via Synthetic Reasoning Data” with Thomas Zeng, Prof. Kangwook Lee, and others is released at arXiv.
My collaborative paper “VersaPRM: Multi-Domain Process Reward Model via Synthetic Reasoning Data” with Thomas Zeng, Prof. Kangwook Lee, and others is released at arXiv.