Ruprecht-Karls-Universität Heidelberg
Institut für Computerlinguistik

Bilder vom Neuenheimer Feld, Heidelberg und der Universität Heidelberg

Process Reward Modeling in LLMs

Module Description

Course Module Abbreviation Credit Points
BA-2010[100%|75%] CS-CL 6 LP
BA-2010[50%] BS-CL 6 LP
BA-2010[25%] BS-AC 4 LP
BA-2010 AS-CL 8 LP
Master SS-CL-TAC 8 LP
Lecturer Lei Tang
Module Type Proseminar / Hauptseminar
Language English
First Session 16.04.2026
Time and Place Thursday, 15:15 - 16:45,
INF 326 / SR 27
Commitment Period tbd.

Participants

All advanced CL Bachelor students and all CL master students. Students from MSc Data and Computer Science or MSc Scientific Computing with Field of Application Computational Linguistics are welcome after getting permission from the lecturer. MSc Scientific Computing students can only take the course as HS for 8 LP.  If the seminar should be oversubscribed, CL students will have priority.  

Prerequisites for Participation

  • Statistical Natural Language Processing
  • Basic Knowledge in Neural Networks

Assessment

  • Presentation (50%)
  • Project (50%)

Content

Recent advances in Large Language Models (LLMs) suggest that Process Reward Models (PRMs) offer a promising approach to verifying intermediate reasoning steps and enhancing model performance. In this seminar, we will provide a systematic review of foundational and influential papers on PRMs.

» More Materials

zum Seitenanfang