Vorlesung: 39734 Approximate Dynamic Programming (Reinforcement Learning) - Details

Vorlesung: 39734 Approximate Dynamic Programming (Reinforcement Learning) - Details

Sie sind nicht in Stud.IP angemeldet.

Allgemeine Informationen

Veranstaltungsname Vorlesung: 39734 Approximate Dynamic Programming (Reinforcement Learning)
Untertitel vormals "Advanced Topics in Management Science"
Veranstaltungsnummer 39734
Semester WiSe 24/25
Aktuelle Anzahl der Teilnehmenden 113
erwartete Teilnehmendenanzahl 200
Heimat-Einrichtung Lehrstuhl für Betriebswirtschaftslehre mit Schwerpunkt Management Science / Operations and Supply Chain Management
Veranstaltungstyp Vorlesung in der Kategorie Lehre (mit Prüfung)
Nächster Termin Montag, 25.11.2024 14:00 - 16:00 Uhr, Ort: (WIWI) HS 7
Art/Form
Teilnehmende
BA, WI, CMT, M, INF, AIE
Voraussetzungen
Mathematical maturity and the ability to write down precise and rigorous arguments.
Solid basic knowledge of modeling and optimization.
Lernorganisation
• Block course with interactive elements and classroom discussions;
• Solution and discussions of exercises and case studies;
• Online forums and discussions;
• A take-home mock exam to simulate the final exam of the course. Discussion of this mock exam;
• Blended learning, such as usage of software examples, videos and web-based exercises
Leistungsnachweis
a) Final exam 100 % or
b) Final exam 90% + 10 % for completing optional assignments during the semester
(with reservations)
SWS
2
Literatur
Bertsekas, D. P., and Tsitsiklis, J. N. (1996). Neuro-Dynamic Programming. Athena Scientific: Massachustetts.
Bertsekas, D. P. Dynamic Programming and Optimal Control. Athena Scientific: Massachustetts.
Bertsekas, D. P. Dynamic Programming and Optimal Control: Approximate Dynamic Programming. Athena Scientific: Massachustetts.
Bertsekas, D. P. Abstract Dynamic Programming. Athena Scientific: Massachustetts. Powell, W. B. Approximate Dynamic Programming. John Wiley and Sons.
Qualifikationsziele
After successful participation in the module, students will be able to:
Represent deterministic and stochastic optimization problems as dynamic programs
Solve deterministic and stochastic optimization problems exactly, incl. with the backward induction, value and policy iteration methods
Understand foundations of Markov chains and Markov decision processes and meaningfully apply them to solve stochastic optimization problems
Apply the approximate dynamic programming (reinforcement learning) algorithm, critically appreciate variations in its design
Critically evaluate the potential of AI and further the generic heuristic solution approaches in the light of the recent successes and of the no-free-lunch
Workload
Lecture 2 SWS (30 h attendance and 45h own work)
Excercise 2 SWS (30 h attendance and 45 own work)
Calculation basis: 15 weeks in a semester, including an examination week; each SWS corresponds to 60 minutes
Sonstiges
Sofern die Klausur PN 266193 "Advanced Topics in Management Science" bereits erfolgreich absolviert wurde, ist es nicht möglich, zusätzlich die Klausur PN 266194 "Approximate Dynamic Programming (Reinforcement Learning)" zu absolvieren.
ECTS-Punkte
5

Räume und Zeiten

(WIWI) HS 7
Montag: 14:00 - 16:00, wöchentlich (13x)
(ISA) SR 008
Montag: 14:00 - 16:00, wöchentlich (1x)

Studienbereiche

Die Angaben zu den Anrechenbarkeiten an der FIM sind ohne Gewähr. Bitte beachten Sie die verbindliche Liste der Anrechenbarkeiten .

Modulzuordnungen

Kommentar/Beschreibung

Dynamic programming (basic concepts, sequential decision making under uncertainty, understanding the curse of dimensionality, stochastic and deterministic shortest paths algorithms);
Markov Decision Processes, exact solution approaches to stochastic optimization problems
Approximate DP with cost-to-go function approximation (reinforcement learning);
Overview of theory and praxis of heuristic algorithms
Case studies.