Stochastic Processes and Optimization in Machine Learning – 28/04/2026

Model-Free Methods in Reinforcement Learning:
i) Direct Approximations, Random Trajectories & State-Action Counting
ii) Temporal-Difference & Stochastic Q-Learning, 2. Distributed Multi-Agent Reinforcement Learning, 3. Bellman-Ford Algorithm, BGP Routing in the Internet

Loading the player...

Βασίλης Μάγκλαρης - maglaris@netmode.ntua.gr

Σχολή Ηλεκτρολόγων Μηχ. & Μηχ. Υπολογιστών Ε.Μ.Π.

Slides