Online robot adaptation by actor-critic reinforcement learning

Vargas Grateron, Pablo Sebastian (2026) Online robot adaptation by actor-critic reinforcement learning. [Laurea magistrale], Università di Bologna, Corso di Studio in Ingegneria e scienze informatiche [LM-DM270] - Cesena
Documenti full-text disponibili:
[thumbnail of Thesis] Documento PDF (Thesis)
Disponibile con Licenza: Creative Commons: Attribuzione - Condividi allo stesso modo 4.0 (CC BY-SA 4.0)

Download (1MB)

Abstract

Robots in real-world environments must adapt their strategies as conditions change. This thesis investigates online strategy adaptation using an Actor-Critic Reinforcement Learning framework optimized by a Genetic Algorithm (GA). By evolving meta parameters and initial network weights over 50 generations, the agent achieves an optimal balance between exploration and exploitation. Using the ARGoS3 simulator, the research evaluates two scenarios: an energy-driven survival task and a multi-headed architecture for contextual rewards based on visual stimuli. Results show that combining learning and evolution enables robots to autonomously adapt to non-stationary environments.

Abstract
Tipologia del documento
Tesi di laurea (Laurea magistrale)
Autore della tesi
Vargas Grateron, Pablo Sebastian
Relatore della tesi
Scuola
Corso di studio
Ordinamento Cds
DM270
Parole chiave
Evolutionary Reinforcement Learning,Actor-Critic Architecture,Online Adaptation,Non-Stationary Environments,Catastrophic Forgetting,Genetic Algorithm,Multi-headed Neural Networks,Contextual Reward Problems
Data di discussione della Tesi
13 Marzo 2026
URI

Altri metadati

Statistica sui download

Gestione del documento: Visualizza il documento

^