A Comparative Analysis of Reinforcement Learning Algorithms in a Hybrid Learning and Optimization Framework

Campardo, Giorgia (2024) A Comparative Analysis of Reinforcement Learning Algorithms in a Hybrid Learning and Optimization Framework. [Laurea magistrale], Università di Bologna, Corso di Studio in Artificial intelligence [LM-DM270]

Salva citazione

Documenti full-text disponibili:

Documento PDF (Thesis)
Disponibile con Licenza: Salvo eventuali più ampie autorizzazioni dell'autore, la tesi può essere liberamente consultata e può essere effettuato il salvataggio e la stampa di una copia per fini strettamente personali di studio, di ricerca e di insegnamento, con espresso divieto di qualunque utilizzo direttamente o indirettamente commerciale. Ogni altro diritto sul materiale è riservato
Download (3MB)

Abstract

The integration of Machine Learning (ML) and Constrained Optimization (CO) represents a promising avenue for enhancing decision-making capabilities in complex, uncertain environments. This thesis empirically evaluates the UNIFY framework, a novel approach that synergistically combines the predictive power of ML with the strategic prowess of CO. Focusing on the Energy Management System and Set Multi-cover with stochastic coverages problems, in this work we carry out a comparative analysis on the performance, efficiency, and scalability of four RL algorithms - A2C, PPO, TD3, and SAC - within the UNIFY framework. Empirical results reveal the strengths and limitations of these algorithms, highlighting SAC's superior sample efficiency and the benefits of training on multiple instances for improved model generalization. These findings underscore the potential of integrating ML and CO through RL, offering valuable insights for the development of advanced decision-making systems in various real-world applications.

Abstract

Tipologia del documento

Tesi di laurea (Laurea magistrale)

Autore della tesi

Campardo, Giorgia

Relatore della tesi

Lombardi, Michele

Scuola

Ingegneria e Architettura

Corso di studio

Artificial intelligence [LM-DM270]

Ordinamento Cds