Dealing with Long-Term Constraints in a Hybrid Learning and Optimization Method

Chinellato, Diego (2024) Dealing with Long-Term Constraints in a Hybrid Learning and Optimization Method. [Laurea magistrale], Università di Bologna, Corso di Studio in Artificial intelligence [LM-DM270]
Documenti full-text disponibili:
[img] Documento PDF (Thesis)
Disponibile con Licenza: Creative Commons: Attribuzione - Condividi allo stesso modo 4.0 (CC BY-SA 4.0)

Download (3MB)

Abstract

The increasing complexity of real-world decision-making problems, where long-term reasoning capabilities are required, is driving novel research on the integration of existing approaches. With this purpose, this thesis explores the synergies between Mathematical Optimization (MO) and Machine Learning (ML), focusing on integrating Reinforcement Learning (RL) with Constrained Optimization (CO) for complex decision-making problems. Our starting point is the UNIFY framework, an hybrid method comprising an offline ML-based phase as well as an online CO-based phase. Our core contribution is the extension of this framework to handle cumulative (long-term) constraints by recasting the problem as a Constrained Markov Decision Process (CMDP) and employing Lagrangian relaxation methods for its solution. We conduct empirical evaluations on an Energy Management System (EMS) tasked with optimal power flow allocation under long-term sustainability constraints. The results reveal: (1) the critical role of virtual parameters modeling in the learning process; (2) the challenges associated with the choice of the online optimization problem; and (3) the importance of hyperparameter tuning in optimizing the framework's performance. Overall, the proposed approach can offer strategic foresight allowing for proactive preparation for future requirements, while also retaining good performances even when dealing with strict constraints.

Abstract
Tipologia del documento
Tesi di laurea (Laurea magistrale)
Autore della tesi
Chinellato, Diego
Relatore della tesi
Scuola
Corso di studio
Ordinamento Cds
DM270
Parole chiave
Reinforcement Learning,Constrained Optimization,Artificial Intelligence,Hybrid offline-online Optimization
Data di discussione della Tesi
19 Marzo 2024
URI

Altri metadati

Statistica sui download

Gestione del documento: Visualizza il documento

^