Documenti full-text disponibili:
![[thumbnail of Thesis]](https://amslaurea.unibo.it/style/images/fileicons/application_pdf.png) |
Documento PDF (Thesis)
Full-text accessibile solo agli utenti istituzionali dell'Ateneo
Disponibile con Licenza: Salvo eventuali più ampie autorizzazioni dell'autore, la tesi può essere liberamente consultata e può essere effettuato il salvataggio e la stampa di una copia per fini strettamente personali di studio, di ricerca e di insegnamento, con espresso divieto di qualunque utilizzo direttamente o indirettamente commerciale. Ogni altro diritto sul materiale è riservato
Download (27MB)
| Contatta l'autore
|
Abstract
This thesis addresses the industrial 3D Bin Packing Problem (3D-BPP) by investigating learning-based approaches for smart palletizing under realistic operational constraints.
Unlike classical formulations that focus mainly on geometric packing efficiency, industrial palletizing must also consider constraints related to mechanical stability, load distribution, stackability, and robotic execution feasibility. To capture this complexity, the palletizing process is modeled as a sequential decision-making problem within a learning framework.
The proposed methodology follows a progressive pipeline from imitation learning to reinforcement learning. First, a policy is trained through behavioral cloning using expert pallet layouts generated by an industrial optimization algorithm, allowing the model to learn structured packing strategies from validated solutions. Building on this baseline, offline reinforcement learning is explored to improve policy performance using fixed datasets. Due to the limitations observed in this setting, the approach ultimately evolves toward an online reinforcement learning framework based on Proximal Policy Optimization (PPO), where the agent learns directly through interaction with a simulated environment.
To support efficient learning while preserving physical realism, the palletizing environment is first represented using a voxel-based discretization and later extended to a continuous formulation enabling more precise placements. The framework also includes mechanisms for multi-pallet scenarios and industrial feasibility constraints.
The proposed approach is evaluated across multiple scenarios and validated through physics-based simulation in NVIDIA Isaac Sim, showing that the learned policy can generate stable and efficient palletizing layouts compatible with realistic industrial environments.
Abstract
This thesis addresses the industrial 3D Bin Packing Problem (3D-BPP) by investigating learning-based approaches for smart palletizing under realistic operational constraints.
Unlike classical formulations that focus mainly on geometric packing efficiency, industrial palletizing must also consider constraints related to mechanical stability, load distribution, stackability, and robotic execution feasibility. To capture this complexity, the palletizing process is modeled as a sequential decision-making problem within a learning framework.
The proposed methodology follows a progressive pipeline from imitation learning to reinforcement learning. First, a policy is trained through behavioral cloning using expert pallet layouts generated by an industrial optimization algorithm, allowing the model to learn structured packing strategies from validated solutions. Building on this baseline, offline reinforcement learning is explored to improve policy performance using fixed datasets. Due to the limitations observed in this setting, the approach ultimately evolves toward an online reinforcement learning framework based on Proximal Policy Optimization (PPO), where the agent learns directly through interaction with a simulated environment.
To support efficient learning while preserving physical realism, the palletizing environment is first represented using a voxel-based discretization and later extended to a continuous formulation enabling more precise placements. The framework also includes mechanisms for multi-pallet scenarios and industrial feasibility constraints.
The proposed approach is evaluated across multiple scenarios and validated through physics-based simulation in NVIDIA Isaac Sim, showing that the learned policy can generate stable and efficient palletizing layouts compatible with realistic industrial environments.
Tipologia del documento
Tesi di laurea
(Laurea magistrale)
Autore della tesi
Palladino, Christian
Relatore della tesi
Correlatore della tesi
Scuola
Corso di studio
Indirizzo
AUTOMATION ENGINEERING
Ordinamento Cds
DM270
Parole chiave
imitation learning, reinforcement learning, learning-based, 3DBinPacking, IsaacSIm, robotics, palletizing, ceramics, maskablePPO, OfflineLearning, OnlineLearning, behavioral cloning, TD3 BC
Data di discussione della Tesi
25 Marzo 2026
URI
Altri metadati
Tipologia del documento
Tesi di laurea
(NON SPECIFICATO)
Autore della tesi
Palladino, Christian
Relatore della tesi
Correlatore della tesi
Scuola
Corso di studio
Indirizzo
AUTOMATION ENGINEERING
Ordinamento Cds
DM270
Parole chiave
imitation learning, reinforcement learning, learning-based, 3DBinPacking, IsaacSIm, robotics, palletizing, ceramics, maskablePPO, OfflineLearning, OnlineLearning, behavioral cloning, TD3 BC
Data di discussione della Tesi
25 Marzo 2026
URI
Statistica sui download
Gestione del documento: