Meglioraldi, Jacopo
(2024)
3D Gaussian Splatting reconstruction with depth enhanced initialization.
[Laurea magistrale], Università di Bologna, Corso di Studio in
Artificial intelligence [LM-DM270]
Documenti full-text disponibili:
Abstract
This thesis proposes a novel pipeline incorporating segmentation masking and depth val-
ues to enhance the performance of Gaussian Splatting techniques. By leveraging 3D
Gaussian Splatting’s ability to achieve high-accuracy photorealistic reconstructions, the
pipeline focuses on singular object reconstruction through segmentation masking, which
removes unwanted backgrounds and accelerates the optimization process. Integrating re-
cent advances in fast semantic segmentation using neural networks, the pipeline produces
nearly ready-to-use models. Additionally, using a depth-sensing camera during acquisi-
tion allows for more accurate point cloud initialization with minimal overhead, leading
to a significant boost in visual accuracy during the early optimization steps. The pipeline
achieves a significant speedup, making it particularly advantageous for devices with lim-
ited computational capacity, though with a slight trade-off in the accuracy of the final
results. The depth-enhanced initialization is carried out by sampling and projecting mean-
ingful point information into the reconstructed space, offering a better approximation of
under-sampled regions. This step also ensures the reconstruction is to scale, enabling
precise measurements and further analysis.
Abstract
This thesis proposes a novel pipeline incorporating segmentation masking and depth val-
ues to enhance the performance of Gaussian Splatting techniques. By leveraging 3D
Gaussian Splatting’s ability to achieve high-accuracy photorealistic reconstructions, the
pipeline focuses on singular object reconstruction through segmentation masking, which
removes unwanted backgrounds and accelerates the optimization process. Integrating re-
cent advances in fast semantic segmentation using neural networks, the pipeline produces
nearly ready-to-use models. Additionally, using a depth-sensing camera during acquisi-
tion allows for more accurate point cloud initialization with minimal overhead, leading
to a significant boost in visual accuracy during the early optimization steps. The pipeline
achieves a significant speedup, making it particularly advantageous for devices with lim-
ited computational capacity, though with a slight trade-off in the accuracy of the final
results. The depth-enhanced initialization is carried out by sampling and projecting mean-
ingful point information into the reconstructed space, offering a better approximation of
under-sampled regions. This step also ensures the reconstruction is to scale, enabling
precise measurements and further analysis.
Tipologia del documento
Tesi di laurea
(Laurea magistrale)
Autore della tesi
Meglioraldi, Jacopo
Relatore della tesi
Correlatore della tesi
Scuola
Corso di studio
Ordinamento Cds
DM270
Parole chiave
Computer Vision,Gaussian Splatting,3D reconstruction,Structure from Motion,stereo camera,Luciano Pavarotti
Data di discussione della Tesi
5 Dicembre 2024
URI
Altri metadati
Tipologia del documento
Tesi di laurea
(NON SPECIFICATO)
Autore della tesi
Meglioraldi, Jacopo
Relatore della tesi
Correlatore della tesi
Scuola
Corso di studio
Ordinamento Cds
DM270
Parole chiave
Computer Vision,Gaussian Splatting,3D reconstruction,Structure from Motion,stereo camera,Luciano Pavarotti
Data di discussione della Tesi
5 Dicembre 2024
URI
Statistica sui download
Gestione del documento: