Bolognini, Damiano
(2022)
Improving the convergence speed of NeRFs with depth supervision and weight initialization.
[Laurea magistrale], Università di Bologna, Corso di Studio in
Ingegneria informatica [LM-DM270]
Documenti full-text disponibili:
Abstract
Neural rendering is a new and developing field where computer graphics and deep learning techniques are combined to generate photo-realistic images using deep neural networks.
In particular, Neural Radiance Fields (NeRF) is able to synthesise novel views of a scene with unprecedented quality by fitting a Multi-Layer Perceptron (MLP) to RGB images.
However, training this network requires plenty of time and computation even on modern GPUs, making this new technology hardly employable on practical specialized applications.
In this project, we show that employing the known depth of the scene as an additional supervision during the training, and starting from pre-trained weights of other scene with similar setups, instead of from scratch, leads to a convergence speed 3 to 5 time faster.
Abstract
Neural rendering is a new and developing field where computer graphics and deep learning techniques are combined to generate photo-realistic images using deep neural networks.
In particular, Neural Radiance Fields (NeRF) is able to synthesise novel views of a scene with unprecedented quality by fitting a Multi-Layer Perceptron (MLP) to RGB images.
However, training this network requires plenty of time and computation even on modern GPUs, making this new technology hardly employable on practical specialized applications.
In this project, we show that employing the known depth of the scene as an additional supervision during the training, and starting from pre-trained weights of other scene with similar setups, instead of from scratch, leads to a convergence speed 3 to 5 time faster.
Tipologia del documento
Tesi di laurea
(Laurea magistrale)
Autore della tesi
Bolognini, Damiano
Relatore della tesi
Correlatore della tesi
Scuola
Corso di studio
Ordinamento Cds
DM270
Parole chiave
computer vision,deep learning,NeRF,neural network,3D reconstruction,computer graphics,MLP,weight initialization,depth supervision,Python,PyTorch,Blender,Ray Casting,meta learning,scene representation
Data di discussione della Tesi
22 Marzo 2022
URI
Altri metadati
Tipologia del documento
Tesi di laurea
(NON SPECIFICATO)
Autore della tesi
Bolognini, Damiano
Relatore della tesi
Correlatore della tesi
Scuola
Corso di studio
Ordinamento Cds
DM270
Parole chiave
computer vision,deep learning,NeRF,neural network,3D reconstruction,computer graphics,MLP,weight initialization,depth supervision,Python,PyTorch,Blender,Ray Casting,meta learning,scene representation
Data di discussione della Tesi
22 Marzo 2022
URI
Statistica sui download
Gestione del documento: