Colasuonno, Gabriele
(2023)
Investigating the Impact of Image Embedding on Generative Diffusion Model: Portrait Reification.
[Laurea magistrale], Università di Bologna, Corso di Studio in
Artificial intelligence [LM-DM270], Documento full-text non disponibile
Il full-text non è disponibile per scelta dell'autore.
(
Contatta l'autore)
Abstract
An application of Generative Diffusion Techniques for the reification of human portraits in artistic paintings is presented. By reification, we intend the transformation of the painter's figurative abstraction into a real human face. The application exploits a recent embedding technique for Denoising Diffusion Implicit Models (DDIM) inverting the generative process and mapping the visible image into its latent representation. In this way, we can first embed the portrait into the latent space, and then use the reverse diffusion model, trained to generate real human faces, to produce the most likelihood real approximation of the portrait. The actual deployment of the application involves several additional techniques, mostly aimed to automatically identify, align and crop the relevant portion of the face, and to post-process the generated reification in order to enhance its quality and to allow a smooth reinsertion in the original painting.
Abstract
An application of Generative Diffusion Techniques for the reification of human portraits in artistic paintings is presented. By reification, we intend the transformation of the painter's figurative abstraction into a real human face. The application exploits a recent embedding technique for Denoising Diffusion Implicit Models (DDIM) inverting the generative process and mapping the visible image into its latent representation. In this way, we can first embed the portrait into the latent space, and then use the reverse diffusion model, trained to generate real human faces, to produce the most likelihood real approximation of the portrait. The actual deployment of the application involves several additional techniques, mostly aimed to automatically identify, align and crop the relevant portion of the face, and to post-process the generated reification in order to enhance its quality and to allow a smooth reinsertion in the original painting.
Tipologia del documento
Tesi di laurea
(Laurea magistrale)
Autore della tesi
Colasuonno, Gabriele
Relatore della tesi
Scuola
Corso di studio
Ordinamento Cds
DM270
Parole chiave
Diffusion Model,Conditional Generation,Computer Vision,Face Generation,Deep Learning,Portrait Reification
Data di discussione della Tesi
20 Luglio 2023
URI
Altri metadati
Tipologia del documento
Tesi di laurea
(NON SPECIFICATO)
Autore della tesi
Colasuonno, Gabriele
Relatore della tesi
Scuola
Corso di studio
Ordinamento Cds
DM270
Parole chiave
Diffusion Model,Conditional Generation,Computer Vision,Face Generation,Deep Learning,Portrait Reification
Data di discussione della Tesi
20 Luglio 2023
URI
Gestione del documento: