A deep learning model via thermodynamics and condensed matter physics

Lapenna, Michela (2022) A deep learning model via thermodynamics and condensed matter physics. [Laurea magistrale], Università di Bologna, Corso di Studio in Physics [LM-DM270], Documento ad accesso riservato.
Documenti full-text disponibili:
[thumbnail of Thesis] Documento PDF (Thesis)
Full-text accessibile solo agli utenti istituzionali dell'Ateneo
Disponibile con Licenza: Salvo eventuali più ampie autorizzazioni dell'autore, la tesi può essere liberamente consultata e può essere effettuato il salvataggio e la stampa di una copia per fini strettamente personali di studio, di ricerca e di insegnamento, con espresso divieto di qualunque utilizzo direttamente o indirettamente commerciale. Ogni altro diritto sul materiale è riservato

Download (7MB) | Contatta l'autore

Abstract

Deep Learning architectures give brilliant results in a large variety of fields, but a comprehensive theoretical description of their inner functioning is still lacking. In this work, we try to understand the behavior of neural networks by modelling in the frameworks of Thermodynamics and Condensed Matter Physics. We approach neural networks as in a real laboratory and we measure the frequency spectrum and the entropy of the weights of the trained model. The stochasticity of the training occupies a central role in the dynamics of the weights and makes it difficult to assimilate neural networks to simple physical systems. However, the analogy with Thermodynamics and the introduction of a well defined temperature leads us to an interesting result: if we eliminate from a CNN the "hottest" filters, the performance of the model remains the same, whereas, if we eliminate the "coldest" ones, the performance gets drastically worst. This result could be exploited in the realization of a training loop which eliminates the filters that do not contribute to loss reduction. In this way, the computational cost of the training will be lightened and more importantly this would be done by following a physical model. In any case, beside important practical applications, our analysis proves that a new and improved modeling of Deep Learning systems can pave the way to new and more efficient algorithms.

Abstract
Tipologia del documento
Tesi di laurea (Laurea magistrale)
Autore della tesi
Lapenna, Michela
Relatore della tesi
Correlatore della tesi
Scuola
Corso di studio
Indirizzo
THEORETICAL PHYSICS
Ordinamento Cds
DM270
Parole chiave
Deep Learning,Thermodynamics,Condensed Matter Physics,Stochastic Gradient Descent,Temperature,Frequency Spectrum,Entropy
Data di discussione della Tesi
15 Luglio 2022
URI

Altri metadati

Statistica sui download

Gestione del documento: Visualizza il documento

^