David, Alessandro
(2024)
Patchcore Transformed: A Vision Transformer Approach To Anomaly Detection.
[Laurea magistrale], Università di Bologna, Corso di Studio in
Artificial intelligence [LM-DM270], Documento full-text non disponibile
Il full-text non è disponibile per scelta dell'autore.
(
Contatta l'autore)
Abstract
Recent advancements in natural language processing (NLP) have spurred innovation in computer vision, leading to the introduction of Vision Transformers (ViT). These transformer-based models have demonstrated remarkable performance, surpassing traditional Convolutional Neural Networks (CNN) in various computer vision tasks. This thesis explores the integration of ViT into PatchCore, an anomaly detection model, to leverage the capabilities of transformer architectures for anomaly detection tasks. Experiments are conducted on two datasets: MVTec AD, a widely used benchmark dataset, and a custom dataset tailored to specific application domains. The results demonstrate the effectiveness of the ViT-PatchCore model in detecting anomalies and outperforming existing methods, including traditional CNN-based approaches. Furthermore, this thesis proposes a novel solution that addresses some challenges in integrating ViT into PatchCore and suggests improvements to its algorithm to enhance its performance and efficiency. This research contributes to the advancement of anomaly detection techniques by harnessing the power of transformer based models in computer vision applications and suggests potential directions for future research in this field.
Abstract
Recent advancements in natural language processing (NLP) have spurred innovation in computer vision, leading to the introduction of Vision Transformers (ViT). These transformer-based models have demonstrated remarkable performance, surpassing traditional Convolutional Neural Networks (CNN) in various computer vision tasks. This thesis explores the integration of ViT into PatchCore, an anomaly detection model, to leverage the capabilities of transformer architectures for anomaly detection tasks. Experiments are conducted on two datasets: MVTec AD, a widely used benchmark dataset, and a custom dataset tailored to specific application domains. The results demonstrate the effectiveness of the ViT-PatchCore model in detecting anomalies and outperforming existing methods, including traditional CNN-based approaches. Furthermore, this thesis proposes a novel solution that addresses some challenges in integrating ViT into PatchCore and suggests improvements to its algorithm to enhance its performance and efficiency. This research contributes to the advancement of anomaly detection techniques by harnessing the power of transformer based models in computer vision applications and suggests potential directions for future research in this field.
Tipologia del documento
Tesi di laurea
(Laurea magistrale)
Autore della tesi
David, Alessandro
Relatore della tesi
Correlatore della tesi
Scuola
Corso di studio
Ordinamento Cds
DM270
Parole chiave
anomaly detection,computer vision,patchcore,vision transformers,swin,vit,swin transformer,positional distance variance,mvtec,mvtec ad,transformers
Data di discussione della Tesi
19 Marzo 2024
URI
Altri metadati
Tipologia del documento
Tesi di laurea
(NON SPECIFICATO)
Autore della tesi
David, Alessandro
Relatore della tesi
Correlatore della tesi
Scuola
Corso di studio
Ordinamento Cds
DM270
Parole chiave
anomaly detection,computer vision,patchcore,vision transformers,swin,vit,swin transformer,positional distance variance,mvtec,mvtec ad,transformers
Data di discussione della Tesi
19 Marzo 2024
URI
Gestione del documento: