Synthetic data augmentation for tabular data via deep learning methods

Paradiso, Francesca (2024) Synthetic data augmentation for tabular data via deep learning methods. [Laurea magistrale], Università di Bologna, Corso di Studio in Automation engineering / ingegneria dell’automazione [LM-DM270], Documento full-text non disponibile
Il full-text non è disponibile per scelta dell'autore. (Contatta l'autore)

Abstract

Payments and online transactions have become an integral part of our daily lives. Consequently, it is of paramount importance to defend against attacks from malicious users who engage in various types of fraudulent activities. Currently, most fraud detection approaches require a training dataset containing records of both benign and malicious usage. However, in practice, there are very few records of the latter activities, making it increasingly difficult to detect and prevent such rare frauds. This thesis focuses on the analysis of different augmentation techniques applied to three highly unbalanced datasets. The goal is to evaluate the performance of each model on tabular datasets and compare them with state-of-the-art machine learning techniques such as SMOTE, GMM, and oversampling. The evaluated models include GAN, WGAN-GP, CTGAN, TVAE and RTF. Each model's performance will be assessed based on Resemblance, Privacy, and Utility evaluations

Abstract
Tipologia del documento
Tesi di laurea (Laurea magistrale)
Autore della tesi
Paradiso, Francesca
Relatore della tesi
Correlatore della tesi
Scuola
Corso di studio
Ordinamento Cds
DM270
Parole chiave
Synthetic Data Augmentation,GAN,CTGAN,WGAN-GP,TVAE,RTF,Tabular Data
Data di discussione della Tesi
22 Luglio 2024
URI

Altri metadati

Gestione del documento: Visualizza il documento

^