Intermediate linguistic task fine-tuning on Multi-lingual models

Rispoli, Luca (2022) Intermediate linguistic task fine-tuning on Multi-lingual models. [Laurea magistrale], Università di Bologna, Corso di Studio in Artificial intelligence [LM-DM270], Documento full-text non disponibile

Salva citazione

Il full-text non è disponibile per scelta dell'autore. (Contatta l'autore)

Abstract

State-of-the-art NLP systems are generally based on the assumption that the underlying models are provided with vast datasets to train on. However, especially when working in multi-lingual contexts, datasets are often scarce, thus more research should be carried out in this field. This thesis investigates the benefits of introducing an additional training step when fine-tuning NLP models, named Intermediate Training, which could be exploited to augment the data used for the training phase. The Intermediate Training step is applied by training models on NLP tasks that are not strictly related to the target task, aiming to verify if the models are able to leverage the learned knowledge of such tasks. Furthermore, in order to better analyze the synergies between different categories of NLP tasks, experimentations have been extended also to Multi-Task Training, in which the model is trained on multiple tasks at the same time.

Abstract

Tipologia del documento

Tesi di laurea (Laurea magistrale)

Autore della tesi

Rispoli, Luca

Relatore della tesi

Torroni, Paolo

Correlatore della tesi

Milewski, Victor ; Moens, Sien

Scuola

Ingegneria e Architettura

Corso di studio