Finetuning commercial Large Language Models with LoRA for enhanced Italian language understanding

Hartsuiker, Jens Matthias (2023) Finetuning commercial Large Language Models with LoRA for enhanced Italian language understanding. [Laurea magistrale], Università di Bologna, Corso di Studio in Artificial intelligence [LM-DM270]

Salva citazione

Documenti full-text disponibili:

Documento PDF (Thesis)
Disponibile con Licenza: Creative Commons: Attribuzione - Condividi allo stesso modo 4.0 (CC BY-SA 4.0)
Download (1MB)

Abstract

In this thesis we took the first steps of creating a well functioning LLM for the Italian language. We finetune two open source commercially licensed LLMs, MPT and LLaMA 2 on an Italian instruction dataset, Stambecco. Although the models do not perform as well as initially aimed for, we did have findings that are broadly applicable and we believe that this work justifies the creation of an LLM pretrained on a majority of Italian data.

Abstract

Tipologia del documento

Tesi di laurea (Laurea magistrale)

Autore della tesi

Hartsuiker, Jens Matthias

Relatore della tesi

Torroni, Paolo

Correlatore della tesi

Ziri, Anna Elisabetta ; Alise, Dario Fioravante ; Ruggeri, Federico

Scuola

Ingegneria e Architettura

Corso di studio