Improving Deep Question Answering: The ALBERT Model

Del Vecchio, Matteo (2020) Improving Deep Question Answering: The ALBERT Model. [Laurea magistrale], Università di Bologna, Corso di Studio in Informatica [LM-DM270]

Salva citazione

Documenti full-text disponibili:

Documento PDF (Thesis)
Disponibile con Licenza: Creative Commons: Attribuzione - Non commerciale - Condividi allo stesso modo 3.0 (CC BY-NC-SA 3.0)
Download (3MB)

Abstract

Natural Language Processing is a field of Artificial Intelligence referring to the ability of computers to understand human speech and language, often in a written form, mainly by using Machine Learning and Deep Learning methods to extract patterns. Languages are challenging by definition, because of their differences, their abstractions and their ambiguities; consequently, their processing is often very demanding, in terms of modelling the problem and resources. Retrieving all sentences in a given text is something that can be easily accomplished with just few lines of code, but what about checking whether a given sentence conveys a message with sarcasm or not? This is something difficult for humans too and therefore, it requires complex modelling mechanisms to be addressed. This kind of information, in fact, poses the problem of its encoding and representation in a meaningful way. The majority of research involves finding and understanding all characteristics of text, in order to develop sophisticated models to address tasks such as Machine Translation, Text Summarization and Question Answering. This work will focus on ALBERT, from Google Research, which is one of the recently released state-of-the-art models and investigate its performance on the Question Answering task. In addition, some ideas will be developed and experimented in order to improve model's performance on the Stanford Question Answering Dataset (SQuAD), after exploring breakthrough changes that made training and fine-tuning of huge language models possible.

Abstract

Tipologia del documento

Tesi di laurea (Laurea magistrale)

Autore della tesi

Del Vecchio, Matteo

Relatore della tesi

Tamburini, Fabio

Correlatore della tesi

Cimiano, Philipp

Scuola

Scienze

Corso di studio

Informatica [LM-DM270]

Indirizzo

Curriculum C: Sistemi e reti

Ordinamento Cds

DM270

Parole chiave

deep learning,machine learning,question answering,natural language processing,questions,answers,dataset,fine tuning,natural language understanding,language,artificial intelligence

Data di discussione della Tesi

19 Marzo 2020

URI

https://amslaurea.unibo.it/id/eprint/20414

Altri metadati

Statistica sui download

Vedi altre statistiche

Gestione del documento:

Strumenti di navigazione

Collezioni AlmaDL

Improving Deep Question Answering: The ALBERT Model

Abstract

Altri metadati

Statistica sui download