Documenti full-text disponibili:
Abstract
This work aims to develop classification models able to automatically perform the task of emotion identification on Italian arias. These models enable the musicologists and the public interested in opera to investigate the emotion of Italian aria in a systematical way.
An aria can be seen as an independent unit of opera that is sung by one character. Each aria contains 1 to 8 verses. Considering an aria may transmit more than one emotion, a lower level granularity is adopted: the identification of the emotion transmitted at the verse level. On the basis of a manually labelled corpus comprised of 2,500 aria verses with their corresponding emotion,
the �first part of this work investigates different text representations and classification approaches.
Building on the results of the exploration in the �first part, the second part investigates emotion identification at the aria level. The size of supervised data is expanded by means of self-learning. The verse-level annotation is converted into aria-level annotation and each aria is assigned up to two emotion
labels. I experimented with pre-trained character trigram embeddings and convolutional neural network.
For the emotion identification at the verse level, the combination of character trigram based TF-IDF and neural network with 2 hidden layers outperformed other combinations, achieving an accuracy of 0.47 on the test set. As for the emotion identification at the aria level, a convolutional neural network combined with character trigram based embeddings developed based on a corpus of Italian arias achieved an accuracy of 0:68.
Abstract
This work aims to develop classification models able to automatically perform the task of emotion identification on Italian arias. These models enable the musicologists and the public interested in opera to investigate the emotion of Italian aria in a systematical way.
An aria can be seen as an independent unit of opera that is sung by one character. Each aria contains 1 to 8 verses. Considering an aria may transmit more than one emotion, a lower level granularity is adopted: the identification of the emotion transmitted at the verse level. On the basis of a manually labelled corpus comprised of 2,500 aria verses with their corresponding emotion,
the �first part of this work investigates different text representations and classification approaches.
Building on the results of the exploration in the �first part, the second part investigates emotion identification at the aria level. The size of supervised data is expanded by means of self-learning. The verse-level annotation is converted into aria-level annotation and each aria is assigned up to two emotion
labels. I experimented with pre-trained character trigram embeddings and convolutional neural network.
For the emotion identification at the verse level, the combination of character trigram based TF-IDF and neural network with 2 hidden layers outperformed other combinations, achieving an accuracy of 0.47 on the test set. As for the emotion identification at the aria level, a convolutional neural network combined with character trigram based embeddings developed based on a corpus of Italian arias achieved an accuracy of 0:68.
Tipologia del documento
Tesi di laurea
(Laurea magistrale)
Autore della tesi
Zhang, Shibingfeng
Relatore della tesi
Correlatore della tesi
Scuola
Corso di studio
Ordinamento Cds
DM270
Parole chiave
emotion identification,computational linguistics,text classification,italian opera,active learning
Data di discussione della Tesi
14 Luglio 2021
URI
Altri metadati
Tipologia del documento
Tesi di laurea
(NON SPECIFICATO)
Autore della tesi
Zhang, Shibingfeng
Relatore della tesi
Correlatore della tesi
Scuola
Corso di studio
Ordinamento Cds
DM270
Parole chiave
emotion identification,computational linguistics,text classification,italian opera,active learning
Data di discussione della Tesi
14 Luglio 2021
URI
Statistica sui download
Gestione del documento: