Pieri, Valentina
(2023)
Stepping into the NLG-Metricverse: Design, Development, and Deployment of a PyPI Library for Evaluating Artificial Text.
[Laurea], Università di Bologna, Corso di Studio in
Ingegneria e scienze informatiche [L-DM270] - Cesena, Documento ad accesso riservato.
Documenti full-text disponibili:
|
Documento PDF (Thesis)
Full-text accessibile solo agli utenti istituzionali dell'Ateneo
Disponibile con Licenza: Salvo eventuali più ampie autorizzazioni dell'autore, la tesi può essere liberamente consultata e può essere effettuato il salvataggio e la stampa di una copia per fini strettamente personali di studio, di ricerca e di insegnamento, con espresso divieto di qualunque utilizzo direttamente o indirettamente commerciale. Ogni altro diritto sul materiale è riservato
Download (2MB)
| Contatta l'autore
|
Abstract
The progress in Natural Language Generation (NLG) has resulted in the widespread use of artificial text in different areas, such as chatbots and content creation. Despite this, it is still difficult to determine the quality and coherence of generated text. This thesis tackles the issue of NLG evaluation by introducing the NLG-Metricverse— an open-source Python library that enables end-to-end evaluation of NLG. This library provides a range of evaluation metrics that allow researchers and practitioners to quantitatively analyze various aspects of generated text. These metrics can be evaluated through visualizations that aid in understanding the scores. The library’s implementation of virtual environments enhances the efficiency, modularity, and reproducibility of the
NLG-Metricverse framework. It creates a controlled space to manage dependencies separately from the system-wide Python environment. The NLGMetricverse project has significantly improved its code and refined its APIs. As a result, the library has been deployed on the Python Package Index (PyPI), making it easily accessible to users in the NLG community. This achievement is expected to empower users to conduct more accurate and insightful evaluations of artificial text, thereby advancing the field of NLG. The library is accompanied by comprehensive documentation that meticulously outlines its metrics and functions, making it easy to understand how it works. Furthermore, the library structure is designed to be extensible, allowing for the incorporation of new metrics and methodologies as the field of NLG continues to evolve, hopefully following and fostering new contributions. A data app is also being developed,
uploaded on Hugging Face Space, allowing future users to try the library without installing it.
Abstract
The progress in Natural Language Generation (NLG) has resulted in the widespread use of artificial text in different areas, such as chatbots and content creation. Despite this, it is still difficult to determine the quality and coherence of generated text. This thesis tackles the issue of NLG evaluation by introducing the NLG-Metricverse— an open-source Python library that enables end-to-end evaluation of NLG. This library provides a range of evaluation metrics that allow researchers and practitioners to quantitatively analyze various aspects of generated text. These metrics can be evaluated through visualizations that aid in understanding the scores. The library’s implementation of virtual environments enhances the efficiency, modularity, and reproducibility of the
NLG-Metricverse framework. It creates a controlled space to manage dependencies separately from the system-wide Python environment. The NLGMetricverse project has significantly improved its code and refined its APIs. As a result, the library has been deployed on the Python Package Index (PyPI), making it easily accessible to users in the NLG community. This achievement is expected to empower users to conduct more accurate and insightful evaluations of artificial text, thereby advancing the field of NLG. The library is accompanied by comprehensive documentation that meticulously outlines its metrics and functions, making it easy to understand how it works. Furthermore, the library structure is designed to be extensible, allowing for the incorporation of new metrics and methodologies as the field of NLG continues to evolve, hopefully following and fostering new contributions. A data app is also being developed,
uploaded on Hugging Face Space, allowing future users to try the library without installing it.
Tipologia del documento
Tesi di laurea
(Laurea)
Autore della tesi
Pieri, Valentina
Relatore della tesi
Correlatore della tesi
Scuola
Corso di studio
Ordinamento Cds
DM270
Parole chiave
Natural Language Processing,Natural Language Generation,PyPI,Text Evaluation Metrics,Data App
Data di discussione della Tesi
5 Ottobre 2023
URI
Altri metadati
Tipologia del documento
Tesi di laurea
(NON SPECIFICATO)
Autore della tesi
Pieri, Valentina
Relatore della tesi
Correlatore della tesi
Scuola
Corso di studio
Ordinamento Cds
DM270
Parole chiave
Natural Language Processing,Natural Language Generation,PyPI,Text Evaluation Metrics,Data App
Data di discussione della Tesi
5 Ottobre 2023
URI
Statistica sui download
Gestione del documento: