Raimondi, Bianca
(2023)
Large Language Models for Education.
[Laurea magistrale], Università di Bologna, Corso di Studio in
Informatica [LM-DM270]
Documenti full-text disponibili:
Abstract
This thesis explores the use of large language models (LLMs) in specialized domains, with a particular focus on educational content found in LaTeX- formatted textbooks. The introduction highlights the importance of fine- tuning LLMs for domain-specific tasks and emphasizes the significance of preprocessing data from specialized sources like textbooks.
One of the key objectives is to demonstrate how fine-tuned LLMs, tailored to educational content, can effectively address the limitations of LLMs pre- trained on general text, especially in responding to single-choice questions. Additionally, the thesis examines the practical aspects of deploying LLMs, comparing the resource consumption of large pretrained models to smaller fine-tuned ones, offering insights into performance-efficiency trade-offs.
In summary, this thesis aims to contribute to the field of natural language processing by exploring the adaptation of LLMs to educational content and addressing their limitations. The research is structured into three chapters, each focusing on distinct aspects, and concludes with reflections on future directions in this evolving field.
Abstract
This thesis explores the use of large language models (LLMs) in specialized domains, with a particular focus on educational content found in LaTeX- formatted textbooks. The introduction highlights the importance of fine- tuning LLMs for domain-specific tasks and emphasizes the significance of preprocessing data from specialized sources like textbooks.
One of the key objectives is to demonstrate how fine-tuned LLMs, tailored to educational content, can effectively address the limitations of LLMs pre- trained on general text, especially in responding to single-choice questions. Additionally, the thesis examines the practical aspects of deploying LLMs, comparing the resource consumption of large pretrained models to smaller fine-tuned ones, offering insights into performance-efficiency trade-offs.
In summary, this thesis aims to contribute to the field of natural language processing by exploring the adaptation of LLMs to educational content and addressing their limitations. The research is structured into three chapters, each focusing on distinct aspects, and concludes with reflections on future directions in this evolving field.
Tipologia del documento
Tesi di laurea
(Laurea magistrale)
Autore della tesi
Raimondi, Bianca
Relatore della tesi
Correlatore della tesi
Scuola
Corso di studio
Indirizzo
CURRICULUM A: TECNICHE DEL SOFTWARE
Ordinamento Cds
DM270
Parole chiave
Large Language Models,Machine Learning,Generative AI,Education
Data di discussione della Tesi
12 Ottobre 2023
URI
Altri metadati
Tipologia del documento
Tesi di laurea
(NON SPECIFICATO)
Autore della tesi
Raimondi, Bianca
Relatore della tesi
Correlatore della tesi
Scuola
Corso di studio
Indirizzo
CURRICULUM A: TECNICHE DEL SOFTWARE
Ordinamento Cds
DM270
Parole chiave
Large Language Models,Machine Learning,Generative AI,Education
Data di discussione della Tesi
12 Ottobre 2023
URI
Statistica sui download
Gestione del documento: