Theory of Mind in Large Language Models

Huang, Xuanqiang (2024) Theory of Mind in Large Language Models. [Laurea], Università di Bologna, Corso di Studio in Informatica [L-DM270]

Salva citazione

Documenti full-text disponibili:

	Documento PDF (Thesis) Disponibile con Licenza: Creative Commons: Attribuzione - Non commerciale - Condividi allo stesso modo 4.0 (CC BY-NC-SA 4.0) Download (3MB)
	Documento PDF (Supplementary file) Disponibile con Licenza: Creative Commons: Attribuzione - Non commerciale - Condividi allo stesso modo 4.0 (CC BY-NC-SA 4.0) Download (32kB)

Abstract

Theory of Mind (ToM) is essential in human interactions and has significant implications across various fields. This thesis investigates the level of ToM of five modern Large Language Models (LLM) and proposes a framework to measure the complexity of ToM tasks. We begin with a brief historical overview, tracing ToM's evolution from philosophy and psychology to its influence in computer science. We then introduce a complexity measure that distinguishes between necessary and spurious states, affecting specific ToM problems' difficulty. Inspired by this framework, we harness ideas from World Models to develop Discrete World Models (DWMs): descriptions of states created by the models themselves. We conclude with an analysis of the effectiveness of the techniques proposed, improving up to 9.99% in the best-case scenario, and an investigation of the consistency of the proposed theoretical measure with public datasets. Finally, we discuss future benchmarking directions for ToM abilities in LLMs and provide a memorization analysis of the datasets used.

Abstract

Tipologia del documento

Tesi di laurea (Laurea)

Autore della tesi

Huang, Xuanqiang

Relatore della tesi

Asperti, Andrea

Correlatore della tesi

La Malfa, Emanuele

Scuola

Scienze

Corso di studio