Generative Information Retrieval of Chest X-Rays

Mazzi, Riccardo (2024) Generative Information Retrieval of Chest X-Rays. [Laurea], Università di Bologna, Corso di Studio in Ingegneria e scienze informatiche [L-DM270] - Cesena, Documento ad accesso riservato.

Salva citazione

Documenti full-text disponibili:

Documento PDF (Thesis)
Full-text non accessibile fino al 30 Dicembre 2027.
Disponibile con Licenza: Creative Commons: Attribuzione - Non commerciale - Non opere derivate 4.0 (CC BY-NC-ND 4.0)
Download (4MB) | Contatta l'autore

Abstract

Traditional information retrieval methods have relied on similarity matching, where representations generated by encoder-only models are used to rank and retrieve relevant documents. The recent advancements in pre-trained language models have enabled a new paradigm--generative information retrieval. This approach leverages the generative capabilities of large language models (LLMs) to produce relevant document identifiers directly. However, generative techniques remain unexplored in medicine. Within this domain, physicians often need to quickly find relevant radiographic images and reports of previous cases to assist in diagnosis and treatment. For these reasons, we present GENerative Information Retrieval of Chest X-Rays (GenIrCxr). We assign a numerical identifier to each report by applying hierarchical k-means on top of PubMedBERT semantics-aware representations. Then, we train a decoder-only LLM from scratch to generate report identifiers in response to queries from a medical expert. Significant effort was invested in developing optimization techniques to enhance model performance, including custom output vocabularies and constrained beam search generation at inference time. We train GenIrCxr using a custom dataset built on top of MIMIC-CXR-JPG v2.0.0, where the input query describes medical concepts of interest and the output is the semantic identifier of the target report computed offline. Retrieval performance is measured using Recall@K and Mean Reciprocal Rank as automatic metrics. Given the complexity of this task, GenIrCxr demonstrates strong performance, which we validate through comparative experiments against several encoders. GenIrCxr not only surpasses these baselines but also outperforms a seq-to-seq model specifically designed for generative information retrieval, accepted at NeurIPS 2022.

Abstract

Tipologia del documento

Tesi di laurea (Laurea)

Autore della tesi

Mazzi, Riccardo

Relatore della tesi

Moro, Gianluca

Correlatore della tesi

Frisoni, Giacomo ; Molfetta, Lorenzo

Scuola

Ingegneria e Architettura

Corso di studio