Tavana, Ali
(2023)
Efficient Information Extraction from Logistics Documents using OCR and AI Model: LayoutLMv3.
[Laurea magistrale], Università di Bologna, Corso di Studio in
Artificial intelligence [LM-DM270], Documento full-text non disponibile
Il full-text non è disponibile per scelta dell'autore.
(
Contatta l'autore)
Abstract
In the logistics industry, the rapid and precise extraction of vital information from documents plays a pivotal role in optimizing operations and decision-making processes. This research focuses on the application of Optical Character Recognition (OCR) technologies and AI models like LayoutLMv3 to streamline and enhance the extraction of critical data from logistics documents.
The study investigates the utilization of various OCR tools, including Tesseract, Google Vision OCR, and PaddlePaddle, in the context of logistics document processing. It explores the strengths and limitations of these OCR technologies and their compatibility with AI-driven information extraction methods.
Our research methodology encompasses the development and optimization of AI models for the automated recognition of essential data fields, such as shipping details, product descriptions, and quantities, within unstructured logistics documents. Fine-tuning and performance evaluation are central to this investigation, with a focus on accuracy, recall, and overall efficiency.
In addition to OCR and AI model development, this study explores strategies for the seamless integration of these technologies into logistics workflows. We emphasize the importance of user-friendly interfaces to facilitate interaction between logistics professionals and the AI-powered document processing system.
The findings of this research hold the potential to significantly improve logistics document processing by reducing manual labor and enhancing data accuracy.
Abstract
In the logistics industry, the rapid and precise extraction of vital information from documents plays a pivotal role in optimizing operations and decision-making processes. This research focuses on the application of Optical Character Recognition (OCR) technologies and AI models like LayoutLMv3 to streamline and enhance the extraction of critical data from logistics documents.
The study investigates the utilization of various OCR tools, including Tesseract, Google Vision OCR, and PaddlePaddle, in the context of logistics document processing. It explores the strengths and limitations of these OCR technologies and their compatibility with AI-driven information extraction methods.
Our research methodology encompasses the development and optimization of AI models for the automated recognition of essential data fields, such as shipping details, product descriptions, and quantities, within unstructured logistics documents. Fine-tuning and performance evaluation are central to this investigation, with a focus on accuracy, recall, and overall efficiency.
In addition to OCR and AI model development, this study explores strategies for the seamless integration of these technologies into logistics workflows. We emphasize the importance of user-friendly interfaces to facilitate interaction between logistics professionals and the AI-powered document processing system.
The findings of this research hold the potential to significantly improve logistics document processing by reducing manual labor and enhancing data accuracy.
Tipologia del documento
Tesi di laurea
(Laurea magistrale)
Autore della tesi
Tavana, Ali
Relatore della tesi
Correlatore della tesi
Scuola
Corso di studio
Ordinamento Cds
DM270
Parole chiave
Logistics,Document Processing,OCR Technology,Information Extraction,Automation,Efficiency.,layoutlmv3
Data di discussione della Tesi
16 Dicembre 2023
URI
Altri metadati
Tipologia del documento
Tesi di laurea
(NON SPECIFICATO)
Autore della tesi
Tavana, Ali
Relatore della tesi
Correlatore della tesi
Scuola
Corso di studio
Ordinamento Cds
DM270
Parole chiave
Logistics,Document Processing,OCR Technology,Information Extraction,Automation,Efficiency.,layoutlmv3
Data di discussione della Tesi
16 Dicembre 2023
URI
Gestione del documento: