Mannini, Leonardo
(2026)
Enabling Multi-Species Bird Classification on Low-Power Bioacoustic Loggers.
[Laurea magistrale], Università di Bologna, Corso di Studio in
Artificial intelligence [LM-DM270], Documento ad accesso riservato.
Documenti full-text disponibili:
![[thumbnail of Thesis]](https://amslaurea.unibo.it/style/images/fileicons/application_pdf.png) |
Documento PDF (Thesis)
Full-text non accessibile fino al 26 Maggio 2026.
Disponibile con Licenza: Salvo eventuali più ampie autorizzazioni dell'autore, la tesi può essere liberamente consultata e può essere effettuato il salvataggio e la stampa di una copia per fini strettamente personali di studio, di ricerca e di insegnamento, con espresso divieto di qualunque utilizzo direttamente o indirettamente commerciale. Ogni altro diritto sul materiale è riservato
Download (11MB)
| Contatta l'autore
|
Abstract
Passive acoustic monitoring is widely used in ecological research to collect and store large volumes of environmental audio for biodiversity assessment and species monitoring. A central challenge is moving from passive recording alone to in-situ inference, especially on low-power bioacoustic loggers with strict limits on memory, computation, and energy.
The main goal of this thesis is to design an efficient bird sound classification pipeline that remains both accurate and deployable on microcontroller-class hardware. The proposed system, named WrenNet, combines a compact neural architecture with a semi-learnable spectral front-end that preserves an interpretable filterbank structure while allowing limited adaptation of frequency allocation.
Training uses augmentation and knowledge distillation to improve robustness to noise, class imbalance, and acoustically similar species. The system is evaluated on a large dataset of bird recordings and environmental sounds. Beyond offline metrics, the model is deployed and tested on AudioMoth hardware, where latency, memory footprint, and energy consumption are measured.
Results show that the proposed approach achieves effective multi-species classification while remaining compatible with the strict resource constraints of low-power bioacoustic loggers, improving scalable and energy-efficient bioacoustic monitoring.
Abstract
Passive acoustic monitoring is widely used in ecological research to collect and store large volumes of environmental audio for biodiversity assessment and species monitoring. A central challenge is moving from passive recording alone to in-situ inference, especially on low-power bioacoustic loggers with strict limits on memory, computation, and energy.
The main goal of this thesis is to design an efficient bird sound classification pipeline that remains both accurate and deployable on microcontroller-class hardware. The proposed system, named WrenNet, combines a compact neural architecture with a semi-learnable spectral front-end that preserves an interpretable filterbank structure while allowing limited adaptation of frequency allocation.
Training uses augmentation and knowledge distillation to improve robustness to noise, class imbalance, and acoustically similar species. The system is evaluated on a large dataset of bird recordings and environmental sounds. Beyond offline metrics, the model is deployed and tested on AudioMoth hardware, where latency, memory footprint, and energy consumption are measured.
Results show that the proposed approach achieves effective multi-species classification while remaining compatible with the strict resource constraints of low-power bioacoustic loggers, improving scalable and energy-efficient bioacoustic monitoring.
Tipologia del documento
Tesi di laurea
(Laurea magistrale)
Autore della tesi
Mannini, Leonardo
Relatore della tesi
Correlatore della tesi
Scuola
Corso di studio
Ordinamento Cds
DM270
Parole chiave
Edge AI, TinyML, Deep Learning, Embedded Systems, Bioacoustics, Sound Classification, Learnable Filterbanks
Data di discussione della Tesi
26 Marzo 2026
URI
Altri metadati
Tipologia del documento
Tesi di laurea
(NON SPECIFICATO)
Autore della tesi
Mannini, Leonardo
Relatore della tesi
Correlatore della tesi
Scuola
Corso di studio
Ordinamento Cds
DM270
Parole chiave
Edge AI, TinyML, Deep Learning, Embedded Systems, Bioacoustics, Sound Classification, Learnable Filterbanks
Data di discussione della Tesi
26 Marzo 2026
URI
Gestione del documento: