Felletti, Lorenzo
(2023)
Edge Cloud Computing for Geospatial Data Processing and Approximate Queries.
[Laurea magistrale], Università di Bologna, Corso di Studio in Ingegneria informatica [LM-DM270]
Documenti full-text disponibili:
Documento PDF (Thesis)
Disponibile con Licenza: Creative Commons: Attribuzione - Condividi allo stesso modo 4.0 (CC BY-SA 4.0) Download (28MB) |
Abstract
Architecture for optimizing geospatial data processing pipelines in the cloud by making use of edge nodes deployed on containers in an urban moving taxi scenario (specifically Shenzhen, China). Edge nodes are using Geohash for efficient data preprocessing, including Geohash-based stratified sampling, and neighborhood location of incoming messages. Apache Kafka was then used to send data to a Spark cluster using a spatially-aware technique for data distribution. In particular, a Kafka topic for each neighborhood of the city considered was created, and each of these topics contained only messages originated in the same neighborhood.
Abstract