Benchmarking Materialized Views of SQL-based Stream Processing Systems

Parrinello, Angelo (2024) Benchmarking Materialized Views of SQL-based Stream Processing Systems. [Laurea magistrale], Università di Bologna, Corso di Studio in Ingegneria e scienze informatiche [LM-DM270] - Cesena
Documenti full-text disponibili:
[thumbnail of Thesis] Documento PDF (Thesis)
Disponibile con Licenza: Creative Commons: Attribuzione - Non commerciale - Non opere derivate 4.0 (CC BY-NC-ND 4.0)

Download (7MB)

Abstract

With the growing importance of real-time analytics, streaming processing systems are becoming an essential tool; in particular, the paradigm of the materialized view is gaining traction due to its simplicity and power. Moreover, a standard benchmark for these systems is still missing, although multiple benchmarks have been proposed for the more general field of stream processing. Many tools can support this kind of processing, both well-established and newer to the scene with different feature sets and technical choices. The goal of this thesis is to thoroughly evaluate and compare these tools in order to gain a deep understanding of their performances, characteristics, and limits. The competitors considered in this evaluation are Flink, Materialize, and ksqlDB (previously KSQL). Multiple aspects will be evaluated, from more abstract ones such as architecture, features, and technical choices to more concrete ones revolving around performance, scalability, and ease of use. This thesis proposes a baseline analysis of the three systems aim to exploit the advantages and disadvantages of each system and provide a comparison between them. As a starting point, we will use the Nexmark benchmark, which is a relatively simple and uncharted benchmark for stream processing systems.

Abstract
Tipologia del documento
Tesi di laurea (Laurea magistrale)
Autore della tesi
Parrinello, Angelo
Relatore della tesi
Scuola
Corso di studio
Ordinamento Cds
DM270
Parole chiave
Stream Processing System,Materialized View,Big Data,Benchmarking,Apache Flink,Materialize,ksqlDB,Apache Kafka,Nexmark
Data di discussione della Tesi
15 Marzo 2024
URI

Altri metadati

Statistica sui download

Gestione del documento: Visualizza il documento

^