Documenti full-text disponibili:
Abstract
With the growing importance of real-time analytics, streaming processing systems are becoming an essential tool; in particular, the paradigm of the materialized view is gaining traction due to its simplicity and power. Moreover, a standard benchmark for these systems is still missing, although multiple benchmarks have been
proposed for the more general field of stream processing. Many tools can support this kind of processing, both well-established and newer to the scene with different feature sets and technical choices. The goal of this thesis is to thoroughly evaluate and compare these tools in order to gain a deep understanding of their performances, characteristics, and limits. The competitors considered in this evaluation are Flink, Materialize, and ksqlDB (previously KSQL). Multiple aspects will be
evaluated, from more abstract ones such as architecture, features, and technical choices to more concrete ones revolving around performance, scalability, and ease of use. This thesis proposes a baseline analysis of the three systems aim to exploit the advantages and disadvantages of each system and provide a comparison between them. As a starting point, we will use the Nexmark benchmark, which is a relatively simple and uncharted benchmark for stream processing systems.
Abstract
With the growing importance of real-time analytics, streaming processing systems are becoming an essential tool; in particular, the paradigm of the materialized view is gaining traction due to its simplicity and power. Moreover, a standard benchmark for these systems is still missing, although multiple benchmarks have been
proposed for the more general field of stream processing. Many tools can support this kind of processing, both well-established and newer to the scene with different feature sets and technical choices. The goal of this thesis is to thoroughly evaluate and compare these tools in order to gain a deep understanding of their performances, characteristics, and limits. The competitors considered in this evaluation are Flink, Materialize, and ksqlDB (previously KSQL). Multiple aspects will be
evaluated, from more abstract ones such as architecture, features, and technical choices to more concrete ones revolving around performance, scalability, and ease of use. This thesis proposes a baseline analysis of the three systems aim to exploit the advantages and disadvantages of each system and provide a comparison between them. As a starting point, we will use the Nexmark benchmark, which is a relatively simple and uncharted benchmark for stream processing systems.
Tipologia del documento
Tesi di laurea
(Laurea magistrale)
Autore della tesi
Parrinello, Angelo
Relatore della tesi
Scuola
Corso di studio
Ordinamento Cds
DM270
Parole chiave
Stream Processing System,Materialized View,Big Data,Benchmarking,Apache Flink,Materialize,ksqlDB,Apache Kafka,Nexmark
Data di discussione della Tesi
15 Marzo 2024
URI
Altri metadati
Tipologia del documento
Tesi di laurea
(NON SPECIFICATO)
Autore della tesi
Parrinello, Angelo
Relatore della tesi
Scuola
Corso di studio
Ordinamento Cds
DM270
Parole chiave
Stream Processing System,Materialized View,Big Data,Benchmarking,Apache Flink,Materialize,ksqlDB,Apache Kafka,Nexmark
Data di discussione della Tesi
15 Marzo 2024
URI
Statistica sui download
Gestione del documento: