Final Thesis: A Systematic Review of Tools to Create ETL Pipelines
Abstract: ETL data pipelining tools are used to create pipelines that extract data from a source, transform it, and load it into a destination. These pipelines can be created using various approaches, but selecting the right approach depends on multiple factors. To develop a deeper understanding, it is important to examine the design considerations of these tools, along with the assessment approaches and benchmarks that determine their effectiveness. Beyond aiding tool users, understanding these factors is also valuable for developers aiming to create new ETL tools, such as those working on Jayvee. In this thesis, we conduct a Systematic Literature Review following Kitchenham’s (2004) guidelines to examine ETL pipeline creation tools based on type, target audience, evaluation methods, and monetization models. Additionally, we analyze the trade-offs associated with different design choices. The results show that developers have a wide range of options depending on their goals, with each involving trade-offs that highlight strengths and weaknesses. We also identify and discuss the intended users of these systems, along with evaluation methods and key metrics explored in the literature. However, monetization models remain largely unexplored.
Keywords: Data Engineering, Research Thesis, JValue, Systematic Literature Review
PDF: Master Thesis
Reference: Mujeeb Ahmed. A Systematic Review of Tools to Create ETL Pipelines. Master Thesis. Friedrich-Alexander-Universität Erlangen-Nürnberg: 2025.