The JValue Project

Making open data safe, easy, and reliable to use

Category: Student Thesis

  • Final Thesis: ETL Data Pipelines Configurations in Spark

    Abstract: The JValue Open Data Service (ODS) is an ETL data pipeline that provides data extraction from different source systems (Extract), performs transformations on the extracted data (Transform), and loads the data to a target database (Load). There are different kinds of stream processing engines that cope with data that have high volume, variety, and…

  • Final Thesis: Design and Implementation of a Version Control System for Open Data Modelling Projects

    Abstract: Many modern software applications and research projects depend on the ability to access high-qualitative data sources. Even though there is already a large number of openly available data sets, such data sets are often hard to (re)use due to various barriers such as incomplete documentation, wrong or missing values, and more. To address these…

  • Final Thesis: A Study and Analysis of the Performance of the JValue Open Data Service as Part of a Data Pipeline Supporting An Online Learning Model

    Abstract: Open data has been known for having data quality issues that require complex data cleansing and data transformation in order to be usable for data analysis, data visualization, training machine learning algorithms, and other data science activities. Open Data Service (ODS) is a software project that aims at creating an interface for reliable and…

  • Final Thesis: Giving Structure to Open Data in the JValue ODS

    Abstract: Nowadays the internet provides a lot of open data for public use. Those can be written in various data types and cover plenty of subjects. Because of that the absence of a standard results into the main problem. Every provider can decide for himself how the data is constructed. The JValue project is dedicated…

  • Final Thesis: Implementing an Open Data ETL Processing Engine with Kafka

    Abstract: The JValue project group is developing a modeling ecosystem for Extract Transform Load (ETL) processes. Part of this ecosystem is a description model for those. This thesis suggests a conversion process from the description model into an Apache Kafka runtime, described in a cloud-native format, like Docker Compose. The conversion is implemented as a…

  • Final Thesis: Elasticity Concept for Microservice-based System

    Abstract: Software Elasticity is the concept of adapting available resources to the current or expected workload. This concept fits modern and stateless microservice architectures, which are scalable by design. Their scalability is closely related to Software Resilience and places new demands on cloud architectures. The JValue Open Data Service (JValue ODS) is an open data…

  • Final Thesis: Testing Microservice Integration with Consumer-Driven Contract Tests

    Abstract: Microservice-Systeme bestehen aus eigenständigen, verteilten Services, die über Netzwerkverbindungen miteinander kommunizieren. Das Testen von Service-Integrationen kann bei derartigen Systemen eine Herausforderung darstellen, da hierzu mehrere Services zur selben Zeit ausgeführt werden müssen und es viele potenzielle Quellen für falsch-negative Testergebnisse gibt. Consumer-Driven Contract Testing (CDCT) ist ein Ansatz, der dazu verwendet werden kann, beide…

  • Final Thesis: Konzept und Implementierung zur Observability für microservicebasierte Anwendungen

    Abstract: ‘Microservices’ sind in der heutigen Zeit ein bekanntes und beliebtes Architekturmuster. Viele weltbekannte Tech-Unternehmen haben sich für diese entschieden. Die Entkopplung und die Aufteilung der Aufgaben in kleinere Services bringen neben daraus resultierenden Vorteilen auch Herausforderungen mit sich. Einen zentralen Negativpunkt hinsichtlich der Entwicklung dieser Dienste stellen die erschwerte Fehlersuche sowie die Schwierigkeit dar,…

  • Final Thesis: Value Types in TypeScript for JValue

    Abstract: Over the past years, TypeScript has increasingly been gaining popularity due to its nature of providing functionalities to ease the development of scalable and robust applications whilst syntactically being a superset of JavaScript. With the growing complexity of data-driven environments, it is essential for programming languages to cope with value types beyond their primitive data…