The JValue Project

Making open data safe, easy, and reliable to use

Month: October 2022

  • Final Thesis: ETL Data Pipelines Configurations in Spark

    Abstract: The JValue Open Data Service (ODS) is an ETL data pipeline that provides data extraction from different source systems (Extract), performs transformations on the extracted data (Transform), and loads the data to a target database (Load). There are different kinds of stream processing engines that cope with data that have high volume, variety, and…

  • Final Thesis: Design and Implementation of a Version Control System for Open Data Modelling Projects

    Abstract: Many modern software applications and research projects depend on the ability to access high-qualitative data sources. Even though there is already a large number of openly available data sets, such data sets are often hard to (re)use due to various barriers such as incomplete documentation, wrong or missing values, and more. To address these…