ProSA: Provenance Management using Schema mappings with Annotations

In our project Provenance Management using Schema mappings with Annotations, we use additional provenance information to determine which part of a huge amount of data is necessary for evaluating a query result in terms of reproducibility, reconstructability and traceability. The determined source data we call (minimal) sub-database.

In ProSA, we answer questions such as: (i) Where does the data come from, (ii) why, and (iii) how is a query result calculated, by combining the CHASE -- a technique for transforming databases -- with data provenance. The same technique allows us to process temporal databases as well. Query evaluation, evolution and their corresponding inverses are processed by the CHASE, implemented in a separate library called ChaTEAU.

Note: ProSA was developed at the University of Rostock (externer Link, ?ffnet neues Fenster) and is now being further developed at the University of Regensburg.

Associated Projects and Theses

T. Han?ffner: Schema-Mapping von Neo4j nach ProSA. Bachelor Thesis, 2025

Git-Repository

References

T. Auge: ProSA: A provenance system for reproducing query results.
WWW (Companium Volumn), 2023 (paper)
T. Auge: Provenance Management unter Verwendung von Schemaabbildungen mit Annotationen. University of Rostock, Germany, 2023 (pdf)
T. Auge, M. Hanzig, A. Heuer: ProSA Pipeline -- Provenance Conquers the Chase.
ADBIS (Short Paper), 2023 (paper)
T. Auge, N. Scharlau, J. Zimmer, A. G?rres, A. Heuer: ChaTEAU -- A Universal Toolkit for Applying the Chase, 2022 (paper)
T. Auge, N. Scharlau, A. Heuer: Provenance and Privacy in ProSA -- A Guided Interview on Privacy-Aware Provenance. DEXA Workshops, 2021 (paper)
T. Auge: Extended Provenance Management for Data Science Applications.
PhD@VLDB, 2020 (paper)
T. Auge, A. Heuer: ProSA -- Using the CHASE for Provenance Management.
ADBIS, 2019 (paper)

nach oben