IIT Database Group

header bar

PUG: a framework and practical implementation for why and why-not provenance

Authors

Materials

Abstract

Explaining why an answer is (or is not) returned by a query is important for many applications including auditing, debugging data and queries, and answering hypothetical questions about data. In this work, we present the first practical approach for answering such questions for queries with negation (first-order queries). Specifically, we introduce a graph-based provenance model that, while syntactic in nature, supports reverse reasoning and is proven to encode a wide range of provenance models from the literature. The implementation of this model in our PUG (Provenance Unification through Graphs) system takes a provenance question and Datalog query as an input and generates a Datalog program that computes an explanation, i.e., the part of the provenance that is relevant to answer the question. Furthermore, we demonstrate how a desirable factorization of provenance can be achieved by rewriting an input query. We experimentally evaluate our approach demonstrating its efficiency.

bibtex

@article{LL18,
  author = {Lee, Seokki and Lud{\"a}scher, Bertram and Glavic, Boris},
  date-added = {2018-08-29 19:09:06 -0500},
  date-modified = {2018-08-29 19:09:33 -0500},
  day = {23},
  doi = {10.1007/s00778-018-0518-5},
  issn = {0949-877X},
  issue = {1},
  journal = {The VLDB Journal},
  keywords = {Datalog; Provenance; Missing Answers; Semirings; PUGS},
  longversionurl = {https://arxiv.org/pdf/1808.05752.pdf},
  month = aug,
  pages = {47---71},
  projects = {PUGS},
  title = {PUG: a framework and practical implementation for why and why-not provenance},
  venueshort = {VLDBJ},
  volume = {28},
  year = {2019}
}

Reference

PUG: a framework and practical implementation for why and why-not provenance Seokki Lee, Bertram Ludäscher and Boris Glavic The VLDB Journal. 28, 1 (Aug. 2019) , 47—71.