Discovery of Genuine Functional Dependencies from Relational Data with Missing Values [Abstract for INFORSID 2019] - IRD - Institut de recherche pour le développement Accéder directement au contenu
Communication Dans Un Congrès Année : 2019

Discovery of Genuine Functional Dependencies from Relational Data with Missing Values [Abstract for INFORSID 2019]

Hazar Harmouch
  • Fonction : Auteur
  • PersonId : 1032231
Felix Naumann
  • Fonction : Auteur
  • PersonId : 1032232
Thirumuruganathan Saravanan
  • Fonction : Auteur
  • PersonId : 1032233

Résumé

This article is an extended abstract of our work published at VLDB’2018. The full paper is available at www.vldb.org/pvldb/vol11/p880-berti-equille.pdf . Functional dependencies (FDs) play an important role in maintaining data quality in relational databases. They can be used to enforce data consistency and guide data repairs. In this work, we investigate the problem of missing values and its impact on FD discovery. When using exist- ing FD discovery algorithms, some genuine FDs could not be detected precisely due to missing values and some non-genuine FDs can be discovered even though they are caused by missing values depending on the considered semantics for NULL values. We define the notion of gen- uineness of FDs and propose algorithms to compute the FD genuineness score. This can be used to identify genuine FDs among the set of all valid dependencies that hold on the data. We evaluate the quality of our method over various real-world and semi-synthetic datasets with extensive experiments. The results show that our method performs well for relatively large FD sets and is able to accurately capture genuine FDs.
Fichier principal
Vignette du fichier
genuineFD_INFORSID_2019_paper_32.pdf (95.36 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

ird-02092569 , version 1 (21-02-2020)

Identifiants

  • HAL Id : ird-02092569 , version 1

Citer

Laure Berti-Equille, Hazar Harmouch, Felix Naumann, Noël Novelli, Thirumuruganathan Saravanan. Discovery of Genuine Functional Dependencies from Relational Data with Missing Values [Abstract for INFORSID 2019]. INFORSID 2019, Jun 2019, Paris, France. ⟨ird-02092569⟩
246 Consultations
73 Téléchargements

Partager

Gmail Facebook X LinkedIn More