Measuring Approximate Functional Dependencies
M Parciak, S Weytjens, F Neven, L Peeters, N Hens, S Vansummeren
09/06/2023 - Knowledge Graphs for Data Integration Workshop, UHasselt
Introduction: Functional Dependencies (FDs)
Introduction: Approximate FDs (AFDs)
Introduction: Approximate FDs (AFDs)
Our Aim
Compare AFD measures proposed in the literature.
AFD measures: literature review
Literature review
Since 1954, 12 AFD measures were described.
We identify three groups
AFD measures: formal comparison
Identifying two new measures
Evaluation on Real-World Data
Approach
Test AFD measures on a real-world benchmark, which we manually annotate, and compare the measures on precision, recall and rankings.
Evaluation on Real-World Data
Evaluating the ranking power
Evaluation on Real-World Data
Evaluating the ranking power
Evaluation on Real-World Data
Evaluating the ranking power
Sensitivity Analysis
Structural properties and AFD measures
Based on synthetic datasets, we find that
RFI’+ & 𝜇+ (both orange) separate FDs from non- FDs best.
Contributions
Conclusions
1
2
3
𝜇+
RFI’+
g’3