Publication | Open Access
Conditional Functional Dependencies for Data Cleaning
367
Citations
14
References
2007
Year
Unknown Venue
Conditional Functional DependenciesEngineeringStructured DataConstraintsVerificationData PreparationSemanticsSemantic WebFormal VerificationConstraint SolvingData ScienceInconsistency DetectionManagementData IntegrationSchema EvolutionData ManagementKnowledge DiscoveryComputer ScienceData CleansingSemantic ReasonerDatabase TheoryCfd ViolationsAutomated ReasoningFormal MethodsData TreatmentData Modeling
Conditional functional dependencies extend traditional functional dependencies by incorporating data bindings to capture consistency, but the resulting abundance of constraints complicates violation detection. The paper proposes CFDs and develops SQL‑based techniques for detecting violations and checking multiple constraints in a single query. An inference system analogous to Armstrong's axioms and consistency analysis are provided, and the authors experimentally evaluate CFD‑based inconsistency detection performance. The work establishes a constraint theory for CFDs and moves toward a practical constraint‑based method for improving data quality.
We propose a class of constraints, referred to as conditional functional dependencies (CFDs), and study their applications in data cleaning. In contrast to traditional functional dependencies (FDs) that were developed mainly for schema design, CFDs aim at capturing the consistency of data by incorporating bindings of semantic ally related values. For CFDs we provide an inference system analogous to Armstrong's axioms for FDs, as well as consistency analysis. Since CFDs allow data bindings, a large number of individual constraints may hold on a table, complicating detection of constraint violations. We develop techniques for detecting CFD violations in SQL as well as novel techniques for checking multiple constraints in a single query. We experimentally evaluate the performance of our CFD-based methods for inconsistency detection. This not only yields a constraint theory for CFDs but is also a step toward a practical constraint-based method for improving data quality.
| Year | Citations | |
|---|---|---|
Page 1
Page 1