Publication | Closed Access
Privacy preserving ID3 using Gini Index over horizontally partitioned data
57
Citations
18
References
2008
Year
Unknown Venue
Privacy ProtectionEngineeringInformation SecurityHardware SecurityId3 AlgorithmData ScienceData MiningDecision TreeData AnonymizationPrivacy SystemData IntegrationPrivacy-preserving CommunicationData ManagementKnowledge DiscoveryData PrivacyPrivate Information RetrievalComputer ScienceDifferential PrivacyPrivacyData SecurityCryptographyGini IndexBig Data
The ID3 algorithm is a standard, popular, and simple method for data classification and decision tree creation. Since privacy-preserving data mining should be taken into consideration, several secure multi-party computation protocols have been presented based on this technique. Entropy and Gini Index are two protocols which compute information-gain at each step when producing a decision tree. The Gini index, however, has been less studied in privacy-preserving data mining protocols. In this paper, we show how Gini can be used in privacy-preserving ID3 algorithms to create decision tree classifications in such a way that involved parties can jointly compute the gain value of each normal attribute without revealing their own private information to each other, while the database is horizontally partitioned over two or more parties. Three secure multiparty sub-protocols are presented to evaluate the intermediate computations. The communication overhead has been kept reasonably low to make the whole protocol efficient and practical.
| Year | Citations | |
|---|---|---|
Page 1
Page 1