Concepedia

Publication | Closed Access

Privacy preserving ID3 using Gini Index over horizontally partitioned data

57

Citations

18

References

2008

Year

Saeed Samet, Ali Miri

Unknown Venue

Abstract

The ID3 algorithm is a standard, popular, and simple method for data classification and decision tree creation. Since privacy-preserving data mining should be taken into consideration, several secure multi-party computation protocols have been presented based on this technique. Entropy and Gini Index are two protocols which compute information-gain at each step when producing a decision tree. The Gini index, however, has been less studied in privacy-preserving data mining protocols. In this paper, we show how Gini can be used in privacy-preserving ID3 algorithms to create decision tree classifications in such a way that involved parties can jointly compute the gain value of each normal attribute without revealing their own private information to each other, while the database is horizontally partitioned over two or more parties. Three secure multiparty sub-protocols are presented to evaluate the intermediate computations. The communication overhead has been kept reasonably low to make the whole protocol efficient and practical.

References

YearCitations

Page 1