Concepedia

Publication | Closed Access

The Optimality of Naive Bayes.

1.4K

Citations

7

References

2004

Year

Harry Zhang

Unknown Venue

Abstract

Naive Bayes is one of the most efficient and effective inductive learning algorithms for machine learning and data mining. Its competitive performance in classifica-tion is surprising, because the conditional independence assumption on which it is based, is rarely true in real-world applications. An open question is: what is the true reason for the surprisingly good performance of naive Bayes in classification? In this paper, we propose a novel explanation on the superb classification performance of naive Bayes. We show that, essentially, the dependence distribution; i.e., how the local dependence of a node distributes in each class, evenly or unevenly, and how the local dependen-cies of all nodes work together, consistently (support-

References

YearCitations

Page 1