Publication | Closed Access
Predicting disease by using data mining based on healthcare information system
54
Citations
17
References
2012
Year
Unknown Venue
EngineeringMachine LearningDiagnosisHealthcare Information SystemDisease ClassificationClassification MethodData ScienceData MiningNaive BayesianClass ImbalanceBiostatisticsDisease DiagnosisStatisticsMultiple Classifier SystemHealthcare Big DataPredictive AnalyticsKnowledge DiscoveryHealthcare Information SystemsEpidemiologyMedical Data MiningData ClassificationMedical RecordsMedical Information SystemBusinessClassificationMedicineData Mining ProcessClinical Decision Support SystemHealth Informatics
This paper applies the data mining process to predict hypertension from patient medical records with eight other diseases. A sample with the size of 9862 cases has been studied. The sample was extracted from a real world Healthcare Information System database containing 309383 medical records. We observed that the distribution of patient diseases in the medical database is imbalanced. Under-sampling technique has been applied to generate training data sets, and data mining tool Weka has been used to generate the NaIve Bayesian and J-48 classifiers. In addition, an ensemble of five J-48 classifiers was created trying to improve the prediction performance, and rough set tools were used to reduce the ensemble based on the idea of second-order approximation. Experimental results showed a little improvement of the ensemble approach over pure Na'ive Bayesian and J-48 in accuracy, sensitivity, and F-measure.
| Year | Citations | |
|---|---|---|
Page 1
Page 1