Publication | Open Access
Comparative Analysis of Data Mining Techniques for Malaysian Rainfall Prediction
97
Citations
15
References
2016
Year
Climate Change PredictionEngineeringMachine LearningWeather ForecastingEnsemble AlgorithmEarth ScienceOptimization-based Data MiningData ScienceData MiningPattern RecognitionManagementDecision Tree LearningDrought ForecastingComparative AnalysisStatisticsMultiple Classifier SystemInstances Random ForestHydrometeorologyMeteorologyPredictive AnalyticsKnowledge DiscoveryComputer ScienceForecastingEvolutionary Data MiningData ClassificationDroughtClassificationClassifier SystemRandom ForestData Modeling
Climate change prediction analyses the behaviours of weather for a specific time. Rainfall forecasting is a climate change task where specific features such as humidity and wind will be used to predict rainfall in specific locations. Rainfall prediction can be achieved using classification task under Data Mining. Different techniques lead to different performances depending on rainfall data representation including representation for long term (months) patterns and short-term (daily) patterns. Selecting an appropriate technique for a specific duration of rainfall is a challenging task. This study analyses multiple classifiers such as Naïve Bayes, Support Vector Machine, Decision Tree, Neural Network and Random Forest for rainfall prediction using Malaysian data. The dataset has been collected from multiple stations in Selangor, Malaysia. Several pre-processing tasks have been applied in order to resolve missing values and eliminating noise. The experimental results show that with small training data (10%) from 1581 instances Random Forest correctly classified 1043 instances. This is the strength of an ensemble of trees in Random Forest where a group of classifiers can jointly beat a single classifier.
| Year | Citations | |
|---|---|---|
Page 1
Page 1