scholarly journals Classification model of air quality in Jakarta using decision tree algorithm based on air pollutant standard index

Author(s):  
F M Putra ◽  
I S Sitanggang
2012 ◽  
Vol 190-191 ◽  
pp. 347-351
Author(s):  
Bing Xiang Liu ◽  
Yan Wu ◽  
Meng Shan Li

The decision tree is a widely used classification model and inductive learning method based on examples. It is characterized by the simple classification rules, easy understanding for users and so on, but we also can see some disadvantages in certain situations. The paper puts forward the multivariable decision tree algorithm which based on a rough set to a combination of rough sets theory and decision tree algorithm. The multivariable decision tree algorithm has reduced the complexity of decision tree while not affect the readability of the classification rules. Experimental analysis has witnessed the feasibility and efficiency of the algorithm.


Air is the most essential natural resource for the survival of humans, animals, and plants on the planet. Air is polluted due to the burning of fuels, exhaust gases from factories and industries, and mining operations. Now, air pollution becomes the most dangerous pollution that humanity ever faced. This causes many health effects on humans like respiratory, lung, and skin diseases, which also causes effects on plants, and animals to survive. Hence, air quality prediction and evaluation as becoming an important research area. In this paper, a machine learning-based prediction model is constructed for air quality forecasting. This model will help us to find the major pollutant present in the location along with the causes and sources of that particular pollutant. Air Quality Index value for India is used to predict air quality. The data is collected from various places throughout India so that the collected data is preprocessed to recover from null values, missing values, and duplicate values. The dataset is trained and tested with various machine learning algorithms like Logistic Regression, Naïve Bayes Classification, Random Forest, Support Vector Machine, K Nearest Neighbor, and Decision Tree algorithm in order to find the performance measurement of the above-mentioned algorithms. From this, the prediction model is constructed using the Decision Tree algorithm to predict the air quality, because it provides the best and highest accuracy of 100%. The machine learning-based air quality prediction model helps India meteorological department in predicting the future of air quality, and its status and depends on that they can take action.


Sign in / Sign up

Export Citation Format

Share Document