Improved k-NN Regression Model Using Random Forests for Air Pollution Prediction.

Siddharth Sharma, L. Rajya Lakshmi

SmartNets(2023)

引用 0|浏览12
暂无评分
摘要
In this paper, we review various k-Nearest-Neighbor (k-NN) based models and their accuracies to develop a better model to predict concentrations of air pollutants. The proposed model splits the range of target variable values into a number of buckets first. Then, a hybrid k-NN model, which is a combination of weighted attribute k-NN and distance-weighted k-NN, and where the weights are assigned by calculating Information Gain, is used for each attribute, to calculate the target variable value of each test case. The proposed model decreases the root mean square error (RMSE) of predicted NO, NO 2 and NO x values by 28.29%, 29.44%, and 16.51% respectively, compared to the state-of the-art. Similarly, the mean absolute error (MAE) values for NO, NO 2 , and NO x are decreased by 18.26%, 33.67%, and 14.54%, compared to the state-of the-art. This model gives good results when the size of each bucket is nearly equal.
更多
查看译文
关键词
k-NN,Random forests,Air pollution data analysis
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要