Estimating Pm2.5 From Multisource Data: A Comparison Of Different Machine Learning Models In The Pearl River Delta Of China

URBAN CLIMATE(2021)

引用 26|浏览24
暂无评分
摘要
Air pollution with high concentrations of fine particulate matter (PM2.5) poses severe threats to human health. Accurate estimation of PM2.5 concentrations can timely assist relevant agencies to conduct air pollution treatment and provide essential data sources for epidemiological research related to PM2.5 exposure. Although China has established a network for monitoring ground-level PM2.5 concentrations over the past decades, the limited available records from the sparsely located PM2.5 monitoring sites hinder the fine-resolution research of air pollution. Many studies have been conducted to fill the data gap caused by sparsely distributed monitoring sites, but the accuracy of different models varies greatly. In recent years, machine learning models have become the preferred choices due to their high estimation accuracy. However, the estimation accuracy may differ significantly in different study areas with different models, and there are few studies on model performance evaluation regarding the Pearl River Delta (PRD) region of China. This study evaluated the performance of six machine learning models for estimating PM2.5 concentrations in PRD from August 2014 to December 2019. Moreover, multi-source data were adopted for reliable daily PM2.5 concentration estimation, including meteorology, vegetation, topography, and point of interest (POI). The results show that the tree-structured models (i.e., Random Forest (RF) and Gradient Boosting Regression Tree (GBRT)) generally produce better estimations than other models. Two neural network models (i.e., Back Propagation Neural Network (BPNN) and Elman Neural Network (ENN)) show a similar estimation accuracy. Additionally, the Generalized Additive Model (GAM) generally gives the worst performance, followed by the Support Vector Machines (SVM) model. RF is thus highly recommended based on the estimation accuracy, while GBRT is also a promising model for daily PM2.5 estimation in PRD. Our study provides a reference for selecting an appropriate model for daily PM2.5 concentration estimation in PRD and other regions with climate background.
更多
查看译文
关键词
PM2.5, Air pollution, Machine learning, Pearl River Delta, Point of Interest (POI)
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要