UM  > Faculty of Science and Technology
Residential Collegefalse
Status已發表Published
Long-term hourly air quality data bridging of neighboring sites using automated machine learning: A case study in the Greater Bay area of China
Wu, Boxi1,2,3; Wu, Cheng1,2,3; Ye, Yuchen4; Pei, Chenglei5; Deng, Tao6; Li, Yong Jie7; Lu, Xingcheng8; Wang, Lei9; Hu, Bin1; Li, Mei1,2,3; Wu, Dui1,2,3
2024-03-05
Source PublicationAtmospheric Environment
ISSN1352-2310
Volume321Pages:120347
Abstract

Long-term air pollution data are essential for formulating air quality management policies and assessing their corresponding impacts on public health. However, missing data are inevitably encountered during air pollution observations at different sites. This study proposed a machine learning approach that utilizes data from neighboring sites to reconstruct missing data. Hourly observation data from three neighboring sites in the Pearl River Delta (PRD) region in South China, were used for data retrieval, including the NC site (2006–2015), JXL site and PYZX site (2014–2022). The overlapped data (2014.05–2015.12) were used to train and evaluate the machine learning models. The performance of 11 algorithms (CatBoost, XGBoost, LightGBM, LightGBMXT, LightGBMLarge, RandomForestMSE, ExtraTreeMSE, NeuralNetTorch, NeuralNetFastAI, KNeighborsDist, and KNeighborsUnif) for the retrieval of major air pollutants, including O3, NO2, PM2.5, PM10 and SO2 was benchmarked by a set of evaluation metrics. CatBoost showed the best performance; thus, it was adopted for air pollutant data reconstruction in NC (2016–2022) and PYZX (2008–2014). Long-term data (2006–2022) at the NC were obtained by combining the observation and retrieval data. In the past 15 years, the O3 concentration of NC has increased by 72% at a rate of 0.83 ppb yr−1 (3.2% yr−1 ). On the contrary, substantial reductions were observed for NO2 (61%), PM2.5 (51%) and PM10 (42%) at the NC site, with the rates of −1.27 ppb yr−1 (−5.9% yr−1 ), −1.96 μg m−3 yr−1 (−5.8% yr−1 ) and −2.32 μg m−3 yr−1 (−5.2% yr−1 ), respectively. SO2 exhibits the most pronounced reduction (79%) among all species, with two distinct rates of −4.10 ppb yr−1 (−27.4% yr−1 ) and −0.40 ppb yr−1 (−6.2% yr−1), for 2008–2012 and 2012–2022, respectively. This study demonstrates the feasibility of machine learning in filling the data gap of air pollution monitoring network and highlights the importance of continuous long-term air pollution data in reviewing air quality management policies. 

KeywordAir Pollution Data Gap Long-term Trend Machine Learning Monitoring Network
DOI10.1016/j.atmosenv.2024.120347
URLView the original
Indexed BySCIE
Language英語English
WOS Research AreaEnvironmental Sciences & Ecology ; Meteorology & Atmospheric Sciences
WOS SubjectEnvironmental Sciences ; Meteorology & Atmospheric Sciences
WOS IDWOS:001166916500001
PublisherPERGAMON-ELSEVIER SCIENCE LTD, THE BOULEVARD, LANGFORD LANE, KIDLINGTON, OXFORD OX5 1GB, ENGLAND
Scopus ID2-s2.0-85182892646
Fulltext Access
Citation statistics
Document TypeJournal article
CollectionFaculty of Science and Technology
DEPARTMENT OF CIVIL AND ENVIRONMENTAL ENGINEERING
DEPARTMENT OF OCEAN SCIENCE AND TECHNOLOGY
Corresponding AuthorWu, Cheng
Affiliation1.Institute of Mass Spectrometry and Atmospheric Environment, Guangdong Provincial Engineering Research Center for on-line Source Apportionment System of Air Pollution, Jinan University, Guangzhou, 510632, China
2.Guangdong-Hongkong-Macau Joint Laboratory of Collaborative Innovation for Environmental Quality, Guangzhou, 510632, China
3.Guangdong MS Institute of Scientific Instrument Innovation, Guangzhou, 510530, China
4.School of Atmospheric Sciences, Nanjing University of Information Science and Technology, Nanjing, 210044, China
5.Guangzhou Sub-branch of Guangdong Ecological and Environmental Monitoring Center, Guangzhou, 518049, China
6.Institute of Tropical and Marine Meteorology, CMA, Guangzhou, 510080, China
7.Department of Civil and Environmental Engineering, Centre for Regional Oceans, And Department of Ocean Science and Technology, Faculty of Science and Technology, University of Macau, Taipa, Macau, China
8.Department of Geography and Resource Management, The Chinese University of Hong Kong, Shatin, Hong Kong SAR, China
9.Institute of Traditional Chinese Medicine and Natural Products, College of Pharmacy, And JNU-HKUST Joint Laboratory for Neuroscience and Innovative Drug Research, Jinan University, Guangzhou, 510632, China
Recommended Citation
GB/T 7714
Wu, Boxi,Wu, Cheng,Ye, Yuchen,et al. Long-term hourly air quality data bridging of neighboring sites using automated machine learning: A case study in the Greater Bay area of China[J]. Atmospheric Environment, 2024, 321, 120347.
APA Wu, Boxi., Wu, Cheng., Ye, Yuchen., Pei, Chenglei., Deng, Tao., Li, Yong Jie., Lu, Xingcheng., Wang, Lei., Hu, Bin., Li, Mei., & Wu, Dui (2024). Long-term hourly air quality data bridging of neighboring sites using automated machine learning: A case study in the Greater Bay area of China. Atmospheric Environment, 321, 120347.
MLA Wu, Boxi,et al."Long-term hourly air quality data bridging of neighboring sites using automated machine learning: A case study in the Greater Bay area of China".Atmospheric Environment 321(2024):120347.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Wu, Boxi]'s Articles
[Wu, Cheng]'s Articles
[Ye, Yuchen]'s Articles
Baidu academic
Similar articles in Baidu academic
[Wu, Boxi]'s Articles
[Wu, Cheng]'s Articles
[Ye, Yuchen]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Wu, Boxi]'s Articles
[Wu, Cheng]'s Articles
[Ye, Yuchen]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.