Affiliations 

  • 1 Department of Energy and Mineral Resources Engineering, Sejong University, 209 Neudong-ro Gwangjin-gu, Seoul, 05006, Republic of Korea
  • 2 School of Civil and Environmental Engineering, University of New South Wales, Sydney, NSW, Australia
  • 3 Department of Geology, Faculty of Sciences, Ibn Zohr University, B.P 8106, 80000, Agadir, Morocco
  • 4 Geospatial Analysis and Modelling (GAM) Research Laboratory, Department of Civil and Environmental Engineering, Universiti Teknologi PETRONAS (UTP), Seri Iskandar, 32610, Perak, Malaysia. alateef.babatunde@utp.edu.my
J Expo Sci Environ Epidemiol, 2021 07;31(4):709-726.
PMID: 33159165 DOI: 10.1038/s41370-020-00271-8

Abstract

Accurate identification of distant, large, and frequent sources of emission in cities is a complex procedure due to the presence of large-sized pollutants and the existence of many land use types. This study aims to simplify and optimize the visualization mechanism of long time-series of air pollution data, particularly for urban areas, which is naturally correlated in time and spatially complicated to analyze. Also, we elaborate different sources of pollution that were hitherto undetectable using ordinary plot models by leveraging recent advances in ensemble statistical approaches. The high performing conditional bivariate probability function (CBPF) and time-series signature were integrated within the R programming environment to facilitate the study's analysis. Hourly air pollution data for the period between 2007 to 2016 is collected using four air quality stations, (ca0016, ca0058, ca0054, and ca0025), situated in highly urbanized locations that are characterized by complex land use and high pollution emitting activities. A conditional bivariate probability function (CBPF) was used to analyze the data, utilizing pollutant concentration values such as Sulfur dioxide (SO2), Nitrogen oxides (NO2), Carbon monoxide (CO) and Particulate Matter (PM10) as a third variable plotted on the radial axis, with wind direction and wind speed variables. Generalized linear model (GLM) and sensitivity analysis are applied to verify and visualize the relationship between Air Pollution Index (API) of PM10 and other significant pollutants of GML outputs based on quantile values. To address potential future challenges, we forecast 3 months PM10 values using a Time Series Signature statistical algorithm with time functions and validated the outcome in the 4 stations. Analysis of results reveals that sources emitting PM10 have similar activities producing other pollutants (SO2, CO, and NO2). Therefore, these pollutants can be detected by cross selection between the pollution sources in the affected city. The directional results of CBPF plot indicate that ca0058 and ca0054 enable easier detection of pollutants' sources in comparison to ca0016 and ca0025 due to being located on the edge of industrial areas. This study's CBPF technique and time series signature analysis' outcomes are promising, successfully elaborating different sources of pollution that were hitherto undetectable using ordinary plot models and thus contribute to existing air quality assessment and enhancement mechanisms.

* Title and MeSH Headings from MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine.