Affiliations 

  • 1 School of Mathematical Sciences, Universiti Sains Malaysia, 11800 USM, Penang, Malaysia. Electronic address: farihasyaqina@usm.my
  • 2 Department of Mathematical Sciences, Faculty of Science and Technology, Universiti Kebangsaan Malaysia 43600 Bangi, Selangor, Malaysia. Electronic address: msn@ukm.edu.my
  • 3 Department of Mathematical Sciences, Faculty of Science and Technology, Universiti Kebangsaan Malaysia 43600 Bangi, Selangor, Malaysia. Electronic address: fatima84@ukm.edu.my
  • 4 Department of Mathematical Sciences, Faculty of Science and Technology, Universiti Kebangsaan Malaysia 43600 Bangi, Selangor, Malaysia. Electronic address: munira@ukm.edu.my
  • 5 Department of Mathematical Sciences, Faculty of Science and Technology, Universiti Kebangsaan Malaysia 43600 Bangi, Selangor, Malaysia. Electronic address: mohdalmie@ukm.edu.my
J Environ Manage, 2022 Mar 15;306:114434.
PMID: 35065362 DOI: 10.1016/j.jenvman.2022.114434

Abstract

Haze has been a major issue afflicting Southeast Asian countries, including Malaysia, for the past few decades. Hierarchical agglomerative cluster analysis (HACA) is commonly used to evaluate the spatial behavior between areas in which pollutants interact. Typically, using HACA, the Euclidean distance acts as the dissimilarity measure and air quality monitoring stations are grouped according to this measure, thus revealing the most polluted areas. In this study, a framework for the hybridization of the HACA technique is proposed by considering the topological similarity (Wasserstein distance) between stations to evaluate the spatial patterns of the affected areas by haze episodes. For this, a tool in the topological data analysis (TDA), namely, persistent homology, is used to extract essential topological features hidden in the dataset. The performance of the proposed method is compared with that of traditional HACA and evaluated based on its ability to categorize areas according to the exceedance level of the particulate matter (PM10). Results show that additional topological features have yielded better accuracy compared to without the case that does not consider topological features. The cluster validity indices are computed to verify the results, and the proposed method outperforms the traditional method, suggesting a practical alternative approach for assessing the similarity in air pollution behaviors based on topological characterizations.

* Title and MeSH Headings from MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine.