1. What are the geographical classifications of the 31 provinces in China?
The 31 provinces in China are geographically classified into six parts: North Central (NC), North East (NE), North West (NW), South East (SE), South West (SW), and the Tibetan Plateau (TP). These classifications represent diverse social-economical and geo-climatic conditions. NC and SE belong to relatively developed regions in eastern China, while NW, SW, and TP are less developed regions. The Qinling Mountain-Huaihe River Line serves as a boundary between the humid south (SE and SW) with more precipitation and the drier north (NC).
read more
2. How does RF model handle nonlinear relationships?
The Random Forest (RF) model is a state-of-the-art statistical method that deals with complicated nonlinear relationships between the response variable and interpretation variables. It uses ensemble learning, where the RF regression predictions are determined as the average of multiple regression trees based on the bootstrap sampling method. The model performance strongly depends on two crucial parameters, ntree (number of regression trees) and mtry (number of interpretation variables sampled for splitting at each node). Significant correlations of regression trees can be avoided by not including all interpretation variables in the node splitting process. Backward variable selection is also performed to achieve better performance. The RF model was applied to estimate the spatiotemporal pattern of dry deposition for individual N and S species, using 1000 trees and 3 interpretation variables for splitting at each node. The model was trained using the 'caret' package in R software (version 4.1.2; Kuhn, 2021).
read more
3. How to estimate sulfate concentration?
To estimate sulfate concentration, a simple linear relationship between SO2 and sulfate concentration is used with CTM. The formula GG SSSS 4 2- = GG SSSS 2 x ff(GG CCCCCC-SSSS 4 2-, GG CCCCCC-SSSS 2) is applied, where 2 is the monthly ground-level concentration at CNEMC, and -4 2- and -2 are the sulfate and SO2 concentrations simulated by CTM, respectively. The ratio of simulated sulfate to SO2, denoted as f, is determined based on the significant positive correlation between the two (Luo et al., 2016). This method compensates for the lack of large-scale ground observation data for sulfate concentration.
read more
4. How is wet deposition flux estimated using GAM?
Wet deposition flux is estimated using a nonlinear Generalized Additive Model (GAM) developed in previous work. The model applies the 'link' function to establish the relationship between response variables and interpretation variables. For SO4 2-, NO3-, and NH4+, observed monthly wet or bulk deposition at NNDMN serves as the response variables. Interpretation variables include precipitation, satellite-derived VCDs, PM2.5 concentrations, total column liquid water, temperature, boundary layer height, forest-cover, and urban-cover. The data sources and model performance evaluation are detailed in Zhao et al. (2022). The deposition obtained through GAM is uniformly defined as wet deposition in this study.
read more