Skip to main content

Short-term PM2.5 and cardiovascular admissions in NY State: assessing sensitivity to exposure model choice



Air pollution health studies have been increasingly using prediction models for exposure assessment even in areas without monitoring stations. To date, most studies have assumed that a single exposure model is correct, but estimated effects may be sensitive to the choice of exposure model.


We obtained county-level daily cardiovascular (CVD) admissions from the New York (NY) Statewide Planning and Resources Cooperative System (SPARCS) and four sets of fine particulate matter (PM2.5) spatio-temporal predictions (2002–2012). We employed overdispersed Poisson models to investigate the relationship between daily PM2.5 and CVD, adjusting for potential confounders, separately for each state-wide PM2.5 dataset.


For all PM2.5 datasets, we observed positive associations between PM2.5 and CVD. Across the modeled exposure estimates, effect estimates ranged from 0.23% (95%CI: -0.06, 0.53%) to 0.88% (95%CI: 0.68, 1.08%) per 10 µg/m3 increase in daily PM2.5. We observed the highest estimates using monitored concentrations 0.96% (95%CI: 0.62, 1.30%) for the subset of counties where these data were available.


Effect estimates varied by a factor of almost four across methods to model exposures, likely due to varying degrees of exposure measurement error. Nonetheless, we observed a consistently harmful association between PM2.5 and CVD admissions, regardless of model choice.

Peer Review reports


The association between air pollution and adverse health is one of the most well-researched topics in epidemiology, with studies spanning different pollutants [1,2,3], timescale of exposure [4,5,6], and outcomes of interest [7,8,9]. Historically, time-series studies in air pollution epidemiology have primarily utilized data from monitoring stations for exposure assignment. In the United States, this is primarily accomplished using data from the Environmental Protection Agency’s (EPA) Air Quality System (AQS) database [10]. Although monitors provide information on pollutant concentrations, there are strong assumptions when working with such data for health studies. For example, Bell et al. [11] noted that because the location of monitor systems also depends on regulatory compliance and not solely on population density, depending on the pollutant, monitor data are not necessarily best suited for public health research. Furthermore, monitor locations are by definition points in space and, thus, may not adequately capture population exposures in a pre-specified area in the time series analysis (e.g., a city) [12, 13].

To reduce exposure measurement error and, further, include populations in areas without monitors, there has been an increasing use of prediction models in air pollution epidemiology for exposure assessment. These prediction models provide outputs with full coverage at a much finer spatial resolution than a spatial point, which monitors represent, and predict both spatial and temporal changes in air pollution. Initially, such models were simple, and early efforts used largely statistical approaches, such as land use regression models and generalized additive mixed models [14,15,16]. With increased computation capacity and demand for higher spatial and temporal resolution, the prediction models have grown increasingly sophisticated. Examples include the integration of remote sensing data, predictions from chemical transport models, and more robust methods for higher predictive accuracy (e.g., random forests, neural networks, and ensemble models) [17,18,19,20].

Many research groups are currently developing and improving prediction models for exposure assessment in epidemiologic studies. However, most epidemiologic studies to date use air pollution predictions from a single model to assign exposures, although in recent years there have been additional efforts to develop statistical and computational exposure models with exhaustive datasets [21, 22]. This is of critical importance because the results from these epidemiologic studies are often used to inform regulations, but the exposure–response functions that are generated from studies using different models for exposure assessment are not necessarily comparable, both spatially and temporally. Our study aims to address this critical knowledge gap by assessing the sensitivity of fine particle (PM2.5; particles with aerodynamic diameter ≤ 2.5 µm) health effect estimates to the choices of different models for exposure assessment in a time series setting. As a case study, we focus on the association between daily PM2.5 concentrations and cardiovascular disease (CVD)-related hospitalizations in New York State (NYS) using a daily time series design at the county level. The goal of this paper is not to identify the “best” PM2.5 product, which would depend on the specific goals of a particular research project. Rather, we aim to characterize the potential variability in the results of epidemiologic analyses by using different PM2.5 products, and whether these results will allow us to reach similar or different conclusions in this NYS case study.


Exposure assessment

We obtained five publicly available, daily PM2.5 exposure products over NYS. These include data from the United States EPA’s AQS database, which provides PM2.5 monitoring data in 18 of 62 counties in NYS [10]; daily output from the Community Multiscale Air Quality Modeling System (CMAQ), an atmospheric chemical transport model developed by the EPA to simulate regional air pollution [23]; the Fused Air Quality Surface Using Downscaling (FAQSD), which uses a Bayesian space–time downscaler model to fuse the AQS measurements with CMAQ estimates [23]; a model developed by the United States Centers for Disease Control and Prevention’s Wide-ranging Online Data for Epidemiologic Research (CDC WONDER), which links satellite-derived and spatially interpolated ground-based PM2.5 using linear regression [24]; and a product from Emory University, which integrates satellite aerosol optical depth (AOD), land use data, and meteorological variables in a random forest model [19]. In Table 1 we present all of the PM2.5 datasets used in this study, with details regarding its spatial and temporal coverage and resolution. All PM2.5 datasets used in this study provided daily PM2.5 concentrations. More details regarding the PM2.5 products, including validation and comparative statistics, can be found in an existing publication [25].

Table 1 Summary of PM2.5 datasets used in this study

Exposure data were available from 2002 – 2012, except from the CDC WONDER model, which was available from 2003 – 2011. Daily average temperature and relative humidity were obtained from the North American Land Data Assimilation Systems (NLDAS), which provides the meteorological data at 1/8th degree grids over the study area [26]. We averaged all available grids within a county to obtain daily county-level averages for all PM2.5 products and the corresponding meteorological variables.

Outcome assessment

Daily total cardiovascular hospital admission counts for each county were obtained from the New York Department of Health’s Statewide Planning and Research Cooperative System (SPARCS). SPARCS is a comprehensive data reporting system that collects information on hospital admissions and emergency department visits within NYS, and includes approximately 98% of all hospitalizations in non-federal acute care facilities, regardless of insurance status [27]. The International Classification of Diseases, 9th revision (ICD-9) was used to classify cardiovascular hospitalizations (ICD-9 codes 390 – 459).

Columbia University Institutional Review Board approval was obtained to conduct the analysis. The same board waived the need for informed consent because of the public nature of the data.

Statistical analysis

This study is a daily time series analysis conducted at the county level. We employed overdispersed Poisson regression models to investigate the relationship between total CVD-related hospitalizations and same day PM2.5 exposure, separately for each set of PM2.5 concentrations, using all available data. In the models, we included smooth functions of calendar time to adjust for seasonality and long-term trends (using natural cubic splines with 4 df per year), as well as indicator variables for day of the week. To control for potential confounding by factors varying across counties, we included indicator variables for all counties used in the analyses (fixed effects). We controlled for potential confounding by weather by including smooth functions for daily average temperature (natural spline, 3 df) and relative humidity (natural spline, 3 df) in all models.

We selected the best fitting model and appropriate dfs for all non-linear terms included in the model based on the quasi-Akaike Information Criterion (qAIC). Specifically, we tested for calendar time df from 4 to 7 per year, and for temperature, and relative humidity, from 3 to 6 df. We also assessed for potential nonlinearity in all PM2.5 products using a natural spline with 3 df and selected the model with the best fit using qAIC. For all PM2.5 products, the linear model yielded a better fit, so we only present results from the linear PM2.5 models.

We first ran analyses with all available information for each exposure model. To ensure comparability across all PM2.5 products, we then restricted the analysis to only the 18 counties where AQS data were available (“AQS only”), and finally to a dataset only with overlapping observations across all exposure datasets (“complete-case analysis”). We recognize that the results from this last set of analyses may have limited generalizability; however, our aim with this last analysis was to facilitate direct comparison across models. To maximize spatial coverage when comparing products, we performed a sensitivity analysis of the last model on a subset excluding AQS. Additionally, we also performed sensitivity analyses using the average of the same day and the previous day’s PM2.5 (lag 0–1) as the exposure window of interest, as well as adding federal holidays as a potential confounder.

It is likely that different exposure models perform differently in space and time. To assess the impact of varying prediction model performance in space and time, thus, in a secondary analysis we evaluated potential spatio-temporal effect modification using all available data. To assess effect modification by urban density, we obtained data on the urban and rural populations by county in NYS from the 2010 United States Census, and included in each model an interaction term between PM2.5 and number of individuals living in rural areas within each county. For effect modification varying by season, we broke each year up into four 3-month increments to define seasons: spring (March – May), summer (June – August), autumn (September – November), and winter (December – February) and included interaction terms between PM2.5 and season. We assessed statistical significance of the continuous interaction term (rural population) directly in the model, and of the interaction with the categorical season variable using a likelihood ratio test by comparing it to a model without the interaction term with season.

We present all results in the main analysis as percentage change in CVD admission rates per 10 µg/m3 increase in PM2.5. To facilitate comparison across PM2.5 products, we also present the results of our primary analysis per interquartile range in the Supplement. All statistical analyses were performed using the R Statistical Software, version 3.6.1 (Foundation for Statistical Computing, Vienna, Austria).


Figure 1 shows the PM2.5 county-wide exposure estimates by PM2.5 product averaged across the entire study period, and Table 2 shows the descriptive statistics of the variables used for the daily models for all available data. The average daily PM2.5 levels in the AQS, CMAQ, FAQSD, CDC WONDER, and Emory datasets were 10.7, 8.7, 9.8, 9.5, and 8.2 µg/m3, respectively. On average, 6.8 inpatient CVD admissions occurred per day and county. Descriptive statistics by season, quartiles of rural population, and those used for our complete-case analysis can be found in the Supplement (Tables S1–S3).

Fig. 1
figure 1

Spatial distributions of average PM2.5 exposure estimates by PM2.5 product

Table 2 Descriptive statistics for all 62 counties, unless otherwise noted (2002 – 2012)

Table 3 displays the correlation coefficients across the different PM2.5 products. In general, the AQS, Fused, CDC, and Emory products were all highly correlated with each other, with correlations ranging from 0.83 to 0.92. The CMAQ product, however, was moderately correlated with the other four products, with correlations only ranging from 0.49 to 0.61.

Table 3 Pairwise correlation coefficients across PM2.5 products for all available data

Figure 2 shows the percent change in CVD rate per 10 µg/m3 increase in PM2.5 across the different PM2.5 products and types of analyses. Effect estimates ranged from 0.23% (95%CI: -0.06, 0.53%) to 0.96% (95%CI: 0.70, 1.21%). Corresponding numeric estimates are presented in Table S4, and results showing the percent change in CVD rate per interquartile range increase in PM2.5 are shown in Figure S2. In general, we obtained the highest effect estimates with the tightest confidence intervals when we used CMAQ, while the CDC WONDER data yielded the lowest effect estimates. Our sensitivity analysis comparing effect estimates on overlapping observations across PM2.5 products excluding AQS and using average lag 0–1 PM2.5 exposure yielded very similar results to those obtained in the main analysis (Figures S2 and S3). Similarly, adding holidays as a potential confounder did not change our results (Figure S4).

Fig. 2
figure 2

Percent increase in daily CVD admissions rates per 10 µg/m3 for all PM2.5 products. “All Data” refers to analyses using all available data for each exposure model from all 62 counties; “AQS Only” refers to analyses using data only in counties where AQS monitors were available (18 of 62 counties); “Complete Case” refers to analyses using data without any missingness across all five PM2.5 products

We detected evidence of effect modification by season for all PM2.5 products. Figure 3 shows the results assessing effect modification by season using all available data for each exposure model. For most products, we generally observed higher effect estimates in the autumn and winter seasons, reaching as high as a 1.87% increase in CVD admissions per 10 µg/m3 increase in PM2.5 in the autumn (for AQS). In comparison, the lowest effect estimates were observed in the spring, some of which were even negative.

Fig. 3
figure 3

Effect modification by season using all available data

Figure 4 shows the results assessing effect modification by rural population. Results are displayed as the percent increase in CVD admissions for a 10 µg/m3 increase in PM2.5 for each 1000-person increase in the rural population of each county. In all but the CMAQ model, we detected decreases in the effect estimates as rural population increased, i.e., the highest effect estimates were observed in urban areas.

Fig. 4
figure 4

Spatial effect modification using all available data


Using five different sets of PM2.5 data spanning from 2002 to 2012, we investigated the relationship between daily PM2.5 and CVD hospital admissions in NYS, and found consistently harmful associations across all exposure metrics, albeit effect estimates quantitatively varied by a factor of almost four. In subsequent analyses, we explored potential spatial and temporal effect modification. We found higher effect estimates in the autumn and winter and higher effect estimates in more urban areas. These results were also largely consistent across exposure metrics.

There are a few papers in the literature that evaluate performance across different air pollution models, most of which focus on only one or two models. Bravo et al. compared PM2.5 predictions from a CMAQ simulation to that of ground-based monitors and found that CMAQ underestimated PM2.5, with substantial variations seasonally [28]. Lee et al. developed a space–time geostatistical kriging model to predict PM2.5 and compared these predictions to satellite-based PM2.5 estimates directly from AOD retrievals; they found that the kriging model provided more accurate estimates within 100 km of a monitoring station, while satellite estimates were more accurate for locations greater than 100 km from a monitoring station [29]. Jin et al. compared seven publicly available PM2.5 products over NYS from 2002 to 2012, including information from ground-based observations, remote sensing, and chemical transport models, and found that while the products differed in spatial patterns, all showed consistent decreases in PM2.5 over the observed time period [25]. A recent study by Kelly et al. comparing nine PM2.5 exposure models in the United States in 2011 found generally consistent PM2.5 concentrations but more variations at finer scales [30].

To date, most existing air pollution epidemiologic studies that assign exposure based on prediction models typically only use data from a single model to assign exposures [17, 18, 31]. We are only aware of a few existing epidemiologic studies that incorporate more than one exposure models. Weber et al. [32] conducted a case-crossover study over New York City from 2004–2006 looking at the association between short-term exposure to PM2.5 and heart failure, utilizing five different exposure models that combine air pollution monitors, aerosol optical depth (AOD), and CMAQ. They found that effect estimates across the models were similar. McGuinn et al. [33] investigated the association between long-term exposure to PM2.5 and cardiovascular disease using different exposure assessment methods. Utilizing station monitoring data, two CMAQ models, and two satellite-based models from 2002 to 2009 for a cohort of patients who had undergone a cardiac catherization residing in North Carolina, they found nearly equivalent results for all exposure assessment methods. Sellier et al. [34] used four different exposure models to estimate nitrogen dioxide (NO2) and particulate matter (PM) levels in two French metropolitan areas and explored their association with infant birthweight in a pregnancy cohort. They found consistent estimated health effects for the PM products, but less so for the NO2 products. Wang et al. [35] compared exposure estimates and associations for NO2 and various PM sizes based on predictions from a land-use regression and a dispersion model, and found that health effect estimates did not differ significantly. More recently, Gariazzo et al. [36] investigated the effects of long-term exposure to PM and NO2 using four exposure models in a large administrative cohort, and found consistent health effect estimates across the different exposure assessment models. Our work expands on this body of literature and assesses variability in acute CVD effect estimates from a time-series analysis using different existing exposure models.

In our main analysis, we found harmful associations across all models, although the confidence intervals of one model crossed the null. However, effect estimates fluctuated by a factor of four across the different exposure models. These differences are likely due to varying degrees of measurement error across the different models, as non-differential exposure measurement error in time series studies biases effect estimates towards the null [13]. The predictive accuracy of all these models likely varies in space and time in different ways, which could best explain the differences in our results. In the most comparable analyses (i.e., “AQS only” and “complete-case analyses”), the confidence intervals for all effect estimates widely overlap. While the results from the “complete-case analysis” may not be generalizable, because this analysis focuses on predominantly urban areas, it is the only analysis that allows direct comparison of the effect estimates across the five PM2.5 products. Therefore, we cannot conclude that these models truly yield different estimates.

In our secondary analysis of seasonal effect modification, we found higher effect estimates in the autumn and winter seasons. With the available information that we have, it is not possible to attribute this finding to varying impacts of exposure measurement error across seasons or to a biological mechanism. One possible explanation for these findings could be that all prediction models perform better during the autumn and winter months [25]; smaller amounts of exposure measurement error would result in de-attenuation of effect estimates. A second explanation could be that in NYS, estimated PM2.5 effects on CVD admissions are worse during fall and winter months. Existing literature has had mixed findings regarding this topic. Studies by Bell et al. and Hsu et al. that investigated the effects of PM2.5 on CVD morbidity found highest effect estimates in the winter [37,38,39]. On the other hand, studies by Peng et al. and Dai et al., who investigated the effects of PM2.5 and mortality, found higher effect estimates in the spring and summer [40, 41]. In our study, the higher effect estimates may indicate a higher contribution from more localized particles, such as traffic, during the colder months. Traffic has been consistently identified as a particularly toxic source of PM2.5, full of combustion products such as black carbon and heavy metals [42,43,44]. Since the mixing height in the winter is lower, this will likely result in higher near-surface PM2.5 concentrations even if emissions remain the same, as they dilute within a smaller near-surface volume.

In comparison to the results for seasonal effect modification, the results for effect modification by rural population were much more consistent: effect estimates were highest in urbanized areas, and decreased as rural population increased. The decrease is least pronounced in the CMAQ model, where the interaction term was not statistically significant, and most in the CDC model. Again, this finding could be due to two possible explanations. First, it is possible that all models except CMAQ have higher predictive accuracy in more densely populated areas. This would not be surprising, as AQS monitors are located near urban centers or more densely populated areas and all models except CMAQ were trained on or fused with PM2.5 concentrations measured at monitoring stations [25]. Given that our results for spatial effect modification were very similar across all models except CMAQ, we would expect these models to perform similarly in NYS counties with limited or no monitoring. Second, our findings of effect modification by urbanicity may indicate that particle composition in urban areas may be more toxic than that of rural areas, which is consistent with our interpretation of seasonality as discussed above. Similar findings have been found in the literature [45, 46], also likely because the distribution of potential effect modifiers, other than just PM2.5 composition, is different in urban versus rural areas.

Our study has several limitations. First and foremost, the results from our analyses may have some comparability issues. As mentioned previously, only 18 of 62 NYS counties included monitors reporting to the AQS database. Consequently, the results using all data available from each product are not directly comparable to each other. We ran additional analyses restricting to counties with monitoring sites and to no missingness across all exposure models. While these results are more directly comparable to each other, their overall generalizability is lower, as they reflect predominantly urban areas. Furthermore, in the analyses with restricted observations, the comparability problem still remains: even by looking at only counties with AQS monitors, oftentimes these counties may only include a single monitor, and its measurements are then uniformly assigned as the exposure for the entire county. In comparison, this is very different from the PM2.5 output we obtain from any of the exposure models that provide much finer spatial resolution.

Second, our analysis is conducted at the county level, which does not take full advantage of the fine-scale spatial resolution provided by the modeled exposure data. It is possible that the spatial aggregation may have introduced additional biases in the differences among the four exposure models used in our analyses, as the differences observed across the exposure models could be due to two sources: the true differences in predictions generated by each model at a specific location and any differences introduced as a result of the aggregation. However, our goal was to evaluate differences in estimated effects when using different exposure models in time series designs, which use aggregated health and exposure data. It is possible that any observed differences in effect estimates in our study would be different in other study designs, e.g., using individual-level data.

Third, we do not explore potential effects of lagged exposure to PM2.5 on CVD admissions in this study. A number of previous studies have found evidence for lagged effects for CVD related outcomes, mainly for exposure on the same and previous day of the CVD event (lag 0–1) and that of the same and previous 3 days combined (lag 0–3) [4, 47,48,49]. However, the goal of our analysis was to compare different exposure metrics and not to identify critical exposure windows, which we believe is beyond the scope of this study. Nonetheless, to facilitate comparison to other studies of PM2.5 and CVD outcomes, we include the results of lag 0–1 exposure for all PM2.5 products in the Supplement (Figure S3).

Lastly, none of the modeled PM2.5 data we used were perfect: each was built and optimized for different reasons and, thus, could overpredict in certain areas and underpredict in others. For example, CMAQ was originally designed to address regional air pollution problems across the United States, while the Emory product focuses on providing accurate PM2.5 predictions over NYS only. All PM2.5 products in this analysis (with the exception of CMAQ) utilized AQS monitors as part of the modeling process, which means that these PM2.5 products are likely to provide more accurate estimates near monitoring sites, i.e., mainly in urban areas. Furthermore, information on the predictive accuracy of these models is not always available, making it difficult to conduct a formal comparison of the different PM2.5 products beyond a qualitative assessment. Our previous work has attempted to evaluate these different PM2.5 products using three major criteria: resolution, availability, and accuracy. We found that no single product stood out for all three criteria, and the choice of PM2.5 product for the purposes of epidemiologic studies should depend on the research question of interest [25].

Nonetheless, our aim was not to identify the “best product” out of the ones examined. All models examined have nationwide coverage and our NYS results may not generalize to other states. Moreover, the choice of model should primarily depend on the study design and whether spatial vs. temporal contrasts are more important for each specific design and research question. Rather, we aimed to characterize the potential variability in estimated effect estimates in time-series analyses, using the NYS daily PM2.5 – CVD association as a case study. Based on our results, we feel comfortable to conclude that while the point effect estimates in our main analysis differ by as much as a factor of four, their corresponding confidence intervals were largely overlapping, and that different PM2.5 products do indeed reach the same conclusion. We also recognize that current health impacts assessments primarily use only point effect estimates from epidemiologic studies as inputs into calculations, and that the resulting conclusions of such assessments may differ drastically depending on the choice of the exposure–response function, which in turn depends on the choice of exposure model. Our findings highlight the importance of incorporating different sources of uncertainty in the exposure–response curves used in health impacts assessments, including uncertainty due to the choice of exposure model.

Our study has numerous strengths. We were able to investigate the sensitivity of estimated PM2.5 – CVD effects to exposure model choice in short-term epidemiologic studies. Given the increasing use of modeled air pollution data in health studies, our work is critical as it provides an example of how much estimated effects may vary across exposure models. In previous work, we evaluated how multiple PM2.5 products perform differently in a health impacts assessment [25]; this current work takes a step further by evaluating the impact of multiple PM2.5 products on the effect estimates that are used in such assessments. Finally, our findings send a strong public health message: increased PM2.5 exposure results in an increase in CVD hospitalizations, regardless of the choice of exposure model.


In conclusion, we investigated the relationship between short-term PM2.5 exposure and cardiovascular admissions in NYS from 2002 – 2012 using five different PM2.5 products and found consistent, harmful associations regardless of exposure metric. However, uncertainty related with the exposure model selection is not captured in the individual estimated effects. Methods are needed for improved exposure assessment that minimize error and include uncertainty characterization and propagation into the health models [50].

Availability of data and materials

The exposure data that support the findings of this study are available from the corresponding author on reasonable request. The health data that support the findings of this study are available from the New York State Department of Health but restrictions apply to the availability of these data, which were used under license for the current study, and so are not publicly available. Data are however available from the authors upon reasonable request and with permission of the New York State Department of Health.


  1. Dockery DW, Pope CA, Xu X, Spengler JD, Ware JH, Fay ME, et al. An association between air pollution and mortality in six U.S. cities. N Engl J Med. 1993;329:1753–9. Available from:

    Article  CAS  Google Scholar 

  2. Jerrett M, Burnett RT, Pope CA, Ito K, Thurston G, Krewski D, et al. Long-term ozone exposure and mortality. N Engl J Med. 2009;360:1085–95. Available from:

    Article  CAS  Google Scholar 

  3. Burnett RT, Stieb D, Brook JR, Cakmak S, Dales R, Raizenne M, et al. Associations between short-term changes in nitrogen dioxide and mortality in Canadian cities. Arch Environ Health. 2004;59:228–36.

    Article  CAS  Google Scholar 

  4. Chen C, Zhu P, Lan L, Zhou L, Liu R, Sun Q, et al. Short-term exposures to PM 2.5 and cause-specific mortality of cardiovascular health in China. Environ Res. 2018;161:188–94. Elsevier Inc. Available from:

    Article  CAS  Google Scholar 

  5. He MZ, Kinney PL, Li T, Chen C, Sun Q, Ban J, et al. Short- and intermediate-term exposure to NO2 and mortality: a multi-county analysis in China. Environ Pollut. 2020;261:114165. Elsevier Ltd.

    Article  CAS  Google Scholar 

  6. Ying Z, Xu X, Bai Y, Zhong J, Chen M, Liang Y, et al. Long-term exposure to concentrated ambient PM2.5 increases mouse blood pressure through abnormal activation of the sympathetic nervous system: a role for hypothalamic inflammation. Environ Health Perspect. 2014;122:79–86.

    Article  Google Scholar 

  7. Kioumourtzoglou M-A, Schwartz J, James P, Dominici F, Zanobetti A. PM2.5 and mortality in 207 US cities. Epidemiology. 2015;27:1. Available from:,

    Article  Google Scholar 

  8. Dominici F, Peng RD, Bell ML, Pham L, McDermott A, Zeger SL, et al. Fine particulate air pollution and hospital admission for cardiovascular and respiratory diseases. J Am Med Assoc. 2006;295(10):1127–34.

    Article  CAS  Google Scholar 

  9. Sørensen M, Daneshvar B, Hansen M, Dragsted LO, Hertel O, Knudsen L, et al. Personal PM2.5 exposure and markers of oxidative stress in blood. Environ Health Perspect. 2003;111:161–5.

    Article  Google Scholar 

  10. U.S. Environmental Protection Agency (EPA). Air Quality System (AQS) | US EPA. 2020. Available from: Cited 2020 Apr 27.

  11. Bell ML, O’Neill MS, Cifuentes LA, Braga ALF, Green C, Nweke A, et al. Challenges and recommendations for the study of socioeconomic factors and air pollution health effects. Environ Sci Policy. 2005;8(5):525–33.

    Article  Google Scholar 

  12. Sarnat SE, Klein M, Sarnat JA, Flanders WD, Waller LA, Mulholland JA, et al. An examination of exposure measurement error from air pollutant spatial variability in time-series studies. J Expo Sci Environ Epidemiol. 2010;20:135–46. Nature Publishing Group.

    Article  CAS  Google Scholar 

  13. Zeger SL, Thomas D, Dominici F, Samet JM, Schwartz J, Dockery D, et al. Exposure measurement error in time-series studies of air pollution: concepts and consequences. Environ Health Perspect. 2000;108:419–26.

    Article  CAS  Google Scholar 

  14. Gilbert NL, Goldberg MS, Beckerman B, Brook JR, Jerrett M. Assessing spatial variability of ambient nitrogen dioxide in montréal, canada, with a land-use regression model. J Air Waste Manag Assoc. 2005;55:1059–63.

    Article  CAS  Google Scholar 

  15. Hoek G, Beelen R, Kos G, Dijkema M, Der ZSCV, Fischer PH, et al. Land use regression model for ultrafine particles in Amsterdam. Environ Sci Technol. 2011;45:622–8.

    Article  CAS  Google Scholar 

  16. Yanosky JD, Paciorek CJ, Laden F, Hart JE, Puett RC, Liao D, et al. Spatio-temporal modeling of particulate air pollution in the conterminous United States using geographic and meteorological predictors. Environ Health. 2014;13:1–15.

    Article  Google Scholar 

  17. Turner MC, Jerrett M, Pope CA, Krewski D, Gapstur SM, Diver WR, et al. Long-term ozone exposure and mortality in a large prospective study. Am J Respir Crit Care Med. 2016;193:1134–42. Available from:,

    Article  CAS  Google Scholar 

  18. Di Q, Wang Y, Zanobetti A, Wang Y, Koutrakis P, Choirat C, et al. Air pollution and mortality in the medicare population. N Engl J Med. 2017;316:2513–22.

    Article  Google Scholar 

  19. Bi J, Belle JH, Wang Y, Lyapustin AI, Wildani A, Liu Y. Impacts of snow and cloud covers on satellite-derived PM2.5 levels. Remote Sens Environ. 2018;221:665–74. Elsevier.

    Article  Google Scholar 

  20. Di Q, Amini H, Shi L, Kloog I, Silvern R, Kelly J, et al. An ensemble-based model of PM2.5 concentration across the contiguous United States with high spatiotemporal resolution. Environ Int. 2019;130:104909. Elsevier.

    Article  CAS  Google Scholar 

  21. Berrocal VJ, Guan Y, Muyskens A, Wang H, Reich BJ, Mulholland JA, et al. A comparison of statistical and machine learning methods for creating national daily maps of ambient PM2.5 concentration. Atmos Environ. 2020;222:117130. Elsevier Ltd.

    Article  CAS  Google Scholar 

  22. Yoo EH, Pu Q, Eum Y, Jiang X. The impact of individual mobility on long-term exposure to ambient pm2.5: assessing effect modification by travel patterns and spatial variability of pm2.5. Int J Environ Res Public Health. 2021;18:1–16.

    Google Scholar 

  23. U.S. Environmental Protection Agency (EPA). RSIG-related downloadable data files. 2019. Available from: Cited 2020 Apr 23.

  24. Al-Hamdan MZ, Crosson WL, Limaye AS, Rickman DL, Quattrochi DA, Estes MG, et al. Methods for characterizing fine particulate matter using ground observations and remotely sensed data: potential use for environmental public health surveillance. J Air Waste Manag Assoc. 2009;59:865–81.

    Article  CAS  Google Scholar 

  25. Jin X, Fiore AM, Civerolo K, Bi J, Liu Y, Van Donkelaar A, et al. Comparison of multiple PM2.5 exposure products for estimating health benefits of emission controls over New York State, USA. Environ Res Lett. 2019;14(8):084023. IOP Publishing.

    Article  CAS  Google Scholar 

  26. Mitchell KE, Lohmann D, Houser PR, Wood EF, Schaake JC, Robock A, et al. The multi-institution North American Land Data Assimilation System (NLDAS): utilizing multiple GCIP products and partners in a continental distributed hydrological modeling system. J Geophys Res D Atmos. 2004;109:1–32.

    Google Scholar 

  27. New York State Department of Health. Statewide Planning and Research Cooperative System. 2020. Available from: Cited 2020 Apr 27.

  28. Bravo MA, Fuentes M, Zhang Y, Burr MJ, Bell ML. Comparison of exposure estimation methods for air pollutants: ambient monitoring data and regional air quality simulation. Environ Res. 2012;116:1–10.

    Article  CAS  Google Scholar 

  29. Lee SJ, Serre ML, van Donkelaar A, Martin RV, Burnett RT, Jerrett M. Comparison of geostatistical interpolation and remote sensing techniques for estimating long-term exposure to ambient PM2.5 concentrations across the continental United States. Environ Health Perspect. 2012;120(12):1727–32.

    Article  CAS  Google Scholar 

  30. Kelly JT, Jang C, Timin B, Di Q, Schwartz J, Liu Y, et al. Examining PM2.5 concentrations and exposure using multiple models. Environ Res. 2020;196:110432. Elsevier Inc.

    Article  CAS  Google Scholar 

  31. Kloog I, Ridgway B, Koutrakis P, Coull BA, Schwartz JD. Long- and short-term exposure to PM2.5 and mortality: using novel exposure models. Epidemiology. 2013;24:555–61.

    Article  Google Scholar 

  32. Weber SA, Insaf TZ, Hall ES, Talbot TO, Huff AK. Assessing the impact of fine particulate matter (PM2.5) on respiratory-cardiovascular chronic diseases in the New York City Metropolitan area using Hierarchical Bayesian Model estimates. Environ Res. 2016;151:399–409. Elsevier.

    Article  CAS  Google Scholar 

  33. McGuinn LA, Ward-Caviness C, Neas LM, Schneider A, Di Q, Chudnovsky A, et al. Fine particulate matter and cardiovascular disease: comparison of assessment methods for long-term exposure. Environ Res. 2017;159:16–23.

    Article  CAS  Google Scholar 

  34. Sellier Y, Galineau J, Hulin A, Caini F, Marquis N, Navel V, et al. Health effects of ambient air pollution: do different methods for estimating exposure lead to different results? Environ Int. 2014;66:165–73. Elsevier Ltd.

    Article  CAS  Google Scholar 

  35. Wang M, Gehring U, Hoek G, Keuken M, Jonkers S, Beelen R, et al. Air pollution and lung function in dutch children: a comparison of exposure estimates and associations based on land use regression and dispersion exposure modeling approaches. Environ Health Perspect. 2015;123:847–51.

    Article  CAS  Google Scholar 

  36. Gariazzo C, Carlino G, Silibello C, Tinarelli G, Renzi M, Finardi S, et al. Impact of different exposure models and spatial resolution on the long-term effects of air pollution. Environ Res. 2021;192:110351.

    Article  CAS  Google Scholar 

  37. Bell ML, Ebisu K, Peng RD, Walker J, Samet JM, Zeger SL, et al. Seasonal and regional short-term effects of fine particles on hospital admissions in 202 US counties, 1999–2005. Am J Epidemiol. 2008;168:1301–10.

    Article  Google Scholar 

  38. Bell ML, Son JY, Peng RD, Wang Y, Dominici F. Brief report: ambient PM2.5 and risk of hospital admissions: do risks differ for men and women? Epidemiology. 2015;26:575–9.

    Article  Google Scholar 

  39. Hsu WH, Hwang SA, Kinney PL, Lin S. Seasonal and temperature modifications of the association between fine particulate air pollution and cardiovascular hospitalization in New York state. Sci Total Environ. 2017;578:626–32. Elsevier B.V.

    Article  CAS  Google Scholar 

  40. Peng RD, Dominici F, Pastor-Barriuso R, Zeger SL, Samet JM. Seasonal analyses of air pollution and mortality in 100 US cities. Am J Epidemiol. 2005;161:585–94.

    Article  Google Scholar 

  41. Dai L, Zanobetti A, Koutrakis P, Schwartz JD. Associations of fine particulate matter species with mortality in the United States: a multicity time-series analysis. Environ Health Perspect. 2014;122(8):837–42.

    Article  CAS  Google Scholar 

  42. Suglia SF, Gryparis A, Schwartz J, Wright RJ. Association between traffic-related black carbon exposure and lung function among urban women. Environ Health Perspect. 2008;116:1333–7.

    Article  CAS  Google Scholar 

  43. U.S. Environmental Protection Agency (EPA). Black carbon. 2016. Available from: Cited 2020 Mar 23.

  44. Johansson C, Norman M, Burman L. Road traffic emission factors for heavy metals. Atmos Environ. 2009;43:4681–8. Elsevier Ltd.

    Article  CAS  Google Scholar 

  45. Wang B, Eum KD, Kazemiparkouhi F, Li C, Manjourides J, Pavlu V, et al. The impact of long-term PM2.5 exposure on specific causes of death: Exposure-response curves and effect modification among 53 million U.S. Medicare beneficiaries. Environ Health. 2020;19:1–12. Environmental Health.

    Article  CAS  Google Scholar 

  46. Kioumourtzoglou MA, Coull BA, Dominici F, Koutrakis P, Schwartz J, Suh H. The impact of source contribution uncertainty on the effects of source-specific PM2.5 on hospital admissions: a case study in Boston, MA. J Expo Sci Environ Epidemiol. 2014;24:365–71. Nature Publishing Group.

    Article  CAS  Google Scholar 

  47. Dominici F, Peng R, Bell ML, Pham L, McDermott A. Fine particulate air pollution and hospital admission for cardiovascular and respiratory diseases. JAMA. 2006;295:1127–34. Available from:

    Article  CAS  Google Scholar 

  48. Wei Y, Wang Y, Di Q, Choirat C, Wang Y, Koutrakis P, et al. Short term exposure to fine particulate matter and hospital admission risks and costs in the Medicare population: time stratified, case crossover study. BMJ. 2019;367:l6258.

    Article  Google Scholar 

  49. Milojevic A, Wilkinson P, Armstrong B, Bhaskaran K, Smeeth L, Hajat S. Short-term effects of air pollution on a range of cardiovascular events in England and Wales: case-crossover analysis of the MINAP database, hospital admissions and mortality. Heart. 2014;100:1093–8.

    Article  Google Scholar 

  50. Liu JZ, Paisley J, Kioumourtzoglou MA, Coull B. Accurate uncertainty estimation and decomposition in ensemble learning. 2019. p. 1–12. Available from:

    Google Scholar 

Download references


Not applicable.


This study was partially supported by the National Institute of Environmental Health Sciences (NIEHS) Individual Fellowship Grant (F31 ES029372), Institutional Research Training Grant (T32 ES023770), Research Project Grant (R01 ES030616) and Center Core Grant (P30 ES009089), the New York State Energy Research and Development Authority (Grant number: 91268), NASA Health and Air Quality Applied Sciences Team (HAQAST, Grant NNX16AQ20G), and NASA Applied Sciences Program (Grant NNX16AQ28G), the Columbia Global Policy Initiative Faculty Grant, and the Columbia Weatherhead East Asian Institute Sasakawa Young Leaders Fellowship Fund.

Author information

Authors and Affiliations



M.Z.H. designed and initiated the study and performed the statistical analyses. M-A.K., P.L.K., and A.M.F. critically revised the manuscript for important intellectual content. A.M.F., Y.L., and T.Z.I. provided the data for analysis. All authors contributed to the write-up of the manuscript and approved its final version prior to submission.

Authors’ information

This manuscript is primarily written from the standpoint of air pollution epidemiologists and supplemented by feedback and expertise from air pollution modelers and atmospheric scientists. M.Z.H. (Postdoctoral Fellow, Icahn School of Medicine at Mount Sinai), M-A.K. (Assistant Professor, Columbia University Mailman School of Public Health), and P.L.K. (Beverly Brown Professor Urban Health, Boston University School of Public Health), T.Z.I (Assistant Professor, University at Albany School of Public Health) are air pollution epidemiologists. A.M.F. (Professor, Columbia University and Lamont Doherty Earth Observatory), X.J. (Postdoctoral Fellow, University of California Berkeley), J.B. (Postdoctoral Fellow, University of Washington School of Public Health), and Y.L. (Gangarosa Distinguished Professor, Emory University Rollins School of Public Health) are air pollution modelers and atmospheric scientists.

Corresponding author

Correspondence to Mike Z. He.

Ethics declarations

Ethics approval and consent to participate

This study was reviewed by the New York State Department of Health Institutional Review Board and approved for exemption (Board Ref #16–049).

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Table S1.

Descriptive statistics by season. Table S2. Descriptive statistics by quartiles of rural population. Table S3. Descriptive statistics for complete-case analysis. Table S4. Percent increase in daily CVD admissions and 95% confidence intervals (CI) per 10 µg/m3 for all PM2.5 products. Table S5. Quasi-Akaike’s Information Criterion (qAIC) for select seasonal and long-term trends. Figure S1. Percent increase in daily CVD admissions rates per interquartile range for all PM2.5 products. Figure S2. Percent increase in daily CVD admissions rates for subset excluding AQS. Figure S3. Percent increase in lag 01 CVD admissions rates per 10 µg/m3 for all PM2.5 products. Figure S4. Sensitivity analysis adding bank holiday as a potential confounder. Figure S5. Time series of average PM2.5 exposure estimates by PM2.5 product.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

He, M.Z., Do, V., Liu, S. et al. Short-term PM2.5 and cardiovascular admissions in NY State: assessing sensitivity to exposure model choice. Environ Health 20, 93 (2021).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: