Skip to main content

Quantifying the short-term effects of air pollution on health in the presence of exposure measurement error: a simulation study of multi-pollutant model results



Most epidemiological studies estimate associations without considering exposure measurement error. While some studies have estimated the impact of error in single-exposure models we aimed to quantify the effect of measurement error in multi-exposure models, specifically in time-series analysis of PM2.5, NO2, and mortality using simulations, under various plausible scenarios for exposure errors. Measurement error in multi-exposure models can lead to effect transfer where the effect estimate is overestimated for the pollutant estimated with more error to the one estimated with less error. This complicates interpretation of the independent effects of different pollutants and thus the relative importance of reducing their concentrations in air pollution policy.


Measurement error was defined as the difference between ambient concentrations and personal exposure from outdoor sources. Simulation inputs for error magnitude and variability were informed by the literature. Error-free exposures with their consequent health outcome and error-prone exposures of various error types (classical/Berkson) were generated. Bias was quantified as the relative difference in effect estimates of the error-free and error-prone exposures.


Mortality effect estimates were generally underestimated with greater bias observed when low ratios of the true exposure variance over the error variance were assumed (27.4% underestimation for NO2). Higher ratios resulted in smaller, but still substantial bias (up to 19% for both pollutants). Effect transfer was observed indicating that less precise measurements for one pollutant (NO2) yield more bias, while the co-pollutant (PM2.5) associations were found closer to the true. Interestingly, the sum of single-pollutant model effect estimates was found closer to the summed true associations than those from multi-pollutant models, due to cancelling out of confounding and measurement error bias.


Our simulation study indicated an underestimation of true independent health effects of multiple exposures due to measurement error. Using error parameter information in future epidemiological studies should provide more accurate concentration-response functions.

Peer Review reports


Air pollution is the major environmental factor affecting human health [1, 2]. Many publications report associations of morbidity and mortality outcomes with exposure to air pollutants, such as PM2.5, NO2 and O3 [3, 4]. It is important to assess the independent effects of each pollutant adjusting for potential confounding by other pollutants in the mixture, as policies to reduce emissions differ by targeted pollutant. Multi-pollutant models are the most commonly used method, and, if correctly specified, provide estimates of the independent effects of each exposure [5,6,7].

Most studies addressing the associations between pollutants and health, or performing health impact assessments (HIA), do not account for exposure measurement error (ME), although it is a well-known problem [8,9,10,11]. ME may lead to biased concentration-response functions (CRFs) for which the magnitude and type of bias is not assessed or corrected. While the methodology to correct for ME is expanding, there is a gap between theory and practice [12, 13]. To evaluate ME we must define the “gold standard” against which “proxy” measures are compared. For exposure to ambient concentrations, the most appropriate gold standard is individual exposure to ambient sources of the pollutant of interest. However, most epidemiological studies investigating ME in air pollution exposure, measure total personal exposure, to which indoor and personally-generated sources contribute substantially [14].

An appropriate framework describing the types of error for exposure variables has not been discussed thoroughly in the literature. Zeger et al. (2000) mention that “classical and Berkson models represent two extremes of a continuum”, so the proper model combines elements of each type [9]. Few studies have examined the impact of mixture error [15,16,17]. Previous studies examining the impact of ME in regression models either only used single-pollutant models or addressed only spatial heterogeneity of the pollutants and their errors or have made specific assumptions for their error definition using empirical data from certain locations which might not be generalisable [18,19,20,21,22,23]. Multi-pollutant models, in addition, provide unstable estimates because of the complex relationships between pollutants, both in space and time, and the differing degree and structure of their errors - especially the variability and correlation between errors.

In our study, we focus on multi-pollutant models using a mixture error model of classical and Berkson components based on the error decomposition of Zeger and colleagues (2000). The percentages of each type depend on the pollutant under investigation, i.e. fine particulate matter, PM2.5, or nitrogen dioxide, NO2. We used simulated data to generate a wide range of plausible scenarios of the pollutant errors under a mixture error model, and to quantify the real impact of ME on the multi-pollutant model effect estimates, specifically in time-series analysis.


Simulation set-up

We previously conducted a systematic review on the differences between ambient concentrations and personal exposures from outdoor origins to inform our simulation inputs and quantify the magnitude and variability of the pollutant errors [24]. In particular, we created a simulated daily time-series for error-free (A) and error-prone (C) exposures and a health outcome over a period of 4 years, under various scenarios for factors that drive ME bias (Fig. S1, Supplementary material). Our focus was only on the quantification of exposure measurement error bias, so we did not include some features of time-series that may introduce other forms of bias and need to be taken into account in real data analysis, such as autocorrelation, trends or confounders measured with error.

Each scenario was created as a combination of key parameters driving ME bias, i.e. the correlation between the exposures, the error variability (\( SD\left({\delta}_{P{M}_{2.5}}\right) \), \( SD\left({\delta}_{NO_2}\right) \)) and the true exposure variance/error variance ratio (\( SD\left({A}_{P{M}_{2.5}}\right)/ SD\left({\delta}_{P{M}_{2.5}}\right), SD\left({A}_{NO_2}\right)/ SD\left({\delta}_{NO_2}\right) \)) [23]. The latter was informed according to data from three different areas, for which we had adequate information [24]. In total, 144 scenarios were investigated, and their inputs are summarised in Figure 1. Due to space constraints, we expanded only one branch in the tree plot (in bold); this was assumed to be the core scenario compared with which all sensitivity analyses were performed. For each scenario, 1000 simulated datasets were generated.

Fig. 1
figure 1

Diagram showing the construction of the 144 scenarios assumed in the analysis

Briefly, we assumed:

  • Variance ratios: We created scenarios separately for Europe, Eastern and Western North America which correspond to different ratios of the true exposure variance over the error variance. Differences in the ratios between these regions can be attributed to various factors, such as the use of Diesel vehicles, air conditioning, and legislation [25].

  • Correlation between the exposures: To the best of our knowledge, no data for the correlation between personal exposures from outdoor sources to PM2.5 and NO2 are reported in the literature. Thus, we assumed that it lies between published correlation coefficients of total personal exposures and correlations of the ambient measurements for the same pollutants [26,27,28]. A range of values was hypothesized to cover various plausible scenarios.

  • Correlations between the errors: Because we did not have real data to estimate the relationship between these errors, an analytical approach for their covariance was applied using the pairwise associations of personal and ambient concentrations for NO2 and PM2.5 (eq. 2).

  • Error structures: Little is known about the distributions and types of exposure ME. We assumed that the errors are additive on the logarithmic scale [29, 30]. Moreover, we assumed that a mixture of classical and Berkson error best describes the type of exposure misclassification [9]. However, scenarios of entirely classical and entirely Berkson error, alternative figures for the classical-Berkson ratio and multiplicative error were also assessed.

Input data

The key input parameters are summarised in Table 1. We assumed different “true” exposures and error variability for each study area which resulted in different exposure variance over error variance ratios. Three correlation coefficients between the exposures (low = 0.25/moderate = 0.5/high = 0.75) and four values for the error variability for each pollutant were hypothesized. For example, for Europe (core scenario) we hypothesized moderate error variability (5.2 μg/m3 for PM2.5 and 6.2 ppb for NO2), very low (0.5 μg/m3 and 0.6 ppb respectively (0.1 × moderate)), low (2.6 μg/m3 and 3.1 ppb respectively (0.5 × moderate)), and high (6.8 μg/m3 and 8.1 ppb respectively (1.3 × moderate)). The 144 scenarios tested derived from all the possible combinations of these parameters.

Table 1 Simulation inputs for the assumed “true” exposures and the error variability of the “error-prone” exposures

Generating "error-free" or "true" exposures (A):

We used a bivariate log-normal distribution to generate 1461 daily measurements of the “error-free” exposures for each scenario s, as follows:

$$ Lognormal\left({\boldsymbol{\mu}}_{\boldsymbol{s}},{\boldsymbol{\varSigma}}_{\boldsymbol{s}}\right),{\boldsymbol{\mu}}_{\boldsymbol{s}}=\left[\begin{array}{c}{\mu}_1\\ {}{\mu}_2\end{array}\right],{\boldsymbol{\varSigma}}_{\boldsymbol{s}}=\left[\begin{array}{cc}{\sigma}_1^2& {\sigma}_{1,2}\\ {}{\sigma}_{2,1}& {\sigma}_2^2\end{array}\right] $$

where μs is the vector of pollutant means, Σs the variance-covariance matrix and 1, 2 = NO2 and PM2.5, respectively. For example, for the European area, we had:

$$ {\boldsymbol{\mu}}_{\boldsymbol{eu}}=\left[\begin{array}{c}21.6\\ {}21.1\end{array}\right],{\boldsymbol{\varSigma}}_{\boldsymbol{eu}}=\left[\begin{array}{cc}{9.0}^2& 27.6\\ {}27.6& {10.9}^2\end{array}\right]. $$

Generating “error-prone” or “apparent” exposures (C):

We considered a mixture of classical and Berkson ME additive on the logarithmic scale as the most appropriate model because the sources of error can originate from both classical error (omitting individuals’ mobility, the infiltration efficiency of the buildings, and instrument errors) and Berkson error from unincorporated spatial heterogeneity [9]. Various scenarios for the errors allowed evaluation of how the error type affects the health effect estimates, especially in two-pollutant models where the magnitude and direction of bias cannot easily be predicted in typical epidemiologic studies without specifically investigating ME.

Classical error

Apparent exposures of classical error were generated simply by adding random error to the assumed true exposures. For this type of error, we have:

$$ {C}_i^t={A}_i^t+{\delta}_i^t $$

where i is NO2 or PM2.5, t is day, t = 1,2, …, 1461, C and A are as defined above, and the errors, δ, of the pollutants were created from a bivariate normal distribution:

$$ {\delta}_i^t\mid {A}_i^t\sim N\left(\mu, \varSigma \right),\mu =\left[\begin{array}{c}0\\ {}0\end{array}\right],\varSigma =\left[\begin{array}{cc} Var\left({\delta}_{N{O}_2}^t\right)& Cov\left({\delta}_{N{O}_2}^t,{\delta}_{P{M}_{2.5}}^t\right)\\ {} Cov\left({\delta}_{N{O}_2}^t,{\delta}_{P{M}_{2.5}}^t\right)& Var\left({\delta}_{P{M}_{2.5}}^t\right)\end{array}\right] $$

The elements in the diagonal of Σ are the error variances as estimated from [24]. The covariance was estimated analytically:

$$ Cov\left({\delta}_{N{O}_2}^t,{\delta}_{P{M}_{2.5}}^t\right)= Cov\left({C}_{N{O}_2}^t-{A}_{N{O}_2}^t,{C}_{P{M}_{2.5}}^t-{A}_{P{M}_{2.5}}^t\right)= Cov\left({C}_{N{O}_2}^t,{C}_{P{M}_{2.5}}^t\right)- Cov\left({C}_{N{O}_2}^t,{A}_{P{M}_{2.5}}^t\right)- Cov\left({A}_{N{O}_2}^t,{C}_{P{M}_{2.5}}^t\right)+ Cov\left({A}_{N{O}_2}^t,{A}_{P{M}_{2.5}}^t\right) $$

There was no information in the literature for the associations between C and A, so we used covariances from studies of total personal exposure instead [26,27,28].

Berkson error

In general, a Berkson error model can be described as follows:

$$ {A}_i^t={C}_i^t+{\delta}_i^t, $$

We used a formula to generate Berkson error proposed by Carroll et al. (2006) [11]. Briefly, if C = A + δ (i.e. classical model) we know that the best linear predictor of A given C is (1-λ)∙E(A) + λ∙C, where \( \lambda =\frac{\sigma_C^2}{\sigma_C^2+{\sigma}_{\delta}^2} \). Then, the following equation (4) results in the generation of a Berksonian-type error-prone exposure. For simplicity, we omitted the indicators i, t:

$$ C={\mu}_A+\left(A-{\mu}_A\right)\bullet \frac{\sigma_C^2}{\sigma_C^2+{\sigma}_{\delta}^2}+{\delta}^{\ast } $$

where \( {\delta}^{\ast}\sim N\left(0,\frac{\sigma_C^2\bullet {\sigma}_{\delta}^2}{\sigma_C^2+{\sigma}_{\delta}^2}\right) \) and \( {\sigma}_{\delta}^2 \) is the error variance (Supplementary material).

Mixture error

Using the error decomposition from Zeger and colleagues [9], with data from other studies [31, 32], we quantified the error-prone variables of the mixture type. We estimated the error as 43% classical and 57% Berkson for PM2.5 and 33% classical and 67% Berkson for NO2 in the main analysis, while in sensitivity analyses we assumed increased percentages of classical error for both pollutants, i.e. either (55,45%) for PM2.5 and (45,55%) for NO2 or (70,30%) for PM2.5 and (60,40%) for NO2. As the hypothesized true exposures are log-normally distributed, we assumed additive error on the log-scale for the ME models based on previous studies, but also applied a multiplicative approach [29, 30].

The mixture model consists of the variables described in eqs. 1 and 2, along with a latent intermediate variable L, between A and C, that allows for mixtures of Berkson and classical error as described elsewhere [15, 33]. Briefly, the model is:

$$ A=L+{\delta}_b $$
$$ C=L+{\delta}_c $$

where δb, δc denote Berkson-type and classical-type error (Supplementary material).

Generating all-cause mortality

For each day t, the number of deaths Yt was assumed to follow a Poisson distribution with overdispersion φ. The mean μt of the Poisson distribution was created to be dependent only on the simulated “true” concentrations of both pollutants on every particular day as follows:

$$ \log \left({\mu}_t\right)={\beta}_0+{\beta}_1\bullet {A}_{N{O}_2}^t+{\beta}_2\bullet {A}_{P{M}_{2.5}}^t $$

where \( {A}_{N{O}_2}^t,{A}_{P{M}_{2.5}}^t \) are the daily concentrations for NO2 and PM2.5, respectively.

To simulate this Poisson over-dispersed data, we assumed that daily mortality followed a Negative Binomial distribution, NB (μt,θ), where μt is the mean daily number of deaths, and θ = φ/(μt-1). To get reasonable estimates for β0, φ we used real mortality and air pollution data during 2011–2014 in Greater London.

For the selection of β1, β2 we reviewed the literature for studies that reported associations between short-term exposure to NO2 and PM2.5 and all-cause mortality and derived their CRFs. We chose plausible CRF values from a recent meta-analysis of time series studies despite the fact that they are not adjusted for ME, i.e. a 0.6 and 0.54% increase in all-cause mortality per 10 units increase in NO2 and PM2.5 respectively (β1 = 0.0006, β2 = 0.00054) [34]. We also tested whether the results remain unchanged when the true coefficients were assumed to be half or twice the above percentage increases or when only one pollutant had a true health effect (βi = 0 for the co-pollutant).

Epidemiological analysis

In the context of this study, we are interested in the ME bias - quantified as the difference between the health effect estimates of error-free and error-prone exposures. The Poisson time-series model allowing for over-dispersion used in every iteration was:

$$ \log \left(E\left({Y}_t\right)\right)={\beta}_0+{\beta}_1\bullet {C}_{N{O}_2}^t+{\beta}_2\bullet {C}_{P{M}_{2.5}}^t $$

where Yt is the death count for day t and \( {C}_{N{O}_2}^t \), \( {C}_{P{M}_{2.5}}^t \) the corresponding error-prone exposure based on every scenario. We also calculated coverage probability as the percentage of 95% confidence intervals that include the assumed true exposure-response association, and power as the percentage of statistically significant estimates at the 5% level.


We confirmed that the generated exposure variables had close to the expected mean values (Table 1) and the expected distributions for the classical, Berkson or mixed error types (Fig. S2, Supplementary Material).

Table 2 contains the average Poisson regression estimates across all scenarios by areas under investigation. Pure Berkson error model resulted in overestimation of the true mortality effect for all areas and pollutants by 1.5 to 8.2%, except for the NO2 coefficient for Europe (a decrease of 7.9%), the assumed area scenario with the lower exposure variance/error variance ratio and higher corresponding ratio for the co-exposure. Otherwise, the mortality effects were underestimated with greater bias observed in Europe for NO2 (27.4%, mixture model). Interestingly, the bias in North America was similar for both pollutants, around 15–19% for classical error and 9–13% for mixture (area scenarios with similar, and relatively low exposure variance/error variance ratios for both pollutants). In Europe, the bias was significantly increased for NO2 compared with PM2.5, 10.0 and 25.1% for classical and 1.3 and 27.4% for mixture error respectively. Coverage and power for both pollutants across all scenarios by area and error type were 51–60% and 60–80% respectively.

Table 2 Summary of the true and error-prone regression coefficients, their standard errors (SE) x 10-4 and relative bias from 144,000 simulated datasets on the impact of three error models (classical, Berkson and mixture) on 2-pollutant Poisson regression by area of study. Results presented for all scenarios (N = 48,000 in each row)

The results of mixture error are illustrated in Fig. 2 showing the relationship between PM2.5 and NO2 coefficients by study area for all scenarios. No scenario resulted in overestimation of the effect of both pollutants (upper right quartile). Between pollutants, higher values of \( {\hat{\beta}}_1 \) tend to be associated with lower values of \( {\hat{\beta}}_2 \), in an approximate quadratic negative relationship. No clear pattern was observed by study area, except for a group of European results in the lower right of the graph (> 70% decrease in the NO2 effect estimate). NO2 coefficient attenuation is higher than in other scenarios because the hypothesized exposure variance/error variance ratio was lower compared to PM2.5.

Fig. 2
figure 2

Scatter plot of the PM2.5 and NO2 Poisson regression coefficients by area of study when mixture error was assumed. In dots are the averages of 1000 simulated datasets across the same scenario. Diamonds show the averages of all the datasets assuming the same area of study. The two lines illustrate the assumed true mortality effect of the pollutants. Results presented for all 144 scenarios

Figure 3 plots the mortality estimates for both pollutants sorted in increasing order of NO2 effect size (left). Effect transfer can clearly be observed either from NO2 to PM2.5 (upper part) or from PM2.5 to NO2 (lower part), but also, there are some scenarios where both coefficients were attenuated (middle part). Interestingly, in the middle and even at the top of the graph, there are scenarios in which the estimates for both pollutants are to the left of the true beta line i.e. attenuated. Nonetheless, large attenuations for both pollutant estimates did not occur simultaneously, as can be seen from the absence of points in the bottom left corner of Fig. 2.

Fig. 3
figure 3

Plot for the comparison of PM2.5 and NO2 mortality estimates (with 95% CIs) for mixture error type (averages of 1000 simulated datasets across the same scenario, sorted by the NO2 regression coefficients). Results presented for all 144 scenarios. The vertical lines illustrate the assumed true mortality effect of the pollutants. In green: statistically significant estimates, in orange: not-statistically significant estimates, in red: significantly biased estimates

Table 3 illustrates the model estimates with different assumed values for the correlation between the exposures and the error variances. Results from the most plausible error model are reported, i.e. mixture error. Effect transfer is clearly observed, as the effect estimates for one exposure are decreasing when its error variability is increasing (less precise exposure), while the co-exposure’s coefficients are increasing, and its error variability is kept fixed. For example, in the “moderate” correlation scenarios, keeping the PM2.5 error variability fixed at “low” (SD(\( {\delta}_{P{M}_{2.5}} \)) = 2.9 μg/m3), the bias in the PM2.5 Poisson regression estimates range from − 3.7% in the “very low” NO2 error variability scenario to + 17.8% in the “high” scenario. On the contrary, the corresponding bias for NO2 varies from + 0.7% to − 50.0% (“very low” to “high” SD(\( {\delta}_{NO_2} \))). Bias in the effect estimates varies highly by assumed error structure from 26.1% overestimation to 44.0% underestimation of the true effect for PM2.5, and from + 12.1% to − 58.4% for NO2. The highest biases are observed in the most extreme scenarios, but in the more plausible “low”-“moderate” error variability scenarios, bias is not negligible (values up to approximately 30% towards the null). Correlation between the exposures does not seem to be a driving factor for the bias.

Table 3 Summary of the regression coefficients, their standard errors (SE)(×10− 4) and relative bias of 144,000 simulated datasets on the impact of mixture error model on 2-pollutant Poisson regression by the error variability of PM2.5 and NO2. Results presented for all scenarios (N = 3000 in each row)

Comparing multi- and single-pollutant model coefficients (βM and βS respectively), Table 4 suggests that, in the absence of ME, using βS may result in a large overestimation of the combined effect of PM2.5 and NO2. Thus, βS, in general, includes effects attributable to both pollutants where both pollutants have a true effect on mortality. For mixture error, regression coefficients increased by 22.5% for PM2.5 and 17.7% for NO2, when single- instead of multi-pollutant models were fitted. Surprisingly, the combined effect, \( {\beta}_{S_1}+{\beta}_{S_2} \), of the mixture error-prone variables was closer to the true combined effect, i.e. 5.4 + 6.0, compared to \( {\beta}_{M_1}+{\beta}_{M_2} \). No significant differences were observed between the standard errors of βM and βS.

Table 4 Summary of the regression coefficients, their standard errors (SE)(x10-4) and the percentage decrease from single- to multi-pollutant model estimates for 144,000 simulated datasets on the impact of three error models (classical, Berkson and mixture) on 2-pollutant Poisson regression. Results presented for all scenarios (N = 144,000 in each row)

We performed sensitivity analyses with the Europe, additive, mixture error, core scenario to check which of the simulation inputs are driving ME bias. No difference in the relative bias was observed when we assumed a “true” CRF half or twice the core one. Mortality estimates were attenuated even more when only one pollutant had a true effect on health compared to the case where both had an effect. Interestingly, effect transfer is observed from the pollutant with the true effect to the co-pollutant assumed to have no true effect. When the classical part in the error mixture was increased for both pollutants, the relative bias increased for PM2.5 (from 1.3 to 6.0%), while it only decreased slightly for NO2 (from 27.4 to 25.4%). Finally, multiplicative error resulted in great underestimation of the exposure-response relationship, i.e. 84.5 and 90.0% for PM2.5 and NO2 respectively (Table 5).

Table 5 Summary of the regression coefficients, their standard errors (SE)(x10-4) and relative bias of 48,000 simulated datasets on the impact of mixture error model on 2-pollutant Poisson regression. Results presented for the core scenario (Area: Europe, Error type: Additive-Mixture) and sensitivity analyses (N = 48,000 in each row)


We performed a simulation study to quantify the bias in mortality effect estimates caused by ME in multi-pollutant, time-series models including PM2.5 and NO2. While the impact of ME can be more easily predicted when single exposures are measured with error, multiple error-prone exposures of any error type (i.e. purely classical, purely Berkson or mixture) can distort the health effect estimates. Our results can be applied to other outcomes and exposures as well.

Mixture error model was found to attenuate the effect of both pollutants, with higher attenuation for NO2, the exposure variable measured with more error as found previously [24]. ME also reduced coverage of 95% confidence intervals and statistical power. The largest underestimations of the true effect were in North America for PM2.5 (10.3%) and in European studies for NO2 (27.4%). Differences by study area were observed because of the different pollutant variability/error variability ratios in these areas and/or potentially other unobserved parameters related to ME. For example, the exposure assessment method used (e.g. measured or modelled concentrations) and the exposure metric (e.g. 24-h mean/max, ambient or personal exposure) may vary by study area. When various values for the correlation between the exposures were assessed, mortality estimates did not change significantly. This is important because correlation between pollutant concentrations is the most common factor discussed in air pollution epidemiological studies as driving unreliability in multi-pollutant models. Day et al. (2004) have reached to similar conclusions for the correlations between the exposures and between the errors except for the case of high correlations (both > 0.8), for a nutritional epidemiology study context [35]. No change was observed in the relative bias with different “true” CRFs, while a small, false positive effect was observed for the pollutant assumed to have no true effect, when both exposures are measured with error. In contrast, when we assumed multiplicative instead of additive error (i.e. a less likely scenario), both pollutant health effect estimates were > 85% biased. Finally, only the PM effect estimate underestimation increased when varying the classical-Berkson ratio of the mixture.

Effect transfer was clearly observed concluding that less precise measurements for one pollutant yield more bias while the co-pollutant effect estimates were closer to the true. This decrease in the bias of the co-pollutant, however, can be regarded as due to the net effect of underestimation due to ME and overestimation due to effect transfer; the latter cancelling out the effect of the former. Szpiro et al. (2011) showed in their simulation study that more accurate exposure predictions do not necessarily improve the health effect estimates [22]. They considered the effects of long-term exposure to air pollution and on comparisons between exposures from correctly specified and misspecified prediction models. However, similar approaches in a multi-pollutant framework have shown that measurement error bias can be severe and correcting for it can strengthen the exposure-response associations [36]. Time-series studies showed that health effect estimates from modelled data are more prone to ME than from measured concentrations [37]. Goldman et al. (2011) reported that spatial error, (only a part of our error decomposition), attenuated the risk ratios from 19 to 31% for primary pollutants (including NO2), but only from 2 to 9% for secondary pollutants (PM2.5 regarded as such) [30]. These values are close to our overall bias estimates of 17 and 7% respectively, even though their characterisation of NO2 and PM2.5 as primary and secondary pollutants respectively might be questionable. Similarly, Dionisio et al. (2016) fitting two-pollutant time-series models with additive and multiplicative error reported total effect attenuation up to 85% for NO2 (close to our estimates for multiplicative error), indicating multi-pollutant model estimates are even more susceptible to ME [38]. Blangiardo et al. (2019) also found, under a Bayesian framework, that NO2 effects were considerably biased when error-prone concentrations were used [39]. However, they focused on collinearity in multi-pollutant models without assessing error structures/types.

When misspecified single-pollutant models were fitted, increased effect estimates were observed compared to two-pollutant model coefficients. Interestingly, with mixture error-prone variables, the sum of the single-pollutant model effect estimates was closer to the true combined effect compared with the corresponding sum from multi-pollutant models (due to an accidental cancelling out of overestimation in single-pollutant models from confounding by the co-pollutant and underestimation due to ME). These conclusions hold only when the pollutants are positively correlated. Hence, even if it is accepted that both pollutants have a true causal effect on health [40], should single- or multi-pollutant models be used for quantifying the combined effects of air pollution? Clearly, multi-pollutant models take into account the confounding effects between the pollutants, but if ME is disregarded, biased estimates are produced. This implies that in a “multi-pollutant air quality management framework”, ME correction methods should not be neglected [41]. If a correction method cannot be applied, single-pollutant model effects may not be significantly overestimated, but this might only hold under some conditions (e.g. specific correlations between pollutants and/or between errors).

Several previous studies have considered the effects of a mixture of classical and Berkson error in exposures other than air pollution. Mallick et al. (2002) found that the mixture error bias in their relative risks for thyroid disease and radiation fallout ranged from 3.2 to 42.7% [15]. Tapsoba et al. (2019) studying medications in HIV patients report biases from 0 to 22% depending on the correction method [33]. These values are close to our findings, as is their assumed percentage of Berkson error in the exposure that lies between 20 and 80%. In contrast, Deffner et al. (2018), examining the effects of ultra-fine particles on heart rate, reported that mixture error had little impact on their results [17]. This, however, may be due to their error definitions: they assumed that total personal measurements include only classical error, measurements from fixed sites only Berkson.

This simulation study has some limitations. Firstly, several of our assumptions increased the uncertainty of our estimates. For example, we used pooled estimates for the error structures across wide study areas (e.g. cities within our areas may have different concentrations), and approximations for the correlation between exposures and between errors. However, to the best of our knowledge, no study has attempted to quantify the important error variables in specific areas, apart from Dionisio et al. (2014) in Atlanta (using a different, more spatial framework than our study) [23]. We used more generalisable inputs provided from previously published work [24]. In addition, equal exposure misclassification across days was assumed, increasing the uncertainty about the relationship between sources of spatial and temporal variations. We could not investigate the dependency or correlation between daily exposures and how this might interact with the spatial variation. Moreover, we assumed a constant percent error model additive on the logarithmic scale which implies that at low exposures, error is also low, and vice versa. Error may be higher at high exposures, but it might not drop substantially at lower concentrations. In future work, we plan to use real data from panel studies to identify the gaps in our information and describe properly the error structures of the air pollutants.

This study discusses PM2.5 and NO2 and how to separate their effects on health. There is debate over whether NO2 is acting as an indicator for other traffic pollutants. To the extent that these other pollutants have similar spatial error characteristics and some, e.g. CO, ultrafine particles, may also have greater infiltration indoors than PM2.5, our findings regarding effect transfer in multi-pollutant models with PM2.5 will also apply. Moreover, our work could not quantify the error structures of ozone so could not assess its exposure misclassification bias. This work was on two pollutants that are generally positively correlated. Ozone, due to its formation and characteristics, can be negatively correlated with PM2.5 and NO2, and this may change our conclusions. However, the correlations are not expected to be very high, and according to this and previous studies, the effect of pollutant correlations on ME bias is not expected to be substantial [35]. We are currently working on this, using previously analysed raw data from a panel study on schoolchildren [42]. We will also assess the impact of ME on the shape of the exposure-response curve. This work did not find any significant distortions for the shape of the CRFs for the health effects of short-term exposures when the error is additive, but the effects of multiplicative error might be more profound especially for the identification of the long-term effects of air pollution, for which the CRFs are used in cost-benefit analyses [43]. We addressed effect estimates from short-term exposures. Future work should address the multi-pollutant model estimates for long-term exposures which may be more biased, as indicated in studies assessing single-pollutant exposures [31, 32]. Finally, measurement error correction is an important aspect which becomes more complex if we consider expanding the findings from exposure to air pollution to the whole exposome, i.e. the totality of environmental exposures throughout a lifetime. In such studies, variable selection methods are usually used, which rely on empirical data and may create false positive and false negative selections in the presence of ME [44].

In summary, this study quantified the effects of exposure measurement error on multi-pollutant, time-series model estimates. Using simulations, under an extensive range of scenarios, we showed that non-trivial underestimation in health effect estimates can result from measurement error, especially for NO2, which was found to be more prone to error, but for PM2.5 as well. We recommend that ME should be considered in every epidemiological analysis assessing exposures prone to large ME, and that studies of personal exposure should provide information on relevant error parameters, such as correlation between errors and error variability, in order to better understand the correct error structures of the pollutants. It is important that correct health effect estimates should be derived in order, not only to separate the independent effects of air pollutants, but also to correctly quantify the health impacts of air pollution, inform interpretation and recommend future approaches for policy making.

Availability of data and materials

The datasets used and analysed during the current study are available from the corresponding author on reasonable request. This work was produced using simulated data.


  1. Cohen AJ, Brauer M, Burnett R, Anderson HR, Frostad J, Estep K, et al. Estimates and 25-year trends of the global burden of disease attributable to ambient air pollution: an analysis of data from the global burden of diseases study 2015. Lancet. 2017;389(10082):1907–18.

  2. World Health Organization (WHO). Review of evidence on health aspects of air pollution–REVIHAAP project: final technical report. Bonn: WHO European Centre for Environment and Health; 2013.

    Google Scholar 

  3. Health Effects Institute (HEI). State of Global Air 2019.

    Google Scholar 

  4. Kelly FJ, Fussell JC. Air pollution and public health: emerging hazards and improved understanding of risk. Environ Geochem Health. 2015;37(4):631–49.

    Article  CAS  Google Scholar 

  5. Dominici F, Peng RD, Barr CD, Bell ML. Protecting human health from air pollution: shifting from a single-pollutant to a multi-pollutant approach. Epidemiology. 2010;21(2):187.

    Article  Google Scholar 

  6. Stafoggia M, Breitner S, Hampel R, Basagaña X. Statistical approaches to address multi-pollutant mixtures and multiple exposures: the state of the science. Current environmental health reports. 2017;4(4):481–90.

    Article  CAS  Google Scholar 

  7. Davalos AD, Luben TJ, Herring AH, Sacks JD. Current approaches used in epidemiologic studies to examine short-term multipollutant air pollution exposures. Ann Epidemiol. 2017;27(2):145–53.e1.

    Article  Google Scholar 

  8. Zidek JV, Wong H, Le ND, Burnett R. Causality, measurement error and multicollinearity in epidemiology. Environmetrics. 1996;7(4):441–51.;2-V.

  9. Zeger SL, Thomas D, Dominici F, Samet JM, Schwartz J, Dockery D, et al. Exposure measurement error in time-series studies of air pollution: concepts and consequences. Environ Health Perspect. 2000;108(5):419–26.

  10. van Smeden M, Lash TL, Groenwold RH. Reflection on modern methods: five myths about measurement error in epidemiological research. Int J Epidemiol. 2020;49(1):338–47.

    Article  Google Scholar 

  11. Carroll RJ, Ruppert D, Stefanski LA, Crainiceanu CM. Measurement error in nonlinear models: a modern perspective: CRC press; 2006.

    Book  Google Scholar 

  12. Samoli E, Butland BK. Incorporating measurement error from modeled air pollution exposures into epidemiological analyses. Current environmental health reports. 2017;4(4):472–80.

    Article  Google Scholar 

  13. Shaw PA, Deffner V, Keogh RH, Tooze JA, Dodd KW, Küchenhoff H, et al. Epidemiologic analyses with error-prone exposures: review of current practice and recommendations. Ann Epidemiol. 2018;28(11):821–8.

  14. Van Roosbroeck S, Li R, Hoek G, Lebret E, Brunekreef B, Spiegelman D. Traffic-related outdoor air pollution and respiratory symptoms in children: the impact of adjustment for exposure measurement error. Epidemiology. 2008;19(3):409–16.

    Article  Google Scholar 

  15. Mallick B, Hoffman FO, Carroll RJ. Semiparametric regression modeling with mixtures of Berkson and classical error, with application to fallout from the Nevada test site. Biometrics. 2002;58(1):13–20.

    Article  Google Scholar 

  16. Heid I, Küchenhoff H, Miles J, Kreienbrock L, Wichmann H. Two dimensions of measurement error: classical and Berkson error in residential radon exposure assessment. J Expo Sci Environ Epidemiol. 2004;14(5):365–77.

    Article  CAS  Google Scholar 

  17. Deffner V, Küchenhoff H, Breitner S, Schneider A, Cyrys J, Peters A. Mixtures of Berkson and classical covariate measurement error in the linear mixed model: Bias analysis and application to a study on ultrafine particles. Biom J. 2018;60(3):480–97.

    Article  Google Scholar 

  18. Sarnat SE, Klein M, Sarnat JA, Flanders WD, Waller LA, Mulholland JA, et al. An examination of exposure measurement error from air pollutant spatial variability in time-series studies. J Expo Sci Environ Epidemiol. 2010;20(2):135–46.

  19. Setton E, Marshall JD, Brauer M, Lundquist KR, Hystad P, Keller P, et al. The impact of daily mobility on exposure to traffic-related air pollution and health effect estimates. J Expo Sci Environ Epidemiol. 2011;21(1):42–8.

  20. Strickland MJ, Gass KM, Goldman GT, Mulholland JA. Effects of ambient air pollution measurement error on health effect estimates in time-series studies: a simulation-based analysis. J Expo Sci Environ Epidemiol. 2015;25(2):160–6.

    Article  CAS  Google Scholar 

  21. Gryparis A, Paciorek CJ, Zeka A, Schwartz J, Coull BA. Measurement error caused by spatial misalignment in environmental epidemiology. Biostatistics. 2009;10(2):258–74.

    Article  Google Scholar 

  22. Szpiro AA, Paciorek CJ, Sheppard L. Does more accurate exposure prediction necessarily improve health effect estimates? Epidemiology. 2011;22(5):680–5.

    Article  Google Scholar 

  23. Dionisio KL, Baxter LK, Chang HH. An empirical assessment of exposure measurement error and effect attenuation in bipollutant epidemiologic models. Environ Health Perspect. 2014;122(11):1216–24.

    Article  CAS  Google Scholar 

  24. Evangelopoulos D, Katsouyanni K, Keogh RH, et al. PM2.5 and NO2 exposure errors using proxy measures, including derived personal exposure from outdoor sources: A systematic review and meta-analysis. Environ Int. 2020;137:105500.

    Article  CAS  Google Scholar 

  25. Katsouyanni K, Samet JM, Anderson H, et al. Air pollution and health: a European and North American approach (APHENA). Res Rep Health Eff Inst. 2009;142:5–90.

    CAS  Google Scholar 

  26. Sarnat JA, Schwartz J, Catalano PJ, Suh HH. Gaseous pollutants in particulate matter epidemiology: confounders or surrogates? Environ Health Perspect. 2001;109(10):1053–61.

    Article  CAS  Google Scholar 

  27. Sarnat JA, Brown KW, Schwartz J, Coull BA, Koutrakis P. Ambient gas concentrations and personal particulate matter exposures - implications for studying the health effects of particles. Epidemiology. 2005;16(3):385–95.

    Article  Google Scholar 

  28. Sarnat SE, Coull BA, Schwartz J, Gold DR, Suh HH. Factors affecting the association between ambient concentrations and personal exposures to particles and gases. Environ Health Perspect. 2006;114(5):649–54.

    Article  CAS  Google Scholar 

  29. Armstrong BG. Effect of measurement error on epidemiological studies of environmental and occupational exposures. Occup Environ Med. 1998;55(10):651–6.

    Article  CAS  Google Scholar 

  30. Goldman GT, Mulholland JA, Russell AG, et al. Impact of exposure measurement error in air pollution epidemiology: effect of error type in time-series studies. Environ Health. 2011;10(61):10.1186.

    Google Scholar 

  31. Butland BK, Samoli E, Atkinson RW, Barratt B, Beevers SD, Kitwiroon N, et al. Comparing the performance of air pollution models for nitrogen dioxide and ozone in the context of a multilevel epidemiological analysis. Environmental Epidemiology. 2020;4(3):e093.

  32. Samoli E, Butland BK, Rodopoulou S, Atkinson RW, Barratt B, Beevers SD, et al. The impact of measurement error in modeled ambient particles exposures on health effect estimates in multilevel analysis: a simulation study. Environmental Epidemiology. 2020;4(3):e094.

  33. Tapsoba JD, Chao EC, Wang C-Y. Simulation extrapolation method for cox regression model with a mixture of Berkson and classical errors in the covariates using calibration data. The international journal of biostatistics. 2019;15(2).

  34. Mills I, Atkinson R, Anderson H, Maynard R, Strachan D. Distinguishing the associations between daily mortality and hospital admissions and nitrogen dioxide from those of particulate matter: a systematic review and meta-analysis. BMJ Open. 2016;6(7):e010751.

    Article  CAS  Google Scholar 

  35. Day NE, Wong MY, Bingham S, Khaw KT, Luben R, Michels KB, et al. Correlated measurement error—implications for nutritional epidemiology. Int J Epidemiol. 2004 Dec 1;33(6):1373–81.

  36. Bergen S, Sheppard L, Kaufman JD, Szpiro AA. Multipollutant measurement error in air pollution epidemiology studies arising from predicting exposures with penalized regression splines. Journal of the Royal Statistical Society. Series C, Applied statistics. 2016 Nov;65(5):731.

    Article  Google Scholar 

  37. Butland BK, Armstrong B, Atkinson RW, Wilkinson P, Heal MR, Doherty RM, et al. Measurement error in time-series analysis: a simulation study comparing modelled and monitored data. BMC Med Res Methodol. 2013;13(1):136.

  38. Dionisio KL, Chang HH, Baxter LK. A simulation study to quantify the impacts of exposure measurement error on air pollution health risk estimates in copollutant time-series models. Environ Health. 2016;15(1):114.

    Article  CAS  Google Scholar 

  39. Blangiardo M, Pirani M, Kanapka L, Hansell A, Fuller G. A hierarchical modelling approach to assess multi pollutant effects in time-series studies. PLoS One. 2019 Mar 4;14(3):e0212565.

    Article  CAS  Google Scholar 

  40. Schwartz J, Fong K, Zanobetti A. A national multicity analysis of the causal effect of local pollution, NO 2, and PM 2.5 on mortality. Environ Health Perspect. 2018;126(8):087004.

    Article  CAS  Google Scholar 

  41. Mauderly JL, Burnett RT, Castillejos M, Özkaynak H, Samet JM, Stieb DM, et al. Is the air pollution health research community prepared to support a multipollutant air quality management framework? Inhal Toxicol. 2010;22(sup1):1–19.

  42. Samoli E, Dimakopoulou K, Evangelopoulos D, Rodopoulou S, Karakatsani A, Veneti L, et al. Is daily exposure to ozone associated with respiratory morbidity and lung function in a representative sample of schoolchildren? Results from a panel study in Greece. J Expo Sci Environ Epidemiol. 2017;27(3):346–51.

  43. Crump KS. The effect of random error in exposure measurement upon the shape of the exposure response. Dose-Response. 2005 Oct 1;3:4.

    Article  Google Scholar 

  44. Agier L, Slama R, Basagaña X. Relying on repeated biospecimens to reduce the effects of classical-type exposure measurement error in studies linking the exposome to health. Environ Res. 2020 Jul 1;186:109492.

    Article  CAS  Google Scholar 

Download references


We thank Professor Ruth Keogh (London School of Hygiene and Tropical Medicine) for her useful feedback in our analysis. We also thank our colleagues from Environmental Research Group (Imperial College London), Dr. Ben Barratt and Dr. David Green, for providing data used as simulation inputs.


This work was supported by the National Institute for Health Research Health Protection Research Unit (NIHR HPRU) in Health Impact of Environmental Hazards at King’s College London in partnership with Public Health England (PHE) and Imperial College London. The views expressed are those of the authors and not necessarily those of the NHS, the NIHR, the Department of Health & Social Care or Public Health England.

Author information

Authors and Affiliations



DE: Statistical Methodology, Statistical Analysis, Writing - Original Draft, Review & Editing, KK: Conceptualisation, Statistical Methodology, Writing – Review & Editing, JS: Methodology, Writing – Review & Editing, HW: Conceptualisation, Statistical Methodology, Writing – Review & Editing, Supervision. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Dimitris Evangelopoulos.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Evangelopoulos, D., Katsouyanni, K., Schwartz, J. et al. Quantifying the short-term effects of air pollution on health in the presence of exposure measurement error: a simulation study of multi-pollutant model results. Environ Health 20, 94 (2021).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: