On the estimation of heat-intensity and heat-duration effects in time series models of temperature-related mortality in Stockholm, Sweden
- Joacim Rocklov^{1}Email author,
- Adrian G Barnett^{2} and
- Alistair Woodward^{3}
DOI: 10.1186/1476-069X-11-23
© Rocklov et al.; licensee BioMed Central Ltd. 2012
Received: 26 November 2011
Accepted: 10 April 2012
Published: 10 April 2012
Abstract
Background
We examine the effect of heat waves on mortality, over and above what would be predicted on the basis of temperature alone.
Methods
Present modeling approaches may not fully capture extra effects relating to heat wave duration, possibly because the mechanisms of action and the population at risk are different under more extreme conditions. Modeling such extra effects can be achieved using the commonly left-out effect-modification between the lags of temperature in distributed lag models.
Results
Using data from Stockholm, Sweden, and a variety of modeling approaches, we found that heat wave effects amount to a stable and statistically significant 8.1-11.6% increase in excess deaths per heat wave day. The effects explicitly relating to heat wave duration (2.0–3.9% excess deaths per day) were more sensitive to the degrees of freedom allowed for in the overall temperature-mortality relationship. However, allowing for a very large number of degrees of freedom indicated over-fitting the overall temperature-mortality relationship.
Conclusions
Modeling additional heat wave effects, e.g. between lag effect-modification, can give a better description of the effects from extreme temperatures, particularly in the non-elderly population. We speculate that it is biologically plausible to differentiate effects from heat and heat wave duration.
Background
Heat stress can lead to fatal consequences due to: dehydration; increased cardiovascular stress; kidney dysfunction; and electrolyte disorders [1, 2]. At a population level, many studies show mortality tends to rise with higher temperatures [3]. Two approaches are generally used to quantify excess mortality: studies that focus exclusively on heat waves (so called episode studies); and studies that use time series to estimate the effects of temperature on mortality by averaging over hot days and heat waves. Heat waves are commonly referred to as a period of extreme heat stress relative to the normal climate, although the exact definition varies according to the number of consecutive days of heat, temperature variable(s) and heat threshold. Many time series studies, assuming the association between temperature and mortality is non-linear, report associations between heat and mortality that are immediate or delayed by up to a week [4, 5]. However, the validity of this approach is challenged by research that reports an additional effect for heat waves [6]. Other studies have reported that heat-related mortality is sensitive to the duration of heat waves regardless of the intensity of the ambient heat (e.g. in France during the 2003 heat wave [7]). A number of studies have since explored additional heat waves effects with respect to their timing, intensity, duration and location [6, 8–13]. All studies found statistically significant additional risks that may relate to the duration of heat waves and the cumulative extreme heat exposure. The main differences between the studies were the models used to estimate the heat-mortality relationship and the location.
The reason why the overall temperature-mortality relationship may not fully explain effects during heat waves is because the physiological effects of high temperatures and heat waves are different. For example, cumulative heat stress during a prolonged heat wave is more likely to cause dehydration. Cumulative heat stress is also more strongly related to cardiovascular deaths [8, 13]. Differences in age stratified relative risks to heat and to heat waves have shown that the population at risk may differ, with the middle aged population potentially at the highest risk during heat waves [8, 13].
We explain why additional heat wave effects are not perfectly captured in models of temperature-related mortality, and illustrate the estimation of additional heat wave effects using empirical data. We also explore how the additional heat wave effect is dependent on the complexity of temperature-mortality model. Using prior studies and our own results, we argue that future studies should evaluate potential additional effects from heat waves by decomposing heat exposure into a temperature term and an added heat wave term. Increasing model complexity will not suffice if there is no differentiation between heat wave days and non-heat wave days.
Modelling weather-related mortality
Time series methods based on daily data have been developed and applied in studies of the short-term health effects of environmental factors like air pollution and weather [14, 15]. Models often include lagged effects of exposure and adjustment for potential mortality displacement [14, 16, 17]. Studies have also explored: the use of non-linear functions to adjust for confounding (e.g., season), allowing non-linear exposure-response relationships, and the use of model fitting criteria [17, 18].
Why additional heat wave effects?
Additional heat wave effects can also arise due to cumulative heat stress. It is tempting to believe that cumulative stress can be estimated just like other delayed effects of heat exposure effects, e.g. by distributed lag models. This is not the case. Distributed lag models allow temperature a few days or weeks prior to day t to affect mortality on day t, and so are perfectly able to capture delayed effects of temperature. However, distributed lag models assume the delayed effects are related to the temperature on day t only. This means that delayed lag effects are independent of other lag days, for example, days t-1 and t-2. They cannot model health effects caused by the temperature being above a heat threshold for a number of consecutive days. Thus, the distributed lag effect at a certain day of the heat wave does not estimate the effects relating to persistent heat stress. In order to estimate the effects relating to several days consecutive heat exposure above a certain threshold one would need to include non-linear interactions (effect-modifications) between the temperature lag effects. Here the non-linearity relates to the fact that the additional heat wave effects are thought to appear above some extreme temperature threshold. Thus, short-term cumulative stress of extreme heat can be described by lag effect-modification.
The model (d) estimates the effects of the duration of heat waves as a linear function avoiding potential collinearity induced by explicitly including many lag interaction variables. The models above assume a dichotomous exposure-response relationship for temperature, and that effects of temperature and heat waves are not extended over more than 3 lag days. However, the same principle can be used when fitting a distributed lag model and a larger number of lag days.
Model choice and additional heat wave effects
While recent studies found significant additional effects from heat waves, particularly in cold climates (such as the northern parts of the US and Sweden [9, 19]), the size of this effect has been estimated using different approaches [9, 10, 12, 13]. The main differences were: i) the complexity of the model used for the exposure-response relationship between temperature and mortality; ii) allowing for spatial heterogeneity in the additional heat wave effect; iii) allowing for heterogeneity in the additional heat wave effects between population sub-groups.
The study presenting the smallest additional effect from heat waves found the size of the effects to be almost negligible when the exposure-response relationship was allowed a very flexible parameterization using two dimensional cubic spline functions for temperature and lag day and modeling the main effect in a first stage model and the effect-modification in a second stage [12]. However, the estimates were for all-cause mortality independent of geographical differences in U.S. cities, while there is evidence that heat wave effects may be very sensitive to age, cause of death and location [8, 9, 13, 19]. In particular, additional heat wave effects were negligible in the southern US, and large in the north east of the US [1, 9, 19].
Two studies estimated the overall temperature-mortality relationship using non-linear distributed lags and two-dimensional spline functions, and then tested the sensitivity of this parameterization [10, 20]. Other studies used less complex linear and/or non-linear exposure response relationships, with a small number of lag days of between 1 and 3 [6, 8, 9, 13, 21]. The lag days in these studies were chosen according to prior literature and model fit criteria.
Bobb et al. argued against fitting one model across a range of climates (using the same degrees of freedom, splines and temperature measures), as they found that in most cities there were two or more models with a similar fit to the data [19]. Interestingly, after averaging over many different models they found larger effects of temperatures on heat wave days compared to non-heat wave days [19].
Estimation of additional heat wave effects in Stockholm, Sweden
Methods
Descriptive statistics for daily deaths and environmental variables in Stockholm County, 1990–2002, per season
Daily deaths, all non-injury causes, all ages | 40 ± 7.2 |
Ages 45–79 | 18 ± 4.6 |
Ages 80+ | 20 ± 5.3 |
Mean temperature (˚C) | 7.5 ± 7.6 (7 %) |
range | –15.7, 26.4 |
Maximum temperature (˚C) | 10.5 ± 8.6 (7 %) |
range | –13.4, 33.5 |
Maximum temperature 98^{th} percentile (˚C) | 27.5 C |
Values and frequencies taken by the heat wave duration and the heat wave indicator variable
Variable | Heat wave duration (HWD) | Heat wave indicator (HWI) | ||||||||
---|---|---|---|---|---|---|---|---|---|---|
Value | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 0 | 1 |
Frequency | 4697 | 23 | 11 | 7 | 4 | 3 | 2 | 1 | 4697 | 51 |
In model (3) we estimated the additional effects from heat waves (maximum temperature above 98^{th} percentile for at least two days) using an indicator variable, HWI. This model does not estimate additional effects due to heat wave duration explicitly, but the average additional excess mortality during the heat wave periods. The heat wave indicator estimates the effect modification of lag terms above the 98^{th} percentile assuming all days above this threshold are equally contributing to the mortality response.
Here t is the time in days, S is a cubic spline function, the spline function of temperature is two dimensional with lag degree of freedom given by lag.df and variable degree of freedom given by var.df. HWD is a linear variable denoting the day of the heat waves, HWI is an indicator variable for heat waves. time_{t} estimates trends and seasonal changes using a spline with 6 degrees of freedom per year (78 degrees of freedom in total), DOW denotes the day of week, and HD denotes national holidays.
We used the “dnlm” package in R [22]. We tested degrees of freedom for temperature from 3 to 8. The lagged effects of temperature were examined over 20 days and allowed 2, 4 or 6 degrees of freedom. The Akaike Information Criterion (AIC) was used to judge the optimal degrees of freedom.
We tested for differences by age through studying the groups: all ages; ages 80 years; and ages between 45–79 years.
We calculated the variation inflation factor as VIF = 1/(1–R-squared) to assess if the variance estimation was inflated through multi-collinearity introduced by having both temperature and heat wave terms in the same model. The variation inflation factor (VIF) of the duration variable and temperature was estimated to 1.048, and for the heat wave indicator variable to 1.058. So collinearity was not a concern for model 2 or 3.
Results
AIC values for the three models using 3 to 8 degrees of freedom for temperature and 2 to 6 degrees of freedom for lag, Stockholm, 1990–2002 in all ages
df Temperature Spline | No heat wave variable (model 1) | With heat wave duration variable (model 2) | With heat wave indicator variable (model 3) |
---|---|---|---|
df Lag Spline = 2 | |||
3 | 20052 | 20044 | 20040 |
4 | 20056 | 20047 | 20044 |
5 | 20059 | 20051 | 20048 |
6 | 20058 | 20051 | 20047 |
7 | 20062 | 20055 | 20052 |
8 | 20063 | 20057 | 20054 |
df Lag Spline = 4 | |||
3 | 20054 | 20049 | 20047 |
4 | 20055 | 20052 | 20051 |
5 | 20060 | 20059 | 20057 |
6 | 20061 | 20061 | 20059 |
7 | 20061 | 20061 | 20059 |
8 | 20067 | 20068 | 20065 |
df Lag Spline = 6 | |||
3 | 20061 | 20056 | 20053 |
4 | 20065 | 20063 | 20061 |
5 | 20075 | 20073 | 20071 |
6 | 20077 | 20077 | 20075 |
7 | 20082 | 20081 | 20079 |
8 | 20089 | 20089 | 20086 |
AIC values for the three models using 3 to 8 degrees of freedom for temperature and 2 to 6 degrees of freedom for lag, Stockholm, 1990–2002 in ages 80 years of age and above
df Temperature Spline | No heat wave variable (model 1) | With heat wave duration variable (model 2) | With heat wave indicator variable (model 3) |
---|---|---|---|
df Lag Spline = 2 | |||
3 | 17962 | 17961 | 17957 |
4 | 17966 | 17965 | 17961 |
5 | 17970 | 17969 | 17965 |
6 | 17971 | 17970 | 17966 |
7 | 17974 | 17972 | 17968 |
8 | 17977 | 17975 | 17971 |
df Lag Spline = 4 | |||
3 | 17958 | 17960 | 17958 |
4 | 17959 | 17961 | 17961 |
5 | 17967 | 17969 | 17968 |
6 | 17971 | 17973 | 17972 |
7 | 17977 | 17979 | 17977 |
8 | 17982 | 17984 | 17983 |
df Lag Spline = 6 | |||
3 | 17958 | 17959 | 17957 |
4 | 17961 | 17963 | 17963 |
5 | 17972 | 17974 | 17974 |
6 | 17980 | 17981 | 17981 |
7 | 17989 | 17991 | 17990 |
8 | 17997 | 17999 | 17998 |
AIC values for the three models using 3 to 8 degrees of freedom for temperature and 2 to 6 degrees of freedom for lag, Stockholm, 1990–2002 in ages 45 to 79 years of age
df Temperature Spline | No heat wave variable (model 1) | With heat wave duration variable (model 2) | With heat wave indicator variable (model 3) |
---|---|---|---|
df Lag Spline = 2 | |||
3 | 17410 | 17405 | 17406 |
4 | 17415 | 17409 | 17410 |
5 | 17416 | 17412 | 17412 |
6 | 17416 | 17412 | 17413 |
7 | 17417 | 17415 | 17415 |
8 | 17419 | 17417 | 17417 |
df Lag Spline = 4 | |||
3 | 17418 | 17413 | 17413 |
4 | 17423 | 17418 | 17419 |
5 | 17428 | 17424 | 17425 |
6 | 17430 | 17428 | 17428 |
7 | 17426 | 17426 | 17426 |
8 | 17434 | 17434 | 17434 |
df Lag Spline = 6 | |||
3 | 17426 | 17421 | 17421 |
4 | 17435 | 17430 | 17430 |
5 | 17443 | 17440 | 17440 |
6 | 17448 | 17447 | 17446 |
7 | 17448 | 17448 | 17447 |
8 | 17456 | 17457 | 17456 |
Figure 6 shows the corresponding marginal excess mortality predictions for a single hot day (non-heat wave day; lag 0 only). The model without the added heat wave effect (model 1) predicts higher mortality on non-heat wave days compared to the models that differentiate between heat wave and non-heat wave days (model 2 and 3). Thus, model 1 (without the additional heat wave effect) may over-estimate the effect of heat on non-heat wave days, while it appears to under-estimate the effect from heat on heat wave days. Models 2 or 3 do better in this sense, differentiating the effects between heat wave days and non-heat wave days through the variables capturing the effect modification between the lags of temperature. The more complex parameterizations of model 2 and 3 strongly indicate over-fitting in graphical examinations, whilst having better AIC values.
Relative risks (RRs) and confidence intervals (CI; 95%) associated with heat waves
Heat wave duration variable (model 2; unit: days of duration) | Heat wave indicator variable (model 3; unit: heat wave = {yes, no}) | |||
---|---|---|---|---|
df Temperature Spline | df Lag Spline = 2 | |||
RR | CI | RR | CI | |
3 | 1.037 | 1.014, 1.060 | 1.112 | 1.050, 1.176 |
4 | 1.039 | 1.015, 1.063 | 1.116 | 1.053, 1.183 |
5 | 1.038 | 1.013, 1.062 | 1.114 | 1.050, 1.182 |
6 | 1.038 | 1.013, 1.063 | 1.114 | 1.049, 1.182 |
7 | 1.038 | 1.013, 1.063 | 1.114 | 1.048, 1.183 |
8 | 1.037 | 1.011, 1.062 | 1.111 | 1.045, 1.180 |
df lag spline = 4 | ||||
3 | 1.032 | 1.008, 1.056 | 1.100 | 1.034, 1.169 |
4 | 1.028 | 1.002, 1.054 | 1.088 | 1.018, 1.162 |
5 | 1.026 | 0.998, 1.053 | 1.086 | 1.011, 1.165 |
6 | 1.023 | 0.994, 1.051 | 1.081 | 1.003, 1.165 |
7 | 1.022 | 0.992, 1.051 | 1.087 | 1.005, 1.174 |
8 | 1.020 | 0.990, 1.051 | 1.088 | 1.004, 1.179 |
df lag spline = 6 | ||||
3 | 1.032 | 1.008, 1.056 | 1.102 | 1.037, 1.172 |
4 | 1.028 | 1.002, 1.054 | 1.090 | 1.020, 1.165 |
5 | 1.025 | 0.998, 1.053 | 1.088 | 1.013, 1.168 |
6 | 1.023 | 0.994, 1.052 | 1.083 | 1.005, 1.167 |
7 | 1.022 | 0.993, 1.052 | 1.088 | 1.006, 1.176 |
8 | 1.019 | 0.989, 1.050 | 1.088 | 1.003, 1.179 |
We did not assess the estimates’ sensitivity to the parameterization (df) of long-term time trends, as this appears to have less influence on the heat wave effect [12, 19].
Discussion
Estimates of additional heat wave effects in models of temperature-related mortality can be interpreted as a constrained form of non-linear effect-modifications between lags of high temperature (or, similarly lag interactions). This can explain why such effects have been found to significantly contribute to additional deaths during heat waves in previous studies of temperature related mortality, over and above the effects of temperature overall. This dispels the widespread belief that such effects are incorporated through distributed lag models. From a mechanistic perspective including additional effects from heat waves are supported through the physiological stress incurred by cumulative exposure being potentially different from the stress from shorter periods of extreme heat, and can also result in differences in the population at risk such as contrasting susceptibility with age [8–10, 13, 19].
We found the additional heat wave effects were more important in middle age populations and the elderly compared to the very elderly. We found the size of the heat wave effect depended on the complexity of the main temperature-mortality parameterization, more specifically on the df used for modeling non-linearity of temperature and lagged effects. This indicates that there is, not surprisingly, some overlap between the effects of hot days and the effects of heat waves. Our results show, however, that there is also likely to be an independent extra effect of heat waves that is not captured by hot days, e.g. an effect modification. Differences in the temperature-mortality parameterization probably explain many of the differences between the conclusions of recent studies on the effects of heat waves [6, 8, 9, 11–13, 19, 21] In our example we found that a simple model for the temperature-mortality association was better than a complex model. The additional heat wave effects are substantial and important to account for in order not to underestimate mortality risks during heat wave days. We conclude that it is important not to over fit the data by using too complex non-linear lag parameterizations, and that simple parameters for the additional effects of heat waves are useful. However, the more complex parameterizations appear to better capture effects and the high end. It appear it may not be reasonable to distribute the degrees of freedom uniformly over the temperature and lag scales as such assumption gave rise to over-fitting in regions of the temperature and lag scale where the relationship is not very complex. Overall, model fit improved as the complexity of the model and flexibility of the splines were reduced. These simpler models also had larger additional heat wave effects, as reported elsewhere [6, 8, 9, 11–13, 19, 21].
Some studies have used two-stage models to describe the effects from temperature using distributed lag non-linear models in the first stage and the additional effect associated with heat waves in a second step [10, 12]. We note, however, that this deviates from the conventional framework for modeling of effect-modifications, and that it can potentially affect the estimates of additional heat wave effects downward.
We achieved a better model fit using an indicator variable for heat waves rather than the duration variable when studying all ages; nevertheless, models with the duration variable performed better than the models without additional heat wave components, and similarly well in the age group 45–79 years of age.
Conclusions
We conclude that it is important to continue to explore the magnitude of additional heat wave effects in future studies of temperature-related mortality, e.g. temperature lag effect-modifications. It appear also important to fit models that are location sensitive in the parameterization (choice of df), as well as in the evaluation of potential additional heat wave effects. Fitting a complex distributed lag non-linear model may reduce the heat wave signal and over-estimate mortality on non-heat wave days, compared to a model including a heat wave term. It is important to be able to differentiate between extended periods of heat and single days of extremely high temperatures, since several recent studies have shown that duration of heat exposure is related to mortality risk. Increasing the accuracy of heat-wave mortality models will assist public health authorities to direct preventive actions when they are most needed. Future studies should continue to study and identify potential differences in the population at risk to heat and heat waves, as well as describe the mechanistic differences.
Declarations
Acknowledgements
Part of this work was undertaken within the Umeå Centre for Global Health Research at Umeå University, with support from FAS, the Swedish Council for Working Life and Social Research (grant no: 2006–1512).
Authors’ Affiliations
References
- Kilbourne EM: The spectrum of illness during heat waves. Am J Prev Med. 1999, 16 (4): 359-360.View Article
- Parsons K: Human Thermal Environments. The effects of hot, moderate and cold temperatures on human health, comfort and performance. 2003, CRC Press, New York, 2003-2
- Kovats RS, Hajat S: Heat stress and public health: a critical review. Annu Rev Public Health. 2008, 29: 41-55. 10.1146/annurev.publhealth.29.020907.090843.View Article
- Baccini M, Biggeri A, Accetta G, Kosatsky T, Katsouyanni K, Analitis A, Anderson HR, Bisanti L, D’Ippoliti D, Danova J: Heat effects on mortality in 15 European cities. Epidemiol. 2008, 19 (5): 711-719. 10.1097/EDE.0b013e318176bfcd.View Article
- Braga AL, Zanobetti A, Schwartz J: The time course of weather-related deaths. Epidemiol. 2001, 12 (6): 662-667. 10.1097/00001648-200111000-00014.View Article
- Hajat S, Armstrong B, Baccini M, Biggeri A, Bisanti L, Russo A, Paldy A, Menne B, Kosatsky T: Impact of high temperatures on mortality: is there an added heat wave effect?. Epidemiol. 2006, 17 (6): 632-638. 10.1097/01.ede.0000239688.70829.63.View Article
- Fouillet A, Rey G, Laurent F, Pavillon G, Bellec S, Guihenneuc-Jouyaux C, Clavel J, Jougla E, Hemon D: Excess mortality related to the August 2003 heat wave in France. Int Arch Occup Environ Health. 2006, 80 (1): 16-24. 10.1007/s00420-006-0089-4.View Article
- Anderson BG, Bell ML: Weather-related mortality: how heat, cold, and heat waves affect mortality in the United States. Epidemiol. 2009, 20 (2): 205-213. 10.1097/EDE.0b013e318190ee08.View Article
- Anderson GB, Bell ML: Heat waves in the United States: mortality risk during heat waves and effect modification by heat wave characteristics in 43 U.S. communities. Environ Health Perspect. 2011, 119 (2): 210-218.View Article
- Barnett AG, Hajat S, Gasparrini A, Rocklov J: Cold and heat waves in the United States. Environ Res. 2012, 112: 218-224.View Article
- D'Ippoliti D, Michelozzi P, Marino C, de’Donato F, Menne B, Katsouyanni K, Kirchmayer U, Analitis A, Medina-Ramon M, Paldy A: The impact of heat waves on mortality in 9 European cities: results from the EuroHEAT project. Environ Health. 2010, 9: 37-10.1186/1476-069X-9-37.View Article
- Gasparrini A, Armstrong B: The impact of heat waves on mortality. Epidemiol. 2011, 22 (1): 68-73. 10.1097/EDE.0b013e3181fdcd99.View Article
- Rocklov J, Ebi K, Forsberg B: Mortality related to temperature and persistent extreme temperatures: a study of cause-specific and age-stratified mortality. Occup Environ Med. 2011, 68 (7): 531-536. 10.1136/oem.2010.058818.View Article
- Dominici F: Time-series analysis of air pollution and mortality: a statistical review. Res Rep Health Eff Inst. 2004, 123: 3-27. discussion 29–33
- Zeger SL: A Regression-Model for Time-Series of Counts. Biometrika. 1988, 75 (4): 621-629. 10.1093/biomet/75.4.621.View Article
- Schwartz J: The distributed lag between air pollution and daily deaths. Epidemiol. 2000, 11 (3): 320-326. 10.1097/00001648-200005000-00016.View Article
- Zanobetti A, Wand MP, Schwartz J, Ryan LM: Generalized additive distributed lag models: quantifying mortality displacement. Biostatistics. 2000, 1 (3): 279-292. 10.1093/biostatistics/1.3.279.View Article
- Peng RD, Dominici F, Louis TA: Model choice in time series studies of air pollution and mortality. J R Statist Soc A. 2006, 169: 25-View Article
- Bobb JF, Dominici F, Peng RD: A Bayesian model averaging approach for estimating the relative risk of mortality associated with heat waves in 105 U.S. cities. Biometrics. 2011, 67 (4): 1605-1616. 10.1111/j.1541-0420.2011.01583.x.View Article
- Sattar A, Rubessa M, Di Francesco S, Longobardi V, Di Palo R, Zicarelli L, Campanile G, Gasparrini B: The influence of gamete co-incubation length on the in vitro fertility and sex ratio of bovine bulls with different penetration speed. Reprod Domest Anim. 2011, 46 (6): 1090-1097. 10.1111/j.1439-0531.2011.01791.x.View Article
- Rocklov J, Forsberg B: The effect of temperature on mortality in Stockholm 1998–2003: a study of lag structures and heatwave effects. Scand J Public Health. 2008, 36 (5): 516-523. 10.1177/1403494807088458.View Article
- Gasparrini A: Distributed Lag Linear and Non-Linear Models in R: The Package dlnm. J Stat Softw. 2011, 43 (8): 1-20.View Article
Copyright
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.