We’re sorry, something doesn't seem to be working properly.
Please try refreshing the page. If that doesn't work, please contact support so we can address the problem.
A global perspective on coal-fired power plants and burden of lung cancer
Environmental Healthvolume 18, Article number: 9 (2019)
The Letter to the Editor to this article has been published in Environmental Health 2019 18:54
Exposure to ambient particulate matter generated from coal-fired power plants induces long-term health consequences. However, epidemiologic studies have not yet focused on attributing these health burdens specifically to energy consumption, impeding targeted intervention policies. We hypothesize that the generating capacity of coal-fired power plants may be associated with lung cancer incidence at the national level.
Age- and sex-adjusted lung cancer incidence from every country with electrical plants using coal as primary energy supply were followed from 2000 to 2016. We applied a Poisson regression longitudinal model, fitted using generalized estimating equations, to estimate the association between lung cancer incidence and per capita coal capacity, adjusting for various behavioral and demographic determinants and lag periods.
The average coal capacity increased by 1.43 times from 16.01 gigawatts (GW) (2000~2004) to 22.82 GW (2010~2016). With 1 kW (KW) increase of coal capacity per person in a country, the relative risk of lung cancer increases by a factor of 59% (95% CI = 7.0%~ 135%) among males and 85% (95% CI = 22%~ 182%) among females. Based on the model, we estimate a total of 1.37 (range = 1.34 ~ 1.40) million standardized incident cases from lung cancer will be associated with coal-fired power plants in 2025.
These analyses suggest an association between lung cancer incidence and increased reliance on coal for energy generation. Such data may be helpful in addressing a key policy question about the externality costs and estimates of the global disease burden from preventable lung cancer attributable to coal-fired power plants at the national level.
Coal-fired power plants are the dominant source of energy production, yielding > 40% of global electrical power since the 1970s . Indeed, global production of coal increased nearly 2.2-fold from 1958 million tons of oil equivalent (Mtoe) in 1980 to 4270 Mtoe in 2010 . However, air pollutants emitted from coal power plants and their potential impact on population health have aroused widespread concerns; fine particulate matter (PM2.5) can cause both short-term and long-term adverse health outcomes [2,3,4]. Long-term exposure to PM2.5 is associated with shorter life expectancy and higher mortality risks from lung cancer and cardiovascular diseases [5,6,7,8]. In fact, the International Agency for Research on Cancer (IARC) has listed several coal-fired power plant-related agents, including coal combustion, coal production, outdoor air pollution, and radon, as human carcinogens . While lung cancer is prevalent, the proportion of cases attributed to environmental factors such as air pollution varies by country and is difficult to estimate . Nonetheless, improved air quality has been correlated to better health [6, 11], prompting many countries to implement regulations on air pollution .
Most available estimates of health risk associated with electricity generation are oversimplified since they are calculated by multiplying a factor to air pollution levels (either PM2.5 or PM10) without considering the heterogeneous compositions of particles from different sources [13,14,15]. Moreover, lower global levels of PM2.5 are not necessarily associated with reduced adverse health effects, likely due to regional variations in composition [16, 17]. For example, satellite-driven PM2.5 measurement showed a high level of air pollution concentrated in sub-Saharan Africa . Yet, a major component of that PM was dust from the earth’s crust rather than from human activities. Therefore, simply using PM to estimate health effects may result in misguided conclusions.
To clarify the long-term health effects from coal-fired power plants at the national level and linking the capacity market in energy economic to health externality, we aim to estimate changes in national lung cancer incidence decades after building or closing coal-fired power plants.
Study period and design
Annual lung cancer incidence rates from 2000 to 2016 among males and females from countries which have had coal-fired power plants were included in the analyses. Most countries in the study are located in Europe (38.55%) and Asia (27.71%) (Additional file 1: Table S1). Country names and geographical categories reflect the United Nations’ country classification .
Dependent variables & independent variables
Annual lung cancer incidence rates were obtained from Global Burden of Disease Study . Lung cancer codes were B101 or 162 in International Classification of Diseases version 9 (ICD-9); C028, 162, 231.1, or 231.2 in ICD-9CM; and C33, or C34 in ICD-10. Calculated age-adjusted incidence rates were based on the WHO 2000–2025 standard population for each country . We use “independent variables” and “covariates” interchangeably throughout.
Electrical capacity of power plants that primarily relied on coal as generating fuel was the study of interest. Coal capacity was defined as the annual accumulation of generating capacity from every coal-fired power plant in a given country. Similarly, we define plant capacity as the accumulation of total generating capacity from all power plants in a country. Non-coal capacity was plant capacity minus coal capacity. Coal percentage was defined as the ratio of coal capacity to plant capacity for each country. Per capita coal capacity is the coal capacity divided by total population in the corresponding country. Total coal consumption is the annual coal usage in all sectors (including electricity, industrial and residential use, units in Quadrillion Btu) in a given country . Capacity data was derived from the Utility Data Institute World Electric Power Plants Data Base ; we merged the WEPP database with incidence data by country and year. After matching, a total of 83 countries were included in the study.
We collected data on covariates of smoking prevalence, economic indexes, industrial indexes, and traffic indexes for each country. Annual smoking prevalence within each country was estimated, sex- and age-adjusted . Per capita gross domestic product adjusted for purchasing power parity [GDP(PPP)] and inflation to base year 2011 USD was used to capture the country’s standard of living and healthcare level . The indicator of CO2 emissions only from manufacturing industries and construction (% of total fuel combustion) was used to characterize industrialization . Traffic index, or the level of urbanization, measured as the proportion of a country’s population living in urban areas, was applied to capture air pollutants emitted from all mechanical vehicles and public transports . The missing data in North Korea and Taiwan were obtained from supplementary sources [25, 26].
The longitudinal model for which we predict lung cancer incidence is the following Poisson regression:
where index i denotes the country, t denotes the year, and T is the believed lag of per capita coal capacity before affecting the current lung cancer incidence rate λit. For completeness, we consider three lags at T = 5, 10, 15 years for coal capacity and assume an adequate lag of 10 year for smoking  and other covariates, except for per capita GDP.
The model stated above is a marginal model; specifically, we are not concerned with how the effect varies across individual countries, but rather with the “overall” effect averaged over all countries. We must, however, account for this within-country variation across the years, for which generalized estimating equations (GEE)  is perfectly suited to handle. GEE’s strengths lie in its semiparametric properties: assuming no residual confounding or other sources of bias, GEE produces unbiased estimates of the beta coefficients, regardless of the within-country correlation structure specified, although a specification closer to the true correlation structure leads to lower standard errors.
The GEE fit was performed using the geepack package within R version 3.2.5 to estimate the effect of the selected covariates on standardized lung cancer incidence. We use an independence correlation structure, and fit for males and females separately, each weighted by the corresponding male and female populations. Figures were also drawn in R version 3.2.5.
To investigate the possibility that general health improvements correlated with coal capacity may obscure our lung-cancer results, we identify colorectal and anal cancer (CRC) as falsification outcomes. Although one study reported the possible association between CRC mortality and NO2 , results from the other studies suggested negative or inconclusive association between PM and CRC [30, 31]. CRC was coded as B093, B094, 153 or 154 in ICD-9; and C18 to C21 in ICD-10 . We applied the same models to CRC to examine any association with coal capacity.
Burden of diseases analysis
We estimate the population attributable fraction (PAF) of lung cancer to coal-fired power plants in 2015 and predict the PAF in 2025 among studied countries. The PAF is the proportion of lung cancer incidence attributable to anthropogenic coal capacity. Detailed step-by-step calculations are summarized in the GBD study  and our previous work . Briefly, to calculate PAFit, the PAF for country i in year t, we need the quantity RRit, the relative risk of lung cancer incidence given coal capacity at year t − 10, holding all other covariates, including smoking, fixed. This can be deduced immediately from our data analysis portion (10-year-lag model) using the relationship
where RR0 is the relative risk for every KW/capita unit increase in lag 10 coal capacity (1.585 for males, 1.851 for females) as we obtained from the 10 year-lag model (Table 2). Pit − 10 is the proportion of males or females. PAFit is useful, because we can then calculate the standardized attributable cases:
Coal capacities were calculated from a total of 13,581 generating units among 83 countries. All countries have complete 17-year follow-up data from 2000 to 2016. Coal capacities in four time points (years 2000, 2005, 2010, 2015) are mapped in Fig. 1. Coal capacity varied widely both within and between countries across time. Additional file 2: Figure S1 shows coal capacity, plant capacity, coal percentage and total coal consumption of the top 5 countries with the highest levels of coal capacity in the world: China, Germany, Russia, the United Kingdom (UK), and the United States (US). Coal capacity in China has been more than the sum of the other four countries over many years, reaching 434.87GW after 2006. China caught up to the US in terms of plant capacity after 2013. Also, coal percentages in China (65%~ 75%) was significantly higher than the other four countries, which reflects the fundamental difference of energy matrices in different countries (Additional file 3: Table S2).
Table 1 displays the mean and 95% confidence intervals of all covariates during the three periods of 2000~2004, 2005~2010 and 2011~2016; note that these summaries are averaged over countries and time; obtained from empirical data without any distribution assumptions. From the first period to the last, average age-standardized incidence rates from lung cancer decreased by 46 (i.e., from 454 to 408) per hundred thousand (10%) in males but increased by 12 (i.e., from 143 to 155) per hundred thousand (8%) in females. Coal capacity increased from 16 GW to 23 GW. Smoking prevalence decreased by 9% in males and 11% in females, respectively.
Figure 2 (males) and Fig. 3 (females) show the relationship between 10-year-lag log coal capacity and log incidence rates of lung cancer in 2000, 2005, 2010 and 2015. Among both sexes, coal capacity was significantly positively correlated with lung cancer incidence rate (male, slopes = 0.10 to 0.13, all p-values < 0.05; females, slopes = 0.09 to 0.11, all p-values < 0.05).
Y axis: ln(lung cancer incidence rate), unit: ln(case/100 thousands); x axis: ln(coal capacity), unit: ln(MW); smoking prevalence: unit: %.
Y axis: ln(lung cancer incidence rate), unit: ln(case/100 thousands); x axis: ln(coal capacity), unit: ln(MW); smoking prevalence: unit: %.
Univariable, behavior-environmental, 5-year-lag, 10-year-lag and 15-year-lag models were applied to examine the effect among males and females, respectively (Table 2). Longer lag time of smoking of 20 and 30 years were also applied as sensitivity analysis (Additional file 1: Table S3). The point estimates of per capita coal capacity among the year-lag models were similar, so we picked the 10-year-lag model as our primary model. With a 1 KW increase of coal capacity per person in a country, the relative risk of lung cancer increases by a factor of 58.5% (95%CI = 7%~ 1.35%) among males and 85% (95%CI = 22%~ 182%) among females. Meanwhile, a 1% increase of smoking prevalence is associated with an increase of lung cancer incidence by a factor of 3% (95%CI = 1%~ 5%) and 2% (95%CI = 0%~ 5%), among males and females, respectively.
No statistically significant interactions between smoking and coal capacity, or any other time-varying effects on the estimates, were discovered, and thus these results were omitted. In the falsification test, coal capacity was not associated with CRC incidence rates in either males or females for any lag model (Additional file 1: Table S4).
Additional file 4: Table S5 presents the PAFs and standardized lung cancer cases attributable to coal-fired power plants among males and females, respectively, in 2015 and 2025. PAFs are higher for females than males in most countries due to higher RRs. Australia (39.26%) and US (32.65%) had the highest PAFs in 2015, corresponding to more than ten thousands and 233 thousands standardized lung cancer among females, respectively. In China, we estimated more than 347 thousand (range = 341,000~355,000) standardized lung cancer among females (PAF = 19%) and 786,000 (range = 769,000~803,000) among males (PAF = 15%) in 2025, based on different fertility scenarios estimated from UN.
Calculating per capita coal capacities as a determinant of lung cancer is a novel approach and should be interpreted differently from PM as seen in most studies. Firstly, per capita coal capacities could be regarded as averaged individual energy consumption from coal for every citizen within a country, thus may provide a meaningful approach to energy policy compared to PM. As countries compose their Intended Nationally Determined Contributions (INDC) goals for the coming decades, an analysis on reducing construction of or shutting down existing coal power plants may reveal further co-benefits of mitigating global warming and adverse health outcomes . Secondly, since all pollutants related to lung cancer are not known, and known pollutants compose a small fraction of PM2.5, per capita coal capacity could serve as a better estimate of externality then pollutant composition measurements. Those pollutants such as SOx, NOx, heavy metal are associated with lung cancer from previous studies . Thirdly, although capacity factors varied among countries, the range of capacity was approximately 40–60% ; this indicates that the quantity of coal combustion remained fixed after a plant was built. Finally, coal prices in a local market reflect coal quality. Although coal quality might vary between countries, it remains constant within a plant across time . Country-specific effects, such as coal quality, are marginalized out by GEE in the analysis. By weighting the model by country population, we are reflecting the individual data by exploiting aggregated mean values of per capita coal capacity for each individual.
The association between per capita coal capacity and lung cancer incidence can be used to understand the potential number of lives affected by different levels of reliance on coal power. In 2015, we estimate a total of 865,805 male and 542,848 female standardized lung cancer cases can be attributed to anthropogenic power plants using coal as primary energy source. There is little difference between the lag 5 and lag 10 models in terms of quadratic information criterion (QIC)  and coefficients, and longer period of latency for smoking also yields similar results. Therefore, for sake of consistency with the other covariates, we fix lag 10 for coal capacity as primary model and estimate PAFs. These numbers should be interpreted as the total attributable cases given every country has WHO 2000–2025 standardized population and should not be compared directly to other estimations. However, these numbers adjust for age distributions in different countries and can be a valuable tool for country-to-country comparisons of the effect from coal capacity.
These estimates are comparable with prior reports but should be interpreted differently. The Global Burden of Disease group estimated that ambient air pollution globally caused 278.29 thousand lung cancer deaths for males in 2015 . WHO suggested a total of three million deaths were attributable to ambient air pollution in 2012 based on PM2.5 measurement . However, the above method barely linked to PM2.5 or its components. The Health and Environment Alliance estimated a total of 22,900 premature all-cause deaths due to coal-fired power plants in the EU in 2013 . The study provides a direct approach for calculating health effects attributable to coal capacity at the national level.
The model also provides a hint of the effect sizes from coal fired power plant and smoking prevalence. Comparing 2005 to 2015 in U.S., 10-year-lag coal capacity increased from 321.06 GW to 322.29 GW, corresponding to an increase of 0.12 KW/person. Meanwhile, 10-year-lag smoking prevalence decreased 3.50% among males (data not shown). The increased per capita coal capacity is associated with the higher risk of lung cancer by a factor of 5.68% (=1.590.12) while the decreasing smoking prevalence prevented the risk by a factor of 11.28% (=1.033.50). This is meant as a quick numerical check; however, one should not try to surmise any statistical results from this.
Despite using an ecological study design, biological plausibility of our results, the lack of any association in the falsification analysis, and the consistence of our estimates with those from previous investigations indicate that a strong impact of ecologic bias is very unlikely.  Moreover, our analysis on aggregated data is meant to infer policy decisions at the national level and for international comparison . Other factors that may lead to overestimation or underestimation related to the ecological design should also be considered hereafter. To address concerns of data quality and other country-specific biases, we fitted a Poisson regression longitudinal model with GEE to account for time-independent confounders such as underreporting and/or over-diagnosis of diseases. GEE is a semiparametric technique in that it makes no assumptions about the correlation structure among outcomes. One disadvantage regarding GEE is potential efficiency losses compared to mixed models, if we could have correctly specified the true correlation structure properly in a parametric form. However, we are willing to sacrifice some efficiency for statistical robustness, a property GEE possesses while mixed models do not . Regardless, this disadvantage would be germane had we failed to reject that coal capacity has null effect on lung cancer, but since we did reject, fitting with a correctly specified mixed model would only serve to increase the significance of the effect.
Our identified confounders associated with both coal capacity and lung cancer at the national level included adjustments for the appropriate latency period and strong temporality justifications for causal inference . However, residual and unmeasured confounders, such as national-level educational attainment or occupational exposure, may exist; adding more parameters to our analysis would destabilize estimates and cause loss of statistical power. Potential misclassifications of meteorological factor such as wind directions, and/or geographical factors, cannot be adjusted in our model. Since neither the electricity matrix nor meteorological/geographical factor is relevant to a country’s healthcare system, misclassification is non-differential and more likely biases toward the null. Potential misclassifications of lung cancer diagnosis must also be considered across countries even GBD study is the best available data we can obtain . The GBD study does not provide different types of lung cancer incidence for country-to-country comparison. Both adenocarcinoma  and squamous cell carcinoma [46, 47] of lung might have association with environmental factors. Further studies focusing on different types of cancer and coal-fired power plants should be conducted.
Our estimates may be conservative since not all time-varying covariates were considered in our model, such as indoor biomass combustion [48,49,50,51]. Although most countries included in this study were high-income countries and used a limited proportion of indoor biomass combustion, the true effect of coal power plants might be even higher if biomass combustion remained constant rather than decreasing. We adjusted total coal consumption in the model, which included the indoor combustion. Not considering control technologies in place of coal-fired power plants might lead to misclassification of the exposure level. Previous studies have showed that 10% national reduction on SOx emissions were associated with lower CVD incidence rates by 0.28% for males and 1.69% lower for females, respectively . Further studies should address the effectiveness in terms of incidence from lung cancer. Finally, although smoking is unlikely to be a confounder at national level (due to lack of association with coal capacity), we are still interested in considering the nuanced differences of smoking prevalence and included in the model since it might be collinear with uncontrolled confounding from occupational exposures. The differences might exist among age, heavy or light smoking and/or synergistic effects between tobacco smoking and environmental exposure.
We demonstrated an association between lung cancer incidence and coal-fired power plants via a novel approach that measures per capita coal capacity rather than PM. The study may be helpful in addressing a key policy question about the externality cost of coal power plants and estimates of the global disease burden from preventable lung cancer attributable to coal-fired power plants. Further studies might focus on the effectiveness of pollutant controls on health outcomes, quality of coal, synergistic effects between tobacco smoking and environmental exposure, and the financial burden of coal on healthcare expenditures.
International Energy Agency. World Balance: IEA Sankey Diagram,; 2017 [Available from: https://www.iea.org/Sankey/.
Dockery DW, Pope CA 3rd, Xu X, Spengler JD, Ware JH, Fay ME, et al. An association between air pollution and mortality in six U.S. cities. N Engl J Med. 1993;329(24):1753–9.
Cui P, Huang Y, Han J, Song F, Chen K. Ambient particulate matter and lung cancer incidence and mortality: a meta-analysis of prospective studies. Eur J Pub Health. 2015;25(2):324–9.
Grant WB. Air pollution in relation to U.S. cancer mortality rates: an ecological study; likely role of carbonaceous aerosols and polycyclic aromatic hydrocarbons. Anticancer Res. 2009;29(9):3537–45.
Miller KA, Siscovick DS, Sheppard L, Shepherd K, Sullivan JH, Anderson GL, et al. Long-term exposure to air pollution and incidence of cardiovascular events in women. N Engl J Med. 2007;356(5):447–58.
Pope CA 3rd, Ezzati M, Dockery DW. Fine-particulate air pollution and life expectancy in the United States. N Engl J Med. 2009;360(4):376–86.
Hu Z, Rao KR. Particulate air pollution and chronic ischemic heart disease in the eastern United States: a county level ecological study using satellite aerosol data. Environ Health. 2009;8:26.
Hu Z. Spatial analysis of MODIS aerosol optical depth, PM2.5, and chronic coronary heart disease. Int J Health Geogr. 2009;8:27.
International Agency for Research on Cancer (IARC). IARC Monogr Eval Carcinog Risks Hum 2016 [Available from: https://monographs.iarc.fr/wp-content/uploads/2018/06/Table4.pdf.
Global Burden of Disease Collaborative Network Global Burden of Disease Study 2016 (GBD 2016) Results Institute for Health Metrics and Evaluation (IHME): Institute for Health Metrics and Evaluation (IHME); 2016 [Available from: http://ghdx.healthdata.org/gbd-results-tool.
Jerrett M, Burnett RT, Ma R, Pope CA 3rd, Krewski D, Newbold KB, et al. Spatial analysis of air pollution and mortality in Los Angeles. Epidemiology. 2005;16(6):727–36.
U.S. EPA. Integrated science assessment (ISA) for particulate matter (final report, Dec 2009). Washington, DC; 2009.
Padula AM, Mortimer K, Hubbard A, Lurmann F, Jerrett M, Tager IB. Exposure to traffic-related air pollution during pregnancy and term low birth weight: estimation of causal associations in a semiparametric model. Am J Epidemiol. 2012;176(9):815–24.
Sarah Penney JB, John Balbus, . Estimating the health impacts of coal-fired power plants receiving international financing. Environmental Defense Fund; 2009.
Markandya A, Wilkinson P. Electricity generation and health. Lancet. 2007;370(9591):979–90.
William M. Hodan WRB. Evaluating the contribution of PM2.5 precursor gases and re-entrained road emissions to Mobile source PM2.5 particulate matter emissions. MACTEC Federal Programs.
Harrison RM, Yin J. Particulate matter in the atmosphere: which particle properties are important for its effects on health? Sci Total Environ. 2000;249(1–3):85–101.
National Aeronautics and Space Administration. New Map Offers a Global View of Health-Sapping Air Pollution 2010 [Available from: https://www.nasa.gov/topics/earth/features/health-sapping.html.
United Nations Statistics Division. Methodology Standard country or area codes for statistical use (M49) [Available from: https://unstats.un.org/unsd/methodology/m49/.
Omar B. Ahmad CB-P, Alan D. Lopez, Christopher JL Murray, Rafael Lozano, Mie Inoue. Age standardization of rates: a new WHO standard. Geneva, World Health Organization; 2001.
US Energy Information Administration. Primary Coal Consumption 2015 [Available from: http://www.eia.gov/cfapps/ipdbproject/IEDIndex3.cfm?tid=1&pid=1&aid=2.
UDI World Electric Power Plants Database (WEPP). WORLD ELECTRIC POWER PLANTS DATABASE. 2016 [Available from: https://www.spglobal.com/platts/en/our-methodologies/survey.
Ng M, Freeman MK, Fleming TD, Robinson M, Dwyer-Lindgren L, Thomson B, et al. Smoking prevalence and cigarette consumption in 187 countries, 1980-2012. JAMA. 2014;311(2):183–92.
The World Bank. World Dev Indicators 2016 [Available from: http://data.worldbank.org/data-catalog/world-development-indicators.
Groningen Growth and Development Centre Faculty of Economics and Business. The Database Penn World Table version 9.0 2016 [.
National Statistics Taiwan. Gross Domestic Product by Kind of Activity and Implicit Price Deflators 2016 [Available from: https://eng.stat.gov.tw/ct.asp?xItem=37408&CtNode=5347&mp=5
Ezzati M, Lopez AD. Estimates of global mortality attributable to smoking in 2000. Lancet. 2003;362(9387):847–52.
KY L, Zeger SL. Longitudinal data analysis using generalized linear models. Biometrika. 1986;73(1):10.
Turner MC, Krewski D, Diver WR, Pope CA 3rd, Burnett RT, Jerrett M, et al. Ambient air pollution and Cancer mortality in the Cancer prevention study II. Environ Health Perspect. 2017;125(8):087013.
Ancona C, Badaloni C, Mataloni F, Bolignano A, Bucci S, Cesaroni G, et al. Mortality and morbidity in a population exposed to multiple sources of air pollution: a retrospective cohort study using air dispersion models. Environ Res. 2015;137:467–74.
Wong CM, Tsang H, Lai HK, Thomas GN, Lam KB, Chan KP, et al. Cancer mortality risks from long-term exposure to ambient fine particle. Cancer Epidemiol Biomark Prev. 2016;25(5):839–45.
Lee LJ, Lin CK, Hung MC, Wang JD. Impact of work-related cancers in Taiwan-estimation with QALY (quality-adjusted life year) and healthcare costs. Prev Med Rep. 2016;4:87–93.
Buonocore JJ, Lambert KF, Burtraw D, Sekar S, Driscoll CT. An analysis of costs and health co-benefits for a U. S Power Plant Carbon Standard PLoS One. 2016;11(6):e0156308.
Ellingsen DG, Andersen A, Nordhagen HP, Efskind J, Kjuus H. Incidence of cancer and mortality among workers exposed to mercury vapour in the Norwegian chloralkali industry. Br J Ind Med. 1993;50(10):875–80.
Kwon A. Electric generator capacity factors vary widely across the world US Energy Information Administration2015 [.
Mernier A. Putting a Price on Energy: International Coal Pricing. Energy Charter Secretariat: Energy Charter Secretariat; 2010 [.
Pan W. Akaike's information criterion in generalized estimating equations. Biometrics. 2001;57(1):120–5.
Global Burden of Disease Collaborative Network. Global burden of disease study 2016 (GBD 2016) results. In: Seattle USIfHMaEI, editor; 2017.
World Health Organization. Ambient Air: A global Assessment of Exposure and Burden of Disease. 2016.
Dave Jones JH, Lauri Myllyvirta, Rosa Gierens, Joanna Flisowska, Kathrin Gutmann, Darek Urbaniak, Sarah Azau. Europe's Dark Cloud. WWF European Policy Office: WWF European Policy Office, Sandbag, CAN Europe and HEAL in Brussels, Belgium.; 2016.
Robinson WS. Ecological correlations and the behavior of individuals. Int J Epidemiol. 2009;38(2):337–41.
Idrovo AJ. Three criteria for ecological fallacy. Environ Health Perspect. 2011;119(8):A332.
John E. Overall ST. robustness of generalized estimating equation (GEE) tests of significance against misspecification of the error structure model. Biom J. 2004;46(2):11.
Hill AB. The environment and disease: association or causation? Proc R Soc Med. 1965;58:295–300.
Gharibvand L, Lawrence Beeson W, Shavlik D, Knutsen R, Ghamsary M, Soret S, et al. The association between ambient fine particulate matter and incident adenocarcinoma subtype of lung cancer. Environ Health. 2017;16(1):71.
Tseng CY, Huang YC, Su SY, Huang JY, Lai CH, Lung CC, et al. Cell type specificity of female lung cancer associated with sulfur dioxide from air pollutants in Taiwan: an ecological study. BMC Public Health. 2012;12:4.
Lamichhane DK, Kim HC, Choi CM, Shin MH, Shim YM, Leem JH, et al. Lung Cancer risk and residential exposure to air pollution: a Korean population-based case-control study. Yonsei Med J. 2017;58(6):1111–8.
Richard Hosier JD. Household fuel choice in Zimbabwe: an empirical test of the energy ladder hypothesis. Resour Energy. 1987;4(4):15.
Seow WJ, Hu W, Vermeulen R, Hosgood Iii HD, Downward GS, Chapman RS, et al. Household air pollution and lung cancer in China: a review of studies in Xuanwei. Chin J Cancer. 2014;33(10):471–5.
Kim C, Gao YT, Xiang YB, Barone-Adesi F, Zhang Y, Hosgood HD, et al. Home kitchen ventilation, cooking fuels, and lung cancer risk in a prospective cohort of never smoking women in Shanghai. China Int J Cancer. 2015;136(3):632–8.
Lui KH, Dai WT, Chan CS, Tian L, Ning BF, Zhou Y, et al. Cancer risk from gaseous carbonyl compounds in indoor environment generated from household coal combustion in Xuanwei, China. Environ Sci Pollut Res Int. 2017;24(21):17500–10.
Lin CK, Lin RT, Chen PC, Wang P. De Marcellis-Warin N, Zigler C, et al. a global perspective on sulfur oxide controls in coal-fired power plants and cardiovascular disease. Sci Rep. 2018;8(1):2611.
We acknowledge the support of Graduate Consortium, Harvard University Center for the Environmental Health. We are also sincerely grateful to Pi-Cheng Chen and He Lin for his important input at the initial stage of study design as well as Professor Stefanos Kales’s in-depth discussion and advice.
Availability data and materials
The datasets supporting the conclusions of this article are included within the article and its additional files.
This research received no financial support from any agency in the public, commercial, or not-for-profit sectors. No funding source or external agency played any role in study design, data collection, data analysis, data interpretation, or writing of this report.
Ethics approval and consent to participate
The study had been reviewed and approved by Harvard T.H. Chan School of Public Health Office of Human Research Administration (IRB protocol #: IRB16–21). No individual data is used in the research.
Consent for publication
We declare that we have no conflict of interest.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Table S1. Countries included in the analysis, by geographical region a (N = 83). Table S3. Relative risk (RR) and 95% confidence intervals (CIs) of the increase in lung cancer incidence with change in coal capacity, among males and females. Table S4. Relative risk (RR) and 95% confidence intervals (CIs) of the increase in colorectal cancer with change in coal capacity, adjusted for different variables in different models among males and females. (DOCX 23 kb)
Figure S1. Coal capacity, plant capacity, coal percentage and total coal consumption of the top 5 countries with the highest levels of coal capacity in the world. (JPG 755 kb)
Table S2. Estimated population attributable factors (2015, 2025) and standardized attributable cases (2015) among males and females of studied countries. (XLSX 40 kb)
Table S5. National cancer incidence of lung, colorectum, population, smoking prevalence by gender and coal capacity in 2000, 2005, 2010, and 2015. (XLSX 44 kb)