Skip to main content

The impact of new research technologies on our understanding of environmental causes of disease: the concept of clinical vulnerability


In spite of decades of epidemiological research, the etiology and causal patterns for many common diseases, such as breast and colon cancer or neurodegenerative diseases, are still largely unknown. Such chronic diseases are likely to have an environmental origin. However, "environmental" risks have been often elusive in epidemiological studies. This is a conundrum for current epidemiological research. On the other side, the relative contribution of genes to chronic diseases, as emerging from GWAS, seems to be modest (15-50% increase in disease risk). What is yet to be explored extensively is a model of disease based on long-term effects of low doses of environmental exposures, incorporating both genetic and acquired susceptibility ("clinical vulnerability"), and the cumulative effects of different exposures. Such a disease model would be compatible with the weak associations found by GWAS and the still elusive role of many (low-level) environmental exposures. We also propose that the introduction of "-omic" high-throughput technologies, such as transcriptomics, proteomics and metabolomics, may provide, in the next years, powerful tools to investigate early effects of environmental exposures and understand the etiology of common diseases better, according to the "clinical vulnerability model". The development of "-omics", in spite of current limitations and lack of sound validation, could greatly contribute to the elucidation of the disease model we propose.

Peer Review reports


We are still far from completely understanding the etiology and causal patterns for many common diseases, such as breast and colon cancer, or neurodegenerative diseases. In spite of decades of epidemiological research, the etiology of many such conditions is still largely unknown.

For a long time it has been hypothesized that most chronic diseases have an environmental origin (using the term "environment" in a broad sense, to indicate essentially what is not due to genetic predisposition). This claim was based on descriptive data showing the broad range of incidence rates in different parts of the world, the rapid temporal changes - such as those currently occurring in China and India - and the crucial observation of incidence rates in migrant populations. The latter unequivocally showed that migrants rapidly acquire - sometimes already in the first generation after migration - the risk of disease that is typical of the population where they move.

However, "environmental" risks have been largely elusive in epidemiological research, and there is still much debate on the real impact of the environment, whose role tends to be overshadowed in recent times by the successes of genetics, in particular Genome-wide association studies (GWAS). Clearly, we need a disease model that allows us to encompass both the mass of genetic data coming from GWAS and the still elusive role of many environmental exposures.

Genes vs. environment

The technological revolution in genetics, which in 2007-2009 led to the identification of a number of novel genes for several common diseases, is inspiring but at the same time a little disappointing. Most highly-penetrant genes had already been uncovered by linkage studies, so that "genome-wide scans" could mainly identify low-penetrant genes. It is a fact that - with a few exceptions - what has been found is genetic variants weakly associated with chronic diseases, with relative risks of 1.15-1.5. These risks are much lower than some known environmental exposures, and of the same order of magnitude of environmental exposures that are usually rejected as too weakly associated with disease to be meaningful or interpretable. For example, Hung et al have found an allelic relative risk of 1.3 for a gene variant associated with lung cancer, in the putative nicotinic acetylcholine receptor subunit on chromosome 15q25 [1]. The genotype nicely combines with the habit of smoking, increasing the probability of developing lung cancer in smokers (Figure 1) [1]. However, it is clear from the figure that the effect of smoking is by far greater than the effect of the genotype. The former increases the cumulative risk in life from less than 1% (non smokers who have never smoked) to about 15-20% (current smokers), while the effect of the genotype is a small additional amount (~1 to 2%) that does not seem to contribute much to the Public Health problem of lung cancer.

Figure 1
figure 1

Cumulative risk of lung cancer by rs8034191 genotype. Relevance of smoking and of rs8034191 genotype to lung cancer mortality in men aged 45-75 years. Cumulative risk (in the absence of other causes of death) based on national lung cancer death rates for men in Poland in the year 2000, assuming that the prevalence of current smoking, former smoking, and never smoking are as in this study and that the relative risks for lung cancer incidence and mortality are similar (Hung et al 2008) [1].

Of course genetic research is not tailored for Public Health purposes. It can be worthwhile, instead, (a) for the investigation of disease mechanisms, (b) to shed light on potential therapies, or (c) to suggest clues for unidentified etiological agents in the case of diseases whose etiology is still unknown. It is unclear, however, how goals (b) and (c) can be met if the relative contribution of genes is so small (15-50% increase in disease risk) being in the same order of magnitude of relative risks found for unclear and controversial environmental exposures, such as electromagnetic fields and leukemia, or some pesticides and prostate cancer. It is therefore useful to reflect on the potential reasons why weak genetic associations (generally) are perceived as much more noteworthy than environmental associations.

1. One reason - perhaps the main one - is certainly the greater robustness of technical detection of gene variants compared to environmental exposures. It is estimated that the error rate with common genotyping techniques is less than 10%, while it can be as high as 50% for environmental exposure assessment, at least for dichotomous exposure variables. Moreover genes are stable in the course of life, while environmental exposures are changing over time (let us think of diet) leading to more inherent measurement error in environmental exposure than in gene variance assessment.

2. A second obvious reason is replication. In the study mentioned above on lung cancer, the association was found in five independent studies. This usually is not the case for environmental exposures, where replication is not easily achievable. For example, a cohort study from Belgium suggested that high levels of cadmium in urine are associated rather strongly with lung cancer [2]. The study is strong (though small), because exposure assessment is based on a biomarker reflecting long-term exposure (i.e. with much higher accuracy than usually available in epidemiology). However, the study is not easily reproducible, because there are not many other populations with similar levels of environmental exposure to cadmium, and there might be several confounders or effect modifiers that either obscure the association with cadmium or change the strength of the association. This is commonly observed in environmental epidemiology, i.e. a number of small-medium size associations that are difficult to reproduce in different settings, both because of methodological difficulties and because of variable study contexts.

Low doses and acquired susceptibility to disease

In spite of the limitations of environmental epidemiology, in particular of exposure assessment, we have seen recently increasing ability of epidemiological tools to consistently unravel the effects of low-level environmental exposures. The cases of air pollution and environmental tobacco smoke (also called ETS, secondhand smoke or passive smoke) are exemplar. In the seventies and eighties many researchers thought it was impossible to detect plausible causal associations with such low-level exposures. The underlying idea was that "noise" (bias, confounding) was larger than the signal. However, a large number of well-conducted studies have been published since: in the case of ETS more than 60 studies show (with few exceptions) increased risks of lung cancer in the order of 1.25, i.e. the same magnitude as the gene for the nicotine acetylcholine receptor [3]. For air pollution, an association with lung cancer has been reported in six cohort studies. One of these studies is sufficiently large as to show the association in non-smokers [3, 4]. Again, the relative risk is around 1.25. In both cases biomarkers, such as cotinine or NNK for ETS and DNA adducts for air pollution, have contributed to make the association more plausibly causal.

We have thus to acknowledge that ETS and air pollution (mainly due to traffic exhaust) are able to induce, after long-term exposure, chronic diseases such as cancer and also coronary artery disease. The interesting observation is that this happens at dose levels that are much lower than those of tobacco smoke or "classical" carcinogens. For example, exposure to ETS occurs at levels that are 1/100 compared to active smoking [5].

One potential explanation for the effects of low doses and the absence of a threshold is based on acquired susceptibility and the cumulative effects of different exposures. As defined by Rothman and Greenland, "the cause of a disease event is an antecedent event, condition or characteristic that was necessary (given that all other conditions are fixed) for the occurrence of the disease at the moment it occurred" [6]. Said in other words, a cause can also be viewed as something that "completes an incomplete causal chain" [7] or precipitates a chain of events, which creates a state of vulnerability. This concept of "clinical vulnerability" is expressed in Figure 2[8]. Exposure to low levels of e.g. ETS or air pollution is not a "cause" of cancer in itself (like an accident is the cause of a death), but because it occurs on top of pre-existing vulnerability. This could well explain why small changes in environmental exposures can have big effects, if they occur in a population of vulnerable subjects.

Figure 2
figure 2

Vulnerability plus exposure events change the physiological state, but a reserve is present. When the reserve is overtaken, clinical manifestations appear. From,21, courtesy of R Anderson.

Vulnerability can be acquired or genetically-based. The concept of acquired "clinical vulnerability" is related to previous insults/pathophysiological changes that predispose to disease, as expressed in Figure 2. An example is the finding of a greater effect of ETS among ex-smokers compared to non smokers who have never smoked in a large prospective investigation [9]. It is plausible that ex-smokers have a greater vulnerability because of already existing mutations or epigenetic changes, so that further exposure to ETS leads to selection and clonal expansion of mutated cells.

Another type of vulnerability (more often called hypersusceptibility) is genetically determined. Many years ago we showed that subjects with the genetically based NAT2 slow acetylator genotype could have greater susceptibility to adverse biological and/or health effects by tobacco smoke related arylamines at lower levels of exposure rather than at higher levels [10]. Our reasoning was that on very rare occasions, e.g. among people exposed to high doses of potent carcinogens, the whole population or a vast majority develops cancer. This is what happened among British chemical workers exposed to 2-naphthylamine in the 1950s and before. For example, all 15 workers exposed to 2-naphthylamine in a plant developed bladder cancer, probably one of the few examples of a "sufficient" exposure in the history of carcinogenesis [11]. It is clear that in that case genetic susceptibility was totally irrelevant. In most cases, however, when exposure levels are lower or much lower, genetic susceptibility can play a role in modulating individual risk.

The contribution of intermediate biomarkers and '-omics'

The concept of acquired "clinical vulnerability" is related to previous insults/pathophysiological changes that predispose to disease. Intermediate markers and specifically '-omics' could be particularly useful in tracing the "history" of such insults and in reflecting the cumulative effect of different exposures.

Intermediate Biomarkers

Intermediate biomarkers directly or indirectly represent events on the continuum between exposure and disease. Intermediate biomarkers can provide important mechanistic insight into the pathogenesis of environmental diseases. As such, they complement classic epidemiological studies that use disease endpoints. Examples of early biologic effect biomarkers include measures of cellular toxicity, chromosomal alterations, DNA, RNA and protein expression, and early non-neoplastic alterations in cell function (e.g., altered DNA repair, altered immune function). For maximum utility, an intermediate biomarker must be shown to be predictive of developing disease. The criteria for validating intermediate biomarkers have been discussed by Schatzkin and colleagues [12, 13] and focus on the calculation of the etiologic fraction of the intermediate endpoint, which varies from 0 to 1. The closer the etiologic fraction is to 1, the more the biologic marker reflects events, either directly or indirectly, on the causal pathway to disease. For instance, bone-marrow toxicity has been linked to future risk of leukemia. Therefore, environmental exposures that lead to bone-marrow toxicity are potentially leukemogens. Bone-marrow toxicity can be measured by studying peripheral blood cell counts including circulating progenitor cells. Studies based on such endpoints have the advantage that the biological marker can be measured quantitatively and therefore increase the power as compared to dichotomous (rare) disease endpoints. Furthermore, these studies can be carried out using cross-sectional or semi-longitudinal study designs in which exposure can be accurately assessed. An example of the use of intermediate markers is a recent study on hematological effects of benzene at low levels of exposure [14]. In this study peripheral blood cells and progenitor cell colony formation were significantly decreased among low exposed individuals in a dose-dependent matter. Moreover, these effects occurred at exposure levels that were lower than previously thought safe [14].


"-Omics" tools can be directly applied to samples from an epidemiologic case-control or cohort study to better characterise intermediate pathways, potentially providing the 'missing links' among exposures, genes, and diseases [15]. The term "-omics" generally refers to the rigorous study of a complete set of biological molecules with high-throughput techniques [16]. One of the main hallmarks is the departure from traditional molecular biology focusing on a single biological structure (e.g. a single gene or protein) to a more comprehensive analysis of biological systems.

Cells and tissues alter their metabolism or gene expression in response to exposure as indicated by, for example, the induction of specific groups of genes in response to heat shock, hypoxia, or osmotic stress [1719]. These responses can produce patterns of specific changes in gene expression, proteins, or metabolic profiles that reflect exposure to a particular agent or class of agents which can be interpreted as a "history" of such exposures and might reflect the cumulative effect of different exposures. Some of the "-omic" technologies in environmental health that offer new opportunities to detect and quantify these changes are discussed below (also see table 1). We propose some examples of uses of "-omics" in environmental health, mainly to elucidate which kind of contributions we may expect.

Table 1 "-omics" in Environmental Health (except genomics).


Transcriptomics is a mature technology, however still strongly constrained by the intrinsic heterogeneity and instability of mRNA. It focuses on gene expression profiling, which can be used to determine which genes are differently expressed as a result of changes in environmental conditions. A study typically involves a group of individuals with similar exposure level or phenotype (e.g. disease status) and compares the gene expression profile of this group to the profile of a reference group. To avoid the problems of large interindividual variations compared with the small changes caused by environmental exposure, Wang et al conducted a self-controlled study design, with measurements made before and after individual exposure to welding fumes. The group was stratified according to smoking status (which profoundly affected the expression profiles). Nonsmokers after exposure to welding fumes exhibited altered gene expression in 35 genes from eight functional pathways, including processes related to oxidative stress, proinflammatory responses, phosphate metabolism, cell proliferation, and apoptosis [20].

A study by Lu et al used a human cancer cDNA expression array to detect differential gene expression patterns determined in liver tissue samples from arsenic exposed individuals and healthy liver tissues [21]. Approximately 60 genes (10%) were differentially expressed in arsenic exposed human liver tissues compared to controls. The differentially expressed genes included those involved in cell-cycle regulation, apoptosis, DNA damage response and intermediate filaments. The observed gene alterations appeared to be reflective of hepatic degenerative lesions seen in arsenic-exposed patients.

However, transcriptomic changes are likely to be temporary, i.e. reflect short-term exposures, and unlikely to express the long-term changes that we need in order to investigate chronic diseases.


Epigenomics involves the study of epigenetic processes, which are independent of DNA sequence and are involved in the inhibition of gene expression (gene silencing), on a large scale [2224]. It benefits from the robustness of technologies for genomics. Altered gene silencing plays a causal role in human diseases [2528]. Epigenetic modifications may provide a plausible link between environmental exposures and alterations in gene expression that might lead to disease phenotypes. A large amount of laboratory animal studies already provide evidence that supports the role of environmental epigenetics in disease susceptibility [25]. Epigenomics is mainly based on a few comprehensively studied mechanisms: (a) DNA methylation, which takes place at the carbon-5 position of cytosine in CpG dinucleotides, (b) changes to the chromatin packaging of DNA by post-translational 'histone modification', and more recently (c) changes in RNA interference by non-coding RNAs such as microRNA and siRNA.

In mice, maternal dietary methyl-donor supplementation with folic acid, vitamin B, choline and betaine was shown to result in an increase in DNA methylation at CpG sites, thus demonstrating that the effect of a mother's diet during pregnancy on the adult phenotype of her offspring was directly linked to DNA-methylation changes in the epigenome [25]. A few more studies on maternal dietary exposure, e.g. to phytoestrogen genistein during gestation given at a level comparable to that of human consumption with high soy diets, are particularly interesting as these results suggest the possibility that hypermethylating dietary supplements could reduce the effects of environmental toxicants, which tend to cause DNA hypomethylation [25].

Recent experiments have shown that epigenetic changes are associated with chromatin remodelling and regulation of gene expression that underlie the development of the metabolic syndrome, and to other common diseases like atherosclerosis and type 2 diabetes [29]. Deciphering epigenetic processes should allow us to target the development of new diets and drugs to prevent aberrant gene silencing, which may be involved in resistance to treatment. The importance of DNA methylation in cancer has also been established, and research now includes the mechanisms by which other chromatin modifications play a role in cancer development [30]. In particular, the link between covalent histone modifications and nucleosomal remodeling is increasingly being explored [30, 31].

Also several environmental exposures have been associated with altered levels of gene methylation. In one study, decreased repeated-element methylation in gene LINE-1 was observed after exposure to traffic particles (beta = -0.09; 95% CI, -0.17 to -0.01; P = 0.03) [32]. Also airborne benzene was associated with a significant reduction in LINE-1 (-2.33% for a 10-fold increase in airborne benzene levels; P = 0.009) and AluI (-1.00%; P = 0.027) methylation. Hypermethylation in p15 (+0.35%; P = 0.018) and hypomethylation in MAGE-1 (-0.49%; P = 0.049) were associated with increasing airborne benzene levels [33] In another study, Rusiecki et al found inverse correlations between global methylation levels and POP (Persistent Organic Pollutant) concentrations in the blood of Inuit populations [34]. Linear regressions, adjusting for age and cigarette smoking, showed statistically significant inverse linear relationships mainly for the Alu assay for p, p'-DDT (1,1,1-trichloro-2,2-bis(p-chlorophenyl)ethane; beta = -0.26), p, p'-DDE [1,1-dichloro-2,2-bis(p-chlorophenyl) ethylene; beta = -0.38], beta-hexachlorocyclohexane (beta = -0.48), oxychlordane (beta = -0.32), alpha-chlordane (beta = -0.75), mirex (beta = -0.27), sum of polychlorinated biphenyls (beta = -0.56), and sum of all POPs (beta = -0.48). Linear regressions for the LINE-1 assay showed beta estimates of similar magnitudes to those using the Alu assay, however, none was statistically significant.


The proteome consists of all proteins present in specific cell types or tissues and, in contrast to the genome, it is highly dynamic over time, between cell types and in response to the environment. Proteomics is perhaps the least mature of the "-omic" technologies. Whether agent-specific changes can be identified is a key question to be answered about proteomic approaches (note, the same issue applies to other "-omics", such as metabolomics) and the issue will be resolved only after a much larger body of proteomic data on exposed cells, tissues, and biofluids is collected and analyzed [35]. Proteomic approaches are twofold: the first is the standardisation of high-throughput methods, and the second is to combine extensive protein fractionation, removal of abundant proteins and tandem MS/MS analysis of peptide sequence data to allow protein identification.

Joo et al performed proteomic analysis of plasma proteins among workers exposed to benzene and matched unexposed controls, and associated up-regulation of specific proteins with lymphocyte DNA damage [36]. Using MS and Western blot, they found statistically significant differences in protein profiles between the exposed and unexposed subjects. The specific proteins that were found to be up-regulated in benzene exposed workers were TCR beta, FKBP51 and MMP13. Vermeulen et al examined the impact of benzene on the human serum proteome in a study of benzene- exposed workers and unexposed controls, where protein-expression patterns were analysed by SELDI-TOF MS. They found that three proteins were consistently down-regulated in the exposed group compared to controls and all proteins were highly inversely correlated with individual estimates of benzene exposure (r > 0.75) [37].

Proteomics has been useful in the field of cancer, particularly in the potential discovery of new biomarkers that may improve the diagnosis of prostate and other cancers [38]. Proteomic approaches may also provide insights into the disease mechanism of rheumatic disease [39, 40]. Lopez-Pedrera reviews how recent genomic and proteomic approaches have provided new insights into autoimmune diseases [41]. Obesity, which is caused by a combination of genetic predisposition and environmental factors, is an area where proteomic approaches are also being applied. Barcelo-Batllori and Gomis discuss the application of proteomics in understanding the pathogenesis or the molecular mechanism of obesity [42].

However proteomics is still open to several methodological problems that have not been solved yet. The most controversial example has been in reporting a distinctive pattern of low-molecular weight proteins that characterises the sera of women with early-stage ovarian cancer that is very rarely found in control sera [43]. The study was later refuted by others who offered alternative interpretations of the initial observations [44]. This was a case where the basic epidemiologic principle of treating samples from cases identically to those from controls was possibly not followed, with the apparent differences between cases and controls reflecting artefacts of sample handling [45].


Changes in gene and protein expression can alter metabolism in particular ways that can provide distinct signatures. The metabolome is the biochemical network consisting of small molecules (e.g. lipids, vitamins) also known as metabolites. Metabolites are involved in energy transmission in cells by interacting with other biological molecules following metabolic pathways. The metabolome is highly variable and time dependent and consists of a wide range of different chemical structures. The most important challenge in this field is to acquire qualitative and quantitative information on metabolites that occur under normal circumstances in order to be able to detect perturbations in the complement of metabolites due to changes in environmental factors. Global metabolomic profiling may be more amenable than transcriptomic or proteomic profiling to high-throughput screening, as suggested for example by studies showing that the main yeast metabolome consists of fewer than 600 low-molecular-weight compounds. It should be noted however that the yeast has only 4,000 genes and is a unicellular organism, representing thus an oversimplified model to study humans. Substantial metabolomic analysis of exposure and effects will be needed to address the possibilities offered by this approach [35, 46]. Global metabolomic analysis in exposure studies in laboratory animals has just begun, but the low-level exposures of interest in large populations have not yet been addressed.

Metabolomic studies are performed with noninvasive samples such as biofluids and breath condensate as well as on tissues in vivo. Profiles within a tissue or cell are compared with profiles in biologic fluids or with cell secretion products to understand the metabolic consequences of xenobiotic-induced toxicity. Examples of this approach are the studies by Waters and colleagues, where NMR and pattern recognition analysis were used to investigate time-related metabolic effects of α-naphthylisothiocyanate on liver, urine, and plasma in laboratory animals [47, 48]. Waters and colleagues found an association between hepatic lipidosis and increased urinary excretion of taurine and creatine. In addition, there was reduced urinary excretion of intermediates in the tricarboxylic acid cycle and increased excretion of plasma ketone bodies. These studies enabled a clearer understanding of key metabolic effects during the development of and recovery from toxic lesions. In the past, there was a failure in specificity for cancer, which hampered biomarker discovery by metabolomics, but technologies like NMR and MS are being reapplied to study a range of cancers [49, 50]. Recently a metabolomic study has been successfully applied to epithelial ovarian cancer and was apparently able to discriminate women with epithelial ovarian cancer from healthy controls, based on 1H NMR spectroscopic profiles of preoperative serum specimens [49].

Holmes et al [51] used a large-scale exploratory analytical approach to investigate metabolic phenotype variation across and within four human populations, based on 1H NMR spectroscopy. Metabolites discriminating across populations were then linked to data for individuals on blood pressure. They analysed spectra from two 24-hour urine specimens for each of 4,630 participants from the INTERMAP epidemiological study, involving 17 population samples in China, Japan, UK and USA. They showed that urinary metabolite excretion patterns for East Asian and western population samples, with contrasting diets, diet-related major risk factors, and coronary heart disease/stroke rates, were significantly differentiated (P < 10(-16)), as were Chinese/Japanese metabolic phenotypes, and subgroups with differences in dietary vegetable/animal protein and blood pressure. Among discriminatory metabolites, they quantified four and showed associations (P < 0.05 to P < 0.0001) of mean 24-hour urinary formate excretion with blood pressure in multiple regression analyses. Mean 24-hour urinary excretion of alanine (direct) and hippurate (inverse), reflecting diet and gut microbial activities, were also associated with blood pressure [51].


The prevailing model of "environmental disease" for decades has been influenced by toxicology, with single agents evaluated at a time, and dose as a key issue. A more recent model, which was not overall very successful, was that of "gene-environment" interactions in which environmental agents at low doses interact with individual genetic susceptibility. What has not been explored extensively yet is another potential model for the effects of low doses and the absence of thresholds, represented by acquired susceptibility and the cumulative effects of different exposures. The concept of "clinical vulnerability" could explain why small changes in exposure can have big effects, if they occur in a population of vulnerable subjects. The concept of acquired "clinical vulnerability" is related to previous insults/pathophysiological changes that predispose to disease.

The new "-omics" can be extremely useful in tracing the "history" of such insults and to reflect the cumulative effect of different exposures. However, the success of "-omics" applications to epidemiology will depend on a coherent and effective model of disease, that addresses issues such the role of environmental low-level exposures.

The ultimate goal of using "-omics" technologies to identify environmental causes of disease is to derive an integrated view of the biological processes involved in the continuum from exposure to disease. Although high-sensitivity and data-rich toxicogenomic approaches may already be feasible for analyzing human responses to environmental stressors in some settings, applying these technologies to human populations exposed to low-level environmental contaminants will require considerably more development. In this context, the integration of information from multiple toxicogenomic approaches may provide significantly more analytical power than any one approach alone.

Current limitations of "-omics" include low reproducibility across laboratories, high intra-individual variability that hampers inter-individual comparisons, high costs, difficulties in data analysis and uncertainties in biological interpretation (see table 2). Type I, II, and III errors need to be addressed. Type I error refers to false positive results deriving from the large number of comparisons. This can be tackled with in-built replication with repeat measures in independent studies. Type II error derives from the relative lack of sensitivity of some techniques and the ensuing false negatives issue. Type III error refers to the lack of relevance of some biomarkers, with uncertain biological meaning, for the interpretation of exposure disease relationships. Lumbreras et al discuss some of the sources of errors in regard to "-omics" based diagnostic techniques and their solutions, and the effects on diagnostic accuracy [52, 53].

Table 2 Using "-omics": advantages and limitations

There is also a challenge for accurate exposure assessment of environmental exposures with "-omics." Wild, for example, has expressed "the need for developing methods with the same precision for an individual's exposure, as we have for the individual's genome" [54]. He goes on to describe the concept of an 'exposome' which encompasses lifetime environmental exposure starting from prenatal period, and stresses the importance of even partial, targeted understanding of exposure, in addition to the genome, that could provide valuable advances. A prospective cohort study design can be carried out to best address the exposure biomarker approach, because it gives opportunities for repeat measures, as well as collection of samples before the onset of disease.

"-Omics" has the potential to play an important role in the generation of new insights into the etiology of disease and gene-environment interactions, but it is important to learn "what technologies can be usefully applied to which questions." [45] With the development of validated technologies, appropriate study designs, adequate sample size, inclusion of quality control and advanced statistical methods for data interpretation, "-omics" could potentially contribute significantly to the identification of environmental causes of disease and will help the field progress towards an integrated view of the interaction between environment and human health.



genome-wide association studies


environmental tobacco smoke


International; Agency for Research on Cancer


cardiovascular disease


4-(methylnitrosamino)-1-(3-pyridyl)-1-butanone (NNK)


Persistent Organic Pollutant


Radio-frequency modulated electromagnetic fields


International study of macro- and micro-nutrients and blood pressure.


  1. Hung RJ, McKay JD, Gaborieau V, et al.: A susceptibility locus for lung cancer maps to nicotinic acetylcholine receptor subunit genes on 15q25. Nature. 2008, 452: 633-637. 10.1038/nature06885.

    Article  CAS  Google Scholar 

  2. Nawrot T, Plusquin M, Hogervorst J, et al.: Environmental exposure to cadmium and risk of cancer: a prospective population-based study. Lancet Oncol. 2006, 7: 119-126. 10.1016/S1470-2045(06)70545-9.

    Article  CAS  Google Scholar 

  3. Vineis P, Hoek G, Krzyzanowski M, et al.: Lung cancers attributable to environmental tobacco smoke and air pollution in non-smokers in different European countries: a prospective study. Environ Health. 2007, 6: 7-10.1186/1476-069X-6-7.

    Article  Google Scholar 

  4. Beelen R, Hoek G, Brandt van den PA, et al.: Long-term exposure to traffic-related air pollution and lung cancer risk. Epidemiology. 2008, 19: 702-710. 10.1097/EDE.0b013e318181b3ca.

    Article  Google Scholar 

  5. Vineis P, Alavanja M, Garte S: Dose-response relationship in tobacco-related cancers of bladder and lung: a biochemical interpretation. Int J Cancer. 2004, 108: 2-7. 10.1002/ijc.11467.

    Article  CAS  Google Scholar 

  6. Rothman KJ, Greenland S: Modern Epidemiology. 1998, Lippincott Williams & Wilkins, 3

    Google Scholar 

  7. Vineis P, Kriebel D: Causal models in epidemiology: past inheritance and genetic future. Environ Health. 2006, 5: 21-10.1186/1476-069X-5-21.

    Article  Google Scholar 

  8. Anderson HR, Atkinson RW, Bremner SA, Marston L: Particulate air pollution and hospital admissions for cardiorespiratory diseases: are the elderly at greater risk?. Eur Respir J Suppl. 2003, 40: 39s-46s. 10.1183/09031936.03.00402203.

    Article  CAS  Google Scholar 

  9. Vineis P, Airoldi L, Veglia F, et al.: Environmental tobacco smoke and risk of respiratory cancer and chronic obstructive pulmonary disease in former smokers and never smokers in the EPIC prospective study. BMJ. 2005, 330: 277-10.1136/bmj.38327.648472.82.

    Article  CAS  Google Scholar 

  10. Vineis P, Bartsch H, Caporaso N, et al.: Genetically based N-acetyltransferase metabolic polymorphism and low-level environmental exposure to carcinogens. Nature. 1994, 369: 154-156. 10.1038/369154a0.

    Article  CAS  Google Scholar 

  11. Case RA, Hosker ME, McDonald DB, Pearson JT: Tumours of the urinary bladder in workmen engaged in the manufacture and use of certain dyestuff intermediates in the British chemical industry. I. The role of aniline, benzidine, alpha-naphthylamine, and beta-naphthylamine. Br J Ind Med. 1954, 11: 75-104.

    CAS  Google Scholar 

  12. Schatzkin A, Freedman LS, Schiffman MH, Dawsey SM: Validation of intermediate end points in cancer research. J Natl Cancer Inst. 1990, 82: 1746-1752. 10.1093/jnci/82.22.1746.

    Article  CAS  Google Scholar 

  13. Schatzkin A, Gail M: The promise and peril of surrogate end points in cancer research. Nat Rev Cancer. 2002, 2: 19-27. 10.1038/nrc702.

    Article  CAS  Google Scholar 

  14. Lan Q, Zhang L, Li G, et al.: Hematotoxicity in workers exposed to low levels of benzene. Science. 2004, 306: 1774-1776. 10.1126/science.1102443.

    Article  CAS  Google Scholar 

  15. Thomas DC: High-volume "-omics" technologies and the future of molecular epidemiology. Epidemiology. 2006, 17: 490-491. 10.1097/01.ede.0000229950.29674.68.

    Article  Google Scholar 

  16. Smith MT, Vermeulen R, Li G, et al.: Use of 'Omic' technologies to study humans exposed to benzene. Chem Biol Interact. 2005, 153: 123-127. 10.1016/j.cbi.2005.03.017.

    Article  Google Scholar 

  17. Finkel T, Holbrook NJ: Oxidants, oxidative stress and the biology of ageing. Nature. 2000, 408: 239-247. 10.1038/35041687.

    Article  CAS  Google Scholar 

  18. Berra E, Ginouves A, Pouyssegur J: The hypoxia-inducible-factor hydroxylases bring fresh air into hypoxia signalling. EMBO Rep. 2006, 7: 41-45. 10.1038/sj.embor.7400598.

    Article  CAS  Google Scholar 

  19. Chellappan SP: HOG on the promoter: regulation of the osmotic stress response. Sci STKE. 2001, 93: PE1-

    Google Scholar 

  20. Wang Z, Neuburg D, Li C, et al.: Global gene expression profiling in whole-blood samples from individuals exposed to metal fumes. Environ Health Perspect. 2005, 113: 233-241.

    Article  CAS  Google Scholar 

  21. Lu T, Liu J, LeCluyse EL, Zhou YS, Cheng ML, Waalkes MP: Application of cDNA microarray to the study of arsenic-induced liver diseases in the population of Guizhou, China. Toxicol Sci. 2001, 59: 185-192. 10.1093/toxsci/59.1.185.

    Article  CAS  Google Scholar 

  22. Callinan PA, Feinberg AP: The emerging science of epigenomics. Hum Mol Genet. 2006, 15 (Spec No 1): R95-101. 10.1093/hmg/ddl095.

    Article  CAS  Google Scholar 

  23. Bird A: Perceptions of epigenetics. Nature. 2007, 447: 396-398. 10.1038/nature05913.

    Article  CAS  Google Scholar 

  24. Egger G, Liang G, Aparicio A, Jones PA: Epigenetics in human disease and prospects for epigenetic therapy. Nature. 2004, 429: 457-463. 10.1038/nature02625.

    Article  CAS  Google Scholar 

  25. Jirtle RL, Skinner MK: Environmental epigenomics and disease susceptibility. Nat Rev Genet. 2007, 8: 253-262. 10.1038/nrg2045.

    Article  CAS  Google Scholar 

  26. Esteller M: Cancer epigenomics: DNA methylomes and histone-modification maps. Nat Rev Genet. 2007, 8: 286-298. 10.1038/nrg2005.

    Article  CAS  Google Scholar 

  27. Liu L, Wylie RC, Andrews LG, Tollefsbol TO: Aging, cancer and nutrition: the DNA methylation connection. Mech Ageing Dev. 2003, 124: 989-998. 10.1016/j.mad.2003.08.001.

    Article  CAS  Google Scholar 

  28. Moss TJ, Wallrath LL: Connections between epigenetic gene silencing and human disease. Mutat Res. 2007, 618: 163-174.

    Article  CAS  Google Scholar 

  29. Gallou-Kabani C, Junien C: Nutritional epigenomics of metabolic syndrome: new perspective against the epidemic. Diabetes. 2005, 54: 1899-1906. 10.2337/diabetes.54.7.1899.

    Article  CAS  Google Scholar 

  30. Jones PA, Baylin SB: The epigenomics of cancer. Cell. 2007, 128: 683-692. 10.1016/j.cell.2007.01.029.

    Article  CAS  Google Scholar 

  31. Zhang Y, Ng HH, Erdjument-Bromage H, Tempst P, Bird A, Reinberg D: Analysis of the NuRD subunits reveals a histone deacetylase core complex and a connection with DNA methylation. Genes Dev. 1999, 13: 1924-1935. 10.1101/gad.13.15.1924.

    Article  CAS  Google Scholar 

  32. Baccarelli A, Wright RO, Bollati V, Tarantini L, Litoniua AA, Suh HH, Zanobetti A, Sparrow D, Vokonas PS, Schwartz J: Rapid DNA methylation changes after exposure to traffic particles. Am J Respir Crit Care Med. 2009, 179 (7): 572-578. 10.1164/rccm.200807-1097OC.

    Article  CAS  Google Scholar 

  33. Bollati V, Baccarelli A, Hou L, Bonzini M, Fustinoni S, Cavallo D, Byun HM, Jiang J, Marinelli B, Pesatori AC, Bertazzi PA, Yang AS: Changes in DNA methylation patterns in subjects exposed to low-dose benzene. Cancer Res. 2007, 67 (3): 876-880. 10.1158/0008-5472.CAN-06-2995.

    Article  CAS  Google Scholar 

  34. Rusiecki JA, Baccarelli A, Bollati V, Tarantini L, Moore LE, Bonofeld-Jorgensen EC: Global DNA hypomethylation is associated with high serum-persistent organic pollutants in Greenlandic Inuit. Environ Health Perspect. 2008, 116 (11): 1547-1552.

    Article  CAS  Google Scholar 

  35. Committee on Applications of Toxicogenomic Technologies to Predictive Toxicology and Risk Assessment NRC: Application to Exposure Assessment. Applications of Toxicogenomic Technologies to Predictive Toxicology and Risk Assessment. 2007, The National Academies Press, Board on Environmental Studies and Toxicology, Board on Life Sciences

    Google Scholar 

  36. Joo WA, Sul D, Lee DY, Lee E, Kim CW: Proteomic analysis of plasma proteins of workers exposed to benzene. Mutat Res. 2004, 558: 35-44.

    Article  CAS  Google Scholar 

  37. Vermeulen R, Lan Q, Zhang L, et al.: Decreased levels of CXC-chemokines in serum of benzene-exposed workers identified by array-based proteomics. Proc Natl Acad Sci USA. 2005, 102: 17041-17046. 10.1073/pnas.0508573102.

    Article  CAS  Google Scholar 

  38. Rehman I, Azzouzi A, Catto J, Hamdy FC: The Use of Proteomics in Urological Research. Molecular Biology for the Urologist. 2005, 3: 171-179.

    Google Scholar 

  39. Camafeita E, Lamas J, Calvo E, Lopez J, Fernandez-Gutierrez B: Proteomics: New insights into rheumatic diseases. Proteomics. 2009, 3: 226-241.

    CAS  Google Scholar 

  40. Ruiz-Romero C, López-Armada M, Blanco F: Proteomic characterization of human normal articular chondrocytes: A novel tool for the study of osteoarthritis and other rheumatic diseases. Proteomics. 2005, 5: 3048-3059. 10.1002/pmic.200402106.

    Article  CAS  Google Scholar 

  41. Lopez-Pedrera C, Barbarroja N, Aguirre MA, Torres LA, Velasco F, Cuadrado MJ: Genomics and proteomics: a new approach for assessing thrombotic risk in autoimmune diseases. Lupus. 2008, 17: 904-915. 10.1177/0961203308095285.

    Article  CAS  Google Scholar 

  42. Barcelo-batllori S, Gomis R: Proteomics in obesity research. Proteomics Clin Appl. 2009, 3: 263-278. 10.1002/prca.200800178.

    Article  CAS  Google Scholar 

  43. Petricoin EF, Ardekani AM, Hitt BA, et al.: Use of proteomic patterns in serum to identify ovarian cancer. Lancet. 2002, 359: 572-577. 10.1016/S0140-6736(02)07746-2.

    Article  CAS  Google Scholar 

  44. Diamandis EP: Analysis of serum proteomic patterns for early cancer diagnosis: drawing attention to potential problems. J Natl Cancer Inst. 2004, 96: 353-356.

    Article  Google Scholar 

  45. Hunter DJ: Genomics and proteomics in epidemiology: treasure trove or "high-tech stamp collecting"?. Epidemiology. 2006, 17: 487-489. 10.1097/01.ede.0000229955.07579.f0.

    Article  Google Scholar 

  46. Oliver SG, Winson MK, Kell DB, Baganz F: Systematic functional analysis of the yeast genome. Trends Biotechnol. 1998, 16: 373-378. 10.1016/S0167-7799(98)01214-1.

    Article  CAS  Google Scholar 

  47. Waters NJ, Holmes E, Williams A, Waterfield CJ, Farrant RD, Nicholson JK: NMR and pattern recognition studies on the time-related metabolic effects of alpha-naphthylisothiocyanate on liver, urine, and plasma in the rat: an integrative metabonomic approach. Chem Res Toxicol. 2001, 14: 1401-1412. 10.1021/tx010067f.

    Article  CAS  Google Scholar 

  48. Waters NJ, Holmes E, Waterfield CJ, Farrant RD, Nicholson JK: NMR and pattern recognition studies on liver extracts and intact livers from rats treated with alpha-naphthylisothiocyanate. Biochem Pharmacol. 2002, 64: 67-77. 10.1016/S0006-2952(02)01016-X.

    Article  CAS  Google Scholar 

  49. Odunsi K, Wollman RM, Ambrosone CB, et al.: Detection of epithelial ovarian cancer using 1H-NMR-based metabonomics. Int J Cancer. 2005, 113: 782-788. 10.1002/ijc.20651.

    Article  CAS  Google Scholar 

  50. Beckonert O, Monnerjahn J, Bonk U, Leibfritz D: Visualizing metabolic changes in breast-cancer tissue using 1H-NMR spectroscopy and self-organizing maps. NMR Biomed. 2003, 16: 1-11. 10.1002/nbm.797.

    Article  CAS  Google Scholar 

  51. Holmes E, Loo RL, Stamler J, et al.: Human metabolic phenotype diversity and its association with diet and blood pressure. Nature. 2008, 453: 396-400. 10.1038/nature06882.

    Article  CAS  Google Scholar 

  52. Lumbreras B, Porta M, Marquez S, Pollan M, Parker LA, Hernandez-Aguado I: QUADOMICS: an adaptation of the Quality Assessment of Diagnostic Accuracy Assessment (QUADAS) for the evaluation of the methodological quality of studies on the diagnostic accuracy of '-omics'-based technologies. Clin Biochem. 2008, 41: 1316-1325. 10.1016/j.clinbiochem.2008.06.018.

    Article  Google Scholar 

  53. Lumbreras B, Porta M, Marquez S, Pollan M, Parker LA, Hernandez-Aguado I: Sources of error and its control in studies on the diagnostic accuracy of "-omics" technologies. Proteomics Clin Appl. 2009, 3: 173-184. 10.1002/prca.200800092.

    Article  CAS  Google Scholar 

  54. Wild CP: Complementing the genome with an "exposome": the outstanding challenge of environmental exposure measurement in molecular epidemiology. Cancer Epidemiol Biomarkers Prev. 2005, 14: 1847-1850. 10.1158/1055-9965.EPI-05-0456.

    Article  CAS  Google Scholar 

Download references


This study was made possible by the ECNIS grant from the European Union (FOOD-CT-2005-513943). We are grateful to Ross Anderson for thoughtful comments and the permission to reprint Figure 2.

Author information

Authors and Affiliations


Corresponding author

Correspondence to Paolo Vineis.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

PV and RV have drafted the manuscript, revised it critically and gave final approval of the version to be submitted.

JV has revised the manuscript and contributed to writing parts of it.

AK has contributed in writing the sections on "-omics", and also on revising the document.

All the authors have read and approved the final manuscript.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Vineis, P., Khan, A.E., Vlaanderen, J. et al. The impact of new research technologies on our understanding of environmental causes of disease: the concept of clinical vulnerability. Environ Health 8, 54 (2009).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: