Bradford Hill criteria
Criteria for measuring cause and effect
From Wikipedia, the free encyclopedia
The Bradford Hill criteria, otherwise known as Hill's criteria for causation, are a group of nine principles that can be useful in evaluating epidemiologic evidence of a causal relationship between a presumed cause and an observed effect and have been widely used in public health research. They were proposed in 1965 by the English epidemiologist Sir Austin Bradford Hill, although Hill did not use the term "criteria" himself and instead described nine "viewpoints from all of which we should study association before we cry causation."[1] Modern interpretations of Hill's viewpoints[2] focus on this more nuanced framing, in line with Hill's original assertion that "none of my nine viewpoints can bring indisputable evidence for or against the cause-and-effect hypothesis and none can be required as a sine qua non."[1]
In 1996, David Fredricks and David Relman remarked on Hill's criteria in their pivotal paper on microbial pathogenesis.[3]
Definition
In 1965, the English statistician Sir Austin Bradford Hill outlined a set of nine principles to consider when evaluating epidemiologic evidence of a causal relationship between an exposure and outcome. (For example, he demonstrated the connection between cigarette smoking and lung cancer). The list of principles is as follows:[1]
- Strength (effect size): A small association does not mean that there is not a causal effect, though the larger the association, the more likely that it is causal.
- Consistency (reproducibility): Consistent findings observed by different persons in different places with different samples strengthens the likelihood of an effect.
- Specificity: Causation is likely if there is a very specific population at a specific site and disease with no other likely explanation. The more specific an association between a factor and an effect is, the bigger the probability of a causal relationship.[1]
- Temporality: The effect has to occur after the cause (and if there is an expected delay between the cause and expected effect, then the effect must occur after that delay).
- Biological gradient (dose–response relationship): Greater exposure should generally lead to greater incidence of the effect. However, in some cases, the mere presence of the factor can trigger the effect. In other cases, an inverse proportion is observed: greater exposure leads to lower incidence.[1]
- Plausibility: A plausible mechanism between cause and effect is helpful (but Hill noted that knowledge of the mechanism is limited by current knowledge).
- Coherence: Coherence between epidemiological and laboratory findings increases the likelihood of an effect. However, Hill noted that "lack of such [laboratory] evidence cannot nullify the epidemiological effect on associations".
- Experiment: "Occasionally it is possible to appeal to experimental evidence".
- Analogy: The use of analogies or similarities between the observed association and any other associations.
Some authors[4] consider, also, Reversibility: If the cause is deleted then the effect should disappear as well.
Debate in epidemiology
All scientific work is incomplete. All scientific work is liable to be upset or modified by advancing knowledge. That does not confer upon us a freedom to ignore the knowledge we already have, or to postpone the action that it appears to demand at a given time.
Bradford Hill's criteria had been widely accepted as useful guidelines for investigating causality in epidemiological studies but their value has been questioned because they have become somewhat outdated.[6]
In addition, their method of application is debated.[citation needed] Some proposed options how to apply them include:
- Using a counterfactual consideration as the basis for applying each criterion.[7]
- Subdividing them into three categories: direct, mechanistic and parallel evidence, expected to complement each other. This operational reformulation of the criteria has been recently proposed in the context of evidence-based medicine.[8]
- Considering confounding factors and bias.[9]
- Using Hill's criteria as a guide, but not considering them to give definitive conclusions.[10]
- Separating causal association and interventions, because interventions in public health are more complex than can be evaluated by use of Hill's criteria[11]
An argument against the use of Bradford Hill criteria as exclusive considerations in proving causality is that the basic mechanism of proving causality is not in applying specific criteria—whether those of Bradford Hill or counterfactual argument—but in scientific common sense deduction.[12] Others argue that the specific study from which data has been produced is important, and while the Bradford Hill criteria may be applied to test causality in these scenarios, the study type may rule out deducing or inducing causality, and the criteria are only of use in inferring the best explanation of this data.[13]
Debate over the scope of application of the criteria includes, whether they can be applied to social sciences.[14] The argument proposes that there are different motives behind defining causality; the Bradford Hill criteria applied to complex systems such as health sciences are useful in prediction models where a consequence is sought; explanation models as to why causation occurred are deduced less easily from Bradford Hill criteria because the instigation of causation, rather than the consequence, is needed for these models.[citation needed]
Examples of application
Researchers have applied Hill's criteria for causality in examining the evidence in several areas of epidemiology, including connections between exposures to molds and infant pulmonary hemorrhage,[15] ultraviolet B radiation, vitamin D and cancer,[16][17] vitamin D and pregnancy and neonatal outcomes,[18] alcohol and cardiovascular disease outcomes,[19] infections and risk of stroke,[20] nutrition and biomarkers related to disease outcomes,[21] foods and nutrients related to cardiovascular disease and diabetes[22] and sugar-sweetened beverage consumption and the prevalence of obesity and obesity-related diseases.[23] They have also been used in non-human epidemiological studies, such as on the effects of neonicotinoid pesticides on honey bees.[24] Their use in quality improvement of health care services has been proposed, highlighting how quality improvement methods can be used to provide evidence for the criteria.[25]
Since the description of the criteria, many methods to systematically evaluate the evidence supporting a causal relationship have been published, for example the five evidence-grading criteria of the World Cancer Research Fund (Convincing; Probable; Limited evidence – suggestive; Limited evidence – no conclusion; Substantial effect on risk unlikely).[26]
See also
- Causal inference – Branch of statistics
- Granger causality – Statistical hypothesis test for forecasting
- Koch's postulates – Four criteria showing a causal relationship between a causative microbe and a disease
- Public health – Promoting health through informed choices
- MAGIC criteria – Set of guidelines for using statistical analysis
- Correlation does not imply causation – A refutation of one logical fallacy