7.1 Introduction
The term cohort derives from the Latin word cohors, meaning “an enclosure.”a This is an apt description of the cohort method because cohort studies follow the experiences of individuals enclosed in closed populations. In their simplest sense, cohort studies follow two groups of individuals. One group is characterized by an exposure and the other is “nonexposed.” Study outcomes in individuals are ascertained over time, tallied, and compared in the form of incidence rates or incidence proportions.
Cohort studies can be either experimental or observational (see Chapter 5 for the distinction). However, when used without specification, “cohort study” almost always refers to an observational cohort study. Chapter 6 addressed experimental cohort studies, while this chapter considers observational cohort studies.
Let us start our consideration of observational cohort studies with two examples.
Many groundbreaking findings about the causes of heart disease and stroke were identified and confirmed by the landmark Framingham Heart Study. The original Framingham cohort consisted of 5209 cardiovascular disease-free volunteers between the ages of 29 and 62 recruited from the moderately sized town of Framingham, Massachusetts, USA. The original Framingham cohort was recruited between 1948 and 1950. Individuals in the cohort have been examined every two years since the inception of the study. For the current illustration, cases of coronary heart disease (angina, myocardial infarction, and sudden death) were confirmed using criteria recommended by the New York Heart Association. Table 7.1 displays the data for 40–59 year old men and women for the first six years of follow-up. Note the progressive increases in coronary heart disease incidence with increasing serum cholesterol levels in both men and women.
Source: Kannel et al. 1961.
As a second example, let us consider an observational study from The Women’s Health Initiative (WHI) project. The WHI project, introduced in Illustrative Example 5.2, included both experimental and observational cohorts. In Chapter 6, we discussed one of the experimental cohort studies. In this chapter, we consider one of its observational cohort studies.
The recruitment period for the observational component of the WHI study began in September 1993 and ended in July 1998. Participants gave informed consent, were screened for eligibility, and were followed prospectively for up to 15 years (WHI, 1998). The current analysis explores the relation between non-narcotic pain medicine use and breast cancer (Harris et al., 2003). Information about the use of aspirin, ibuprofen, other nonsteroidal anti-inflammatory drugs (NSAIDs), and acetaminophen was collected from an interview-administered questionnaire.b For those individuals who reported use of an NSAID or acetaminophen at least two times in each of the two weeks preceding the interview, the type of compound, dose, and duration of use were recorded. The investigators checked pill bottle labels and prescription records to validate medication use.
Breast cancer cases were identified through health-care contacts and annual follow-up questionnaires. Follow-up time for each study subject was accrued from enrollment to the date of breast cancer diagnosis, death, or withdrawal from the study. Cases were confirmed by review of clinical, diagnostic, and pathology reports by physicians blinded to the exposure status of potential cases.
Table 7.2 summarizes findings from this study. An inverse dose–response relation between aspirin use, ibuprofen use, and NSAID use is evident. No such trend is found for acetaminophen use. This suggests that extended use of NSAIDs may reduce the risk of breast cancer.
Before addressing modern cohort studies further, let us gain an historical perspective into the method.
7.2 Historical perspective
The idea of observing people in their natural setting in order to gain insight into health determinants goes back a very long way. Hippocrates (circa 400 BCE to circa 370 BCE) urged us to consider the health of individuals in relation to “the mode in which the inhabitants live, and what are their pursuits, whether they are fond of drinking and eating in excess, and given to indolence, or are fond of exercise and labor and not given to excess in eating and drinking” (Hippocrates, 400 BCE). Here we recognize the seed of the observational cohort design, that is, comparing the long term health experiences of groups of individuals based on personal characteristics and exposures.
First century (CE) Roman authors noted differences in the patterns of diseases among various worker groups. Specific references to the diseases of slaves, sulfur workers, blacksmiths, and miners were made by the poets Pliny, Martial, and Luca (Rosen, 1993).
However, it was not until the 18th century when the focus on the health of worker cohorts met an important milestone with the publication of the treatise De Morbis Artificum Diatriba (“Diseases of Workers,” 1713) by Bernardino Ramazzini (1633–1714). De Morbis documented the effects of dozens of hazardous environmental exposures, such as specific chemicals, dusts, and abrasives. In addition, it documented the ill-effects of specific occupational practices and lifestyles. For example, the sedentary lifestyle and constant bent work posture of cobblers was cited by Ramazzini (1713) as a cause of ill-health.
In 1755, the surgeon Percival Pott (1713–1788) observed enormously elevated rates of scrotal cancer in chimney sweeps. He attributed this to the lodgment of soot in the rugae of the scrotum of the chimney sweeps, which is perhaps the first identification of an environment carcinogen.
Eighteenth century French physicians such as Louis and Pinel brought observational cohort studies into a clinical setting. In one study, Pierre Charles Alexandre Louis (1787–1872) observed superior cure rates in pneumonia patients who experienced delayed bloodletting compared with those who experienced early treatment. Phillippe Pinel (1745–1826) used the clinical histories of patients with mental illnesses to demonstrate superior cure rates at institutions that practiced humane methods of treatment compared with the standard inhumane practices of the time.
The Victorian physician and statistician William Farr (1807–1883), who is usually associated with vital statistics studies in open populations, also understood the importance of the longitudinal observation of individuals, declaring individual subjects in studies of health “should be followed from the beginning to the end; every death or recovery should be recorded.” Farr (1838) applied this concept in studying the prognosis of smallpox patients using sophisticated survival analysis techniques (Gerstman, 2003).
The 20th century brought with it the development of the modern cohort study. The British scientist Janet Elizabeth Lane-Claypon (1877–1967) reported on the longitudinal results of weight gain in infants fed either boiled cows’ milk or human breast milk (1912). One year later, the German physician Wilhelm Weinberg (1862–1937) published the results of a large retrospective cohort study comparing the health experiences of 18 212 children whose fathers and mothers had previously died of tuberculosis with that of an “nonexposed” cohort of 7574 children of parents who died of non-tubercular causes (Morabia and Guthold, 2007).
In 1914, Joseph Goldberger (1847–1929) published cohort observations on pellagra, noting the absence of pellagra in nurses and ward attendants at hospitals in which 98 cases of pellagra had occurred in the patient population. Although the common belief at the time was that pellagra was infectious, this suggested to Goldberger that pellagra was not contagious.
It was not until 1935, however, that the first recorded use of the term cohort made its appearance in epidemiology when Wade Hampton Frost referred to rates of tuberculosis in generational cohorts (Doll, 2001). Frost’s generational cohort studies are discussed in the last section of this chapter.
By the middle of the 20th century, the epidemiologic transition made it clear that large-scale long-term follow-up studies were needed in order to untangle the causes of chronic diseases in human populations. These modern cohort studies initially addressed cigarette-related diseases, cancers, and heart disease. One such study from this era—The Framingham Heart Study—was introduced as Illustrative Example 7.1. Another historically important cohort study from this era was the British Doctors Study.
The British Doctors study was launched by Richard Doll and Bradford Hill in 1951 with a seven question survey form sent to approximately 59 600 medical doctors in the United Kingdom. Of the 59 600 mailings, roughly 50 000 were thought to have been received by living physicians. Of these, 34 440 physicians replied, representing a response rate of 69% of the men who were reached (Doll and Peto, 1976). A second questionnaire was sent out beginning in late 1957. By that time, 3122 of the cohort members had died, leaving 31 318 still alive. Of those remaining alive, 30 810 (98%) replied. By the time the third questionnaire was sent out in 1966, an additional 7301 had died. Of the remaining 27 139 living individuals, 26 163 (96%) replied to the survey. The fourth questionnaire, sent out in 1972, had a response rate of 98%. The nonresponse rates of 2, 4, and 2% are remarkably low, demonstrating the cooperative nature of the physicians that constituted the cohort.
The British Doctors cohort has now been followed for more than half a century. This study has identified or confirmed excess mortality in smokers due to dozens of neoplastic, vascular, and respiratory diseases (Doll et al., 2004). It also has also confirmed a negative association between smoking and Parkinson’s disease (Doll et al., 1994).
7.3 Assembling and following a cohort
Before embarking on a cohort investigation, it is essential to obtain the cooperation of the study population. One of the reasons the landmark Framingham Heart Study was based in Framingham was because of its supportive population and health-care system (Dawber et al., 1963). The same can be said of The British Doctors Study. As another example, the Nurses’ Health Study selected this population for long-term follow-up because nurses represented a large group of cooperative, health-conscious women who would be compliant and relatively easy to follow (Belanger et al., 1978).
However, even with a highly cooperative population, a certain percentage of individuals will refuse to participate. The aforementioned British Doctors, Framingham, and Nurses cohort studies had, for example, initial nonresponse rates that were all around 30%, (Doll and Peto, 1976; Dawber et al., 1963; Belanger et al., 1973).
In addition, subjects who enroll in a study may become lost to follow-up, refuse to continue in the study, or die from unrelated causes during the course of the study. These individuals are referred to as withdrawals. For example, the Framingham study began with 5209 subjects attending the initial set of examinations. Over the next ten years there were 950 withdrawals leaving the cohort with 4259 study subjects (Framingham Heart Study, 2011). This amounts to a ten-year withdraw rate of about 18%.
Nonresponses and withdrawals raise a potential for a “selection bias” in a cohort study. However, if nonresponse and withdrawal are independent of the exposures and diseases being studied, observed association will remain unbiased.
Once enrolled in a cohort study, each study subject is followed until:
- the subject withdraws from the study
- the study ends, or
- the study outcome is experienced.
The method of identifying the cases in a cohort study will depend greatly on the type of disease or health outcome being studied and the diagnostic and technologic resources that are available. Cases are ascertained based on criteria referred to as the case definition. The case definition is then uniformly applied to screen for and confirm cases using relevant diagnostic and other information. See Chapter 16 for additional details regarding the construction of case definitions.
7.4 Prospective, retrospective, and ambidirectional cohorts
The cohort studies discussed to this point in the chapter were planned to observe events that had yet to occur. Cohort studies carried out in this manner are referred to as prospective cohort studies. Cohort studies can also be carried out using records of events that had occurred in the past. Studies of this second type are called retrospective cohort studies or historical cohort studies. Cohort studies that combine prospective data with retrospective data are called ambidirectional cohort studies. Figure 7.1 illustrates these temporal relationships.
Note that the study design feature that determines whether a cohort study is prospective or retrospective is the proximity in time of the data collection to the time events actually occurred. Prospective cohort studies used data that are proximal to data collection. Retrospective cohort studies use data from the past.
Retrospective data can be obtained from a variety of sources, including medical records, administrative data sources, vital records systems, surveillance systems, and employment records. In addition, we may interview study subjects or their proxies about prior events to obtain retrospective data. An example of a retrospective cohort study follows.
Case and colleagues (1954) compiled a cohort of workmen in Great Britain based on historical records from 21 companies involved in the manufacture of aniline-based dyes. Among a total of 4622 men exposed to aniline dyes between 1921 to 1952, there were 127 mentions of bladder tumors on death certificates. Based on expected rates using national statistics for Britain as a whole, approximately four such death were expected in a group of this size and age distribution. Thus, the overall risk of dying of bladder cancer in the aniline-exposed cohort was approximately 30 times the expected rate.
The above example illustrates an advantage of retrospective data: the investigator need not wait the many years required for disease to develop following exposure to a harmful substance to study outcomes with long induction. In addition, because the historical cohort studies use existing records, such studies can be completed relatively rapidly and economically.
7.5 Addressing the potential for confounding
The objective of a cohort study is to provide accurate information about the independent effects of study exposures on health outcomes. To accomplish this goal, like-to-like comparisons are necessary. Without like-to-like comparisons, differences in incidence found at the end of the study could be due to factors other than the study exposure. This phenomenon is known as confounding.3
Conceptually, the ideal nonexposed cohort would consist of the same individuals as the exposed cohort had these individuals been nonexposed. Of course this is counterfactual, that is, not possible in fact. Nevertheless, this counterfactual ideal defines a way to think about suitable comparisons. To address this issue of comparability, cohort studies compare the distribution of risk factors and other attributes in exposure groups at the onset of the follow-up period. An example follows.
The Nurses’ Health Study is a prospective cohort study of married, female registered nurses born between 1 January 1921 and 31 December 1946. One of the many investigations that stemmed from this project addressed postmenopausal hormonal replacement therapy and cardiovascular disease. Table 7.3 compares the prevalence of cardiovascular risk factors among the exposure groups at enrollment. This table reveals that current hormone users were less likely to have diabetes, were leaner, and were more likely to engage in regular physical activity, to have had a surgical menopause, and to have used oral contraceptives in the past.