Systematic Review: Comparative Effectiveness of Medications to Reduce Risk for Primary Breast Cancer

  1. Heidi D. Nelson, MD, MPH;
  2. Rongwei Fu, PhD;
  3. Jessica C. Griffin, MS;
  4. Peggy Nygren, MA;
  5. M.E. Beth Smith, DO; and
  6. Linda Humphrey, MD, MPH
  1. From Oregon Health & Science University, Veterans Affairs Medical Center, and the Women and Children's Health Research Center, Providence Health & Services, Portland, Oregon.

    Abstract

    Background: Trials demonstrate the efficacy of medications to reduce the risk for invasive breast cancer.

    Purpose: To summarize benefits and harms of tamoxifen citrate, raloxifene, and tibolone to reduce the risk for primary breast cancer.

    Data Sources: MEDLINE and Cochrane databases from inception to January 2009, Web of Science, trial registries, and manufacturer information.

    Study Selection: Predefined eligibility criteria were used to select articles. English-language reports of randomized, controlled trials (RCTs) for benefits and RCTs and observational studies for harms were included.

    Data Extraction: Two reviewers assessed study data, quality, and applicability.

    Data Synthesis: Seven placebo-controlled RCTs and 1 head-to-head trial provide results for main outcomes. Tamoxifen (risk ratio, 0.70 [95% CI, 0.59 to 0.82]; 4 trials), raloxifene (risk ratio, 0.44 [CI, 0.27 to 0.71]; 2 trials), and tibolone (risk ratio, 0.32 [CI, 0.13 to 0.80]; 1 trial) reduce risk for invasive breast cancer compared with placebo by 7 to 10/1000 women per year. Tamoxifen and raloxifene reduce estrogen receptor–positive breast cancer, but not estrogen receptor–negative breast cancer, noninvasive breast cancer, or mortality; all medications reduce fractures. Tamoxifen (risk ratio, 1.93 [CI, 1.41 to 2.64]; 4 trials) and raloxifene (risk ratio, 1.60 [CI, 1.15 to 2.23]; 2 trials) increase thromboembolic events by 4 to 7/1000 women per year; raloxifene causes fewer events than tamoxifen. Tamoxifen increases risk for endometrial cancer (risk ratio, 2.13 [CI, 1.36 to 3.32]; 3 trials) compared with placebo by 4/1000 women per year and causes cataracts compared with raloxifene. Tibolone causes strokes in older women.

    Limitations: Biases, trial heterogeneity, and few head-to-head trials limit this review. Data are lacking on doses, duration, and timing of the medications; long-term effects; and nonwhite and premenopausal women.

    Conclusion: Three medications reduce risk for primary breast cancer but increase risk for thromboembolic events (tamoxifen, raloxifene), endometrial cancer (tamoxifen), or stroke (tibolone).

    Primary Funding Source: Agency for Healthcare Research and Quality.

    Editors' Notes

    Context

    • Medications that reduce breast cancer risk may have other potential benefits and harms.

    Contribution

    • This review of 8 trials found that tamoxifen, raloxifene, and tibolone each reduced risk for invasive breast cancer in women more than placebo. Tamoxifen and raloxifene reduced risk for estrogen receptor–positive but not estrogen receptor–negative breast cancer or death. All drugs reduced fracture risks. They also increased risk for thromboembolic events (tamoxifen and raloxifene), endometrial cancer (tamoxifen), and strokes (tibolone).

    Implication

    • Although tamoxifen, raloxifene, and tibolone sometimes reduce risk for primary breast cancer, they also may increase a woman's risk for particular harmful events.

    —The Editors

    Recent clinical trials have shown the efficacy of the selective estrogen receptor modulators tamoxifen citrate and raloxifene and of the selective tissue estrogenic activity regulator tibolone to reduce the risk for invasive breast cancer in women without preexisting cancer. Tamoxifen and raloxifene are approved by the U.S. Food and Drug Administration (FDA) for this purpose among women at high risk for developing breast cancer. The FDA indications define high risk as having a breast biopsy showing lobular carcinoma in situ or atypical hyperplasia, 1 or more first-degree relatives with breast cancer, or a 5-year predicted risk for breast cancer of 1.66% or more calculated by the modified Gail model (1–3). These are similar to entry criteria of the major U.S. clinical trials (4–8). Tamoxifen is also approved for treatment of early and advanced estrogen receptor–positive breast cancer in pre- and postmenopausal women. Raloxifene is most often used to prevent and treat osteoporosis in postmenopausal women. Tibolone is not currently approved for use in the United States, but is approved to treat menopausal symptoms in 90 countries and to prevent osteoporosis in 45 (9, 10). In the United States, use of these medications for reducing breast cancer risk is low (4).

    This comparative effectiveness review summarizes the available evidence for the effectiveness and safety of tamoxifen citrate, raloxifene, and tibolone for reducing risk for primary breast cancer in women in general and among population subgroups of women. This review highlights outcomes that are most commonly associated with the medications and are clinically most important.

    Methods

    For all steps of the review, we followed a standard protocol that is consistent with the Agency for Healthcare Research and Quality (AHRQ) Effective Healthcare Program Methods Reference Guide for Effectiveness and Comparative Effectiveness Reviews (11). Additional information about methods for this review, including detailed search strategies, inclusion criteria, extraction, and rating processes, is available in the appendices to this article and the full technical report (http://effectivehealthcare.ahrq.gov) (12).

    Key questions guiding this review were developed through the Effective Healthcare Program. Investigators created an analytic framework that incorporated the key questions and outlined the patient population, interventions, and clinical outcomes (Appendix Figure 1). Additional outcomes and evidence related to key questions about risk stratification and adherence to medications are available in the full technical report (http://effectivehealthcare.ahrq.gov) (12). The target population includes women who did not have preexisting invasive or noninvasive breast cancer or precursor conditions and were not known carriers of breast cancer susceptibility mutations (BRCA1, BRCA2, or others).

    Appendix Figure 1.
    View larger version:
    Appendix Figure 1. Analytic framework and key questions for full comparative effectiveness review.

    * Detailed descriptions are provided in the exclusion and exclusion criteria in Appendix Figure 2.

    Data Sources and Searches

    In conjunction with a research librarian, the investigators searched MEDLINE, the Cochrane Central Register of Controlled Trials, and the Cochrane Database of Systematic Reviews from inception to January 2009 for relevant English-language reports of studies, systematic reviews, and meta-analyses. We also manually reviewed reference lists, citations for major trials in Web of Science, and clinical trial registries. We requested scientific information packets from manufacturers of the 3 medications, although the only packet provided was for raloxifene. We contacted investigators of the major trials for additional unpublished data specifically addressing population subgroups but received none.

    Study Selection

    Before reviewing abstracts and articles, we developed inclusion and exclusion criteria for studies based on the key questions and target population. We included studies with treatment durations of 3 months or more that enrolled 100 or more participants to ensure adequate drug exposure and power to support results.

    For comparative benefits, we included only double-blind, placebo-controlled, or head-to-head randomized, controlled trials (RCTs) of tamoxifen, raloxifene, or tibolone to reduce risk for breast cancer that enrolled women without preexisting breast cancer. We included trials that were designed and powered to demonstrate invasive breast cancer incidence as a primary or secondary outcome. The technical expert panel for this project advised including only RCTs because of the lack of observational studies of tamoxifen and raloxifene with breast cancer outcomes in women without preexisting cancer and concerns for bias in observational studies in which women were using these medications for other indications.

    We defined our inclusion criteria for comparative harms more broadly. We included RCTs and observational studies of tamoxifen, raloxifene, or tibolone in women without breast cancer that were designed for multiple types of outcomes. However, studies must have had a nonuser comparison group or direct comparisons between tamoxifen, raloxifene, or tibolone to be included. We considered all adverse outcomes at all reported follow-up times to capture potential short- and long-term adverse effects. However, because the National Surgical Adjuvant Breast and Bowel Project P-1 (NSABP P-1) trial was unblinded after investigators reported initial results in 1998, we focused on data from the earlier 1998 publication (8) and then compared these results with data from the subsequent 2005 publication (7). Appendix Figure 2 provides detailed inclusion and exclusion criteria for benefits and harms.

    Appendix Figure 2.
    View larger version:
    Appendix Figure 2. Inclusion and exclusion criteria for studies.

    RCT = randomized, controlled trial.

    * Benefit outcomes are defined by key question 1 and include invasive breast cancer; noninvasive breast cancer, including ductal carcinoma in situ; breast cancer mortality; all-cause mortality; and osteoporotic fractures.

    † Population subgroups are defined by key question 3 and include but are not limited to those based on age, menopausal status (pre-, peri-, postmenopausal), hysterectomy status, use of exogenous estrogen, level of risk for breast cancer (based on family history, body mass index, parity [number of pregnancies], age at first live birth, age at menarche, personal history of breast abnormalities, previous breast biopsy, estradiol levels, breast density), ethnicity and race, metabolism status (CYP 2D6 mutation), and risk for thromboembolic events (obesity and other risk factors).

    ‡ Definitions of types of outcomes: primary outcome—the main outcome of a study that the study was designed and powered to demonstrate; secondary outcome—major outcome of a study that the study was designed and powered to demonstrate, but not the primary outcome of the study; health outcomes—signs, symptoms, conditions, or events that individuals experience, such as myocardial infarction, death, or hot flashes; intermediate outcomes—health measures that individuals do not personally experience, such as laboratory test results or bone mineral density.

    § Harms outcomes are defined by key question 2 and may include but are not limited to thromboembolic events (deep venous thrombosis, pulmonary embolism), cardiovascular events (coronary heart disease, stroke and transient ischemic attack, arrhythmias), metabolic disorders (diabetes) , musculoskeletal symptoms (myalgia, leg cramps), mental health (depression, mood changes), genitourinary outcomes (vaginal dryness, uterine bleeding, hysterectomy, endometrial cancer, urinary symptoms), adverse breast outcomes (biopsies), other malignancies (incidence, death), ophthalmologic disorders (cataracts), gastrointestinal/hepatobiliary disorders (abdominal pain, nausea), and other adverse events affecting quality of life (vasomotor symptoms, sexual function, sleep disturbances, headaches, cognitive changes, peripheral edema).

    After an initial review of abstracts, we retrieved full-text articles of potentially relevant material and conducted a second review to determine inclusion. A second reviewer confirmed results of the initial reviewer, and discrepancies were resolved by group consensus.

    Data Abstraction and Quality Assessment

    From the included studies, investigators abstracted study design, setting, participant characteristics, enrollment criteria, interventions, numbers enrolled and lost to follow-up, methods of outcome ascertainment, and results for each outcome. A second investigator confirmed the accuracy of abstracted information.

    We used predefined criteria developed by the U.S. Preventive Services Task Force to assess the quality of individual studies (good, fair, poor) (13). We assessed study applicability by following the population, intervention, comparator, outcomes, timing of outcomes measurement, and setting format (good, fair, poor) (11). Two investigators independently rated quality and applicability of each study, and final ratings were determined by consensus. Investigators determined the overall strength of the body of evidence through group consensus using the Evidence-based Practice Center GRADE (Grading of Recommendations Assessment, Development and Evaluation) approach (high, moderate, low, insufficient) (11).

    Data Synthesis and Analysis

    We combined results of eligible trials to obtain more precise estimates of the major health outcomes. To determine the appropriateness of meta-analysis, we considered clinical and methodologic diversity and assessed statistical heterogeneity. We abstracted or calculated estimates of risk ratios (rate ratio, hazard ratio, or relative risk) and their SEs from each trial and used them as the effect measures (Appendix). We assessed the presence of statistical heterogeneity among the studies by using standard chi-square tests and evaluated the magnitude of heterogeneity by using the I2 statistic (14). We used a random-effects model to account for variation among studies (15). When there is no variation among studies, the random-effects model yields the same results as a fixed-effects model. For all meta-analyses, we combined results separately for tamoxifen and raloxifene and provided 95% CIs.

    To explore whether combined estimates differ among subpopulations, we performed subgroup analysis by age (≤50 vs. >50 years), family history of breast cancer (yes vs. no), use of menopausal hormone therapy (yes vs. no), menopausal status (premenopausal vs. postmenopausal), and body mass index (≤25 vs. >25 mg/kg2), when at least 2 studies reported results. We also performed subgroup analysis for tamoxifen trials stratified by active versus posttreatment periods when studies reported these data. We used Stata software, version 9.1, for all these analyses (Stata Corp., College Station, Texas).

    To facilitate the evaluation of benefits and harms across trials, we abstracted or calculated event rates per 1000 women-years for both treatment and placebo groups using steps similar to those used to obtain risk ratios. When the event rates were not reported or calculable, we indicate them as such. To obtain the combined event rates, we conducted a meta-analysis of the placebo event rates by using a random-effects Poisson model and raw data of number of events and women-years of follow-up. We used PROC NLMIXED, SAS 9, 1.3., software for this analysis (SAS Institute, Inc., Cary, North Carolina).

    Role of the Funding Source

    The Agency for Healthcare Research and Quality provided the initial key questions and copyright release for this manuscript but did not participate in the literature search, data analysis, or interpretation of the results.

    Results

    From electronic database searches, secondary references, and the scientific information packet, we identified 4230 abstracts and included 58 full-text articles. Fourteen articles from 6 trials provided data for the meta-analysis of major health outcomes (Appendix Figure 3).

    Appendix Figure 3.
    View larger version:
    Appendix Figure 3. Study flow diagram.

    RCT = randomized, controlled trial.

    Characteristics of Primary Prevention Trials

    Eight large randomized, controlled trials that enrolled women without preexisting breast cancer and reported breast cancer outcomes provided most of the data for this review and all of the data for the meta-analysis (Appendix Tables 1 and 2). Additional outcomes from these trials include mortality, fractures, thromboembolic events, cardiovascular disease events, uterine abnormalities, cataracts, and other adverse effects. Trials include a head-to-head trial of tamoxifen and raloxifene—the Study of Tamoxifen and Raloxifene (STAR) (4, 16); 4 placebo-controlled trials of tamoxifen—the International Breast Cancer Intervention Study (IBIS-I) (17, 18), the NSABP P-1 (5–8), the Royal Marsden Hospital Trial (19, 20), and the Italian Tamoxifen Prevention Trial (21–24); 2 placebo-controlled trials of raloxifene—the Multiple Outcomes of Raloxifene Evaluation (MORE) study with long-term follow-up in the Continuing Outcomes Relevant to Evista (CORE) study (25–39) and the Raloxifene Use for the Heart (RUTH) trial (40, 41); and 1 placebo-controlled trial of tibolone—the Long-Term Intervention on Fractures with Tibolone (LIFT) (9).

    Appendix Table 1. Evidence Table for Randomized, Controlled Trials of Benefits
    Appendix Table 2. Evidence Table for Studies of Harms

    The tamoxifen trials, including STAR, were designed to determine breast cancer incidence as the primary outcome (4, 7, 8, 17–24). Inclusion criteria considered breast cancer risk in all these trials except the Italian trial (22). Two trials, STAR and NSABP P-1, used the modified Gail model to select participants (1, 2). In STAR, women were eligible if they were postmenopausal and had a Gail model 5-year predicted risk for breast cancer of 1.66% or more (4). The NSABP P-1 used this same threshold as well as additional criteria, such as age 60 years and older or a history of lobular carcinoma in situ (8). Most women age 60 years or older have a Gail model risk of at least 1.66% without additional risk factors because age is a dominant predictor in the model. The IBIS-I and Royal Marsden trials defined eligibility criteria on the basis of numbers of relatives with breast cancer as well as personal history of previous benign breast biopsy findings (17, 19).

    Breast cancer incidence was 1 of 2 primary outcomes in RUTH and a secondary outcome in MORE and LIFT. The latter two studies enrolled women with osteoporosis to determine the efficacy of raloxifene or tibolone in preventing fractures (9, 32). Eligibility criteria for both trials included bone mineral density T score of −2.5 or less at the femoral neck or lumbar spine or low bone mineral density with preexisting vertebral fractures at baseline. The RUTH trial was designed to determine the efficacy of raloxifene in preventing coronary events and enrolled women with coronary heart disease or multiple risk factors for coronary heart disease (40).

    Differences in trial designs led to the enrollment of dissimilar groups of women into trials. The mean age of participants at entry ranged from 47 (19) to 51 (22, 42) years in the tamoxifen trials and from 67 (28) to 68 (9, 40) years in the raloxifene and tibolone trials. Mean age in the STAR trial was 59 years (4). Risks for most outcomes measured in these trials increase with age, including risks for such adverse events as thromboembolic events and strokes. The 15- to 20-year age difference between participants in different trials would be expected to influence results and limit comparisons between medications.

    Trials also varied by follow-up times. For placebo-controlled trials of tamoxifen, median treatment duration was approximately 4 years (8, 23). In the MORE trial, results were reported after 3 and 4 years of treatment (25–33, 35, 38); results of CORE, a continuation study of MORE (43), were reported for 4-year and combined 8-year outcomes (MORE and CORE) (36, 37, 39). Median treatment durations were 2.8 years in LIFT (9), 3.1 to 3.2 years in STAR (44), and 5.1 years in RUTH (40).

    Although most trials reported similar main outcomes, the ascertainment of outcomes varied by trial. The diagnostic criteria for several outcomes were not well described in the trials, and differences in results between trials for some of these outcomes may be due, at least in part, to how the outcomes were determined and measured.

    Quality and Applicability of Primary Prevention Trials

    All the primary prevention trials met criteria for fair or good quality for major outcomes. The most important methodologic limitation was the inclusion of women using estrogen in the Italian (14% of women), Royal Marsden (15% to 27%), and IBIS (40%) tamoxifen trials. Estrogen use could modify or confound breast cancer risk and other outcomes, such as thromboembolic events (45, 46).

    Trials generally met criteria for good applicability, except for the Italian trial, which exclusively enrolled women who had previously undergone hysterectomy (22). Women with oophorectomies may be at lower than average risk for breast cancer (47, 48). In addition, most participants in all trials were white, and none of the trials provided outcomes specific to racial or ethnic groups. The trials were multicenter, relevant to primary care, and large, ranging from the Royal Marsden trial (19), which enrolled 2471 women, to STAR (4), which enrolled 19 747. Participants were recruited from clinics and communities located in many countries, with North America, Europe, and the United Kingdom most represented.

    Comparative Effectiveness of Tamoxifen, Raloxifene, and Tibolone to Reduce Risk for Breast Cancer, Death, and Fractures

    In placebo-controlled trials, tamoxifen (risk ratio, 0.70 [CI, 0.59 to 0.82]; 4 trials) (7, 18, 20, 23), raloxifene (risk ratio, 0.44 [CI, 0.27 to 0.71]; 2 trials) (40, 43), and tibolone (risk ratio, 0.32 [CI, 0.13 to 0.80]; 1 trial) (9) reduced the incidence of invasive breast cancer in women in midlife and older women by approximately 30% to 68% (Table; Figure 1). Tamoxifen and raloxifene had similar effects in the STAR head-to-head trial (4). Reduction of invasive breast cancer continued at least 3 to 5 years after discontinuation of tamoxifen therapy in the 2 trials that provided posttreatment follow-up data (Figure 1) (18, 20). Rates of invasive cancer were low and risk was not reduced in the Italian trial compared with the other trials, possibly reflecting the underlying lower risk among the women enrolled in this trial (23).

    Table. Results of Primary Prevention Trials
    Figure 1.
    View larger version:
    Figure 1. Meta-analysis of risk ratios for incidence of invasive breast cancer.

    Error bars represent 95% CIs. CORE = Continuing Outcomes Relevant to Evista; IBIS-1 = International Breast Cancer Intervention Study; MORE = Multiple Outcomes of Raloxifene Evaluation; NSABP P-1 = National Surgical Adjuvant Breast and Bowel Project P-1 Study; RUTH = Raloxifene Use for the Heart.

    * Per 1000 women-years.

    † Italian Tamoxifen Prevention Trial and RUTH reported mean or median duration of actual treatment period.

    ‡ Analysis included data from both MORE and CORE. Participants from MORE had 4-year treatment, and those who continued in CORE had 4 additional years of treatment. Total follow-up time is averaged over both MORE and CORE for 7705 participants.

    In placebo-controlled trials, tamoxifen (7, 18, 20, 23) and raloxifene (40, 43) reduced estrogen receptor–positive but not estrogen receptor–negative invasive breast cancer and had similar effects in the STAR head-to-head trial (4) (Table). Tamoxifen (7, 18, 20, 23) and raloxifene (40, 43) did not reduce noninvasive breast cancer, including ductal carcinoma in situ, in combined estimates (Figure 2). However, tamoxifen reduced noninvasive breast cancer in the NSABP P-1 trial (risk ratio, 0.63 [CI, 0.45 to 0.89]) (7). The STAR head-to-head trial suggested a higher incidence of noninvasive breast cancer for raloxifene compared with tamoxifen (risk ratio, 1.40 [CI, 0.98 to 2.00]) (4).

    Figure 2.
    View larger version:
    Figure 2. Meta-analysis of risk ratios for incidence of noninvasive breast cancer.

    Error bars represent 95% CIs. CORE = Continuing Outcomes Relevant to Evista; IBIS-1 = International Breast Cancer Intervention Study; MORE = Multiple Outcomes of Raloxifene Evaluation; NSABP P-1 = National Surgical Adjuvant Breast and Bowel Project P-1 Study; RUTH = Raloxifene Use for the Heart.

    * Per 1000 women-years.

    † Italian Tamoxifen Prevention Trial and MORE reported mean or median duration of actual treatment period.

    ‡ Analysis included data from both MORE and CORE. Participants from MORE had 4-year treatment, and those who continued in CORE had 4 additional years of treatment. Total follow-up time is averaged over both MORE and CORE for 7705 participants.

    All-cause mortality was similar for women using raloxifene compared with those using tamoxifen in the STAR trial (4) and for those receiving tamoxifen (7, 18, 20, 23), raloxifene (40, 43), and tibolone (9) compared with those in the placebo group (Table). Tamoxifen did not reduce breast cancer mortality compared with placebo (risk ratio, 1.07 [CI, 0.66 to 1.74]; 4 trials) (7, 18, 20, 23). Trials of raloxifene and tibolone did not report breast cancer mortality.

    In placebo-controlled trials, raloxifene (29, 40) and tibolone (9) reduced vertebral fractures (Table); tamoxifen (7) and tibolone (9) reduced nonvertebral fractures (Table); and tibolone reduced wrist but not hip fractures (9). Tamoxifen and raloxifene had similar effects on fractures at multiple sites in the STAR head-to-head trial (4). In LIFT, tibolone reduced risk for colon cancer (risk ratio, 0.31 [CI, 0.10 to 0.96]) (9).

    Adverse Effects of Tamoxifen, Raloxifene, and Tibolone

    Tamoxifen (risk ratio, 1.93 [CI, 1.41 to 2.64]; 4 trials) (8, 18, 20, 21) and raloxifene (risk ratio, 1.60 [CI, 1.15 to 2.23]; 2 trials) (33, 40) caused more thromboembolic events than placebo (Table; Figure 3). Risk returned to normal after discontinuation of tamoxifen therapy in the 2 trials providing posttreatment data (Figure 3) (18, 20). Raloxifene caused fewer thromboembolic events than tamoxifen in the STAR head-to-head trial (4). Tibolone did not increase risk for thromboembolic events in the LIFT trial, although data are limited (9).

    Figure 3.
    View larger version:
    Figure 3. Meta-analysis of risk ratios for incidence of venous thromboembolism.

    Error bars represent 95% CIs. IBIS-1 = International Breast Cancer Intervention Study; LIFT = Long-Term Intervention on Fractures with Tibolone; MORE = Multiple Outcomes of Raloxifene Evaluation; NSABP P-1 = National Surgical Adjuvant Breast and Bowel Project P-1 Study; RUTH = Raloxifene Use for the Heart.

    * Per 1000 women-years.

    † For tamoxifen trials, venous thromboembolic events include deep venous thrombosis and pulmonary embolism only. For other trials, additional thromboembolic events may be included.

    ‡ Events were reported from at least 3 months after treatment was stopped until the end of follow-up.

    Tamoxifen (7, 18, 20, 23), raloxifene (26, 40), and tibolone (9) did not increase risk for coronary heart disease events in placebo-controlled trials (Table), although data for tibolone are limited. Most trials used composite measures that included myocardial infarction, acute coronary syndrome, and severe angina (12). Tibolone caused more strokes than placebo in the LIFT trial, and these occurred more often in women older than age 70 years (9). Tamoxifen (7, 18, 20, 23) and raloxifene (26, 40) did not increase stroke incidence in placebo-controlled trials (Figure 4). However, in the RUTH trial, women randomly assigned to raloxifene had higher stroke mortality than those assigned to placebo (risk ratio, 1.49 [CI, 1.00 to 2.24]) (40).

    Figure 4.
    View larger version:
    Figure 4. Meta-analysis of risk ratios for incidence of stroke.

    Error bars represent 95% CIs. IBIS-1 = International Breast Cancer Intervention Study; LIFT = Long-Term Intervention on Fractures with Tibolone; MORE = Multiple Outcomes of Raloxifene Evaluation; NSABP P-1 = National Surgical Adjuvant Breast and Bowel Project P-1 Study; RUTH = Raloxifene Use for the Heart.

    * Per 1000 women-years.

    † Events were reported from at least 3 months after treatment was stopped until the end of follow-up.

    In placebo-controlled trials, tamoxifen caused more cases of endometrial cancer (risk ratio, 2.13 [CI, 1.36 to 3.32]; 3 trials) (7, 18, 20) (Figure 5). Women using tamoxifen also had more benign gynecologic conditions (18, 49); surgical procedures, including hysterectomy (18, 20, 49); and uterine bleeding (18, 49) than women using placebo. Raloxifene did not increase risk for endometrial cancer (33, 40) (Figure 5) or uterine bleeding (27, 40, 50–58). In the STAR head-to-head trial, raloxifene caused fewer cases of endometrial hyperplasia (risk ratio, 0.16 [CI, 0.09 to 0.29]) and was associated with fewer hysterectomies (risk ratio, 0.44 [CI, 0.35 to 0.56]) than was tamoxifen but did not reduce endometrial cancer (risk ratio, 0.62 [CI, 0.35 to 1.08]) (4).

    Figure 5.
    View larger version:
    Figure 5. Meta-analysis of risk ratios for incidence of endometrial cancer and cataracts.

    Error bars represent 95% CIs. LIFT = Long-Term Intervention on Fractures with Tibolone; MORE = Multiple Outcomes of Raloxifene Evaluation; NSABP P-1 = National Surgical Adjuvant Breast and Bowel Project P-1 Study.

    * Per 1000 women-years.

    † Rates were based on number of women with an intact uterus.

    ‡ The rate and risk ratio were recalculated on the basis of the number of women at risk (having an intact uterus). The values reported by the study were based on all randomly assigned participants.

    § The number of women at risk (having an intact uterus) was not reported, and the risk ratio is calculated on the basis of the number of randomly assigned participants at baseline.

    In the NSABP P-1 trial (8), women using tamoxifen had more cataract surgeries than those using placebo (risk ratio, 1.57 [CI, 1.16 to 2.14]), although risk for cataracts was not increased in combined estimates (8, 18, 20) (Figure 5). Raloxifene did not increase risk for cataracts (Figure 5) or cataract surgery in placebo-controlled trials (33, 40). Raloxifene caused fewer cataracts (risk ratio, 0.79 [CI, 0.68 to 0.92]) and cataract surgeries (risk ratio, 0.82 [CI, 0.68 to 0.99]) than tamoxifen in the STAR head-to-head trial (4).

    Most common side effects for tamoxifen are hot flashes and other vasomotor symptoms (8, 18, 20, 23) and vaginal discharge, itching, or dryness (8, 18, 20, 23). For raloxifene, vasomotor symptoms (27, 40, 52, 53, 55) and leg cramps (27, 40, 55) are most common. In STAR, raloxifene users reported more musculoskeletal problems, dyspareunia, and weight gain, whereas tamoxifen users had more gynecologic problems, vasomotor symptoms, leg cramps, and bladder control symptoms (16). Tibolone increases vaginal bleeding (59–61), but in contrast to tamoxifen and raloxifene, it reduces the number and severity of hot flashes (60, 62, 63).

    Variability of Outcomes in Population Subgroups

    Tamoxifen and raloxifene had similar effects on breast cancer outcomes regardless of age and family history of breast cancer in the STAR head-to-head trial (4).

    Tamoxifen reduced breast cancer incidence in several subgroups of women evaluated in placebo-controlled primary prevention trials (12). Results are similar for women age 50 years or younger (risk ratio, 0.68 [CI, 0.55 to 0.85]; 3 trials) versus those older than age 50 years (risk ratio, 0.68 [CI, 0.51 to 0.90]; 3 trials) (7, 18, 23); premenopausal (risk ratio, 0.63 [CI, 0.46 to 0.85]; 2 trials) versus postmenopausal (risk ratio, 0.68 [CI, 0.44 to 1.05]; 2 trials) women (18, 20); estrogen users (risk ratio, 0.75 [CI, 0.47 to 1.20]; 2 trials) versus nonusers (risk ratio, 0.68 [CI, 0.54 to 0.86]; 3 trials) (7, 18, 23); and those with a family history of breast cancer (risk ratio, 0.58 [CI, 0.46 to 0.73]; 1 trial) versus those with no family history (risk ratio, 0.54 [CI, 0.34 to 0.83]; 1 trial) (7). In the NSABP P-1 trial, cancer rates were highest and risk reduction greatest among women in the highest modified Gail model risk category (>5% 5-year risk) and among women with previous atypical hyperplasia (7). Some harms, including thromboembolic events, strokes, and endometrial cancer, were more common in older (age >50 years) than younger women in the NSABP P-1 trial (7).

    Breast cancer incidence results for raloxifene are similar among women who had previously used estrogen (risk ratio, 0.45 [CI, 0.19 to 1.07]; 2 trials) versus those who had not (risk ratio, 0.44 [CI, 0.30 to 0.65]; 2 trials) (36, 41) and among those with a body mass index of 25 mg/kg2 or less (risk ratio, 0.47 [CI, 0.17 to 1.33]; 2 trials) versus those with an index greater than 25 mg/kg2 (risk ratio, 0.43 [CI, 0.30 to 0.63]; 2 trials) (36, 41). Results also do not vary by age (36, 41), age at menarche (41), parity (41), and age at first live birth (41). Estimates from subgroups based on family history of breast cancer and previous hysterectomy or oophorectomy are limited by smaller numbers of participants (36, 41).

    Discussion

    In midlife and older women without preexisting breast cancer, tamoxifen, raloxifene, and tibolone reduce the risk for invasive breast cancer by 30% to 68%. Effects were similar for tamoxifen and raloxifene in the STAR head-to-head trial. Reduction of invasive breast cancer continued after discontinuation of tamoxifen therapy in the 2 trials that provided follow-up data. Tamoxifen and raloxifene reduced estrogen receptor–positive but not estrogen receptor–negative invasive breast cancer and had similar effects on these subtypes when directly compared. Tamoxifen reduced noninvasive breast cancer in the NSABP P-1 trial but not in the other tamoxifen trials. Raloxifene did not decrease noninvasive cancer, and the STAR trial suggested that more women using raloxifene had noninvasive breast cancer than those using tamoxifen. Tamoxifen and raloxifene reduced invasive breast cancer for all population subgroups evaluated; these subgroups have not been studied for tibolone.

    On the basis of our meta-analysis of placebo-controlled primary prevention trials, the number of cases of invasive breast cancer reduced is approximately 7 to 10/1000 women per year, assuming 5 years of use. Conclusions about the comparative effectiveness of medications are limited by the heterogeneity of participants in the trials and the existence of only 1 head-to-head trial. Trials demonstrate risk reduction not only for breast cancer but also for fractures, providing additional benefits for women. In the United States, the current choices of medications are raloxifene for postmenopausal women and tamoxifen for premenopausal and postmenopausal women.

    These medications also increase risks for serious and potentially life-threatening adverse events. Thromboembolic events are the most common serious complications, more so with tamoxifen than raloxifene in the STAR trial. Risk was increased by 60% to 90% in the placebo-controlled primary prevention trials that enrolled women with no history of thromboembolic events (4 to 7/1000 women per year). Clinicians considering these medications need to assess history and risk factors for thromboembolic events in treatment candidates. The effects of tamoxifen on endometrial cancer (4/1000 women per year), endometrial hyperplasia, and hysterectomy are also important. These problems could be avoided if use of the drug were limited to women who have had a hysterectomy. However, because tamoxifen is the only medication approved for use in premenopausal women, close monitoring of adverse uterine effects would be required for some users. Raloxifene and tamoxifen also cause adverse effects that could affect quality of life, such as hot flashes, vaginal symptoms, and musculoskeletal symptoms. Although tibolone is not currently available in the United States, its effect on stroke raises concern.

    This review is limited by potential biases and evidence gaps. These include publication bias and biases resulting from our selection criteria, such as using English-only reports. Trials may not have been truly blinded because side effects of active medications may have lead to differential ascertainment of outcomes. Active surveillance ended with completion of therapy in most trials, and important long-term outcomes may have been underreported. For some tamoxifen trials, participants randomly assigned to placebo switched to active medications after closure of the trial, compromising long-term tracking of outcomes. All efficacy trials were powered to detect statistical differences in breast cancer incidence, not adverse outcomes. Risks for some adverse outcomes may be underestimated because of lack of statistical power. Underestimation of adverse outcomes may also relate to inadequate ascertainment. For example, rates of cataracts and cataract surgery in the NSABP P-1 trial are substantially higher than rates in the other trials, most likely because the trial enlisted a more aggressive detection method (8).

    Evidence gaps include determination of optimal doses, duration and timing of use, persistence of effects after treatment, and outcomes in population subgroups. Data are lacking for nonwhite women, premenopausal women, and women who have comorbid conditions or are taking additional medications for other indications. Follow-up of women enrolled in existing trials would provide needed data on long-term outcomes. Additional evaluation of tibolone is necessary to determine important clinical outcomes, particularly regarding its safety. Trials of other emerging medications to reduce breast cancer risk, such as aromatase inhibitors and retinoids, will be needed as these are developed. Controlled trials of lifestyle modification interventions to reduce risk for breast cancer, such as weight loss and exercise, should also be explored.

    Clinical applications require caution. Women enrolled in clinical trials tend to be healthier than the general population and probably have fewer adverse outcomes. The heterogeneity of study populations must be considered in interpreting results of the trials. Results may best apply to patients with characteristics similar to those of the study participants. In general, tamoxifen results apply to younger premenopausal and postmenopausal women meeting breast cancer risk criteria; tibolone results, to older postmenopausal women with osteoporosis; and raloxifene results, to postmenopausal women meeting breast cancer risk criteria and to older postmenopausal women with osteoporosis, coronary heart disease, or risk factors for coronary heart disease. Before applying these findings to practice, clinicians must ensure that women understand their individual risks for breast cancer and can favorably balance these with the unwanted effects of risk-reducing medications.

    Appendix: Steps to Obtain Estimates for the Meta-AnalysisGraphic

    We abstracted or calculated estimates of risk ratios (rate ratio, hazard ratio, or relative risk) and their SEs from each study and used them as effect measures. For each outcome, we adopted the following steps to obtain the risk ratio and to account for the varying follow-up periods of the trials:

    1. If a study reported a rate ratio based on a Poisson model, wherein women-years of follow-up was incorporated in the estimates, or a hazard ratio from a Cox regression model, the reported estimate was used.

    2. If not, but the study reported the number of events and women-years of follow-up, or women-years of follow-up could be calculated from reported data, we calculated the rate ratio based on a Poisson distribution using the number of events and women-years of follow-up.

    3. If both 1 and 2 were not possible, we used the reported or calculated relative risk, which does not take into account the women-years of follow-up. However, the estimate of relative risk would be expected to be very close to the estimate of rate ratio because the mean or median follow-up time was similar between the treatment and control groups in the trials.

    We used similar steps to obtain risk ratios to determine event rates for both treatment and control groups. We provided the reported event rates per 1000 women-years if the study reported such data. Otherwise, if the study reported the number of events and women-years of follow-up, or women-years of follow-up could be calculated from the reported data, we calculated the event rates per 1000 women-years. When the event rates were not reported or calculable, we indicated them as such in our results.

    Article and Author Information

    • Disclaimer: The findings and conclusions in this document are those of the authors, who are responsible for its content, and do not necessarily represent the views of the Agency for Healthcare Research and Quality. No statement in this report should be construed as an official position of the Agency or of the U.S. Department of Health and Human Services.

    • Acknowledgment: The authors thank Andrew Hamilton, MLS, MS, and Rose Campbell, MLIS, MS, for literature searches and Jennifer Nguyen for administrative assistance at the Oregon Evidence-based Practice Center at the Oregon Health & Science University. They also acknowledge the contributions of Agency for Healthcare Research and Quality Officers Shilpa Amin, MD, MBsc, and Kenneth Lin, MD, and members of the Technical Expert Panel and expert reviewers.

    • Grant Support: This manuscript is based on research conducted by the Oregon Evidence-based Practice Center under contract to the Agency for Healthcare Research and Quality, Rockville, Maryland (contract no. 290-2007-10057-1).

    • Potential Conflicts of Interest: None disclosed.

    • Requests for Single Reprints: Heidi Nelson, MD, MPH, Oregon Evidence-based Practice Center, Oregon Health & Science University, Mailcode BICC, 3181 SW Sam Jackson Park Road, Portland, OR 97239-3098; e-mail, nelsonh{at}ohsu.edu.

    • Current Author Addresses: Drs. Nelson and Humphrey, Ms. Griffin, and Ms. Nygren: Oregon Evidence-based Practice Center, Oregon Health & Science University, Mailcode BICC, 3181 Southwest Sam Jackson Park Road, Portland, OR 97239-3098.

    • Dr. Fu: Oregon Evidence-based Practice Center, Oregon Health & Science University, Mailcode CB669, 3181 Southwest Sam Jackson Park Road, Portland, OR 97239-3098.

    • Dr. Smith: Oregon Evidence-based Practice Center, Oregon Health & Science University, Mailcode L475, 3181 Southwest Sam Jackson Park Road, Portland, OR 97239-3098.

    • Author Contributions: Conception and design: H.D. Nelson, J.C. Griffin, P. Nygren, L. Humphrey.

    • Analysis and interpretation of the data: H.D. Nelson, R. Fu, J.C. Griffin, P. Nygren, M.E.B. Smith, L. Humphrey.

    • Drafting of the article: H.D. Nelson, R. Fu, J.C. Griffin, P. Nygren, M.E.B. Smith, L. Humphrey.

    • Critical revision of the article for important intellectual content: H.D. Nelson, R. Fu, J.C. Griffin, P. Nygren, M.E.B. Smith, L. Humphrey.

    • Final approval of the article: H.D. Nelson, R. Fu, J.C. Griffin, P. Nygren, M.E.B. Smith, L. Humphrey.

    • Provision of study materials or patients: H.D. Nelson, M.E.B. Smith.

    • Statistical expertise: R. Fu.

    • Obtaining of funding: H.D. Nelson.

    • Administrative, technical, or logistic support: H.D. Nelson, J.C. Griffin, P. Nygren.

    • Collection and assembly of data: H.D. Nelson, R. Fu, J.C. Griffin, P. Nygren, M.E.B. Smith, L. Humphrey.

    References

    1. 1.
    2. 2.
    3. 3.
    4. 4.
    5. 5.
    6. 6.
    7. 7.
    8. 8.
    9. 9.
    10. 10.
    11. 11.
    12. 12.
    13. 13.
    14. 14.
    15. 15.
    16. 16.
    17. 17.
    18. 18.
    19. 19.
    20. 20.
    21. 21.
    22. 22.
    23. 23.
    24. 24.
    25. 25.
    26. 26.
    27. 27.
    28. 28.
    29. 29.
    30. 30.
    31. 31.
    32. 32.
    33. 33.
    34. 34.
    35. 35.
    36. 36.
    37. 37.
    38. 38.
    39. 39.
    40. 40.
    41. 41.
    42. 42.
    43. 43.
    44. 44.
    45. 45.
    46. 46.
    47. 47.
    48. 48.
    49. 49.
    50. 50.
    51. 51.
    52. 52.
    53. 53.
    54. 54.
    55. 55.
    56. 56.
    57. 57.
    58. 58.
    59. 59.
    60. 60.
    61. 61.
    62. 62.
    63. 63.
    64. 64.
    65. 65.
    66. 66.
    67. 67.
    68. 68.
    69. 69.
    70. 70.
    71. 71.
    72. 72.
    73. 73.