Home |
Current Issue |
Past Issues |
In the Clinic |
ACP Journal Club |
CME |
Collections |
Audio/Video |
Mobile |
Subscribe |
Tools |
Help |
ACP Online
|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
15 October 1997 | Volume 127 Issue 5 Part 2 | Pages 757-763
The aim of many analyses of large databases is to draw causal inferences about the effects of actions, treatments, or interventions. Examples include the effects of various options available to a physician for treating a particular patient, the relative efficacies of various health care providers, and the consequences of implementing a new national health care policy. A complication of using large databases to achieve such aims is that their data are almost always observational rather than experimental. That is, the data in most large data sets are not based on the results of carefully conducted randomized clinical trials, but rather represent data collected through the observation of systems as they operate in normal practice without any interventions implemented by randomized assignment rules. Such data are relatively inexpensive to obtain, however, and often do represent the spectrum of medical practice better than the settings of randomized experiments. Consequently, it is sensible to try to estimate the effects of treatments from such large data sets, even if only to help design a new randomized experiment or shed light on the generalizability of results from existing randomized experiments. However, standard methods of analysis using available statistical software (such as linear or logistic regression) can be deceptive for these objectives because they provide no warnings about their propriety. Propensity score methods are more reliable tools for addressing such objectives because the assumptions needed to make their answers appropriate are more assessable and transparent to the investigator.
Propensity score technology, introduced by Rosenbaum and Rubin [1], addresses this situation by reducing the entire collection of background characteristics to a single composite characteristic that appropriately summarizes the collection. This reduction from many characteristics to one composite characteristic allows the straightforward assessment of whether the treatment and control groups overlap enough with respect to background characteristics to allow a sensible estimation of treatment versus control effects from the data set. Moreover, when such overlap is present, the propensity score approach allows a straightforward estimation of treatment versus control effects that reflects adjustment for differences in all observed background characteristics.
The potential for a large database to suggest causal effects of treatments is indicated in Table 1, adapted from Cochran's work [2], which concerns mortality rates per 1000 person-years for nonsmokers, cigarette smokers, and cigar and pipe smokers drawn from three large databases in the United States, the United Kingdom, and Canada. The treatment factor here involves three levels of smoking. The unadjusted mortality rates in Table 1 make it seem that cigarette smoking is good for health, especially relative to cigar and pipe smoking; clearly, this result is contrary to current wisdom. A problem with this naive conclusion is exposed in Table 1, where the average ages of the subpopulations are given. Age correlates with both mortality rates and smoking behavior. In this example, age is a confounding covariate, and conclusions about the effects of smoking should be adjusted for its effects. STATISTICAL METHODS
Estimating Causal Effects from Large Data Sets Using Propensity Scores
Many observational studies based on large databases attempt to estimate the causal effects of some new treatment or exposure relative to a control condition, such as the effect of smoking on mortality. In most such studies, it is necessary to control for naturally occurring systematic differences in background characteristics between the treatment group and the control group, such as age or sex distributions, that would not occur in the context of a randomized experiment. Typically, many background characteristics need to be controlled.
Subclassification on One Confounding Variable
![]()
Before describing the use of propensity scores in the statistical analysis of observational studies with many confounding background characteristics, I begin with an example showing how subclassification adjusts for a single confounding covariate, such as age, in a study of smoking and mortality. I then show how propensity score methods generalize subclassification in the presence of many confounding covariates, such as age, region of the country, and sex.
|
A straightforward way of adjusting for age is to 1) divide the population into age categories of approximately equal size [such as younger and older if two categories are appropriate; younger, middle-aged, and older if three are appropriate; and so on], 2) compare mortality rates within an age category [for example, compare mortality rates for the three treatment groups within the younger population and similarly for the older population], and 3) average the age-group-specific comparisons to obtain overall estimates of the age-adjusted mortality rates per 1000 person-years for each of the three groups. Table 1 shows the results for different numbers of age categories where the subclass-age boundaries were defined to have equal numbers of nonsmokers in each subclass. These results align better than the unadjusted mortality rates with our current understanding of the effects of smoking, especially when 9 to 11 subclasses are used. Incidentally, having approximately equal numbers of nonsmokers within each subclass is not necessary, but if the nonsmokers are considered the baseline group, it is a convenient and efficient choice because then the overall estimated effect is the simple unweighted average of the subclass-specific results. That is, the mortality rates in all three groups are being standardized [3] to the age distribution of nonsmokers as defined by their subclass counts.
Cochran [2] calls this method subclassification and offers theoretical results showing that as long as the treatment and exposure groups overlap in their age distributions (that is, as long as a reasonable number of persons from each treatment group are in each subclass), comparisons using five or six subclasses will typically remove 90% or more of the bias present in the raw comparisons shown in Table 1. More than five subclasses were used for the adjusted mortality rates because the large size of the data sets made it possible to do so.
A particular statistical model, such as a linear regression (or a logistic regression model; or in other settings, a hazard model) could have been used to adjust for age, but subclassification has two distinct advantages over such models, at least for offering initial trustworthy comparisons that are easy to communicate. First, if the treatment or exposure groups do not adequately overlap on the confounding covariate age, the investigator will see it immediately and be warned. Thus, if members of one group have ages outside the range of another group's ages, it will be obvious because one or more age-specific subclasses will consist almost solely of members exposed to one treatment. In contrast, nothing in the standard output of any regression modeling software will display this critical fact; the reason is that models predict an outcome (such as death) from regressors (such as age and treatment indicators), and standard regression diagnostics do not include careful analysis of the joint distribution of the regressors (such as a comparison of the distributions of age across treatment groups). When the overlap on age is too limited, the database, no matter how large, cannot support any causal conclusions about the differential effects of the treatments. For example, comparing 5-year survival rates among 70-year-old smokers and 40-year-old nonsmokers gives essentially no information about the effect of smoking or nonsmoking for either 70-year-old or 40-year-old persons.
The second reason for preferring subclassification to models concerns situations such as that found in Table 1, in which the groups overlap enough on the confounding covariate to make a comparison possible. Subclassification does not rely on any particular functional form, such as linearity, for the relation between the outcome (death) and the covariate (age) within each treatment group, whereas models do. If the groups have similar distributions of the covariate, such specific assumptions like linearity are usually harmless, but when the groups have different covariate distributions, model-based methods of adjustment are dependent on the specific form of the model (for example, linearity or log linearity) and their results are determined by untrustworthy extrapolations.
If standard models can be so dangerous, why are they commonly used for such adjustments when large databases are examined for estimates of causal effects? One reason is the ease with which automatic data analysis can be done using existing, pervasive software on plentiful, speedy hardware. A second reason is the seeming difficulty of using subclassification when many confounding covariates need adjustment, which is the common case. Standard modeling software can automatically handle many regressor variables and produce results, although they can be remarkably misleading. With many confounding covariates, however, the issues of lack of adequate overlap and reliance on untrustworthy model-based extrapolations are even more serious than with only one confounding covariate. The reason is that small differences in many covariates can accumulate into a substantial overall difference. For example, if members of one treatment or exposure group are slightly older, have slightly higher cholesterol levels, and have slightly more familial history of cancer, that group may be substantially less healthy. Moreover, although standard comparisons of means between the groups like those in Table 1, or comparisons of histograms for each confounding covariate among groups are adequate with one covariate, they are inadequate with more than one. The groups may differ in a multivariate direction to an extent that cannot be discerned from separate analyses of each covariate. This multivariate direction is closely related to the statistical concept of the best linear discriminant and intuitively is the single combination of the covariates on which the treatment groups are farthest apart.
Subclassification techniques can be applied with many covariates with almost the same reliability as with only one covariate. The key idea is to use propensity score techniques, as developed by Rosenbaum and Rubin [1]. These methods can be viewed as important extensions of discriminant matching techniques, which calculate the best linear discriminant between the treatment groups and match on it [4].
Since their introduction approximately 15 years ago, propensity score methods have been used in various applied problems in medical and other research disciplines [5-23] but not nearly as frequently as they should have been relative to model-based methods.
Propensity Score Methods
|
|---|
The basic idea of propensity score methods is to replace the collection of confounding covariates in an observational study with one function of these covariates, called the propensity score (that is, the propensity to receive treatment 1 rather than treatment 2). This score is then used just as if it were the only confounding covariate. Thus, the collection of predictors is collapsed into a single predictor. The propensity score is found by predicting treatment group membership (that is, the indicator variable for being in treatment group 1 as opposed to treatment group 2) from the confounding covariates, for example, by a logistic regression or discriminant analysis. In this prediction of treatment group measurement, it is critically important that the outcome variable (for example, death) play no role; the prediction of treatment group must involve only the covariates. Each person in the database then has an estimated propensity score, which is the estimated probability (as determined by that person's covariate values) of being exposed to treatment 1 rather than treatment 2. This propensity score is then the single summarized confounding covariate to be used for subclassification.
Subclassification into about five groups on the basis of the propensity score then has the rather remarkable property of adjusting for all of the covariates that went into its estimation, no matter how many there are. This is a large-sample claim that relies on certain conditions dealt with in technical statistical publications, but it is nevertheless an extremely useful guide for practice. The intuition behind the validity of this claim is fairly straightforward and proceeds as follows.
If two persons, one exposed to treatment 1 and the other exposed to treatment 2, had the same value of the propensity score, these two persons would then have the same predicted probability of being assigned to treatment 1 or treatment 2. Thus, as far as we can tell from the values of the confounding covariates, a coin was tossed to decide who received treatment 1 and who received treatment 2. Now suppose that we have a collection of persons receiving treatment 1 and a collection of persons receiving treatment 2 and that the distributions of the propensity scores are the same in both groups (as is approximately true within each propensity subclass). In subclass 1, the persons who received treatment 1 were essentially chosen randomly from the pool of all persons in subclass 1, and analogously for each subclass. As a result, within each subclass, the multivariate distribution of the covariates used to estimate the propensity score differs only randomly between the two treatment groups.
The formal proof of this result appears in Rosenbaum and Rubin [1]. Research on how well this theoretical result is satisfied when using estimated rather than true propensity scores is the topic of technical statistical publications [24-28]. Generally, the conclusion is that using estimated propensity scores in place of true propensity scores works very well.
Propensity Subclassification
|
|---|
4 cm [29]) were applied to match criteria for the randomized experiments. These restrictions reduced the database to 1106 women who received breast conservation therapy and 4220 who received mastectomy, for a total of 5326 women.
|
The U.S. Government Accounting Office used propensity score methods on the SEER database to compare the two treatments for breast cancer. First, approximately 30 potential confounding covariates and interactions were identified: year of diagnosis (1983-1985), age category (4 levels), tumor size, geographical registry (9 levels), race (4 levels), marital status (4 levels), and interactions of year and registry. A logistic regression was then used to predict treatment (mastectomy compared with conservation therapy) from these confounding covariates on the basis of data from the 5326 women. Each woman was then assigned an estimated propensity score, which was her probability, on the basis of her covariate values, of receiving breast conservation therapy rather than mastectomy. The group was then divided into five subclasses of approximately equal size on the basis of the womens' individual propensity scores: 1064 in the most mastectomy-oriented subclass, 1070 in the next subclass, 1059 in the middle subclass, 1067 in the next subclass, and 1066 in the most breast conservation-oriented subclass.
Before examining any outcomes (5-year survival results), the subclasses were checked for balance with respect to the covariates. Propensity score theory claims that if the propensity scores are relatively constant within each subclass, then within each subclass, the distribution of all covariates should be approximately the same in both treatment groups. This balance was found to be satisfactory. If important within-subclass differences between treatment groups had been found on some covariates, then either the propensity score prediction model would need to be reformulated or it would have been concluded that the covariate distributions did not overlap sufficiently to allow subclassification to adjust for these covariates. This process of cycling between checking for balance on the covariates and reformulating the propensity score model is described by Rosenbaum and Rubin [18] in the context of a study investigating coronary bypass surgery. For example, when the variances of an important covariate were found to differ importantly between treatment and control groups, then the square of that covariate was included in the revised propensity score model. For another example, if the correlations between two important covariates differed between the groups, then the product of the covariates was added to the propensity score model.
The estimates of 5-year survival rates made on basis of the resulting propensity score subclassification are given in Table 3 [29]. Total rates and rates excluding deaths unrelated to cancer are shown. Several features of Table 3 are particularly striking, especially when compared with the results of the randomized experiments shown in Table 2. First, the general conclusion of similar performance of both treatments is maintained. Second, although overall survival is similar across treatment groups, the results indicate that survival in general practice may be slightly lower than suggested by data from the population of women and types of clinics participating in the randomized clinical trials, especially in the single-clinic studies.
|
Third, results slightly indicate that, in general practice, women and their physicians may be making beneficial choices. More precisely, women in propensity subclasses 1 to 3, composed of patients whose characteristics (including age, size of tumor, and region of country) make them relatively more likely to receive mastectomy than with breast conservation therapy. In contrast, for women in propensity subclasses 4 and 5 (whose characteristics make them relatively more likely to receive breast conservation therapy than mastectomy), there seems to be no advantage to mastectomy and possibly a slight advantage to breast conservation therapy. Of course, this last interpretation is subject to two caveats. First, we only adjusted for the covariates that were used to estimate the propensity score; hence, other hidden covariates may alter this interpretation. In a randomized experiment, the effects of these hidden covariates are reflected in the SEs of the estimates, but in an observational study, these effects can create bias not reflected in the SEs. Second, the sampling variability (that is, SEs) of the results do not permit firm conclusions, even if the collection of confounding covariates was sufficient to remove bias in this observational study.
Although there is no randomized assignment in the SEER database, the propensity score analyses seem to provide useful suggestive results, especially when coupled with the results of the randomized experiments, with which they are consistent.
More Than Two Treatment Conditions
|
|---|
First, consider what we could have learned if the nonsmokers and cigarette smokers had had adequately overlapping age distributions, but the cigar and pipe smokers had been substantially older than persons in either of the other groups, with essentially no overlap with the cigarette smokers or the nonsmokers. Even with only one covariate, with more than two groups, the groups in one two-group comparison (nonsmokers compared with cigarette smokers) may overlap adequately, whereas for all other comparisons (in this example, those involving cigar and pipe smokers), the overlap may be inadequate. A typical model-based analysis would use all the data to provide estimates for all three two-group comparisons, even using the data from the cigar and pipe smokers to influence the comparison between the nonsmokers and the cigarette smokers, with no warning of either the extreme extrapolations involved in two of the three two-group comparisons or the use of data on cigar and pipe smokers to help estimate the comparison of nonsmokers and cigarette smokers.
Let us again modify the Cochran [2] smoking example but now include an additional covariate: an index of socioeconomic status. We assume that nonsmokers and cigarette smokers have adequate overlap in their age distributions but not much overlap in their socioeconomic status distributions, with nonsmokers having higher socioeconomic status values. In contrast, we suppose that nonsmokers and cigar and pipe smokers have substantial overlap in their socioeconomic distributions but have essentially no overlap in their age distributions. This scenario illustrates that with more than two groups and more than one covariate, the comparison of one pair of groups can be compromised by one covariate and the comparison of another pair of groups can be compromised by a different covariate. As discussed earlier, typical model-based analyses provide no warning that comparisons may be based on extreme extrapolations, nor do they show that the extrapolations include data from groups that are not in the pair of groups being compared.
Now suppose that the nonsmokers and cigarette smokers have the same age distributions and adequately overlapping socioeconomic status distributions. For this comparison, age needs no adjustment but socioeconomic status does need to be adjusted. The propensity score for the comparison would essentially equal socioeconomic status because it, and not age, would predict being a cigarette smoker as opposed to being a nonsmoker. Thus, for this comparison, adjusting for the propensity score would be the same as adjusting for socioeconomic status. Now also assume that the nonsmokers and cigar and pipe smokers have the same socioeconomic status distributions, so that socioeconomic status needs no adjustment, and have adequately overlapping age distributions that need adjustment. Then the propensity score for this comparison would equal age, and therefore, adjusting for the propensity score would be the same as adjusting for age. Thus, the propensity score for a comparison of one pair of groups generally needs to be different from that for a comparison of a different pair of groups. To complete the current scenario, assume that cigarette smokers and cigar and pipe smokers have adequate overlap in both age and socioeconomic status and that both need adjustment. The propensity score for this comparison would involve both age and socioeconomic status because both help to predict cigarette group membership, as opposed to cigar and pipe smoking group membership, and adjusting for this propensity score would adjust for both age and socioeconomic status. Clearly, different propensity score models are needed to adjust appropriately for different comparisons. Estimating all effects by using one model in our example with three groups and adequate overlap on all covariates can be even more deceptive than estimation in the two-group setting because the model being used to compare one pair of groups (for example, nonsmokers compared with cigarette smokers) is affected by the data from the third group (here cigar and pipe smokers), which probably has covariate values that differ from those in either one of the other two groups being compared.
Limitations of Propensity Scores
|
|---|
In observational studies, confidence in causal conclusions must be built by seeing how consistent the obtained answers are with other evidence (such as results from related experiments) and how sensitive the conclusions are to reasonable deviations from assumptions, as illustrated by Connors and colleagues [20], who used techniques from Rosenbaum and Rubin's work [30]. Such sensitivity analyses suppose that a relevant but unobserved covariate has been left out of the propensity score model. By explicating how this hypothetical unmeasured covariate is related to treatment assignment and outcome, we can obtain an estimate of the treatment effect that adjusts for it as well as for measured covariates and hereby investigate how answers might change if such a covariate were available for adjustment. Of course, medical knowledge is needed when assessing whether the posited relations involving the hypothetical unmeasured covariate are realistic or extreme. Clarifications of nomenclature and extended sensitivity analyses reported by Lin and colleagues [31] moderate the initial conclusions of Connors and colleagues [20].
Another limitation of propensity score methods is that they work better in larger samples for the following reason. The distributional balance of observed covariates created by subclassifying on the propensity score is an expected balance, just as the balance of all covariates in a randomized experiment is an expected balance. In a small randomized experiment, random imbalances of some covariates can be substantial despite randomization; analogously, in a small observational study, substantial imbalances of some covariates may be unavoidable despite subclassification using a sensibly estimated propensity score. The larger the study, the more minor are such imbalances.
A final possible limitation of propensity score methods is that a covariate related to treatment assignment but not to outcome is handled the same as a covariate with the same relation to treatment assignment but strongly related to outcome. This feature can be a limitation of propensity scores because inclusion of irrelevant covariates reduces the efficiency of the control on the relevant covariates. However, recent work [28] suggests that, at least in modest or large studies, the biasing effects of leaving out even a weakly predictive covariate dominate the efficiency gains from not using such a covariate. Thus, in practice, this limitation may not be substantial if investigators use some judgment.
Conclusion
|
|---|
|
|
|---|
Author and Article Information
|
|---|
|
|
|---|
References
|
|---|
|
|
|---|
1. Rosenbaum P, Rubin DB. The central role of the propensity score in observational studies for causal effects. Biometrika. 1983; 70:41-55.
2. Cochran WG. The effectiveness of adjustment by subclassification in removing bias in observational studies. Biometrics. 1968; 24:295-313.
3. Finch PE. Standardization. In: Kotz S, Johnson NL, eds. Encyclopedia of Statistical Sciences. v 8. New York: Wiley; 1988:629-32.
4. Rubin DB. Bias reduction using Mahalanobis' metric matching. Biometrics. 1980:36; 295-8.
5. Aiken LH, Smith HL, Lake ET. Lower Medicare mortality among a set of hospitals known for good nursing care. Med Care. 1994; 32:771-87.
6. Cook EF, Goldman L. Asymmetric stratification. An outline for an efficient method for controlling confounding in cohort studies. Am J Epidemiol. 1989; 127:626-39.
7. Cook EF, Goldman L. Performance of tests of significance based on stratification by a multivariate confounder score or by a propensity score. J Clin Epidemiol. 1989; 42:317-24.
8. Eastwood EA, Fisher GA. Skills acquisition among matched samples of institutionalized and community-based persons with mental retardation. Am J Ment Retard. 1988; 93:75-83.
9. Fiebach NH, Cook EF, Lee TH, Brand DA, Rouan GW, Weisberg M, et al. Outcomes in patients with myocardial infarction who are initially admitted to stepdown units: data from the Multicenter Chest Pain Study. Am J Med. 1990; 89:15-20.
10. Harrell FE Jr, Marcus SE, Layde PM, Broste SK, Cook EF, Wagner DP, et al. Statistical methods in SUPPORT. J Clin Epidemiol. 1990; 43(Suppl):89S-98S.
11. Kane R, Garrad J, Buchanan J, Rosenfeld A, Skay C, McDermott S. Improving primary care in nursing homes. J Am Geriatr Soc. 1991; 39:359-67.
12. Lavori PW, Keller MB, Endicott J. Improving the validity of FH-RDC diagnosis of major affective disorder in uninterviewed relatives in family studies: a model based approach. J Psychiatr Res. 1988; 22:249-59.
13. Lavori PW, Keller MB. Improving the aggregate performance of psychiatric diagnostic methods when not all subjects receive the standard test. Stat Med. 1988; 7:727-37.
14. Myers WO, Gersh BJ, Fisher LD, Mock MB, Holmes DR, Schaff HV, et al. Medical versus early surgical therapy in patients with triple-vessel disease and mild angina pectoris: a CASS registry study of survival. Ann Thorac Surg. 1987; 44:471-86.
15. Stone RA, Obrosky DS, Singer DE, Kapoor WN, Fine MJ. Propensity score adjustment for pretreatment differences between hospitalized and ambulatory patients with community-acquired pneumonia. Pneumonia Patient Outcomes Research Team (PORT) Investigators. Med Care. 1995; 33(4 Suppl):AS56-66.
16. Willoughby A, Graubard BI, Hocker A, Storr C, Vietze P, Thackaberry JM, et al. Population-based study of the developmental outcome of children exposed to chloride-deficient infant formula. Pediatrics. 1990; 85:485-90.
17. Drake C, Fisher L. Prognostic models and the propensity score. Int J Epidemiol. 1995; 24:183-7.
18. Rosenbaum PR, Rubin DB. Reducing bias in observational studies using subclassification on the propensity score. Journal of the American Statistical Association. 1984; 79:516-24.
19. Reinisch J, Sanders SA, Mortensen EL, Rubin DB. In utero exposure to phenobarbital and intelligence deficits in adult men. JAMA. 1995; 274:1518-25.
20. Connors AF Jr, Speroff T, Dawson NV, Thomas C, Harrell FE Jr, Wagner D, et al. The effectiveness of right heart catheterization in the initial care of critically ill patients. SUPPORT Investigators. JAMA. 1996; 276:889-97.
21. Gu XS, Rosenbaum PR. Comparison of multivariate matching methods: structures, distances, and algorithms. Journal of Computational and Graphical Statistics. 1993; 2:405-520.
22. Rubin, DB, Rosenbaum PR. Constructing a control group using multivariate matched sampling methods that incorporate the propensity score. The American Statistician. 1985; 39:33-8.
23. Rosenbaum PR, Rubin DB. The bias due to incomplete matching. Biometrics. 1985; 41:103-16.
24. Rubin DB. Assessing the fit of logistic regressions using the implied discriminant analysis. Discussion of "Graphical Methods for Assessing Logistic Regression Models" by Landwehr, Pregibone, and Smith. Journal of the American Statistical Association. 1984; 79:79-80.
25. Drake C. Effects of misspecification of the propensity score on estimators of treatment effect. Biometrics. 1993; 49:1231-36.
26. Rubin DB, Thomas N. Affinely invariant matching methods with ellipsoidal distributions. The Annals of Statistics. 1992; 20:1079-93.
27. Rubin DB, Thomas N. Characterizing the effect of matching using linear propensity score methods with normal covariates. Biometrika. 1992; 79:797-809.
28. Rubin DB, Thomas N. Matching using estimated propensity scores: relating theory to practice. Biometrics. 1996; 52:249-64.
29. U.S. General Accounting Office. Breast conservation versus mastectomy: patient survival in day-to-day medical practice and randomized studies: report to the chairman, Subcommitee on Human Resources and Intergovernmental Relations, Committee on Government Operations, House of Representatives Washington, DC: U.S. General Accounting Office; 1994. Report #GAO-PEMD-95-9.
30. Rosenbaum PR, Rubin DB. Assessing sensitivity to an unobserved binary covariate in an observational study with binary outcome. The Journal of the Royal Statistical Society. Series B. 1983; 45:212-8.
31. Lin DY, Psaty BM, Kronmal RA. Assessing the sensitivity of regression results to unmeasured confounders in observational studies. Seattle: Univ Washington School of Public Health; 1997. Technical Report #144.
This article has been cited by other articles:
![]() |
J. B. Christian, K. L. Lapane, A. L. Hume, C. B. Eaton, and M. A. Weinstock Association of ACE Inhibitors and Angiotensin Receptor Blockers with Keratinocyte Cancer Prevention in the Randomized VATTC Trial J Natl Cancer Inst, September 3, 2008; 100(17): 1223 - 1232. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. E. Ensrud, T. S. Walczak, T. L. Blackwell, E. R. Ensrud, E. Barrett-Connor, E. S. Orwoll, and For the Osteoporotic Fractures in Men (MrOS) Study Antiepileptic drug use and rates of hip bone loss in older men: A prospective study Neurology, September 2, 2008; 71(10): 723 - 730. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. M. Lattouf, V. H. Thourani, P. D. Kilgo, M. E. Halkos, K. T. Baio, R. Myung, W. A. Cooper, R. A. Guyton, and J. D. Puskas Influence of On-Pump Versus Off-Pump Techniques and Completeness of Revascularization on Long-Term Survival After Coronary Artery Bypass Ann. Thorac. Surg., September 1, 2008; 86(3): 797 - 805. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Y. Lin, R. A. Weinstein, and B. Hota Delay of Active Antimicrobial Therapy and Mortality among Patients with Bacteremia: Impact of Severe Neutropenia Antimicrob. Agents Chemother., September 1, 2008; 52(9): 3188 - 3194. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. C. Campbell, X. Sui, G. Filippatos, T. E. Love, C. Wahle, P. W. Sanders, and A. Ahmed Association of chronic kidney disease with outcomes in chronic heart failure: a propensity-matched study Nephrol. Dial. Transplant., August 18, 2008; (2008) gfn445v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. S. Khalafi, D. W. Bradford, and M. G. Wilson Topical application of autologous blood products during surgical closure following a coronary artery bypass graft Eur. J. Cardiothorac. Surg., August 1, 2008; 34(2): 360 - 364. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Forsyth, M. Hearst, J. M. Oakes, and K. H. Schmitz Design and Destinations: Factors Influencing Walking and Total Physical Activity Urban Stud, August 1, 2008; 45(9): 1973 - 1996. [Abstract] [PDF] |
||||
![]() |
A. S. Kazley and Y. A. Ozcan Do Hospitals With Electronic Medical Records (EMRs) Provide Higher Quality Care?: An Examination of Three Clinical Conditions Med Care Res Rev, August 1, 2008; 65(4): 496 - 513. [Abstract] [PDF] |
||||
![]() |
J.-C. Roze, S. Denizot, R. Carbajal, P.-Y. Ancel, M. Kaminski, C. Arnaud, P. Truffert, S. Marret, J. Matis, G. Thiriez, et al. Prolonged Sedation and/or Analgesia and 5-Year Neurodevelopment Outcome in Very Preterm Infants: Results From the EPIPAGE Cohort Arch Pediatr Adolesc Med, August 1, 2008; 162(8): 728 - 733. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Cummings Propensity Scores Arch Pediatr Adolesc Med, August 1, 2008; 162(8): 734 - 737. [Full Text] [PDF] |
||||
![]() |
B. Pitt, A. Ahmed, T. E. Love, H. Krum, J. Nicolau, J. S. Cardoso, A. Parkhomenko, M. Aschermann, R. Corbalan, H. Solomon, et al. History of Hypertension and Eplerenone in Patients With Acute Myocardial Infarction Complicated by Heart Failure Hypertension, August 1, 2008; 52(2): 271 - 278. [Abstract] [Full Text] [PDF] |
||||
![]() |
I M Tleyjeh, J M Steckelberg, G Georgescu, H M K Ghomrawi, T L Hoskin, F B Enders, F Mookadam, W C Huskins, W R Wilson, and L M Baddour The association between the timing of valve surgery and 6-month mortality in left-sided infective endocarditis Heart, July 1, 2008; 94(7): 892 - 896. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Rahme PhD, K. Dasgupta MD MSc, M. Burman MD, H. Yin PhD, S. Bernatsky MD PhD, G. Berry MD, H. Nedjar MSc, and S. R. Kahn MD MSc Postdischarge thromboprophylaxis and mortality risk after hip-or knee-replacement surgery Can. Med. Assoc. J., June 3, 2008; 178(12): 1545 - 1554. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. F. Mamoun, M. Xu, D. I. Sessler, J. F. Sabik, and C. A. Bashour Propensity matched comparison of outcomes in older and younger patients after coronary artery bypass graft surgery. Ann. Thorac. Surg., June 1, 2008; 85(6): 1974 - 1979. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Ali, Y. Abu-Omar, A. Patel, Z. Ali, A. Y. Sheikh, A. Akhtar, A. Pavlovic, P. Theodorou, T. Athanasiou, and J. Pepper Valve failure following homograft aortic valve replacement: does implantation technique have an effect? Eur. Heart J., June 1, 2008; 29(11): 1454 - 1462. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. D Graham, A. G Hartzema, I. S Sketris, and A. G Winterstein Effect of an Academic Detailing Intervention on the Utilization Rate of Cyclooxygenase-2 Inhibitors in the Elderly Ann. Pharmacother., June 1, 2008; 42(6): 749 - 756. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. S. Mujahid, A. V. D. Roux, M. Shen, D. Gowda, B. Sanchez, S. Shea, D. R. Jacobs Jr., and S. A. Jackson Relation between Neighborhood Environments and Obesity in the Multi-Ethnic Study of Atherosclerosis Am. J. Epidemiol., June 1, 2008; 167(11): 1349 - 1357. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. W. Groeneveld, M. A. Matta, A. P. Greenhut, and F. Yang Drug-Eluting Compared With Bare-Metal Coronary Stents Among Elderly Patients J. Am. Coll. Cardiol., May 27, 2008; 51(21): 2017 - 2024. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. K. Miller, F. D. Wolinsky, E. M. Andresen, T. K. Malmstrom, and J. P. Miller Adverse Outcomes and Correlates of Change in the Short Physical Performance Battery Over 36 Months in the African American Health Project J. Gerontol. A Biol. Sci. Med. Sci., May 1, 2008; 63(5): 487 - 494. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. P. Barth, Shenyang Guo, and J. S. McCrae Propensity Score Matching Strategies for Evaluating the Success of Child and Family Service Programs Research on Social Work Practice, May 1, 2008; 18(3): 212 - 222. [Abstract] [PDF] |
||||
![]() |
E. Spaepen, N. Demarteau, S. Van Belle, and L. Annemans Health Economic Evaluation of Treating Anemia in Cancer Patients Receiving Chemotherapy: A Study in Belgian Hospitals Oncologist, May 1, 2008; 13(5): 596 - 607. [Abstract] [Full Text] [PDF] |
||||
![]() |
D.-W. Park, S.-C. Yun, S.-W. Lee, Y.-H. Kim, C. W. Lee, M.-K. Hong, J.-J. Kim, S. J. Choo, H. Song, C. H. Chung, et al. Long-Term Mortality After Percutaneous Coronary Intervention With Drug-Eluting Stent Implantation Versus Coronary Artery Bypass Surgery for the Treatment of Multivessel Coronary Artery Disease Circulation, April 22, 2008; 117(16): 2079 - 2086. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Nathan and T. M. Pawlik Limitations of Claims and Registry Data in Surgical Oncology Research Ann. Surg. Oncol., February 1, 2008; 15(2): 415 - 423. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. A. Vohra, T. Bahrami, S. Farid, A. Mafi, G. Dreyfus, M. Amrani, and J. A.R. Gaer Propensity score analysis of early and late outcome after redo off-pump and on-pump coronary artery bypass grafting Eur. J. Cardiothorac. Surg., February 1, 2008; 33(2): 209 - 214. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. R. Farwell, R. E. Scranton, E. V. Lawler, R. A. Lew, M. T. Brophy, L. D. Fiore, and J. M. Gaziano The Association Between Statins and Cancer Incidence in a Veterans Population J Natl Cancer Inst, January 16, 2008; 100(2): 134 - 139. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. A. Olsen, S. Chu-Ongsakul, K. E. Brandt, J. R. Dietz, J. Mayfield, and V. J. Fraser Hospital-Associated Costs Due to Surgical Site Infection After Breast Surgery Arch Surg, January 1, 2008; 143(1): 53 - 60. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. M. From, B. J. Bartholmai, A. W. Williams, S. S. Cha, A. Pflueger, and F. S. McDonald Sodium Bicarbonate is Associated with an Increased Incidence of Contrast Nephropathy: A Retrospective Cohort Study of 7977 Patients at Mayo Clinic Clin. J. Am. Soc. Nephrol., January 1, 2008; 3(1): 10 - 18. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Ahmed, I. B Aban, V. Vaccarino, D. M Lloyd-Jones, D. C Goff Jr, J. Zhao, T. E Love, C. Ritchie, F. Ovalle, G. Gambassi, et al. A propensity-matched study of the effect of diabetes on the natural history of heart failure: variations by sex and age Heart, December 1, 2007; 93(12): 1584 - 1590. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. R. Tinklenberg, H. C. Kraemer, K. Yaffe, L. Ross, J. Sheikh, J. W. Ashford, J. A. Yesavage, and J. L. Taylor Donepezil Treatment and Alzheimer Disease: Can the Results of Randomized Clinical Trials Be Applied to Alzheimer Disease Patients in Clinical Practice? Am J Geriatr Psychiatry, November 1, 2007; 15(11): 953 - 960. [Abstract] [Full Text] [PDF] |
||||