Home |
Current Issue |
Past Issues |
In the Clinic |
ACP Journal Club |
CME |
Collections |
Audio/Video |
Mobile |
Subscribe |
Tools |
Help |
ACP Online
|
1 July 1998 | Volume 129 Issue 1 | Pages 42-48
This paper reviews methods commonly used to assess the clinical competence of residents in internal medicine, including the In-Training Examination, medical record audits, rating scales, clinical evaluation exercises, and the use of standardized patients. Studies were identified through a MEDLINE search (1966 to present) and from the bibliographies of relevant articles and were selected for inclusion according to consensus between the authors. Whenever possible, original studies were chosen over reviews and editorials.
No single assessment method can successfully evaluate the clinical competence of residents in internal medicine, and educators need to be cognizant of the most appropriate applications and the advantages and disadvantages of the available evaluation tools.A combination of assessment tools provides the best opportunity to evaluate and educate physicians-in-training.
This article reviews the major instruments available to educators for the assessment of clinical competence during residency: the In-Training Examination, the ABIM Longitudinal Evaluation Form (rating scale), medical record audits, the clinical evaluation exercise (CEX), and the use of standardized patients (Table 1). Because use of these evaluation tools is voluntary and is not required for certification, we focus on the advantages and disadvantages of these tools in an attempt to help educators decide which combination best provides the information they need to assess their residents' knowledge, skills, and attitudes. ACADEMIA AND CLINIC
Methods for Evaluating the Clinical Competence of Residents in Internal Medicine: A Review
The evaluation of clinical competence is a major responsibility of medical educators, and forces within and outside of organized medicine are pushing training programs to establish and enforce standards of clinical competence [1-5]. Both the American Board of Internal Medicine (ABIM) and the Accreditation Council for Graduate Medical Education have developed specific guidelines for the evaluation of housestaff [1, 6]. Although it is considered a benchmark for certification, the ABIM certifying examination is inadequate as the sole measure of clinical competence. For this reason, the ABIM actively encourages the use of other forms of competency assessment.
|
The term competence is often used broadly to incorporate the domains of knowledge, skills, and attitudes. We should note, however, that other educators define competence more narrowly. Miller [4] divides clinical ability into a four-step pyramid: knowledge (knows), competence (knows how), performance (shows how), and action (does). Regardless of the framework chosen, clinical ability-or competence-is a multidimensional construct, and, as Miller notes [4], "no single assessment method can provide all the data required for judgment of anything so complex as the delivery of professional services by a successful physician."
Methods
|
|---|
|
|
|---|
Medical Record Audits
|
|---|
Chart review can change the clinical practice of residents. When combined with feedback, it has been used successfully to improve the preventive care practices of residents, assess physical examination skills, promote the cost-effective use of laboratory tests, and improve record documentation [8-16]. Reviews can also be conducted longitudinally over time, providing many opportunities for evaluation. Audits can obtain a high level of reliability when explicit criteria are used [8-13]. Chart review is also an excellent method with which to involve housestaff in self-assessment, peer assessment, and quality improvement [17, 18].
The major disadvantages of chart audit are related to the quality of the documentation. Medical records may not accurately reflect what occurred during the clinical encounter; may not include pertinent information that was, in fact, collected; and may fail to justify impressions and plans. Tugwell and Dok [7] summarized the main weakness of chart review as "the fact that records are used more as an aide-de-memoir rather than a documentation of the justification for management decisions, which continues to compromise the validity of the medical record." Norman and colleagues [19] found that physicians often failed to completely record information obtained and procedures performed; diagnosis, patient education, and procedures were undocumented up to 50% of the time. In addition, lack of direct observation of resident-patient interactions makes a valid assessment of the "quality" of the visit difficult, particularly with regard to the accuracy of physical examination findings, patient education, interviewing skills, and judgment [20, 21].
The nature of the criteria used for chart audit is important. Studies using implicit review (for example, subjective or qualitative impressions) have reported low reliability among reviewers [22]. Ognibene and Norman and their colleagues [15, 19] noted that many charts are needed to ensure reasonable reliability even when explicit criteria are used. This ultimately means that chart review requires a substantial time commitment by faculty. Trained ancillary personnel can perform certain reviews, but the more complex the target of review, the more likely it is that physician involvement will be necessary.
In summary, chart audit is a useful evaluation tool that can provide important information about a trainee's clinical skills. When accompanied by specific feedback, it can lead to changes in behavior. However, the achievement of optimal results may require a substantial time commitment from faculty, and problems with the assessment of accuracy limit the value of chart audit for summative resident evaluations.
The In-Training Examination
|
|---|
The examination has several advantages. It is a 1-day test that primarily measures a resident's knowledge base, and its overall reliability coefficients are consistently greater than 0.9. However, reliability coefficients for the subsections of the test range from 0.54 to 0.80; this is probably because each section has fewer questions [23].
Several studies [24-27] have shown the predictive validity of the In-Training Examination for subsequent performance on the ABIM certifying examination [24-27]. Grossman and colleagues [25] found that the In-Training Examination results for 109 postgraduate year 2 (PGY-2) residents explained up to 70% of the variation in scores on the ABIM certifying examination. Scores above and below the 35th percentile (relative to a national peer group) had a positive predictive value of 89% and a negative predictive value of 83% for a pass or fail, respectively, on the ABIM certifying examination [25]. Waxman and coworkers [24], in a study of eight programs and 223 PGY-2 residents, found that a percentile score greater than 70 predicted a 100% pass rate [24]. However, sensitivity and specificity for the examination vary depending on the composition of the group studied.
The In-Training Examination also complements other aspects of resident evaluation by serving as an objective measure of aspects of a resident's knowledge that are not accurately assessed by other methods. In our own program, faculty could identify residents at risk for failing the ABIM certifying examination only about one third of the time (Sumption KF. The In-Training Examination in internal medicine. Presented at Navy ACP meeting, 4 November 1996, San Diego, California). In a related study, family practice program directors accurately predicted only 25% of their residents' scores on a licensing examination [28].
Although the In-Training Examination has predictive power, studies show that earlier examinations, such as the National Board of Medical Examiners tests, also have predictive accuracy for the ABIM certifying examination [29, 30]. Is the information gained through the In-Training Examination worth the cost of the test (approximately $75.00 per examinee)? The Examination may measure certain important abilities, such as analytical and interpretive skills, less accurately and comprehensively than other instruments, and it may not identify deficiencies in subspecialty knowledge.
Despite these limitations, the number of residents who take the examination continues to increase, and many residents take the examination in all 3 years of their training. In our experience, residents are increasingly aware of the examination's predictive value for the ABIM certifying examination, and more than 90% of our residents modified their schedules or study habits on the basis of the examination results (Sumption KF. The In-Training Examination in internal medicine). Therefore, the In-Training Examination is important to housestaff and is a useful tool for knowledge assessment for program directors and residents alike. It remains to be shown whether feedback from the In-Training Examination leads to better performance on the ABIM certifying examination or improves the competence of residents in any other way.
The American Board of Internal Medicine Longitudinal Evaluation Form
|
|---|
The ABIM rating form uses a 9-point scale: Scores 1 to 3 denote unsatisfactory performance, scores 4 to 6 denote satisfactory performance, and scores 7 to 9 denote superior performance. The categories evaluated are clinical judgment, medical knowledge, history taking and physical examination skills, procedural skills, interpersonal skills, medical care, attitudes and professionalism, and overall competence. For each domain, the form provides "behaviorally anchored" descriptors at each end of the scale to guide the rater.
The rating scale is easy to use and time friendly. The form, which is less intrusive than other evaluation tools, provides an opportunity for a composite evaluation of a resident's knowledge, skills, and attitudes over a period of time [31]. The rating scale is an excellent template for communicating expectations and objectives or giving feedback to residents; most forms allow space for written feedback and evaluation. Ratings forms for use by peers and patients have also been developed [32-35]. Evaluations by peers and patients appear to complement faculty ratings in the areas of professionalism and humanism [32-35].
Unfortunately, rating scales as they are generally used do not effectively discriminate among the domains of clinical competence in individual residents or even between the competence levels of different trainees. The "halo effect," under which all ratings are unduly influenced in a positive way by a single characteristic, is evident in evaluations of residents. Thompson and colleagues [36] noted that 96% of all rating scores for 85 residents fell between 6 and 9. Factor analysis shows that ratings tend to be weighted heavily on perceived knowledge and interpersonal skills [36, 37]. Standardizing the observation of the behavior of interest and defining the nomenclature for the desired expectations may help reduce errors resulting from the halo effect [38].
Why are rating scales associated with such difficulties? Important domains, such as history taking, physical examination, and interpersonal skills, are often not directly observed by faculty. Evaluations, which are based on limited direct observation, usually reflect only skills used in case presentations and team management. Thus, the major limitation of the ABIM form is really a reflection of the skill of the raters, and major changes in the forms alone do not improve reliability or discrimination [39]. More research is needed on the training of raters in the effective use of these forms, especially given the central and vital role of the ABIM evaluation form for the assessment of residents' competence.
Performance-Based Assessment
|
|---|
Program directors may not feel a compelling need to directly observe the above skills and attitudes because they believe that these skills and attitudes have been sufficiently mastered in medical school. However, trainees often enter residencies with significant deficiencies in clinical skills [40-42]. Furthermore, recent work clearly shows the striking lack of proficiency in physical examination skills among residents [43, 44]. The following sections review evaluation methods that involve direct observation: the CEX and standardized-patient-based testing.
The Clinical Evaluation Exercise
The CEX was conceived as part of the ABIM's clinical competency program after the oral portion of the ABIM certifying examination was abandoned. It is designed to introduce the direct observation of a trainee's clinical competence and to assess integration of clinical skills. The ABIM recommends that the CEX be performed during the first 6 months of internship so that deficiencies can be detected early. In the CEX, which is used by 80% to 88% of training programs, the resident is observed while taking a comprehensive history and doing a physical examination. He or she then presents the case and discusses management plans with the faculty observer [41].
Recently, the ABIM introduced the "mini-CEX," a new format designed to evaluate residents in a setting that better reflects day-to-day practice [42]. In this exercise, which is designed to last approximately 20 minutes, a resident is directly observed while taking a focused history and doing a physical examination in a clinic, an emergency department, or an inpatient ward. On completion of the exercise, the observer gives the resident feedback and completes a 9-point rating scale.
Both the CEX and the mini-CEX use direct observation of residents and give the opportunity for immediate feedback. These exercises are less expensive than the use of standardized patients because actual patients are used. The CEX also promotes one-on-one "bonding" with a faculty member, and resident satisfaction with the exercise is high [42].
For the evaluation of competency, however, the CEX has several important problems, most notably inter-rater variability [41, 45-47]. Noel and colleagues [46] found a wide range of ratings produced by faculty from 12 teaching hospitals. Problems included frequent failure to note both excellent and poor clinical skills and substantial disagreement in global assessments of resident performance, including pass-fail determinations. Kroboth and coworkers [48] concluded that the CEX has to be done 6 to 10 times to achieve a reliability coefficient of 0.8.
Other difficulties with the CEX include limited reliability due to the "content specificity" of clinical skills. The quality of physician performance varies from patient to patient, and a single observation with the traditional format of the CEX is not sufficient to form an accurate impression of an examinee's clinical skills. The mini-CEX may alleviate this problem by sampling more cases with variable medical content, but more data are needed on this point [42].
The traditional CEX requires substantial time commitments from faculty. Although the mini-CEX requires less time per encounter, 12 to 14 encounters are needed to reach a reliability coefficient of 0.8. Finally, a modest correlation has been noted with the CEX in relation to evaluation rating forms and clinical competency committee ratings [41, 49]. However, the lack of correlation between seemingly unrelated measurement tools may actually be a strength in that the CEX may measure components of competence not captured by these other two evaluation methods.
In conclusion, the direct observation of clinical skills is important and the CEX is a valuable teaching and feedback tool. Variability in resident performance in specific content areas may compromise assessment reliability when the number of evaluation encounters is limited; here, the mini-CEX is clearly an improvement.
Use of Standardized Patients
Since it was first introduced by Barrows more than 30 years ago, standardized-patient-based evaluation has gradually assumed a significant role in the assessment of clinical competence [50]. A standardized patient is someone other than a physician who is trained to portray a patient in a standardized and reproducible fashion [51]. Standardized patients may use checklists or rating scales that can be used to teach or to evaluate clinical skills [52-54]. A broad range of persons have been used as standardized patients, including asymptomatic persons with normal findings on physical examination, real patients with stable physical findings, and persons who can simulate various physical findings [52, 53]. Standardized patients are most often used in the teaching and evaluation of history taking, physical examination, patient education, and counseling or in focused encounters involving combinations of these skills [55, 56].
The incorporation of supplementary clinical material (such as radiographs, electrocardiograms, and laboratory results) into multistation standardized-patient examinations has allowed the teaching and evaluation of a broad spectrum of clinical skills. In multistation examinations, examinees perform single or related tasks (such as taking a focused history, doing a physical examination, or counseling a patient) at a series of stations, with or without ancillary data. In contrast to the more comprehensive nature of a single standardized-patient encounter, these exercises focus on distinct skills during brief encounters but sample skills more broadly overall, improving reliability [50, 51, 57, 58]. Such examinations are most often referred to as OSCEs (Objective Structured Clinical Examinations), although other descriptive acronyms have been suggested [50]. The OSCE format is widely used in the performance-based assessment of clinical competence [59]. Recent studies have described a wide variety of applications for OSCEs, including the assessment of the efficacy of an alcohol and drug curriculum; the teaching or evaluating of specific outpatient management skills; clinical breast examination skills; and the ability to address clinical ethical situations, counsel patients about risk factor modification, deliver bad news, and assess pain control in patients with cancer [60-70]. In addition, multistation standardized-patient exercises are used in "high-stakes" testing, including the Medical Council of Canada's qualifying examination for licensure and programs rating international medical graduates for Canadian and U.S. program directors [71-73].
The psychometric qualities of standardized-patient exercises have been studied extensively. The reliability of standardized-patient-based evaluation varies from 0.41 to 0.85, and coefficients of reliability are better for pass-fail determinations than for absolute scores [53, 54]. Depending on the number of cases, the duration of encounters, and the complexity of individual cases, a reliability coefficient of 0.80 may be achieved [53, 54]. Stations of shorter duration (5 to 10 minutes), with focused instruction and evaluation of discrete skills, may provide optimal reproducibility with shorter testing times [72, 74, 75]. Standardized-patient exercises give more reliable results in the evaluation of history taking, physical examination, or communication skills than in the measurement of problem-solving or clinical reasoning skills [52, 53, 65]. Recent studies, however, suggest that this technique may adequately evaluate problem-solving skills if interstation progress notes are incorporated, postencounter questions focusing on key diagnostic findings are added, or oral presentation and problem-solving stations are included [76-78]. Further investigation is needed because older data suggest that linking written follow-up questions to standardized-patient encounters may detract from overall reproducibility [53].
Standardized-patient exercises have face validity because the skills and behaviors taught or evaluated are intrinsic to those involved with clinical practice. Practicing physicians and residents cannot differentiate between real and standardized patients when these patients are sent into their offices unannounced [54, 79]. Examinees presumed to have different levels of competence perform as expected on standardized-patient exercises: Resident performance is better than medical student performance, senior residents score higher than junior residents, and residents from programs with stronger academic records do better than residents from programs with lesser reputations [51, 52, 54, 58]. The results of standardized-patient-based evaluations show variable correlation with written standardized examinations, faculty ratings, rotation evaluations, or clinical competency committee evaluations [51-5358, 61, 80, 81]. In the absence of a gold standard for measuring overall clinical competence, many experts suggest that the variable correlation noted may occur because standardized-patient exercises measure unique components of clinical competence and may be considered complementary to the other evaluation tools.
Two important theoretical and practical advantages of standardized-patient-based teaching and evaluation are the patient-centered nature of these evaluation methods and their incorporation of direct observation of clinical skills. In addition, educators can control the clinical encounter and its educational content, allowing them to guarantee uniform presentation of material (ensuring exposure to important but infrequently encountered clinical situations) or to tailor the test to the level of the examinee. Trainees can practice skills in a non-threatening and low-risk environment; this is particularly important when patient education skills, counseling skills, or the ability to deal with certain emergency situations are being taught and evaluated. The quality of the feedback can be high; it can be provided immediately and can be directed from the patient's perspective [50].
The principal disadvantages of standardized-patient-based evaluation are its expense and the lack of uniform access to experienced staff who can train standardized patients and develop clinical scenarios. The development of consortia for training standardized patients, conducting exercises, and sharing costs may allow more programs to participate [50]. In addition, although standardized patients can be trained to portray a wide range of clinical content and to simulate many physical findings, they remain a supplement to real patients. A high volume of patient contact is still required for an appreciation of the diversity of clinical presentations.
In conclusion, standardized-patient-based methods allow for high-quality teaching and evaluation of the basic clinical skills of history taking, physical examination, communication, and interpersonal relating. They are less efficient in the assessment of knowledge, clinical reasoning, or judgment. However, reports describing the inclusion of techniques that focus on problem solving seem promising, and the requisite integration of clinical skills in these stations may further endorse the validity and relevance of this evaluation tool. Concerns about reliability are important when standardized-patient-based testing is used for "high-stakes" summative testing but are less critical when it is used primarily for formative assessment and teaching.
Conclusions
|
|---|
If effectively used, the In-Training Examination (knowledge), rating scales (skill and attitude), and the CEX (skills) can serve as the core of a successful assessment program. However, the reliability and accuracy of faculty evaluation with these measures are consistently shown to be suboptimal, and programs may need to use such techniques as standardized-patient encounters (to better evaluate skills and attitudes) or medical record audit (to examine clinical care practices more effectively). We clearly need to find ways to equip faculty better for their role as evaluators as we struggle to find the optimal strategy with which to assess clinical competence. Regardless, all programs should choose the combination of tools that best meets the needs of their trainees.
Author and Article Information
|
|---|
|
|
|---|
References
|
|---|
|
|
|---|
1. American Board of Internal Medicine. Guide to the Evaluation of Residents in Internal Medicine 1990-92. Philadelphia: American Board of Internal Medicine; 1992.
2. Neufeld V, Norman GR, eds. Assessing Clinical Competence. New York: Springer; 1985.
3. Eisenberg JM. Evaluating internists' clinical competence. J Gen Intern Med. 1989; 4:139-43.
4. Miller GE. The assessment of clinical skills/competence/performance. Acad Med. 1990; 65(9 Suppl):S63-7.
5. American Board of Internal Medicine. Project Professionalism. Philadelphia: American Board of Internal Medicine; 1994.
6. American Medical Association. Directory of Graduate Medical Education Programs. Chicago: American Med Assoc; 1990.
7. Tugwell P, Dok C. Medical record review. In: Neufeld V, Norman GR, eds. Assessing Clinical Competence. New York: Springer; 1985:142-81.
8. Holmboe ES, Gross RM, Hawkins RE. Chart review for residents [Letter]. Acad Med. 1996; 71:1139-40.
9. Scranton R, Holmboe ES, Sumption KF, Hawkins RE. Impact of feedback from medical record audit on resident compliance with preventive health measures. Acad Med. 1998; [In press].
10. Winickoff RN, Coltin KL, Morgan MM, Buxbaum RC, Barnett GO. Improving physician performance through peer comparison feedback. Med Care. 1984; 22:527-34.
11. Wayland MT, Culik D, Goldsby C, Carion W, Orzame N. Screening by family practice and internal medicine residents and the effect of an intervention program. J Med Educ. 1987; 62:519-22.
12. Kern DE, Harris WL, Boekeloo BO, Barker LR, Hogeland P. Use of an outpatient medical record audit to achieve educational objectives: changes in residents' performances over six years. J Gen Intern Med. 1990; 5:218-24.
13. Ramsdell JW, Berry CC. Evaluation of general and traditional internal medicine residents utilizing a medical records audit based on educational objectives. Med Care. 1983; 21:1144-53.
14. Martin AR, Wolf MA, Thibodeau LA, Dzau V, Braunwald E. A trial of two strategies to modify the test-ordering behavior of medical residents. N Engl J Med. 1980; 303:1330-6.
15. Ognibene AJ, Jarjoura DG, Illera VA, Blend DA, Cugino AE, Whittier FC. Using chart reviews to assess residents' performances of components of physical examinations: a pilot study. Acad Med. 1994; 69:583-7.
16. Opila DA. The impact of feedback to medical housestaff on chart documentation and quality of care in the outpatient setting. J Gen Intern Med. 1997; 12:352-6.
17. Engebretsen B. Peer review in graduate education. N Engl J Med. 1977; 296:1230-1.
18. Ashton CM. "Invisible" doctors: making a case for involving medical residents in hospital quality improvement programs. Acad Med. 1993; 68:823-4.
19. Norman GR, Neufeld VR, Walsh A, Woodward CA, McConvey GA. Measuring physicians' performances by using simulated patients. J Med Educ. 1985; 60:925-34.
20. Leaf DA, Neighbor WE, Schaad D, Scott CS. A comparison of self-report and chart audit in studying resident physician assessment of cardiac risk factors. J Gen Intern Med. 1995; 10:194-8.
21. Gennis VM, Gennis MA. Supervision in the outpatient clinic: effects on teaching and patient care. J Gen Intern Med. 1993; 9:116-7.
22. Brook RH, Lohr KN. Monitoring quality of care in the Medicare program. Two proposed systems. JAMA. 1987; 258:3138-41.
23. Garibaldi RA, Trontell MC, Waxman H, Holbrook JH, Kanya DT, Khoshbin S, et al. The In-Training Examination in internal medicine. Ann Intern Med. 1994; 121:117-23.
24. Waxman H, Braunstein G, Dantzker D, Goldberg S, Lefrak S, Lichstein E, et al. Performance on the internal medicine second-year residency In-Training Examination predicts the outcome of the ABIM certifying examination. J Gen intern Med. 1994; 9:692-4.
25. Grossman RS, Fincher RM, Layne RD, Seelig CB, Berkowitz LR, Levine MA. Validity of the In-Training Examination for predicting American Board of Internal Medicine certifying examination scores. J Gen Intern Med. 1992; 7:63-7.
26. Wolfsthal SD, Hooper FJ. In-Training Examination in internal medicine [Letter]. Ann Intern Med. 1995; 122:73.
27. O'Dell JR. In-Training Examination in internal medicine [Letter]. Ann Intern Med. 1995; 122:73-4.
28. Brailovsky CA, Grand'Maison P, Lescop J. Residency directors' predictions of candidates' performances on a licensing examination. Acad Med. 1995; 70:410-4.
29. Yindra KJ, Rosenfeld PS, Donnelly MB. Medical school achievements as predictors of residency performance. J Med Educ. 1988; 63:356-63.
30. Norcini JJ, Grosso LJ, Shea JA, Webster GD. The relationship between features of residency training and ABIM certifying examination performance. J Gen Intern Med. 1987; 2:330-6.
31. Streiner DL. Global rating scales. In: Neufeld V, Norman GR, eds. Assessing Clinical Competence. New York: Springer; 1985:119-41.
32. Van Rosendaal GM, Jennett PA. Comparing peer and faculty evaluations in an internal medicine residency. Acad Med. 1994; 69:299-303.[Medline]
33. Matthews DA, Feinstein AR. A new instrument for patients' ratings of physician performance in the hospital setting. J Gen Intern Med. 1989; 4:14-22.
34. Klessig J, Robbins AS, Wieland D, Rubenstein L. Evaluating humanistic attributes of internal medicine residents. J Gen Intern Med. 1989; 4:514-21.
35. Tamblyn R, Benaroya S, Snell L, McLeod P, Schnarch B, Abrahamowicz M. The feasibility and value of using patient satisfaction ratings to evaluate internal medicine residents. J Gen Intern Med. 1994; 9:146-52.
36. Thompson WG, Lipkin M Jr, Gilbert DA, Guzzo RA, Roberson L. Evaluating evaluation: assessment of the American Board of Internal Medicine Resident Evaluation Form. J Gen Intern Med. 1990; 5:214-7.
37. Haber RJ, Avins AL. Do ratings on the American Board of Internal Medicine Resident Evaluation Form detect differences in clinical competence? J Gen Intern Med. 1994; 9:140-5.
38. Borman WC. Effects of instructions to avoid halo error on reliability and validity of performance evaluation ratings. J Appl Psychol. 1975; 60:556-60.
39. Gray JD. Global rating scales in residency education. Acad Med. 1996; 71(1 Suppl): S55-63.
40. Wray NP, Friedland JA. Detection and correction of house staff error in physical diagnosis. JAMA. 1983; 249:1035-7.
41. Woolliscroft JO, Stross JK, Silva J Jr. Clinical competence certification: a critical appraisal. J Med Educ. 1984; 59:799-805.
42. Norcini JJ, Blank LL, Arnold GK, Kimball HR. The mini-CEX (clinical evaluation exercise): a preliminary investigation. Ann Intern Med. 1995; 123:795-9.
43. Fletcher SW, O'Malley MS, Bunce LA. Physicians' abilities to detect lumps in silicone breast models. JAMA. 1985; 253:2224-8.
44. Mangione S, Nieman LZ. Cardiac auscultatory skills of internal medicine and family practice trainees. JAMA. 1997; 278:717-22.
45. Blank LL, Grosso LJ, Benson JA Jr. A survey of clinical skills evaluation practices in internal medicine residency programs. J Med Educ. 1984; 59:401-6.
46. Noel GL, Herbers JE Jr, Caplow MP, Cooper GS, Pangaro LN, Harvey J. How well do internal medicine faculty members evaluate the clinical skills of residents? Ann Intern Med. 1992; 117:757-65.
47. Herbers JE Jr, Noel GL, Cooper GS, Harvey J, Pangaro LN, Weaver MJ. How accurate are faculty evaluations of clinical competence? J Gen Intern Med. 1989; 4:202-8.
48. Kroboth FJ, Hanusa BH, Parker S, Coulehan JL, Kapoor WN, Brown FH, et al. The inter-rater reliability and internal consistency of a clinical evaluation exercise. J Gen Intern Med. 1992; 7:174-9.
49. Kroboth FJ, Kapoor W, Brown FH, Karpf M, Levey GS. A comparative trial of the clinical evaluation exercise. Arch Intern Med. 1985; 145:1121-3.
50. Barrows HS. An overview of the uses of standardized patients for teaching and evaluating clinical skills. Acad Med. 1993; 68:443-53.
51. Stillman P, Swanson D, Regan MB, Philbin MM, Nelson V, Ebert T, et al. Assessment of clinical skills of residents utilizing standardized patients. A follow-up study and recommendations for application. Ann Intern Med. 1991; 114:393-401.
52. Stillman PL, Swanson DB, Smee S, Stillman AE, Ebert TH, Emmel VS, et al. Assessing clinical skills of residents with standardized patients. Ann Intern Med. 1986; 105:762-71.
53. Van der Vleuten CP, Swanson DB. Assessment of clinical skills with standardized patients: state of the art. Teaching and Learning in Internal Medicine. 1990; 2:58-76.
54. Colliver JA, Williams RG. Technical issues: test application. AAMC. Acad Med. 1993; 68:454-60.
55. Anderson MB, Stillman PL, Wang Y. Growing use of standardized patients in teaching and evaluation in medical education. Teach Learn Med. 1994; 6:15-22.
56. Day SC, Grosso LJ, Norcini JJ Jr, Blank LL, Swanson DB, Horne MH. Residents' perception of evaluation procedures used by their training program. J Gen Intern Med. 1990; 5:421-6.
57. Ainsworth MA, Rogers LP, Markus JF, Dorsey NK, Blackwell TA, Petrusa ER. Standardized patient encounters. A method for teaching and evaluation. JAMA. 1991; 266:1390-6.
58. Petrusa ER, Blackwell TA, Ainsworth MA. Reliability and validity of an objective structured clinical examination for assessing the clinical performance of residents. Arch Intern Med. 1990; 150:573-7.
59. Martin JA, Reznick RK, Rothman A, Tamblyn RM, Regehr G. Who should rate candidates in an objective structured clinical examination? Acad Med. 1996; 71:170-5.
60. Schwartz RW, Donnelly MB, Sloan DA, Johnson SB, Strodel WE. The relationship between faculty ward evaluations, OSCE, and ABSITE as measures of surgical intern performance. Am J Surg. 1995; 169:414-7.
61. Joorabchi B, Devries JM. Evaluation of clinical competence: the gap between expectation and performance. Pediatrics. 1996; 97:179-84.
62. Kokotailo PK, Langhough R, Neary EJ, Matson SC, Fleming MF. Improving pediatric residents' alcohol and other drug use clinical skills: use of an experiential curriculum. Pediatrics. 1995; 96(1 pt 1):99-104.
63. Singer PA, Cohen R, Robb A, Rothman A. The ethics objective structured clinical examination. J Gen Intern Med. 1993; 8:23-8.
64. Pololi LH, Potter SP. Behavioral change in preventive medicine. An efficacy assessment of a physician education module. J Gen Intern Med. 1996; 11:545-7.
65. Burdick WP, Ben-David MF, Swisher L, Becher J, Magee D, McNamara R, et al. Reliability of performance-based clinical skill assessment of emergency medicine residents. Acad Emerg Med. 1996; 3:1119-23.
66. Eggly S, Afonso N, Rojas G, Baker M, Cardozo L, Robertson RS. An assessment of residents' competence in the delivery of bad news to patients. Acad Med. 1997; 72:397-9.
67. Boudreau D, Tamblyn R, Dufresne L. Evaluation of consultative skills in respiratory medicine using a structured medical consultation. Am J Respir Crit Care Med. 1994; 150(5 pt 1):1298-304.
68. Sloan PA, Donnelly MB, Schwartz RW, Sloan DA. Cancer pain assessment and management by housestaff. Pain. 1996; 67:475-81.
69. Chalabian J, Garman K, Wallace P, Dunnington G. Clinical breast evaluation skills of house officers and students. Am Surg. 1996; 62:840-5.
70. Day RP, Hewson MG, Kindy P Jr, Van Kirk J. Evaluation of resident performance in an outpatient internal medicine clinic using standardized patients. J Gen Intern Med. 1993; 8:193-8.
71. Reznick RK, Blackmore D, Dauphinee WD, Rothman AI, Smee S. Large-scale high-stakes testing with an OSCE: report from the Medical Council of Canada. Acad Med. 1996; 71:S19-21.
72. Cohen R, Rothman AI, Bilan S, Ross J. Analysis of the psychometric properties of eight administrations of an objective structured clinical examination used to assess international medical graduates. Acad Med. 1996; 71:S22-4.
73. Sutnick AI, Stillman PL, Norcini JJ, Friedman M, Williams RG, Trace DA, et al. Pilot study of the use of the ECFMG clinical competence assessment to provide profiles of clinical competencies of graduates of foreign medical schools for residency directors. Educational Commission for Foreign Medical Graduates. Acad Med. 1994; 69:65-7.
74. Rothman AI, Cohen R, Bilan S. A comparison of short- and long-case stations in a multiple-station test of clinical skills. Acad Med. 1996; 71(1 Suppl):S110-2.
75. Shatzer JH, Wardrop JL, Williams RG, Hatch TF. Generalizability of performance on different-station-length standardized patient cases. Teach Learn Med. 1994; 6:54-8.
76. Stillman PL, Regan MB, Haley HL, Norcini JJ, Friedman M, Sutnick AI. The use of a patient note to evaluate clinical skills of first-year residents who are graduates of foreign medical schools. Acad Med. 1992; 76(10 Suppl):S57-9.
77. Solomon DJ, Speer AJ, Perkowski LC, DiPette D. Evaluating problem solving based on the use of history findings in a standardized-patient examination. Acad Med. 1994; 69:754-7.
78. Rosebraugh CJ, Speer AJ, Ainsworth MA, Solomon DJ, Callaway MR, Holden MD. Developing a presentation and problem-solving station in a multistation standardized-patient examination. Acad Med. 1996; 71(1 Suppl):S102-4.
79. Norman GR, Tugwell P, Feightner JW. A comparison of resident performance on real and simulated patients. J Med Educ. 1982; 57:708-15.
80. Hull AL, Hodder S, Berger B, Ginsberg D, Lindheim N, Quan J, et al. Validity of three clinical performance assessments of internal medicine clerks. Acad Med. 1995; 70:517-22.
81. Dupras DM, Li JT. Use of an objective structured clinical examination to determine clinical competence. Acad Med. 1995; 70:1029-34.
Related articles in Annals:
This article has been cited by other articles:
![]() |
S R Kirsh and D C Aron Integrating the chronic-care model and the ACGME competencies: using shared medical appointments to focus on systems-based practice Qual. Saf. Health Care, February 1, 2008; 17(1): 15 - 19. [Full Text] [PDF] |
||||
![]() |
S. D. Sisson, D. A. Rastegar, T. N. Rice, and M. T. Hughes Multicenter Implementation of a Shared Graduate Medical Education Resource Arch Intern Med, December 10, 2007; 167(22): 2476 - 2480. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Brown and M. Doshi Assessing professional and clinical competence: the way forward Advan. Psychiatr. Treat., March 1, 2006; 12(2): 81 - 89. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. H. Goroll, C. Sirio, F. D. Duffy, R. F. LeBlond, P. Alguire, T. A. Blackwell, W. E. Rodak, T. Nasca, and the Residency Review Committee for Internal Medici A New Model for Accreditation of Residency Programs in Internal Medicine Ann Intern Med, June 1, 2004; 140(11): 902 - 909. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. K. Cassel Quality of Care and Quality of Training: A Shared Vision for Internal Medicine? Ann Intern Med, June 1, 2004; 140(11): 927 - 928. [Full Text] [PDF] |
||||
![]() |
A. K. Wong Full scale computer simulators in anesthesia training and evaluation: [Des simulateurs informatises grandeur nature pour la formation et l'evaluation en anesthesie] Can J Anesth, May 1, 2004; 51(5): 455 - 464. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. M. Jennings, L. A. Thompson, C. L. Townsend, C. A. Stashwick, and D. C. Goodman The Relationship Between Pediatric Residency Program Size and Inpatient Illness Severity and Diversity Arch Pediatr Adolesc Med, July 1, 2003; 157(7): 676 - 680. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. E. Cox, S. S. Carson, E. W. Ely, J. A. Govert, J. M. Garrett, R. G. Brower, D. G. Morris, E. Abraham, V. Donnabella, A. Spevetz, et al. Effectiveness of Medical Resident Education in Mechanical Ventilation Am. J. Respir. Crit. Care Med., January 1, 2003; 167(1): 32 - 38. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. A. Garibaldi, R. Subhiyah, M. E. Moore, and H. Waxman The In-Training Examination in Internal Medicine: An Analysis of Resident Performance over Time Ann Intern Med, September 17, 2002; 137(6): 505 - 510. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Cornwall Teaching Professionalism in Orthopaedic Residency J. Bone Joint Surg. Am., April 1, 2001; 83(4): 626 - 626. [Full Text] |
||||
![]() |
R. A. Kearney, P. Sullivan, and E. Skakun Performance on ABA-ASA In-training Examination predicts success for RCPSC certification Can J Anesth, September 1, 2000; 47(9): 914 - 918. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. R. Smith Evidence for the Effectiveness of Techniques To Change Physician Behavior Chest, August 1, 2000; 118(2_suppl): 8S - 17S. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. S. Siegel and L. W. Greenberg Effective Evaluation of Residency Education: How Do We Know It When We See It? Pediatrics, April 1, 2000; 105(4): 964 - 965. [Full Text] |
||||
![]() |
C. P. Gross, G. B. Donnelly, A. B. Reisman, K. A. Sepkowitz, and M. A. Callahan Resident Expectations of Morning Report: A Multi-institutional Study Arch Intern Med, September 13, 1999; 159(16): 1910 - 1914. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. B. Issenberg, W. C. McGaghie, and R. A. Waugh Computers and Evaluation of Clinical Competence Ann Intern Med, February 2, 1999; 130(3): 244 - 244. [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||