4
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Evaluation of an OSCE’s implementation and a two-step approach for a theoretical and practical training program in Obstetrics and Gynecology

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Objective structured clinical examination (OSCE) is a well-known assessment method to evaluate clinical skills and competence in healthcare. Following the recently reformed National Competence-Based Catalog of Learning Objectives in Medicine, the implementation of this assessment method in the training program for medical students is now obligatory in Germany. This major change requires a reorganization not only of the training programs but also of the students themselves and the way they learn. We performed a poll evaluating the students’ opinions regarding these major changes and the implementation of the OSCE with a new training program. To implement this assessment method and to evaluate the OSCE, Kern’s six-step approach comprising (1) problem identification and general needs assessment, (2) needs assessment of the targeted learners, (3) goals and objectives, (4) educational strategies, (5) implementation, and (6) evaluation and feedback was applied. To evaluate and gather feedback, a poll was used to analyze the student’s opinions regarding OSCE in gynecology and obstetrics and OSCE in general, in addition to the regular analysis of the students’ results. To reform the educational strategy, a two-step approach was developed: First, the students completed the regular training program and a written examination, and second, they participated in a 1-week clerkship, in small group teaching, and in the OSCE. The OSCE stations were developed primarily based on the National Competence-Based Catalog and the German Catalog of Learning Objectives in Medicine, as well as on the feedback of experts reflecting their expectations for physicians beginning their careers. The students performed well in the OSCE and gave positive feedback regarding this examination method. Furthermore, they welcomed the upcoming changes by considering OSCE a valuable assessment tool, and they showed appreciation for the two-step approach by supporting the combination of an OSCE and a written examination. Thus, this article presents the implementation of an OSCE and a strategy for the adaptation of the curriculum to fulfill the new OSCE requirements and—to our knowledge—reveals students’ primary opinions regarding the changes in their medical training program for the first time.

          Related collections

          Most cited references58

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          Making sense of Cronbach's alpha

          Medical educators attempt to create reliable and valid tests and questionnaires in order to enhance the accuracy of their assessment and evaluations. Validity and reliability are two fundamental elements in the evaluation of a measurement instrument. Instruments can be conventional knowledge, skill or attitude tests, clinical simulations or survey questionnaires. Instruments can measure concepts, psychomotor skills or affective values. Validity is concerned with the extent to which an instrument measures what it is intended to measure. Reliability is concerned with the ability of an instrument to measure consistently. 1 It should be noted that the reliability of an instrument is closely associated with its validity. An instrument cannot be valid unless it is reliable. However, the reliability of an instrument does not depend on its validity. 2 It is possible to objectively measure the reliability of an instrument and in this paper we explain the meaning of Cronbach’s alpha, the most widely used objective measure of reliability. Calculating alpha has become common practice in medical education research when multiple-item measures of a concept or construct are employed. This is because it is easier to use in comparison to other estimates (e.g. test-retest reliability estimates) 3 as it only requires one test administration. However, in spite of the widespread use of alpha in the literature the meaning, proper use and interpretation of alpha is not clearly understood. 2 , 4 , 5 We feel it is important, therefore, to further explain the underlying assumptions behind alpha in order to promote its more effective use. It should be emphasised that the purpose of this brief overview is just to focus on Cronbach’s alpha as an index of reliability. Alternative methods of measuring reliability based on other psychometric methods, such as generalisability theory or item-response theory, can be used for monitoring and improving the quality of OSCE examinations 6 - 10 , but will not be discussed here. What is Cronbach alpha? Alpha was developed by Lee Cronbach in 1951 11 to provide a measure of the internal consistency of a test or scale; it is expressed as a number between 0 and 1. Internal consistency describes the extent to which all the items in a test measure the same concept or construct and hence it is connected to the inter-relatedness of the items within the test. Internal consistency should be determined before a test can be employed for research or examination purposes to ensure validity. In addition, reliability estimates show the amount of measurement error in a test. Put simply, this interpretation of reliability is the correlation of test with itself. Squaring this correlation and subtracting from 1.00 produces the index of measurement error. For example, if a test has a reliability of 0.80, there is 0.36 error variance (random error) in the scores (0.80×0.80 = 0.64; 1.00 – 0.64 = 0.36). 12 As the estimate of reliability increases, the fraction of a test score that is attributable to error will decrease. 2 It is of note that the reliability of a test reveals the effect of measurement error on the observed score of a student cohort rather than on an individual student. To calculate the effect of measurement error on the observed score of an individual student, the standard error of measurement must be calculated (SEM). 13 If the items in a test are correlated to each other, the value of alpha is increased. However, a high coefficient alpha does not always mean a high degree of internal consistency. This is because alpha is also affected by the length of the test. If the test length is too short, the value of alpha is reduced. 2 , 14 Thus, to increase alpha, more related items testing the same concept should be added to the test. It is also important to note that alpha is a property of the scores on a test from a specific sample of testees. Therefore investigators should not rely on published alpha estimates and should measure alpha each time the test is administered. 14 Use of Cronbach’s alpha Improper use of alpha can lead to situations in which either a test or scale is wrongly discarded or the test is criticised for not generating trustworthy results. To avoid this situation an understanding of the associated concepts of internal consistency, homogeneity or unidimensionality can help to improve the use of alpha. Internal consistency is concerned with the interrelatedness of a sample of test items, whereas homogeneity refers to unidimensionality. A measure is said to be unidimensional if its items measure a single latent trait or construct. Internal consistency is a necessary but not sufficient condition for measuring homogeneity or unidimensionality in a sample of test items. 5 , 15 Fundamentally, the concept of reliability assumes that unidimensionality exists in a sample of test items 16 and if this assumption is violated it does cause a major underestimate of reliability. It has been well documented that a multidimensional test does not necessary have a lower alpha than a unidimensional test. Thus a more rigorous view of alpha is that it cannot simply be interpreted as an index for the internal consistency of a test. 5 , 15 , 17 Factor Analysis can be used to identify the dimensions of a test. 18 Other reliable techniques have been used and we encourage the reader to consult the paper “Applied Dimensionality and Test Structure Assessment with the START-M Mathematics Test” and to compare methods for assessing the dimensionality and underlying structure of a test. 19 Alpha, therefore, does not simply measure the unidimensionality of a set of items, but can be used to confirm whether or not a sample of items is actually unidimensional. 5 On the other hand if a test has more than one concept or construct, it may not make sense to report alpha for the test as a whole as the larger number of questions will inevitable inflate the value of alpha. In principle therefore, alpha should be calculated for each of the concepts rather than for the entire test or scale. 2 , 3 The implication for a summative examination containing heterogeneous, case-based questions is that alpha should be calculated for each case. More importantly, alpha is grounded in the ‘tau equivalent model’ which assumes that each test item measures the same latent trait on the same scale. Therefore, if multiple factors/traits underlie the items on a scale, as revealed by Factor Analysis, this assumption is violated and alpha underestimates the reliability of the test. 17 If the number of test items is too small it will also violate the assumption of tau-equivalence and will underestimate reliability. 20 When test items meet the assumptions of the tau-equivalent model, alpha approaches a better estimate of reliability. In practice, Cronbach’s alpha is a lower-bound estimate of reliability because heterogeneous test items would violate the assumptions of the tau-equivalent model. 5 If the calculation of “standardised item alpha” in SPSS is higher than “Cronbach’s alpha”, a further examination of the tau-equivalent measurement in the data may be essential. Numerical values of alpha As pointed out earlier, the number of test items, item inter-relatedness and dimensionality affect the value of alpha. 5 There are different reports about the acceptable values of alpha, ranging from 0.70 to 0.95. 2 , 21 , 22 A low value of alpha could be due to a low number of questions, poor inter-relatedness between items or heterogeneous constructs. For example if a low alpha is due to poor correlation between items then some should be revised or discarded. The easiest method to find them is to compute the correlation of each test item with the total score test; items with low correlations (approaching zero) are deleted. If alpha is too high it may suggest that some items are redundant as they are testing the same question but in a different guise. A maximum alpha value of 0.90 has been recommended. 14 Summary High quality tests are important to evaluate the reliability of data supplied in an examination or a research study. Alpha is a commonly employed index of test reliability. Alpha is affected by the test length and dimensionality. Alpha as an index of reliability should follow the assumptions of the essentially tau-equivalent approach. A low alpha appears if these assumptions are not meet. Alpha does not simply measure test homogeneity or unidimensionality as test reliability is a function of test length. A longer test increases the reliability of a test regardless of whether the test is homogenous or not. A high value of alpha (> 0.90) may suggest redundancies and show that the test length should be shortened. Conclusions Alpha is an important concept in the evaluation of assessments and questionnaires. It is mandatory that assessors and researchers should estimate this quantity to add validity and accuracy to the interpretation of their data. Nevertheless alpha has frequently been reported in an uncritical way and without adequate understanding and interpretation. In this editorial we have attempted to explain the assumptions underlying the calculation of alpha, the factors influencing its magnitude and the ways in which its value can be interpreted. We hope that investigators in future will be more critical when reporting values of alpha in their studies.
            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            The assessment of clinical skills/competence/performance

            G E Miller (1990)
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              How to use the nominal group and Delphi techniques

              Introduction The Nominal Group Technique (NGT) and Delphi Technique are consensus methods used in research that is directed at problem-solving, idea-generation, or determining priorities. While consensus methods are commonly used in health services literature, few studies in pharmacy practice use these methods. This paper provides an overview of the NGT and Delphi technique, including the steps involved and the types of research questions best suited to each method, with examples from the pharmacy literature. Methodology The NGT entails face-to-face discussion in small groups, and provides a prompt result for researchers. The classic NGT involves four key stages: silent generation, round robin, clarification and voting (ranking). Variations have occurred in relation to generating ideas, and how ‘consensus’ is obtained from participants. The Delphi technique uses a multistage self-completed questionnaire with individual feedback, to determine consensus from a larger group of ‘experts.’ Questionnaires have been mailed, or more recently, e-mailed to participants. When to use The NGT has been used to explore consumer and stakeholder views, while the Delphi technique is commonly used to develop guidelines with health professionals. Method choice is influenced by various factors, including the research question, the perception of consensus required, and associated practicalities such as time and geography. Limitations The NGT requires participants to personally attend a meeting. This may prove difficult to organise and geography may limit attendance. The Delphi technique can take weeks or months to conclude, especially if multiple rounds are required, and may be complex for lay people to complete.
                Bookmark

                Author and article information

                Contributors
                URI : https://loop.frontiersin.org/people/2279984/overviewRole: Role: Role: Role: Role: Role: Role: Role:
                Role: Role:
                Role: Role:
                URI : https://loop.frontiersin.org/people/704006/overviewRole: Role: Role:
                Role: Role: Role:
                URI : https://loop.frontiersin.org/people/1283897/overviewRole: Role: Role: Role: Role:
                URI : https://loop.frontiersin.org/people/2389032/overviewRole: Role:
                Role: Role:
                URI : https://loop.frontiersin.org/people/1916362/overviewRole: Role: Role:
                URI : https://loop.frontiersin.org/people/1417282/overviewRole: Role: Role: Role: Role: Role: Role: Role: Role: Role: Role:
                Journal
                Front Med (Lausanne)
                Front Med (Lausanne)
                Front. Med.
                Frontiers in Medicine
                Frontiers Media S.A.
                2296-858X
                21 December 2023
                2023
                : 10
                : 1263862
                Affiliations
                [1] 1Department of Obstetrics and Gynecology, University Hospital Bonn , Bonn, Germany
                [2] 2Department of Senology, University Hospital Bonn , Bonn, Germany
                [3] 3Department of Gynecology and Gynecological Oncology, University Hospital Bonn , Bonn, Germany
                [4] 4Department of Gynecological Endocrinology and Reproductive Medicine, University Hospital Bonn , Bonn, Germany
                [5] 5Department of Neonatology and Pediatric Intensive Care, University Hospital Bonn , Bonn, Germany
                [6] 6Division of Prenatal Medicine, Gynecological Ultrasound and Fetal Surgery, Department of Obstetrics and Gynecology, University of Cologne , Cologne, Germany
                [7] 7Department of Obstetrics and Perinatal Medicine, University Hospital Bonn , Bonn, Germany
                Author notes

                Edited by: Jacqueline G. Bloomfield, The University of Sydney, Australia

                Reviewed by: Mohd Nasri Awang Besar, National University of Malaysia, Malaysia; Majed Wadi, Qassim University, Saudi Arabia

                *Correspondence: Florian Recker, florian.recker@ 123456ukbonn.de
                Article
                10.3389/fmed.2023.1263862
                10765409
                38179276
                00b3bac8-0dac-4f3e-9775-e1378b81e6f6
                Copyright © 2023 Plöger, Abramian, Egger, Mustea, Sänger, Plöger, Weber, Gembruch, Walter, Strizek and Recker.

                This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

                History
                : 15 August 2023
                : 13 November 2023
                Page count
                Figures: 4, Tables: 1, Equations: 0, References: 61, Pages: 10, Words: 7169
                Funding
                The author(s) declare that no financial support was received for the research, authorship, and/or publication of this article.
                Categories
                Medicine
                Original Research
                Custom metadata
                Healthcare Professions Education

                osce,implementation,interprofessional training,undergraduate medical education,national curriculum reform,transition,gynecology and obstetrics

                Comments

                Comment on this article