scholarly journals Assessment by Comparative Judgement: An Application to Secondary Statistics and English in New Zealand

2020 ◽  
Vol 55 (1) ◽  
pp. 49-71
Author(s):  
Neil Marshall ◽  
Kirsten Shaw ◽  
Jodie Hunter ◽  
Ian Jones

Abstract There is growing interest in using comparative judgement to assess student work as an alternative to traditional marking. Comparative judgement requires no rubrics and is instead grounded in experts making pairwise judgements about the relative ‘quality’ of students’ work according to a high level criterion. The resulting decision data are fitted to a statistical model to produce a score for each student. Cited benefits of comparative judgement over traditional methods include increased reliability, validity and efficiency of assessment processes. We investigated whether such claims apply to summative statistics and English assessments in New Zealand. Experts comparatively judged students’ responses to two national assessment tasks, and the reliability and validity of the outcomes were explored using standard techniques. We present evidence that the comparative judgement process efficiently produced reliable and valid assessment outcomes. We consider the limitations of the study, and make suggestions for further research and potential applications.

Author(s):  
Mathias Kyelem ◽  
Amadou Tamboura ◽  
Daniel Favre

To prove the quality of their teaching activities, school leaders and teachers are almost always resort to summative assessments; the level of the scores obtained by the largest number of students for assays or exams is the best indicator of the quality of learning achieved by students. Summative assessments become sufficiently numerous to help establish report cards and rankings monthly to the detriment of a formative assessment needed to regulate and guide the educational activity. All this does not take into account the dynamics of the error in learning and the stress state in which the learner is then subjected to a strong emotional pressure. Research in neuroscience show that a high level of anxiety causes a deficit in the ability to perform tasks involving solving non-routine problems. In this study, most of the respondents have had several years of professional training and a long teaching practice. It was interesting to explore their relationship to error and their level of apprehension of the summative evaluation in a context where they are in the process of "exercise" the student work. In general, the results show that the dominant fundamental emotion among all respondents is the fear as with most recurrent words anxiety, worry, fear, feelings or emotions that inhibit the action.


Author(s):  
Prae Keerasuntonpong ◽  
Keitha Dunstan ◽  
Bhagwan Khanna

The statement of service performance is a mandatory report provided by local governments in New Zealand. Despite 20 years' reporting experience, the Office of the Auditor-General (2008) criticised the poor quality of these reports. Past theoretical literature has attempted to develop a framework for the accountability expectations of documents provided by public-sector entities (Stewart, 1984). The purpose of this paper is to measure the consistency of the statements of service performance about wastewater services made by New Zealand local governments with the accountability expectations, using an accountability disclosure index. The paper reveals a moderately high level of consistency. “Probity” and “legality” accountability disclosures are high while “process/efficiency” and “performance programme-effectiveness” accountability are less emphasised. The results suggest that accountability expectations provide a useful tool for evaluating statements of service performance.


2018 ◽  
Vol 1 (80) ◽  
Author(s):  
Audrius Gocentas ◽  
Anatoli Landõr ◽  
Aleksandras Kriščiūnas

Research background and hypothesis. Replete schedule of competitions and intense training are features of contemporary team sports. Athletes, especially the most involved ones, may not have enough time to recover. As a consequence, aggregated fatigue can manifest in some undesirable form and affect athlete’s performance and health.Research aim. The aim of this study was to evaluate the changes in heart rate recovery (HRR) and investigate possible relations with sport-specifi c measures of effi cacy in professional basketball players during competition season.Research methods. Eight male high-level basketball players (mean ± SD, body mass, 97.3 ± 11.33 kg; height 2.02 ± 0.067 m, and age 23 ± 3.12 years) were investigated. The same basketball specifi c exercise was replicated several times from September till April during the practice sessions in order to assess the personal trends of HRR. Heart rate monitoring was performed using POLAR TEAM SYSTEM. Investigated athletes were ranked retrospectively according to the total amount of minutes played and the coeffi cients of effi cacy. Research results. There were signifi cant differences in the trends of HRR between the investigated players. The most effective players showed decreasing trends of HRR in all cases of ranking.Discussion and conclusions. Research fi ndings have shown that the quality of heart rate recovery differs between basketball players of the same team and could be associated with sport-specifi c effi cacy and competition playing time.Keywords: adaptation, autonomic control, monitoring training.


2019 ◽  
Vol 118 (11) ◽  
pp. 552-562
Author(s):  
Nguyen Thi Ngan ◽  
Bui Huy Khoi

This research aims to assess the service quality of industrial parks (IP) in the view of FDI (foreign direct investment) firms in Vietnam. Data was collected from 270 FDI firms in Vietnam - Singapore Industrial Parks (VSIP) in Vietnam. The proposed research model was based on researches on service quality. Cronbach's Alpha Average Variance Extracted (Pvc),rho (ρA), and Composite Reliability (Pc) tested the reliability and validity of the scale. The analysis results showed that four factors were affecting the servicequality of industrial park in Vietnam being tangibleof VSIP, reliability of VSIP, the empathyof FDI investors, and their assurance. The responsivenessof VSIP did not affect the servicequality of the industrial park. Contents of the article focus on two main issues: the analysis framework of the quantitative model and implicating results todevelop the industrial park services. The limitation of the research was only in VSIP in Vietnam.


2019 ◽  
Vol 28 (10) ◽  
pp. 106-117
Author(s):  
R. M. Asadullin

The continuous modernization of the education system makes the problems of the quality of teacher training increasingly relevant. Moreover, the measures taken to improve the system of teacher education are largely confined to the introduction of new organizational and managerial mechanisms and practically do not affect the internal content and technological structure of the teacher training process.Modern pedagogical universities are constantly looking for innovative models of training teachers that will be able to solve non-standard social and professional tasks. However, recent studies in this area do not fully take into account the nature of pedagogical activity and conditions of its formation. Thus, the need arises for a special study of the processes and means of updating the content and technologies of teacher training in order to control the level of students’ professional competencies development, as required by educational and professional standards. This means the creation of a special educational system in a pedagogical university, which can provide a harmonious and synchronous mastering by future specialists of both subject knowledge and methods of pedagogical activity.The article provides a theoretical study aimed at identifying key patterns of designing a new content for teacher education, the basis of which is the formation of a future teacher as a subject of his own professional activity. The author describes the experience of using a subject-oriented model of education, implemented at Bashkir State Pedagogical University n.a. M. Akmulla. The effectiveness of this model is confirmed by the high level of students’ mastery of designing methods and constructing the educational process, as well as their positive experience in the implementation of educational activities.


2020 ◽  
Vol 2 (2) ◽  
pp. 112-129
Author(s):  
Lelly Oktafiana ◽  
Iis Holisin ◽  
Himmatul Mursyidah

This study aims to describe the quality of the 2018 Mathematics National Examination (UN) in the HOTS types at the junior high level in terms of the level of validity, reliability, problem differentiation power, level of difficulty and distractor. This type of research is a descriptive study. The research was conducted at SMP Muhammadiyah 4 Surabaya and SMP Negeri 13 Surabaya for students in class VIII. The data collection technique used is a test. The test was taken from the 2018 math UN questions in odd semester VIII grade material including HOTS type. The number of UN mathematics questions in 2018 in the odd semester VIII class material consisted of 12 questions with 25% including LOTS types and 75% including HOTS types. The results showed: (1) 100% valid test questions, (2) high question reliability, (3) good problem differentiation power, (4) the difficuly level of the question 77,77% categorized as moderate and 2 question 22,23% are categorized as difficult, (5) there are 2 questions with one of the answer options do not work.


2020 ◽  
Author(s):  
Hiran Thabrew ◽  
Karolina Stasiak ◽  
Harshali Kumar ◽  
Tarique Naseem ◽  
Christopher Frampton ◽  
...  

BACKGROUND Approximately 10% to 12% of New Zealand children and young people have long-term physical conditions (also known as chronic illnesses) and are more likely to develop psychological problems, particularly anxiety and depression. Delayed treatment leads to worse physical and mental healthcare, school absence, and poorer long-term outcomes. Recently, electronic health (eHealth) interventions, especially those based on the principles of Cognitive Behavior Therapy (CBT), have been shown to be as good as face-to-face therapy. Biofeedback techniques have also been shown to enhance relaxation during the treatment of anxiety. However, these modalities have rarely been combined. Young people with long-term physical conditions have expressed a preference for well-designed and technologically-based support to deal with psychological issues, especially anxiety. OBJECTIVE This study aimed to co-design and evaluate the (i) acceptability and (ii) usability of a CBT and biofeedback-based, 5-module eHealth game called ‘Starship Rescue’ and (iii) to provide preliminary evidence regarding its effectiveness in addressing anxiety and quality of life in young people with long-term physical conditions. METHODS Starship Rescue was co-designed with children and young people from a tertiary hospital in Auckland, New Zealand. Following this, 24 young people aged 10 to 17 years were enrolled in an open trial, during which they were asked to use the game for an 8-week period. Acceptability of the game to all participants was assessed using a brief, open-ended questionnaire, and more detailed feedback was obtained from a subset of 10 participants via semi-structured interviews. Usability was evaluated via the System Usability Scale (SUS) and device-recorded frequency and duration of access on completion of the game. Anxiety levels were measured prior to commencement, on completion of the game, and 3 months later using the Generalized Anxiety Disorder 7-item scale (GAD-7) and Spence Child Anxiety Scales (SCAS), and at the start of each module and at the end of the game using an embedded Likert/visual analog scale. Quality of life was measured prior to commencement and on completion of the game using the Pediatric Quality of Life Scale (PEDS-QL). RESULTS Users gave Starship Rescue an overall rating of 5.9 out of 10 (range 3-10 and a mean score of 71 out of 100 (SD 11.7; min 47.5; max 90) on the System Usability Scale (SUS). The mean time period for use of the game was just over 11-weeks (78.8 days, 13.5 hours, 40 minutes). Significant reductions in anxiety were noted between the start and end of the game on the GAD-7 (-4.6 (p=0.000)), SCAS (-9.6 (p=0.005)), and the Likert/visual analogue scales (-2.4 (p=0.001)). Quality of life also improved on the PedsQL scale (+4.3 (p=0.042)). All changes were sustained at 3-month follow-up. CONCLUSIONS This study provides preliminary evidence for Starship Rescue being an acceptable, usable and effective eHealth intervention for addressing anxiety in young people with long-term physical conditions. Further evaluation is planned via a more formal randomized controlled trial. CLINICALTRIAL Australian New Zealand Clinical Trials Network Registry (ANZCTR): ACTRN12616001253493p;https://www.anzctr.org.au/Trial/Registration/TrialReview.aspx?id=371443 (Archived by WebCite at http://www.webcitation.org/6sYB716lf)


2021 ◽  
Vol 19 (1) ◽  
Author(s):  
Hua-hong Wu ◽  
Feng-qi Wu ◽  
Yang Li ◽  
Jian-ming Lai ◽  
Gai-xiu Su ◽  
...  

Abstract Background Juvenile idiopathic arthritis (JIA) may seriously affects patients’ quality of life (QoL), but it was rarely focused and studied in China, so we explore JIA children’s QoL using Chinese version of the PedsQL4.0 Generic Core and PedsQL3.0 Rheumatology Module scale, and analyzed the psychometric properties of these two Scales among Chinese JIA children. Methods We recruited 180 JIA patients from Children's Hospital Affiliated to Capital Institute of Pediatrics and Hebei Yanda Hospital from July 2018 to August 2019. The questionnaires include information related on JIA, PedsQL4.0 generic core and PedsQL3.0 Rheumatology Module scales. According to the disease type, onset age of and course of JIA, we divided them into different groups, then compared the QoL status among different groups. Moreover, we analyzed the reliability and validity of these two scales in these 180 JIA children. Results The mean score of PedsQL4.0 generic core scale on these 180 patients was 82.85 ± 14.82, for these in active period was 72.05 ± 15.29, in remission period was 89.77 ± 9.23; the QoL score of systemic, polyarticular and oligoarticular JIA patients were 77.05 ± 19.11, 84.33 ± 12.46 and 87.12 ± 10.23. The mean score of PedsQL3.0 Rheumatology Module scale on 180 patients was 91.22 ± 9.45, for these in active period was 84.70 ± 11.37, in remission period was 95.43 ± 4.48; the QoL score of systemic, polyarticular and oligoarticular JIA patients were 89.41 ± 11.54, 89.38 ± 10.08 and 93.71 ± 6.92. In the PedsQL 4.0 Generic Core scale, the α coefficients of total scale and almost every dimension are all greater than 0.8 except for the school activity dimension of 0.589; the correlation coefficients of 22 items’ scores (total 23 items) with the scores of dimensions they belong to are greater than 0.5 (maximum value is 0.864), and the other one is 0.406. In PedsQL3.0 Rheumatology Module scale, except for the treatment and worry dimensions of 0.652 and 0.635, the α coefficients of other dimensions and the total scale are all greater than 0.7; the correlation coefficients of all items’ score were greater than 0.5 (the maximum is 0.933, the minimum is 0.515). Conclusions The QoL of Chinese JIA children is worse than their healthy peers, these in active period and diagnosed as systemic type were undergoing worst quality of life. The reliability and validity of PedsQL 4.0 Generic Core and PedsQL3.0 Rheumatology Module scale in Chinese JIA children are satisfactory, and can be used in clinical and scientific researches.


Sign in / Sign up

Export Citation Format

Share Document