D-test: A New Test for Analyzing Scale Invariance Using Symbolic Dynamics and Symbolic Entropy

The aim of this study is to improve measurement in marketing research by constructing a new, simple, nonparametric, consistent, and powerful test to study scale invariance. The test is called D-test. D-test is constructed using symbolic dynamics and symbolic entropy as a measure of the difference between the response patterns which comes from two measurement scales. We also give a standard asymptotic distribution of our statistic. Given that the test is based on entropy measures, it avoids smoothed nonparametric estimation. We applied D-test to a real marketing research to study if scale invariance holds when measuring service quality in a sports service. We considered a free-scale as a reference scale and then we compared it with three widely used rating scales: Likert-type scale from 1 to 5 and from 1 to 7, and semantic-differential scale from −3 to +3. Scale invariance holds for the two latter scales. This test overcomes the shortcomings of other procedures for analyzing scale invariance; and it provides researchers a tool to decide the appropriate rating scale to study specific marketing problems, and how the results of prior studies can be questioned.

Download Full-text

A Rasch-Based Validation of ELT Certificate-LORT

English Language Teaching ◽

10.5539/elt.v13n9p94 ◽

2020 ◽

Vol 13 (9) ◽

pp. 94

Author(s):

Xin Qu

Keyword(s):

Teacher Candidates ◽

Rating Scales ◽

Rating Scale ◽

Qualitative Interviews ◽

Structured Interviews ◽

High Stakes ◽

Efl Teacher ◽

The Difference ◽

Group Interviews

The present study was executed with the purpose of validating ELT Certificate Lesson Observation and Report Task (ELTC-LORT), which was developed by China Language Assessment to certify China’s EFL teachers by performance-based testing. The ELT Certificate has high-stakes considering its impacts on candidates’ recruitment, ELT in China and quality of education, so it is crucially important for its validation so as to guarantee fairness and justice. The validity of task construct and rating rubric went through a process suited for many-facet Rasch measurement supplemented with qualitative interviews. Participants (N = 40) were provided with a video excerpt from a real EFL lesson, and required to deliver a report on the teacher’s performance. Two raters graded the records of the candidates’ reports using rating scales developed to measure EFL teacher candidates’ oral English proficiency and ability to analyze and evaluate teaching. Many-facet Rasch analysis demonstrated a successful estimation, with a noticeable spread among the participants and their traits, proving the task functioned well in measuring candidates’ performance and reflecting the difference of their ability. The raters were found to have good internal self-consistency, but not the same leniency. The rating scales worked well, with the average measures advancing largely in line with Rasch expectations. Semi-structured interviews as well as focus group interviews were executed to provide knowledge regarding the raters’ performance levels and the functionalities of the rating scale items. The findings provide implications for further research and practice of the Certificate.

Download Full-text

Nonadjectival Rating Scales in Human Response Experiments

Human Factors The Journal of the Human Factors and Ergonomics Society ◽

10.1177/001872087301500311 ◽

1973 ◽

Vol 15 (3) ◽

pp. 275-280 ◽

Cited By ~ 17

Author(s):

Ronald A. Hess

Keyword(s):

System Dynamics ◽

Input Signal ◽

Rating Scales ◽

Rating Scale ◽

Analog Computer ◽

Error Signal ◽

Human Response ◽

Oscilloscope Screen ◽

The Difference ◽

Degree Of Instability

Twenty-two subjects participated in two tracking experiments for the purpose of determining the utility of a nonordinal, nonadjectival rating scale. The scale was devised in an effort to allow a human to quantify his subjective opinions of the characteristics of a system in situations where an adjectival scale would be inappropriate. The tracking task in both experiments was a compensatory one in which the human operator attempted to minimize the difference between a random-appearing input signal and the output of an unstable, controlled element. The system dynamics and input signal were mechanized on an analog computer. The error signal was viewed by the operator on an oscilloscope screen. Control was effected by a small isometric manipulator. In the first experiment, ratings were generated by changing the degree of instability of the controlled element. In the second, the manipulator sensitivity was varied. The nonadjectival rating concept shows definite potential for use in a wide variety of situations in which human opinion is elicited.

Download Full-text

Application Of Rating Scale Model In Conversion Of Rating Scales' Points To The Form Of Triangular Fuzzy Numbers

Folia Oeconomica Stetinensia ◽

10.1515/foli-2015-0010 ◽

2014 ◽

Vol 14 (2) ◽

pp. 7-18

Author(s):

Bartłomiej Jefmański

Keyword(s):

Rating Scales ◽

Fuzzy Numbers ◽

Rating Scale ◽

Transformation Method ◽

Economic Research ◽

Scale Model ◽

Triangular Fuzzy Numbers ◽

Measurement Scales ◽

Suggested Approach ◽

Rating Scale Model

Abstract A new application of fuzzy sets theory in social and economic research is a fuzzy measurement of respondents' opinions. In the subject literature fuzzy rating scales or fuzzy conversion scales are being applied. In this second case, a key stage is a choice of such parameters' values of fuzzy numbers which will best illustrate the perception of linguistic values constituting points of measurement scales. In the construction of fuzzy conversion scales the item response theory models can find an application. The transformation method of verbal categories to the form of triangular fuzzy numbers with the application of rating scale model was proposed in this article. Usefulness of a suggested approach was introduced on the basis of the analysis of selected research results on inhabitants' quality of life in one of the Lower Silesian Voivodship districts. The analysis results showed big ambiguity of particular verbal categories and, in consequence, the validity of fuzzy conversion scales application.

Download Full-text

Going Back to Kahlbaum’s Psychomotor (and GABAergic) Origins: Is Catatonia More Than Just a Motor and Dopaminergic Syndrome?

Schizophrenia Bulletin ◽

10.1093/schbul/sbz074 ◽

2019 ◽

Cited By ~ 6

Author(s):

Dusan Hirjak ◽

Katharina M Kubera ◽

R Christian Wolf ◽

Georg Northoff

Keyword(s):

Systematic Review ◽

Rating Scales ◽

Rating Scale ◽

Clinical Syndrome ◽

Aminobutyric Acid ◽

Gamma Aminobutyric Acid ◽

Glutamatergic Transmission ◽

Mechanistic Insight ◽

The Difference

Abstract In 1874, Karl Kahlbaum described catatonia as an independent syndrome characterized by motor, affective, and behavioral anomalies. In the following years, various catatonia concepts were established with all sharing the prime focus on motor and behavioral symptoms while largely neglecting affective changes. In 21st century, catatonia is a well-characterized clinical syndrome. Yet, its neurobiological origin is still not clear because methodological shortcomings of hitherto studies had hampered this challenging effort. To fully capture the clinical picture of catatonia as emphasized by Karl Kahlbaum, 2 decades ago a new catatonia scale was developed (Northoff Catatonia Rating Scale [NCRS]). Since then, studies have used NCRS to allow for a more mechanistic insight of catatonia. Here, we undertook a systematic review searching for neuroimaging studies using motor/behavioral catatonia rating scales/criteria and NCRS published up to March 31, 2019. We included 19 neuroimaging studies. Studies using motor/behavioral catatonia rating scales/criteria depict cortical and subcortical motor regions mediated by dopamine as neuronal and biochemical substrates of catatonia. In contrast, studies relying on NCRS found rather aberrant higher-order frontoparietal networks which, biochemically, are insufficiently modulated by gamma-aminobutyric acid (GABA)-ergic and glutamatergic transmission. This is further supported by the high therapeutic efficacy of GABAergic agents in acute catatonia. In sum, this systematic review points out the difference between motor/behavioral and NCRS-based classification of catatonia on both neuronal and biochemical grounds. That highlights the importance of Kahlbaum’s original truly psychomotor concept of catatonia for guiding both research and clinical diagnosis and therapy.

Download Full-text

AAC Collaboration Using the Self-Anchored Rating Scales (SARS): An Aphasia Case Study

Perspectives on Augmentative and Alternative Communication ◽

10.1044/aac21.4.136 ◽

2012 ◽

Vol 21 (4) ◽

pp. 136-143

Author(s):

Lynn E. Fox

Keyword(s):

Augmentative And Alternative Communication ◽

Rating Scales ◽

Rating Scale ◽

Therapeutic Process ◽

Family Counseling ◽

The Self ◽

Alternative Communication ◽

Her Family ◽

Solution Focused

Abstract The self-anchored rating scale (SARS) is a technique that augments collaboration between Augmentative and Alternative Communication (AAC) interventionists, their clients, and their clients' support networks. SARS is a technique used in Solution-Focused Brief Therapy, a branch of systemic family counseling. It has been applied to treating speech and language disorders across the life span, and recent case studies show it has promise for promoting adoption and long-term use of high and low tech AAC. I will describe 2 key principles of solution-focused therapy and present 7 steps in the SARS process that illustrate how clinicians can use the SARS to involve a person with aphasia and his or her family in all aspects of the therapeutic process. I will use a case study to illustrate the SARS process and present outcomes for one individual living with aphasia.

Download Full-text

Childbirth and Posttraumatic Stress Responses

European Journal of Psychological Assessment ◽

10.1027/1015-5759.22.4.259 ◽

2006 ◽

Vol 22 (4) ◽

pp. 259-267 ◽

Cited By ~ 40

Author(s):

Eelco Olde ◽

Rolf J. Kleber ◽

Onno van der Hart ◽

Victor J.M. Pop

Keyword(s):

Posttraumatic Stress Disorder ◽

Posttraumatic Stress ◽

Stress Responses ◽

Rating Scales ◽

Rating Scale ◽

Stress Disorder ◽

Traumatic Experience ◽

Event Scale ◽

Self Rating ◽

The Impact

Childbirth has been identified as a possible traumatic experience, leading to traumatic stress responses and even to the development of posttraumatic stress disorder (PTSD). The current study investigated the psychometric properties of the Dutch version of the Impact of Event Scale-Revised (IES-R) in a group of women who recently gave birth (N = 435). In addition, a comparison was made between the original IES and the IES-R. The scale showed high internal consistency (α = 0.88). Using confirmatory factor analysis no support was found for a three-factor structure of an intrusion, an avoidance, and a hyperarousal factor. Goodness of fit was only reasonable, even after fitting one intrusion item on the hyperarousal scale. The IES-R correlated significantly with scores on depression and anxiety self-rating scales, as well as with scores on a self-rating scale of posttraumatic stress disorder. Although the IES-R can be used for studying posttraumatic stress reactions in women who recently gave birth, the original IES proved to be a better instrument compared to the IES-R. It is concluded that adding the hyperarousal scale to the IES-R did not make the scale stronger.

Download Full-text

Management Measurement Scale As A Reference To Determine Interval In A Variable

Aptisi Transactions on Management (ATM) ◽

10.33050/atm.v2i1.775 ◽

2018 ◽

Vol 2 (1) ◽

pp. 45-54

Author(s):

Qurotul Aini ◽

Siti Ria Zuliana ◽

Nuke Puji Lestari Santoso

Keyword(s):

Information Technology ◽

Quantitative Data ◽

Rating Scale ◽

Semantic Differential ◽

Short Length ◽

Measurement Scale ◽

Measuring Instrument ◽

Guttman Scale ◽

Measurement Scales ◽

Thurstone Scale

The scale is usually used to check and determine the value of a qualitative factor in quantitative measures. The measurement scale is a management in agreement that is used as a reference to determine the short length of the interval that is in the measuring instrument, so that the measuring instrument when used in measurements will produce quantitative data. The results of the scale management calculation must be interpreted carefully because in addition to producing a rough picture, the respondent's answers are not just straightforward to be trusted. Types of measurement scales: Likert scale, Guttman scale, semantic differential scale, rating scale, Thurstone scale, Borgadus scale, and various other measurement management scales. One of the most difficult jobs for information technology researchers faced with the necessity of measuring variables is: finding directions in the midst of many existing sizes. If there is a good size for a particular variable, it seems that there are not many reasons to compile a new size yourself. Keywords: Scale, Measurement, Variables.

Download Full-text

Finding Talent Among Elementary English Learners: A Validity Study of the HOPE Teacher Rating Scale

Gifted Child Quarterly ◽

10.1177/0016986220985942 ◽

2021 ◽

pp. 001698622098594

Author(s):

Nielsen Pereira

Keyword(s):

English Language Learners ◽

English Learners ◽

English Language ◽

Rating Scales ◽

Rating Scale ◽

Teacher Rating ◽

Esl Teachers ◽

Invariance Testing ◽

Scale Scores ◽

The One

The purpose of this study was to investigate the validity of the HOPE Scale for identifying gifted English language learners (ELs) and how classroom and English as a second language (ESL) teacher HOPE Scale scores differ. Seventy teachers completed the HOPE Scale on 1,467 students in grades K-5 and four ESL teachers completed the scale on 131 ELs. Measurement invariance tests indicated that the HOPE Scale yields noninvariant latent means across EL and English proficient (EP) samples. However, confirmatory factor analysis results support the use of the scale with ELs or EP students separately. Results also indicate that the rating patterns of classroom and ESL teachers were different and that the HOPE Scale does not yield valid data when used by ESL teachers. Caution is recommended when using the HOPE Scale and other teacher rating scales to compare ELs to EP students. The importance of invariance testing before using an instrument with a population that is different from the one(s) for which the instrument was developed is discussed.

Download Full-text

Assessment of Agreement Between Human Ratings and Lexicon-Based Sentiment Ratings of Open-Ended Responses on a Behavioral Rating Scale

Assessment ◽

10.1177/1073191121996466 ◽

2021 ◽

pp. 107319112199646

Author(s):

Olivia Gratz ◽

Duncan Vos ◽

Megan Burke ◽

Neelkamal Soares

Keyword(s):

Sentiment Analysis ◽

Language Processing ◽

Rating Scales ◽

Clinical Decision Making ◽

Rating Scale ◽

Clinical Decision ◽

Assessment System ◽

Word Count ◽

Positive Correlation ◽

Sentiment Score

To date, there is a paucity of research conducting natural language processing (NLP) on the open-ended responses of behavior rating scales. Using three NLP lexicons for sentiment analysis of the open-ended responses of the Behavior Assessment System for Children-Third Edition, the researchers discovered a moderately positive correlation between the human composite rating and the sentiment score using each of the lexicons for strengths comments and a slightly positive correlation for the concerns comments made by guardians and teachers. In addition, the researchers found that as the word count increased for open-ended responses regarding the child’s strengths, there was a greater positive sentiment rating. Conversely, as word count increased for open-ended responses regarding child concerns, the human raters scored comments more negatively. The authors offer a proof-of-concept to use NLP-based sentiment analysis of open-ended comments to complement other data for clinical decision making.

Download Full-text

Pre-Hospital Pain Management in Children with Injuries: A Retrospective Cohort Study

Journal of Clinical Medicine ◽

10.3390/jcm10143056 ◽

2021 ◽

Vol 10 (14) ◽

pp. 3056

Author(s):

Ada Holak ◽

Michał Czapla ◽

Marzena Zielińska

Keyword(s):

Pain Management ◽

Pain Intensity ◽

Pain Assessment ◽

Rating Scales ◽

Rating Scale ◽

Low Frequency ◽

Numeric Rating Scale ◽

Health Concern ◽

School Age Children ◽

Medical Teams

Background: The all-too-frequent failure to rate pain intensity, resulting in the lack of or inadequacy of pain management, has long ceased to be an exclusive problem of the young patient, becoming a major public health concern. This study aimed to evaluate the methods used for reducing post-traumatic pain in children and the frequency of use of such methods. Additionally, the methods of pain assessment and the frequency of their application in this age group were analysed. Methods: A retrospective analysis of 2452 medical records of emergency medical teams dispatched to injured children aged 0–18 years in the area around Warsaw (Poland). Results: Of all injured children, 1% (20 out of 2432) had their pain intensity rated, and the only tool used for this assessment was the numeric rating scale (NRS). Children with burns most frequently received a single analgesic drug or cooling (56.2%), whereas the least frequently used method was multimodal treatment combining pharmacotherapy and cooling (13.5%). Toddlers constituted the largest percentage of patients who were provided with cooling (12%). Immobilisation was most commonly used in adolescents (29%) and school-age children (n = 186; 24%). Conclusions: Low frequency of pain assessment emphasises the need to provide better training in the use of various pain rating scales and protocols. What is more, non-pharmacological methods (cooling and immobilisation) used for reducing pain in injured children still remain underutilized.

Download Full-text