scholarly journals Interpreting published effect sizes in behavioral science: a thought-experiment

2021 ◽  
Author(s):  
Erich H. Witte ◽  
Frank Zenker

Standardized effect size measures (e.g., Cohen’s d) state the observed mean difference, m1-m0, relative to the observed standard deviation, s. These measures are commonly used in behavioral science today in meta-analytical research to quantify the observed m1-m0 across object-level studies that use different measurement-scales, as well as in theory-construction research to point-specify m1-m0 as a theoretically predicted parameter. Since standardization conceptually relates to the quality of measurement, m1-m0 can be interpreted fully only relative to whichever error-theory determines s. The error-theory, however, is what behavioral scientists must typically choose freely, because a theoretically motivated measurement-scale is normally unavailable. Using a thought-experiment, we show that differentially sophisticated error-theories let the observed m1-m0 vary massively given identical observations. This lets the common praxis of publishing m1-m0 “nakedly”—without a transparent error-theory—appear problematic, because it undermines the goals of a cumulative science of human behavior. We advocate reporting standardized effect sizes along with a transparent error-theory.

Author(s):  
Danièle Roberge ◽  
Francine Ducharme ◽  
Paule Lebel ◽  
Raynald Pineault ◽  
Jacynthe Loiselle

ABSTRACTUntil now, family caregivers have been involved very little in the processes of assessing the quality of care delivered to a hospitalized relative. This study is the second phase of a broader research project whose aim is to develop measurement scales intended for elderly patients and their caregivers on their perceptions of the quality of services delivered in Geriatric Assessment Units. More specifically, the goal of this phase of the research is to document the criteria that caregivers use to judge the quality of these services: these criteria should constitute the content of the measurement scale that is intended for them. Four focus groups, bringing together 21 caregivers, allowed for the identification of 31 criteria of quality. These criteria have been classified according to six dimensions of quality: information, communication, attitude of staff, technical quality, continuity, and physical resources. The study highlights the dual concerns of participants: the well-being of the patient and support for caregivers. It shows that caregivers consider themselves to be clients of geriatric services.


2021 ◽  
Vol 13 (10) ◽  
pp. 5734
Author(s):  
Željko Stević ◽  
Ilija Tanackov ◽  
Adis Puška ◽  
Goran Jovanov ◽  
Jovica Vasiljević ◽  
...  

To run a business successfully, quality determination and customer relations are very important factors. Therefore, it is necessary to measure quality and identify critical points of business. In this paper, an original integrated model for measuring the service quality of reverse logistics (RL) was developed for the company Komunalac Teslić, which was used as an example. The Delphi and Full Consistency Method (FUCOM) was applied to determine the significance of the quality dimensions, while a modified SERVQUAL (SQ) model was used to measure the service quality of the logistics. An original SQ questionnaire was formed with a total of 21 statements that were arranged in five standard dimensions. Examining the reliability of the questionnaire for quality dimensions using the Cronbach Alpha coefficient, it was found that the measurement scales for dimensions are appropriate in terms of user expectations, while in terms of quality perception there is no measurement scale for the empathy dimension. An extensive statistical analysis was then performed to verify the results. A Signum test was applied to identify the relationship between the responses in terms of expectations and perceptions, i.e., to examine their differences. The findings obtained by this research show that the expectations were higher than the perceived quality of the services and that there was a significant statistical difference for 12 of the SQ statements. For two statements, there was a significant statistical difference in favor of perceived quality compared to expectations. Based on the results obtained, the company must improve its services in order for service quality to be at a satisfactory level.


2018 ◽  
Vol 30 (2) ◽  
pp. 168-180 ◽  
Author(s):  
Francesca Bassi ◽  
Renata Clerici ◽  
Debora Aquario

Purpose Students’ evaluation of teaching quality plays a major role in higher education. Satisfaction is not directly observable, nevertheless it can be measured through multi-item measurement scales. These instruments are extremely useful and their importance requires accurate development and validation procedures. The purpose of this paper is to show how latent class (LC) analysis can improve the procedures for developing and validating a multi-item measurement scale for measuring students’ evaluation of teaching and, at the same time, provide a deeper insight in the phenomenon under investigation. Design/methodology/approach The traditional literature highlights specific protocols along with the statistical instruments to be used for achieving this goal. However, these tools are suited for metric variables but they are adopted even when the nature of the observed variables is different, as it often occurs, since in many cases the items are ordinal. LC analysis takes explicitly into account the ordinal nature of the variables and also the fact that the object of interest is unobservable. Findings The data refer to the questionnaire to evaluate didactics to the students of the University of Padua. Within LC analysis allows an insight of scale properties, such as dimensionality, validity and reliability. Moreover, the results provide a deeper view in the way students use the scale to report satisfaction suggesting to revise the instrument according to the suggestion by the National Agency for University Evaluation. Originality/value The paper gives an original contribution on two sides. On the side of methods, it introduces a more accurate methodology for evaluating scales to measure the students’ satisfaction. On the side of applications, it provides important suggestions to the university management to improve the process of quality of the didactics evaluation.


2018 ◽  
Vol 2 (1) ◽  
pp. 45-54
Author(s):  
Qurotul Aini ◽  
Siti Ria Zuliana ◽  
Nuke Puji Lestari Santoso

The scale is usually used to check and determine the value of a qualitative factor in quantitative measures. The measurement scale is a management in agreement that is used as a reference to determine the short length of the interval that is in the measuring instrument, so that the measuring instrument when used in measurements will produce quantitative data. The results of the scale management calculation must be interpreted carefully because in addition to producing a rough picture, the respondent's answers are not just straightforward to be trusted. Types of measurement scales: Likert scale, Guttman scale, semantic differential scale, rating scale, Thurstone scale, Borgadus scale, and various other measurement management scales. One of the most difficult jobs for information technology researchers faced with the necessity of measuring variables is: finding directions in the midst of many existing sizes. If there is a good size for a particular variable, it seems that there are not many reasons to compile a new size yourself. Keywords: Scale, Measurement, Variables.


2013 ◽  
pp. 77-90
Author(s):  
Yen Nguyen Thi Hoang

This paper focuses on the understanding of service quality in the context of Vietnamese universities. It proposes an approach for measuring the quality of the higher education service provided by universities in Vietnam. Firstly, an exploratory study was conducted. Then, the set of items which were generated became the subject of a questionnaire that was then administered to 675 students of a Vietnamese university to determine the dimensions of higher education service quality in this context. The obtained results permit us to appropriate a measurement scale which is slightly different from the SERVQUAL scale widely known as the standard for measuring service quality. The results also show that tangible elements, responsiveness and assurance seem to be three specific dimensions of the higher education service of Vietnamese universities.


2020 ◽  
Vol 20 (4) ◽  
pp. 207-218
Author(s):  
Won Seok Lee ◽  
Joon Moon

This study aims to develop cross-cultural value measurement scales that can overcome established methodological problems and test the dimensional frameworks of the scale with non-Asian respondents. It applies a mixed-method approach to observe intrinsic, nationally distinct values, and develop a generalized values measurement scale. This study found new value dimensions that were not present in the previous value studies (i.e., life balance, emotional growth, family union, and friendship) and provided segmented subdimensions (i.e., balancing between work and rest, time management, rewards of investment, and self-examination). This complements and enhances the current body of knowledge on value measurement.


Author(s):  
Shiv Visvanathan

This chapter is an attempt to look at the question of quality within a wider vision of diversity and democracy. It is an effort to show how epistemological approaches to knowledge and democracy help to determine the quality of knowledge, life and well-being in a society. The chapter also discusses the problems of science and examines the current nature of discourse in scientific education in India.


2020 ◽  
Vol 18 (1) ◽  
Author(s):  
Jia-Xi Li ◽  
Yan-Mei Shi ◽  
Li-Ya An ◽  
Jin-Xu Yang ◽  
Yu-Xing Qi ◽  
...  

Abstract Objectives To fully assess the quality of the guidelines for the management of malignant pleural effusions (MPE) and ascites and reveal the heterogeneity of recommendations and possible reasons among guidelines. Methods A systematic search was performed in the database to obtain guidelines for the management of MPE and ascites. The AGREE IIGtool was used to assess the quality of these guidelines. The Measurement Scale of Rate of Agreement (MSRA) was introduced to assess the scientific agreement of formulated recommendations for the management of MPE and ascites among guidelines, and evidence supporting these recommendations was extracted and analyzed. Results Nine guidelines were identified. Only 4 guidelines scored more than 60% and are worth recommending. Recommendations were also heterogeneous among guidelines for the management of MPE, and the main reasons were the different emphases of the recommendations for the treatment of MPE, the contradictions in recommendations, and the unreasonably cited evidence for MPE. Conclusions The quality of the management guidelines for patients with MPE and malignant ascites was highly variable. Specific improvement of the factors leading to the heterogeneity of recommendations will be a reasonable and effective way for developers to upgrade the recommendations in the guidelines for MPE.


Sign in / Sign up

Export Citation Format

Share Document