test bias Latest Research Papers

This paper aims to explore the relationship between how language teachers perceive test bias and where they are working, how long they have been working, and where they were professionally trained. The data were collected from 19 in-service English teachers from Eastern and Western settings. They completed a questionnaire in which they were asked to respond to test bias stimuli and answer questions related to their teaching background and training. The stimuli contained either of two forms of bias, unfair penalization and offensiveness. Qualitative and quantitative analysis showed teachers were not fully informed of possible forms of test bias and possible ways potential biases unfairly penalize or offend students. They were better able to recognize biases of unfair penalization than offensiveness. Statistical analyses revealed teachers with over 10 years of experience were better able to recognize potential test bias than those with less experience (at 90% confidence level). The findings contribute to the current limited literature on bias in classroom language testing and assessment, leading to implications for bias review in teacher-developed assessments and teacher training.

Download Full-text

Spearman’s Hypothesis in the Vietnam Experience Study and National Longitudinal Survey of Youth ‘79

10.31234/osf.io/m4yn9 ◽

2021 ◽

Author(s):

Jordan Lasker ◽

Emil O. W. Kirkegaard ◽

Helmuth Nyborg

Keyword(s):

Strong Relationship ◽

Weak Form ◽

Longitudinal Survey ◽

Test Bias ◽

Pass Rate ◽

National Longitudinal Survey ◽

Group Differences ◽

Cognitive Differences ◽

Level Data ◽

Sensitivity Specificity

There are few empirically derived theories explaining group differences in cognitive ability. Spearman's hypothesis is one such theory which holds that group differences are a function of a given test's relationship to general intelligence, g. Research into this hypothesis has generally been limited to the application of a single method lacking sensitivity, specificity, and the ability to assess test bias: Jensen’s method of correlated vectors. In order to overcome the resulting empirical gap, we applied three different psychometrically sound methods to examine the hypothesis among American blacks and whites in the Vietnam Experience Study (VES) and the National Longitudinal Survey of Youth 1979 (NLSY ‘79). We first used multi-group confirmatory factor analysis to assess bias and evaluate the hypothesis directly; we found that strict factorial invariance was tenable in both samples and either the strong or the weak form of the hypothesis was supported, with 87 and 78% of the group differences attributable to g in the VES and NLSY ’79 respectively. Using item response theory metrics to avoid pass rate confounding, a strong relationship between g loadings and group differences (r = 0.80 and 0.79) was observed. Finally, assessing differential item functioning with item level data revealed that a handful of items functioned differently, but their removal did not affect gap sizes much beyond what would be expected from shortening tests, and assessing the effect this had on scores using an anchoring method, the differential functioning was found to be negligible in size. In aggregate, results supported Spearman's hypothesis but not test bias as an explanation for the cognitive differences between the groups we studied.

Download Full-text

Test Bias

Encyclopedia of Autism Spectrum Disorders ◽

10.1007/978-3-319-91280-6_331 ◽

2021 ◽

pp. 4781-4781

Author(s):

Michele Goyette-Ewing

Keyword(s):

Test Bias

Download Full-text

Towards Explainable Classifiers Using the Counterfactual Approach - Global Explanations for Discovering Bias in Data

Journal of Artificial Intelligence and Soft Computing Research ◽

10.2478/jaiscr-2021-0004 ◽

2021 ◽

Vol 11 (1) ◽

pp. 51-67

Author(s):

Agnieszka Mikołajczyk ◽

Michał Grochowski ◽

Arkadiusz Kwasigroch

Keyword(s):

Skin Lesion ◽

Strong Influence ◽

Test Bias ◽

Detection And Identification ◽

Post Hoc ◽

Counterfactual Approach

AbstractThe paper proposes summarized attribution-based post-hoc explanations for the detection and identification of bias in data. A global explanation is proposed, and a step-by-step framework on how to detect and test bias is introduced. Since removing unwanted bias is often a complicated and tremendous task, it is automatically inserted, instead. Then, the bias is evaluated with the proposed counterfactual approach. The obtained results are validated on a sample skin lesion dataset. Using the proposed method, a number of possible bias-causing artifacts are successfully identified and confirmed in dermoscopy images. In particular, it is confirmed that black frames have a strong influence on Convolutional Neural Network’s prediction: 22% of them changed the prediction from benign to malignant.

Download Full-text

Test Bias, Unfairness, and Equivalence

Measurement Theory in Action ◽

10.4324/9781003127536-11 ◽

2020 ◽

pp. 144-161

Author(s):

Kenneth S. Shultz ◽

David J. Whitney ◽

Michael J. Zickar

Keyword(s):

Test Bias

Download Full-text

Measuring financial literacy with a Situational Judgement Test: do some groups really perform worse or is it the measuring instrument?

Empirical Research in Vocational Education and Training ◽

10.1186/s40461-020-00103-x ◽

2020 ◽

Vol 12 (1) ◽

Author(s):

Eveline Wuttke ◽

Christin Siegfried ◽

Carmela Aprea

Keyword(s):

Measurement Invariance ◽

Financial Literacy ◽

Educational Background ◽

Individual Characteristics ◽

Migration Background ◽

Test Bias ◽

Group Differences ◽

Opportunities To Learn ◽

Situational Judgement Test ◽

Situational Judgement

AbstractDue to current trends in society and economy, financial literacy is often considered as an important twenty-first century skill. However, regardless of the postulated relevance, studies suggest that financial illiteracy seems to be a widespread phenomenon in the population of many nations. Some studies also show that some groups perform particularly poorly (e.g. women, persons with migration background and/or low level of education). These differences are often attributed to different individual characteristics such as abilities, dispositions or socialisation patterns. However, available research also suggests that even after controlling for them, a quite large portion of the performance differences between the various groups of test-takers remains unexplained. One explanation for performance gaps in financial literacy might be that differences in test scores could also be evoked by the test instruments itself and may thus, at least in part, be interpreted as testing bias. In this paper, we present a newly developed Situational Judgement Test, which is focused on financial competence. For this test, we examine whether differences between groups are attributable to individual differences or due to a test bias. To analyse a possible test bias, we tested one facet of financial literacy (with three factors: control of one’s financial situation, budgeting and handling of money) related to everyday money management for measuring invariance for different groups. If measuring invariance could be assumed, we analysed group differences by calculating t-tests. Results show that two factors of the test show measurement invariance for all groups considered (gender, migration and educational background, opportunities to learn). Group comparisons are thus possible and potential differences are not due to a test bias. For one factor, we can only assume measurement invariance for the group with/without migration background and with/without opportunities to learn in financial topics. When we look at group differences, we find that in contrast to the findings of many previous studies, the analysis of the mean differences does not show any systematic deficits in financial literacy for specific groups.

Download Full-text

Adjusting group intercept and slope bias in predictive equations

Methodology ◽

10.5964/meth.4001 ◽

2020 ◽

Vol 16 (3) ◽

pp. 241-257

Author(s):

Bruce W. Austin ◽

Brian F. French

Keyword(s):

Mathematics Achievement ◽

Measurement Invariance ◽

Self Efficacy ◽

Test Bias ◽

Regression Equations ◽

Correction Factors ◽

Regression Analyses ◽

Predictive Equations ◽

Achievement Score ◽

The Impact

Methods to assess measurement invariance in constructs have received much attention, as invariance is critical for accurate group comparisons. Less attention has been given to the identification and correction of the sources of non-invariance in predictive equations. This work developed correction factors for structural intercept and slope bias in common regression equations to address calls in the literature to revive test bias research. We demonstrated the correction factors in regression analyses within the context of a large international dataset containing 68 countries and regions (groups). A mathematics achievement score was predicted by a math self-efficacy score, which exhibited a lack of invariance across groups. The proposed correction factors significantly corrected structural intercept and slope bias across groups. The impact of the correction factors was greatest for groups with the largest amount of bias. Implications for both practice and methodological extensions are discussed.

Download Full-text

The effect of raters fatigue on scoring EFL writing tasks

Indonesian Journal of Applied Linguistics ◽

10.17509/ijal.v10i1.24956 ◽

2020 ◽

Vol 10 (1) ◽

pp. 1-13

Author(s):

Amir Mahshanian ◽

Mohammadtaghi Shahnazari

Keyword(s):

Correlation Coefficient ◽

Pearson Correlation ◽

Test Bias ◽

Pearson Correlation Coefficient ◽

Efl Writing ◽

Efl Learners ◽

Negative Effect ◽

Spss Software ◽

Post Hoc ◽

The Relationship

Given the importance of testing, in general, and scoring writing tasks in particular, the negative effect of fatigue on human raters is important to investigate. This study aimed to (1) explore the relationship between fatigue and scoring composition tasks written by upper-intermediate EFL learners; and (2) to investigate the discrepancy of the frequency of comments among EFL raters while scoring composition tasks. Four raters were selected, and each given 28 composition tasks to score and comment on. The data were analyzed through SPSS software by running ANOVA, Pearson correlation coefficient, and post-hoc tests. Results suggested that the scores assigned to the first 16 tasks were significantly lower than those assigned to the last 12 tasks and that the last four tasks were scored highest. Based on the results obtained from the questionnaire, the observed diversity is argued to be rooted in raters’ fatigue and result in test bias. Furthermore, findings indicated that the frequency of comments given by the raters on the first 12 essays was significantly higher than those on the last 16 essays (the highest and the lowest frequency of comments were observed in the first four, and the last four scored essays, respectively).

Download Full-text

Test Bias

Encyclopedia of Quality of Life and Well-Being Research ◽

10.1007/978-3-319-69909-7_2998-2 ◽

2020 ◽

pp. 1-4

Author(s):

Brian F. French

Keyword(s):

Test Bias

Download Full-text

test bias
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Dimensionality, Item Response Theory, Effect Size Attenuation, and Test Bias Analyses of the Self-Importance of Moral Identity Scale (SIMIS)

AN INVESTIGATION INTO THE RELATIONSHIP BETWEEN TEACHERS’ PERCEPTIONS OF TEST BIAS AND THEIR WORKING AND TRAINING BACKGROUND

Spearman’s Hypothesis in the Vietnam Experience Study and National Longitudinal Survey of Youth ‘79

Test Bias

Towards Explainable Classifiers Using the Counterfactual Approach - Global Explanations for Discovering Bias in Data

Test Bias, Unfairness, and Equivalence

Measuring financial literacy with a Situational Judgement Test: do some groups really perform worse or is it the measuring instrument?

Adjusting group intercept and slope bias in predictive equations

The effect of raters fatigue on scoring EFL writing tasks

Test Bias

Export Citation Format

test biasRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Dimensionality, Item Response Theory, Effect Size Attenuation, and Test Bias Analyses of the Self-Importance of Moral Identity Scale (SIMIS)

AN INVESTIGATION INTO THE RELATIONSHIP BETWEEN TEACHERS’ PERCEPTIONS OF TEST BIAS AND THEIR WORKING AND TRAINING BACKGROUND

Spearman’s Hypothesis in the Vietnam Experience Study and National Longitudinal Survey of Youth ‘79

Test Bias

Towards Explainable Classifiers Using the Counterfactual Approach - Global Explanations for Discovering Bias in Data

Test Bias, Unfairness, and Equivalence

Measuring financial literacy with a Situational Judgement Test: do some groups really perform worse or is it the measuring instrument?

Adjusting group intercept and slope bias in predictive equations

The effect of raters fatigue on scoring EFL writing tasks

Test Bias

test bias
Recently Published Documents