Test Validation Without Measurement

2016 ◽  
Vol 32 (3) ◽  
pp. 204-214 ◽  
Author(s):  
Emilie Lacot ◽  
Mohammad H. Afzali ◽  
Stéphane Vautier

Abstract. Test validation based on usual statistical analyses is paradoxical, as, from a falsificationist perspective, they do not test that test data are ordinal measurements, and, from the ethical perspective, they do not justify the use of test scores. This paper (i) proposes some basic definitions, where measurement is a special case of scientific explanation; starting from the examples of memory accuracy and suicidality as scored by two widely used clinical tests/questionnaires. Moreover, it shows (ii) how to elicit the logic of the observable test events underlying the test scores, and (iii) how the measurability of the target theoretical quantities – memory accuracy and suicidality – can and should be tested at the respondent scale as opposed to the scale of aggregates of respondents. (iv) Criterion-related validity is revisited to stress that invoking the explanative power of test data should draw attention on counterexamples instead of statistical summarization. (v) Finally, it is argued that the justification of the use of test scores in specific settings should be part of the test validation task, because, as tests specialists, psychologists are responsible for proposing their tests for social uses.

2021 ◽  
Author(s):  
Mohamad Mustaqim Mokhlis ◽  
Nurdini Alya Hazali ◽  
Muhammad Firdaus Hassan ◽  
Mohd Hafiz Hashim ◽  
Afzan Nizam Jamaludin ◽  
...  

Abstract In this paper we will present a process streamlined for well-test validation that involves data integration between different database systems, incorporated with well models, and how the process can leverage real-time data to present a full scope of well-test analysis to enhance the capability for assessing well-test performance. The workflow process demonstrates an intuitive and effective way for analyzing and validating a production well test via an interactive digital visualization. This approach has elevated the quality and integrity of the well-test data, as well as improved the process cycle efficiency that complements the field surveillance engineers to keep track of well-test compliance guidelines through efficient well-test tracking in the digital interface. The workflow process involves five primary steps, which all are conducted via a digital platform: Well Test Compliance: Planning and executing the well test Data management and integration Well Test Analysis and Validation: Verification of the well test through historical trending, stability period checks, and well model analysis Model validation: Correcting the well test and calibrating the well model before finalizing the validity of the well test Well Test Re-testing: Submitting the rejected well test for retesting and final step Integrating with corporate database system for production allocation This business process brings improvement to the quality of the well test, which subsequently lifts the petroleum engineers’ confidence level to analyze well performance and deliver accurate well-production forecasting. A well-test validation workflow in a digital ecosystem helps to streamline the flow of data and system integration, as well as the way engineers assess and validate well-test data, which results in minimizing errors and increases overall work efficiency.


1968 ◽  
Vol 11 (3) ◽  
pp. 475-488 ◽  
Author(s):  
Norman R. Draper ◽  
Irwin Guttman

In a recent paper Box and Cox (1964) considered the problem of transforming dependent variables in regression and analysis of variance problems, in order to achieve the usual assumptions of Normality, constant variance and additivity of effects. Here we adopt the same approach to investigate transformations of data which allow the transformed observations to follow a Gamma distribution. A special case of this is the exponential distribution, valuable in life-testing, for which examples are given.


1989 ◽  
Vol 69 (1) ◽  
pp. 64-66 ◽  
Author(s):  
Marilyn Kline ◽  
Gary L. Sapp

When mean Carolina Picture Vocabulary Test standard scores were compared with WISC—R Performance Scale IQs significant differences were observed. The Carolina test scores were also correlated with both Performance IQs and the Performance Scale subtest scores. Most correlations were minimal as the two tests were independent; only WISC—R Picture Arrangement scores correlated significantly with Carolina scores. These outcomes coupled with the technical limitations of the Carolina raise serious questions regarding its utility for hearing-impaired children.


1982 ◽  
Vol 54 (3_suppl) ◽  
pp. 1339-1342 ◽  
Author(s):  
Otto Haller ◽  
Eugene S. Edgington

Rod-and-frame test data of 100 undergraduates were analyzed by a recently developed pattern analysis and several interpretations of their test scores were considered. Pattern analysis showed (a) that most subjects (about 65%) tilt toward the spatial position of the frame and (b) that some (about 30%) utilize two frame cues, i.e., the nearest to vertical side and corner of the frame. Tilting to the starting position of the rod and tilting away from the frame or the starting position of the rod were not viable interpretations.


2020 ◽  
Vol 23 (2) ◽  
pp. 1-19
Author(s):  
Michelle Chen ◽  
Jennifer J. Flasko

Seeking evidence to support content validity is essential to test validation. This is especially the case in contexts where test scores are interpreted in relation to external proficiency standards and where new test content is constantly being produced to meet test administration and security demands. In this paper, we describe a modified scale- anchoring approach to assessing the alignment between the Canadian English Language Proficiency Index Program (CELPIP) test and the Canadian Language Benchmarks (CLB), the proficiency framework to which the test scores are linked. We discuss how proficiency frameworks such as the CLB can be used to support the content validation of large-scale standardized tests through an evaluation of the alignment between the test content and the performance standards. By sharing both the positive implications and challenges of working with the CLB in high-stakes language test validation, we hope to help raise the profile of this national language framework among scholars and practitioners.


1970 ◽  
Vol 33 (9) ◽  
pp. 380-382
Author(s):  
C. Jensen ◽  
L. D. Beck ◽  
Donald G. Rusness

A curvillinear regression resulted when the Feulgen-DNA test scores were plotted against the average concentration of leucocytes. Application of a logarithmic transformation to the average test data resulted in a linear correlation which slightly raised the correlation between the Feulgen-DNA Test and the leucocyte count. But, the application of a sample per cow transformation to the average test data from each of 30 representative cows, resulted in an approximate linear relationship and in significant correlations between the Feulgen, California, and Whiteside tests and the leucocyte count. Correlations in descending order were Feulgen> California> Modified Whiteside Test with respect to the leucocyte count.


1987 ◽  
Vol 12 (1) ◽  
pp. 62-75 ◽  
Author(s):  
Wim J. van der Linden

The classification problem consists of assigning subjects to one of several available treatments on the basis of their test scores, where the success of each treatment is measured by a different criterion. It is indicated how this problem can be formulated as an (empirical) Bayes decision problem. As an example, the case of classification with a threshold utility function is analyzed, and optimal assignment rules are derived. The results are illustrated empirically with data from a classification problem in which achievement test data are used to assign students to appropriate continuation schools.


2018 ◽  
Vol 41 ◽  
Author(s):  
Daniel Crimston ◽  
Matthew J. Hornsey

AbstractAs a general theory of extreme self-sacrifice, Whitehouse's article misses one relevant dimension: people's willingness to fight and die in support of entities not bound by biological markers or ancestral kinship (allyship). We discuss research on moral expansiveness, which highlights individuals’ capacity to self-sacrifice for targets that lie outside traditional in-group markers, including racial out-groups, animals, and the natural environment.


Author(s):  
Dr. G. Kaemof

A mixture of polycarbonate (PC) and styrene-acrylonitrile-copolymer (SAN) represents a very good example for the efficiency of electron microscopic investigations concerning the determination of optimum production procedures for high grade product properties.The following parameters have been varied:components of charge (PC : SAN 50 : 50, 60 : 40, 70 : 30), kind of compounding machine (single screw extruder, twin screw extruder, discontinuous kneader), mass-temperature (lowest and highest possible temperature).The transmission electron microscopic investigations (TEM) were carried out on ultra thin sections, the PC-phase of which was selectively etched by triethylamine.The phase transition (matrix to disperse phase) does not occur - as might be expected - at a PC to SAN ratio of 50 : 50, but at a ratio of 65 : 35. Our results show that the matrix is preferably formed by the components with the lower melting viscosity (in this special case SAN), even at concentrations of less than 50 %.


Sign in / Sign up

Export Citation Format

Share Document