Test Validation Without Measurement

Abstract. Test validation based on usual statistical analyses is paradoxical, as, from a falsificationist perspective, they do not test that test data are ordinal measurements, and, from the ethical perspective, they do not justify the use of test scores. This paper (i) proposes some basic definitions, where measurement is a special case of scientific explanation; starting from the examples of memory accuracy and suicidality as scored by two widely used clinical tests/questionnaires. Moreover, it shows (ii) how to elicit the logic of the observable test events underlying the test scores, and (iii) how the measurability of the target theoretical quantities – memory accuracy and suicidality – can and should be tested at the respondent scale as opposed to the scale of aggregates of respondents. (iv) Criterion-related validity is revisited to stress that invoking the explanative power of test data should draw attention on counterexamples instead of statistical summarization. (v) Finally, it is argued that the justification of the use of test scores in specific settings should be part of the test validation task, because, as tests specialists, psychologists are responsible for proposing their tests for social uses.

Download Full-text

Connecting Data, Model and Process to Streamline and Enhance Single-Point Production Well Test Validation

10.2118/207270-ms ◽

2021 ◽

Author(s):

Mohamad Mustaqim Mokhlis ◽

Nurdini Alya Hazali ◽

Muhammad Firdaus Hassan ◽

Mohd Hafiz Hashim ◽

Afzan Nizam Jamaludin ◽

...

Keyword(s):

Test Data ◽

Single Point ◽

Database Systems ◽

Test Validation ◽

Well Test ◽

Production Well ◽

Well Test Analysis ◽

Test Analysis ◽

Digital Ecosystem ◽

Well Model

Abstract In this paper we will present a process streamlined for well-test validation that involves data integration between different database systems, incorporated with well models, and how the process can leverage real-time data to present a full scope of well-test analysis to enhance the capability for assessing well-test performance. The workflow process demonstrates an intuitive and effective way for analyzing and validating a production well test via an interactive digital visualization. This approach has elevated the quality and integrity of the well-test data, as well as improved the process cycle efficiency that complements the field surveillance engineers to keep track of well-test compliance guidelines through efficient well-test tracking in the digital interface. The workflow process involves five primary steps, which all are conducted via a digital platform: Well Test Compliance: Planning and executing the well test Data management and integration Well Test Analysis and Validation: Verification of the well test through historical trending, stability period checks, and well model analysis Model validation: Correcting the well test and calibrating the well model before finalizing the validity of the well test Well Test Re-testing: Submitting the rejected well test for retesting and final step Integrating with corporate database system for production allocation This business process brings improvement to the quality of the well test, which subsequently lifts the petroleum engineers’ confidence level to analyze well performance and deliver accurate well-production forecasting. A well-test validation workflow in a digital ecosystem helps to streamline the flow of data and system integration, as well as the way engineers assess and validate well-test data, which results in minimizing errors and increases overall work efficiency.

Download Full-text

Transformation of Life-Test Data

Canadian Mathematical Bulletin ◽

10.4153/cmb-1968-057-3 ◽

1968 ◽

Vol 11 (3) ◽

pp. 475-488 ◽

Cited By ~ 2

Author(s):

Norman R. Draper ◽

Irwin Guttman

Keyword(s):

Exponential Distribution ◽

Analysis Of Variance ◽

Gamma Distribution ◽

Test Data ◽

Life Testing ◽

Life Test ◽

Constant Variance ◽

Dependent Variables ◽

Special Case

In a recent paper Box and Cox (1964) considered the problem of transforming dependent variables in regression and analysis of variance problems, in order to achieve the usual assumptions of Normality, constant variance and additivity of effects. Here we adopt the same approach to investigate transformations of data which allow the transformed observations to follow a Gamma distribution. A special case of this is the exponential distribution, valuable in life-testing, for which examples are given.

Download Full-text

Carolina Picture Vocabulary Test: Validation with Hearing-Impaired Students

Perceptual and Motor Skills ◽

10.2466/pms.1989.69.1.64 ◽

1989 ◽

Vol 69 (1) ◽

pp. 64-66 ◽

Cited By ~ 1

Author(s):

Marilyn Kline ◽

Gary L. Sapp

Keyword(s):

Test Scores ◽

Hearing Impaired ◽

Test Validation ◽

Vocabulary Test ◽

Picture Arrangement ◽

Impaired Children ◽

Performance Scale ◽

Hearing Impaired Children ◽

Hearing Impaired Students

When mean Carolina Picture Vocabulary Test standard scores were compared with WISC—R Performance Scale IQs significant differences were observed. The Carolina test scores were also correlated with both Performance IQs and the Performance Scale subtest scores. Most correlations were minimal as the two tests were independent; only WISC—R Picture Arrangement scores correlated significantly with Carolina scores. These outcomes coupled with the technical limitations of the Carolina raise serious questions regarding its utility for hearing-impaired children.

Download Full-text

Relationship of National Merit Scholarship screening test scores to test data obtained earlier in high school.

Journal of Applied Psychology ◽

10.1037/h0048434 ◽

1959 ◽

Vol 43 (1) ◽

pp. 32-34 ◽

Cited By ~ 1

Author(s):

Edward O. Swanson ◽

Wilbur L. Layton

Keyword(s):

High School ◽

Test Data ◽

Test Scores ◽

Screening Test ◽

Merit Scholarship ◽

Relationship Of

Download Full-text

Interpretations of Rod-and-Frame Test Scores: An Application of Pattern Analysis

Perceptual and Motor Skills ◽

10.2466/pms.1982.54.3c.1339 ◽

1982 ◽

Vol 54 (3_suppl) ◽

pp. 1339-1342 ◽

Cited By ~ 5

Author(s):

Otto Haller ◽

Eugene S. Edgington

Keyword(s):

Test Data ◽

Test Scores ◽

Pattern Analysis ◽

Spatial Position ◽

Vertical Side ◽

Starting Position ◽

Rod And Frame Test ◽

Rod And Frame

Rod-and-frame test data of 100 undergraduates were analyzed by a recently developed pattern analysis and several interpretations of their test scores were considered. Pattern analysis showed (a) that most subjects (about 65%) tilt toward the spatial position of the frame and (b) that some (about 30%) utilize two frame cues, i.e., the nearest to vertical side and corner of the frame. Tilting to the starting position of the rod and tilting away from the frame or the starting position of the rod were not viable interpretations.

Download Full-text

Investigating the Alignment Between the CELPIP-General Reading Test and the Canadian Language Benchmarks: A Content Validation Study

Canadian Journal of Applied Linguistics ◽

10.37213/cjal.2020.30649 ◽

2020 ◽

Vol 23 (2) ◽

pp. 1-19

Author(s):

Michelle Chen ◽

Jennifer J. Flasko

Keyword(s):

Language Proficiency ◽

Test Scores ◽

Large Scale ◽

English Language ◽

Performance Standards ◽

National Language ◽

Test Validation ◽

Content Validation ◽

High Stakes ◽

Scale Anchoring

Seeking evidence to support content validity is essential to test validation. This is especially the case in contexts where test scores are interpreted in relation to external proficiency standards and where new test content is constantly being produced to meet test administration and security demands. In this paper, we describe a modified scale- anchoring approach to assessing the alignment between the Canadian English Language Proficiency Index Program (CELPIP) test and the Canadian Language Benchmarks (CLB), the proficiency framework to which the test scores are linked. We discuss how proficiency frameworks such as the CLB can be used to support the content validation of large-scale standardized tests through an evaluation of the alignment between the test content and the performance standards. By sharing both the positive implications and challenges of working with the CLB in high-stakes language test validation, we hope to help raise the profile of this national language framework among scholars and practitioners.

Download Full-text

EVALUATION OF CERTAIN MASTITIS DETECTION TESTS

Journal of Milk and Food Technology ◽

10.4315/0022-2747-33.12.380 ◽

1970 ◽

Vol 33 (9) ◽

pp. 380-382

Author(s):

C. Jensen ◽

L. D. Beck ◽

Donald G. Rusness

Keyword(s):

Linear Relationship ◽

Test Data ◽

Linear Correlation ◽

Test Scores ◽

Leucocyte Count ◽

Average Concentration ◽

Logarithmic Transformation ◽

Dna Test

A curvillinear regression resulted when the Feulgen-DNA test scores were plotted against the average concentration of leucocytes. Application of a logarithmic transformation to the average test data resulted in a linear correlation which slightly raised the correlation between the Feulgen-DNA Test and the leucocyte count. But, the application of a sample per cow transformation to the average test data from each of 30 representative cows, resulted in an approximate linear relationship and in significant correlations between the Feulgen, California, and Whiteside tests and the leucocyte count. Correlations in descending order were Feulgen> California> Modified Whiteside Test with respect to the leucocyte count.

Download Full-text

The Use of Test Scores for Classification Decisions with Threshold Utility

Journal of Educational Statistics ◽

10.3102/10769986012001062 ◽

1987 ◽

Vol 12 (1) ◽

pp. 62-75 ◽

Cited By ~ 3

Author(s):

Wim J. van der Linden

Keyword(s):

Utility Function ◽

Test Data ◽

Test Scores ◽

Decision Problem ◽

Empirical Bayes ◽

Classification Problem ◽

Achievement Test ◽

Optimal Assignment ◽

Threshold Utility ◽

Continuation Schools

The classification problem consists of assigning subjects to one of several available treatments on the basis of their test scores, where the success of each treatment is measured by a different criterion. It is indicated how this problem can be formulated as an (empirical) Bayes decision problem. As an example, the case of classification with a threshold utility function is analyzed, and optimal assignment rules are derived. The results are illustrated empirically with data from a classification problem in which achievement test data are used to assign students to appropriate continuation schools.

Download Full-text

Extreme self-sacrifice beyond fusion: Moral expansiveness and the special case of allyship

Behavioral and Brain Sciences ◽

10.1017/s0140525x18001620 ◽

2018 ◽

Vol 41 ◽

Cited By ~ 2

Author(s):

Daniel Crimston ◽

Matthew J. Hornsey

Keyword(s):

General Theory ◽

Biological Markers ◽

Natural Environment ◽

Relevant Dimension ◽

Special Case

AbstractAs a general theory of extreme self-sacrifice, Whitehouse's article misses one relevant dimension: people's willingness to fight and die in support of entities not bound by biological markers or ancestral kinship (allyship). We discuss research on moral expansiveness, which highlights individuals’ capacity to self-sacrifice for targets that lie outside traditional in-group markers, including racial out-groups, animals, and the natural environment.

Download Full-text

Determination of the phase structure of polymerblends by electron microscopy

Proceedings, annual meeting, Electron Microscopy Society of America ◽

10.1017/s0424820100109604 ◽

1978 ◽

Vol 36 (1) ◽

pp. 492-493

Author(s):

Dr. G. Kaemof

Keyword(s):

Screw Extruder ◽

Electron Microscopic ◽

Thin Sections ◽

Single Screw Extruder ◽

Transmission Electron ◽

The Matrix ◽

Twin Screw ◽

Special Case ◽

Microscopic Investigations

A mixture of polycarbonate (PC) and styrene-acrylonitrile-copolymer (SAN) represents a very good example for the efficiency of electron microscopic investigations concerning the determination of optimum production procedures for high grade product properties.The following parameters have been varied:components of charge (PC : SAN 50 : 50, 60 : 40, 70 : 30), kind of compounding machine (single screw extruder, twin screw extruder, discontinuous kneader), mass-temperature (lowest and highest possible temperature).The transmission electron microscopic investigations (TEM) were carried out on ultra thin sections, the PC-phase of which was selectively etched by triethylamine.The phase transition (matrix to disperse phase) does not occur - as might be expected - at a PC to SAN ratio of 50 : 50, but at a ratio of 65 : 35. Our results show that the matrix is preferably formed by the components with the lower melting viscosity (in this special case SAN), even at concentrations of less than 50 %.

Download Full-text