Performance Evaluation Tests for Environmental Research (PETER): Landolt C Reading Test

1980 ◽  
Vol 24 (1) ◽  
pp. 335-339 ◽  
Author(s):  
John C. Guignard ◽  
Alvah C. Bittner ◽  
Steven W. Einbender ◽  
Robert S. Kennedy

A Landolt C visual acuity test was evaluated for inclusion in a battery of Performance Evaluation Tests for Environmental Research (PETER) for use in repeated measures experiments. Eight subjects were tested for 12 days to determine stability of means, standard deviations and cross-day correlations. Mixed results for these cross-day correlation analyses made questionable the stability of speed and error measures. Fine-structure analyses indicated that the task was more a measure of response accuracy than acuity. Fine-structure analyses in future performance test evaluation and exploration of alternative visual acuity tests are recommended. It is concluded that the Landolt C test in the form used is not stable and cannot be recommended as a test of visual acuity for inclusion in PETER.

1980 ◽  
Vol 24 (1) ◽  
pp. 344-348 ◽  
Author(s):  
Robert S. Kennedy ◽  
Robert C. Carter ◽  
Alvah C. Bittner

Performance Evaluation Tests for Environmental Research (PETER) are under development at the Naval Biodynamics Laboratory and supporting organizations. The tests, or tasks, studied in this program have been largely derived from the literature. Each task was evaluated for suitability for repeated measures experimental designs which are almost universally used in environmental research. Suitability criteria included the “stability” of task means, standard deviations, and between-trial correlations. The magnitude of the “stabilized” between-trial correlations, task definition, was also examined with respect to the administration time. There are 60 active tasks in the present program. All tasks examined to date exhibit stable means and variances after adequate practice but: (a) less than 30% meet minimal stability criteria for intertrial correlations; and (b) substantial practice (typically more than an hour over five days) is required to achieve stability. A tabular catalogue of the research findings and background for 15 tasks is presented and discussed.


1986 ◽  
Vol 63 (2) ◽  
pp. 683-708 ◽  
Author(s):  
Alvah C. Bittner ◽  
Robert C. Carter ◽  
Robert S. Kennedy ◽  
Mary M. Harbeson ◽  
Michele Krause

The goal of the Performance Evaluation Tests for Environmental Research (PETER) Program was to identify a set of measures of human capabilities for use in the study of environmental and other time-course effects. 114 measures studied in the PETER Program were evaluated and categorized into four groups based upon task stability and task definition. The Recommended category contained 30 measures that clearly obtained total stabilization and had an acceptable level of reliability efficiency. The Acceptable-But-Redundant category contained 15 measures. The 37 measures in the Marginal category, which included an inordinate number of slope and other derived measures, usually had desirable features which were outweighed by faults. The 32 measures in the Unacceptable category had either differential instability or weak reliability efficiency. It is our opinion that the 30 measures in the Recommended category should be given first consideration for environmental research applications. Further, it is recommended that information pertaining to preexperimental practice requirements and stabilized reliabilities should be utilized in repeated-measures environmental studies.


1980 ◽  
Vol 24 (1) ◽  
pp. 320-324 ◽  
Author(s):  
Robert C. Carter ◽  
Robert S. Kennedy ◽  
Alvah C. Bittner

A battery of Performance Evaluation Tests for Environmental Research (PETER) that is suitable for use in repeated measures experiments is being developed at the Naval Biodynamics Laboratory. This paper describes the sources of tasks which have been considered for inclusion in PETER. It also lists the tests in the source batteries which have or have not yet been considered for inclusion in PETER. The performance content of the tests that have been considered is compared with the content of those that have not. Recommendations are made for selection of additional tests from the source batteries which will not be redundant with tests that already have been considered. This report puts PETER into the context of the tests and test batteries which came before it.


1983 ◽  
Vol 57 (1) ◽  
pp. 283-293 ◽  
Author(s):  
M. M. Harbeson ◽  
A. C. Bittner ◽  
R. S. Kennedy ◽  
R. C. Carter ◽  
M. Krause

Listed are 90 reports of the Performance Evaluation Tests for Environmental Research (PETER) Program. Conducted from 1977 to 1982, the programs' purpose was to develop a test battery for use in repeated measures investigations of environmental effects on human performance, e.g., vehicle motion, toxic substances, aging, etc. The battery also has applications to training, selection, and research on equipment design. The PETER Program concentrated on selecting tests which remained stable with repeated measurements, as environmental research usually involves testing before, during, and after exposure. Stability of the means, variances, and intertrial correlations ensures that simple analyses may be applied with minimal complications and without difficulties of attribution of effect. Over 80 measures were evaluated, 30% were found suitable for repeated measures applications, 20% were acceptable for limited use, and 50% could not be recommended. The unsuitability of many tasks brings into question the validity of portions of the literature on environmental effects. The reports describe program rationale, development of statistical methodology, and stable tasks. PETER reports are available from published sources, authors, or the Naval Biodynamics Laboratory.


1988 ◽  
Vol 32 (14) ◽  
pp. 838-842
Author(s):  
Kennedy ◽  
D. R. Baltzley ◽  
M. K. Osteen ◽  
J. J. Turnage

The Automated Performance Test System (APTS), a microcomputer-based performance test battery, has been under development for over five years. The emphasis has been on psychometric theory and utility for repeated-measures applications during extended exposure to various environmental stressors. Stability of means and variances and retest reliability have been the criteria for suitability of a test. In addition, differential stability (i.e., parallelism of individual differences over sessions) is a unique requirement for test acceptability in this battery. In the present menu of “qualified” mental tests, there are presently more than 30 fully up-and-running on portable Zenith and NEC microcomputers as well as floppy disks for IBM compatible systems. Qualified tests stabilize in less than 10 minutes and possess test-retest reliabilities of r 0.70 for a three-minute test/work period. The battery includes tests of cognition, information processing, psychomotor skill, memory, mood, and others. More than a dozen normative laboratory and field studies have been conducted to validate the stability and reliability requirements of the tests in the battery and more than a dozen “sensitivity” studies have been or are in the process of being completed. To date, one or more tests have been shown sensitive to chemoradiotherapy, sleep loss, hypoxia, amphetamine, hyoscine, mood disorders, thermal stress, sensory deprivation, motion stress, altitude, fatigue, and alcohol use. The present paper describes our experiences with these tests and reports on a recommended menu for a short battery (6 minutes), a middle length battery (12 minutes), and a longer battery (22 minutes). Other test batteries which are presently under development do not pay as much attention to individual differences. The consequence of this inattention is low sensitivity if test-retest reliability is poor and inability to properly attribute effects if instability occurs.


1980 ◽  
Vol 51 (2) ◽  
pp. 655-665 ◽  
Author(s):  
Michael E. Mc Cauley ◽  
Robert S. Kennedy ◽  
Alvah C. Bittner

A time-estimation task was considered for inclusion in the Performance Evaluation Tests for Environmental Research (PETER) battery. As part of this consideration, the effects of repeated testing on the reliability of time judgments were studied. The method of production was used to estimate eight time intervals. Five trials per day at each interval were administered individually to each of 19 subjects for 15 consecutive workdays. Two scores, constant error and variable error, were reported. The effect of days was not significant for constant error and was moderate for variable error ( p < .04). The standard deviations were relatively stable across trials. A pronounced decline in reliability over repeated days of testing was found for both errors. It was concluded that this time-estimation test would be a poor candidate for inclusion in PETER, but further research is warranted because of the potential unique contribution of a time-estimation task in a performance test battery.


1984 ◽  
Vol 28 (1) ◽  
pp. 11-15 ◽  
Author(s):  
A. C. Bittner ◽  
R. C. Carter ◽  
R. S. Kennedy ◽  
M. M. Harbeson ◽  
M. Krause

The goal of the Performance Evaluation Tests for Environmental Research (PETER) Program was to identify a set of measures of human cognitive, perceptual, and motor capabilities for use in the study of environmental and other time-course effects. Tasks were evaluated as suitable for repeated measures applications when their intertrial means, variances and correlations were well-behaved under constant baseline conditions. This report provides an evaluation of 112 test measures studied in the program. They are categorized into four groups based upon joint consideration of task stability and task definition. Thirty test measures were categorized as Good, 15 as Good-But-Redundant, 35 as Ugly (flawed), and 32 as Bad.


1980 ◽  
Vol 24 (1) ◽  
pp. 330-334 ◽  
Author(s):  
Denise B. McCafferty ◽  
Alvah C. Bittner ◽  
Robert C. Carter

Auditory digit span was evaluated as an instrument for repeated measurements experimentation. Twelve subjects were tested for one hour on each of 12 consecutive workdays in a standard environment. Both forward and backward digit span were measured. It was found that forward digit span was suitable for repeated measures after ten days of practice at 30 minutes per day. The criteria for suitability were predictability of the mean scores, constancy of the standard deviations and differential stability of the intertrial correlations. These criteria are sufficient conditions both for repeated measures Analysis of Variance, and for interpretation of experimental effects. Although the backward digit span scores did not meet these criteria, they became more and more correlated with the forward digit span scores as the experiment progressed. This indicates that the mental content of the two tests of memory converged with practice. One implication of this finding is to question the meaningfulness of factor structure after only limited practice. The forward auditory digit span test was recommended for inclusion in a battery of Performance Evaluation Tests for Environmental Research (PETER).


2019 ◽  
Vol 14 (4) ◽  
pp. 515-519
Author(s):  
Chen Ai Hong ◽  
Nurulain Muhamad

Our study aimed to explore the effect of positive and negative polarities on visual acuity measurements by utilizing black and white as a text against background with three distinct colours. Visual acuity was recorded as logarithm of minimum angle of resolution (LogMAR) using the detection of the gap in a four-position Landolt-C. The 2x3 (polarity x background color) two way repeated measures ANOVA showed a statistically significant interaction between polarity and colour background on visual resolution [F (2, 16) = 23.704, p < 0.001, η2 = 0.744]. Among the three primary background colour, shorter-wavelength (blue background) showed statistically significant findings between both positive and negative polarity [F (1, 9) = 39.875, p < 0.001, η2 = 0.818].  Visual acuity measurements improved with negative polarity but decreased with positive polarity with blue colour background. However, visual acuity was not statistically significantly different with the green (medium-wavelength) [F (1, 11) = 0.625, p = 0.446, η2 = 0.053] and the red (long-wavelength) backgrounds [F (1, 9) = 4.021, p = 0.856, η2 = 0.000]. In conclusion, black text against shorter-wavelength (blue) background apparently more difficult to be resolved by human eyes compared with white text. These findings suggest colour element might be an advantage for negative polarity colour combinations.


Sign in / Sign up

Export Citation Format

Share Document