scholarly journals Investigating the validity of subjective workload rating (NASA TLX) and subjective situation awareness rating (SART) for cognitively complex human–machine work

2021 ◽  
Vol 86 ◽  
pp. 103233
Author(s):  
Per Øivind Braarud
2021 ◽  
Vol 36 (6) ◽  
pp. 1139-1139
Author(s):  
Kristina E Smith ◽  
Daniel W Lopez-Hernandez ◽  
Alexis Bueno ◽  
Rachel A Rugh-Fraser ◽  
Bethany A Nordberg ◽  
...  

Abstract Objective We examined perceived workload as it is related to Brief Visual Memory Test-Revised (BVMT-R) short-delay and long-delay performance in traumatic brain injury (TBI) and healthy comparison (HC) participants. Method The sample consisted of 39 TBI participants and 54 HC participants. Demographically corrected BVMT-R scores were used to evaluate short-delay and long-delay performances. The perceived workload was measured using the NASA-TLX. Results ANOVA revealed that the HC group outperformed the TBI group on the BVMT-R short-delay and long-delay score, p < 05, η p 2 = 0.05. ANCOVAs controlling for age were used to evaluate NASA-TLX group differences. In regards to the NASA-TLX, TBI participants reported higher levels of physical demand, effort, frustration and overall subjective workload on the BVMT-R short-delay compared to HC participants, p < 05, η p 2 = 0.01–0.09. Furthermore, on the long-delay of the BVMT-R, the NASA-TLX revealed that the TBI group reported higher levels of temporal demand, effort, frustration and overall subjective workload compared to the HC group, p < 0.05, η p 2 = 0.05–0.14. Conclusions Results revealed that TBI participants demonstrated worse BVMT-R performances than HC participants. However, TBI survivors reported higher perceived workload demands compared to the HC group in both short-delay and long-delay of the BVMT-R. Our findings suggest that TBI impacts non-verbal memory performance in both BVMT-R short-delay and long-delay. Also, brain injury may be impacting TBI survivors’ awareness of their non-verbal memory performance. Further work is required to determine what drives the impaired perception of non-verbal memory performance among TBI survivors.


Author(s):  
Brittany E. Holthausen ◽  
Zoe M. Becerra ◽  
M. David Keller ◽  
John M. Ziriax ◽  
Bruce N. Walker

The relationship between workload and situation awareness (SA) has been researched generally; however, the relationship between measures of SA and workload, SPAM and NASA TLX respectively, has not been established. This study was completed using a tactical military scenario in which two participants acted as they would standing watch on a US Navy ship. One participant, the Tactical Action Officer (TAO) had reduced speech intelligibility in some trials. The results showed that the relationship between workload and SA differed by role (TAO or CICWO) and by the speech intelligibility level. This work shows the complex relationship between workload and SA and shows the need further investigation in this area, especially in teams.


1988 ◽  
Vol 32 (2) ◽  
pp. 162-163 ◽  
Author(s):  
Mark Nataupsky ◽  
Alan T. Pope ◽  
Mary L. McManus ◽  
Daniel X. Burdette

With the advent of digital avionics and cathode ray tube (CRT) displays, the use of computer-generated flight displays has become increasingly prevalent in both commercial and military cockpits. These flight displays, however, typically are renditions of the electromechanical displays that are being replaced. But there is great potential for display enhancement and integration through formats such as pictorial, real-world formats that could enhance situation awareness and reduce crew workload. This study had as its objective the exploration of workload measures in conjunction with a larger study of a computer-generated, integrated pictorial primary flight display presented in stereo and non-stereo modes. Of interest in the workload aspect of the study was the ability to differentiate workload attendant with use of several versions of the pictorial display having alternate implementations of 3-D symbology cues, presented in stereo and non-stereo modes. Both physiological and subjective measures were used with a pilot-in-the-loop flight simulation task. The physiological measure was visually-evoked brain wave potentials; the subjectve measure was the Subjective Workload Assessment Technique (SWAT) (Reid, Shingledecker, and Eggemeier, 1981a and Reid, Shingledecker, Clark, and Eggemeier, 1981b). A number of physiological studies have demonstrated the relationship of the P300 component of event-related brain potentials to controlled stimuli (e.g., Zacharias, 1988). Further physiological studies by Kramer, Donchin, and Wickens (In Press) have shown that the amplitude of the P300 can be related to levels of pilot workload. The SWAT is a validated measure of subjective workload. Nataupsky and Abbott (1987) used SWAT to differentiate levels of a pilot's perception of workload in a piloted simulator flight task. The workload portion of the study was accomplished in the following manner. Pilots were initialized on the nominal flight path (as defined within the pictorial display). After approximately 2 seconds, they were suddenly offset to one of eight positions. The sudden display shift was the stimulus to which pilots had been asked to respond with control inputs and was the trigger for the visually-evoked response. Thus, contrary to previous studies conducted at Langley which used secondary tasks to elicit evoked potentials, this study had the primary task as the trigger for the evoked response. The pilots' task was to make the initial pitch and/or roll input necessary to correct back to the nominal flight path. Data was collected on 216 trials for each of eight Air Force pilots. Analysis of the workload data is in progress.


Author(s):  
Logan T. Hale ◽  
Preston A. Long

Subjective workload assessments are used often in human factors, almost always from the perspective of the one performing the task. How well would an observer or experimenter be able to subjectively rate the mental workload of a task being performed by a subject? In this experiment, participants formed groups of two in which one acted as an experimenter and the other as a subject. The experimenter administered three types of distractor tasks to the subject while the subject held a consonant triad in their working memory. The three tasks were a high difficulty task (counting), a medium difficulty task (writing), and a low difficulty task (drawing). Both experimenter and subject then filled out a NASA-TLX for all three task types. Roles were switched and the process repeated. Significant differences in rating were found for the high difficulty but not the low difficulty task, with the medium difficulty task in between.


2019 ◽  
Author(s):  
Olivier Garot ◽  
Julian Rössler ◽  
Juliane Pfarr ◽  
Michael T. Ganter ◽  
Donat R. Spahn ◽  
...  

Abstract Background Maintaining adequate situation awareness is crucial for patient safety. Previous studies found that the use of avatar-based monitoring (Visual Patient Technology) improved the perception of vital signs compared to conventional monitoring showing numerical and waveform data; and was further associated with a reduction of perceived workload. In this study, we aimed to evaluate the effectiveness of Visual Patient Technology on perceptive performance and perceived workload when monitoring multiple patients at the same time, such as in central station monitors in intensive care units or operating rooms.Methods A prospective, within-subject, computer-based laboratory study was performed in two tertiary care hospitals in Switzerland in 2018. Thirty-eight physician and nurse anesthetists volunteered for the study. The participants were shown four different central monitor scenarios in sequence, where each scenario displayed two critical and four healthy patients simultaneously for 10 or 30 seconds. After each scenario, participants had to recall the vital signs of the critical patients. Perceived workload was assessed with the National Aeronautics and Space Association Task-Load-Index (NASA TLX) questionnaire.Results In the 10-second scenarios, the median number of perceived vital signs significantly improved from 7 to 11 using avatar-based versus conventional monitoring with a median of differences of 5 vital signs, p < 0.001. At the same time, the median NASA TLX scores were significantly lower for avatar-based monitoring (67 vs. 77) with a median of differences of 5 points, p = 0.024. In the 30-second scenarios, vital sign perception and workload did not differ significantly.Conclusions In central monitor multiple patient monitoring, we found a significant improvement of vital sign perception and reduction of perceived cognitive workload using Visual Patient Technology, compared to conventional monitoring. The technology enabled improved assessment of patient status and may, thereby, increase situation awareness and enhance patient safety.


Author(s):  
Justin G. Hollands ◽  
Tzvi Spivak ◽  
Eric W. Kramkowski

Objective: We sought to determine the influence of message presentation rate (MPR) and sensory modality on soldier cognitive load. Background: Soldiers commonly communicate tactical information by radio. The Canadian Army is equipping soldiers with a battle management system (BMS), which also allows them to communicate by text. Method: We varied presentation modality (auditory vs. visual) and MPR (fast or slow) in an experiment involving a tactical scenario. Participants (soldiers) received messages and periodically provided situation reports to higher level command, and the scored reports were used to provide a measure of situation awareness (SA). The detection response task (DRT) and NASA-TLX were used to measure cognitive load. Results: The fast MPR reduced DRT accuracy and increased response times relative to slow MPR. The NASA-TLX results also showed higher subjective workload ratings for several subscales with fast MPR. Messages presented visually produced greater cognitive load, with slower DRT response times for the visual than the auditory condition. SA scores were higher with slower MPR and auditory presentation. There was no statistical interaction of presentation modality and rate for any measure. Conclusion: Fast MPR and visual presentation increased cognitive load and degraded SA. Application: These findings show that the DRT can be used to measure workload effectively in a tactical military context and that the method of information presentation affects how soldiers process information in a BMS.


Author(s):  
Travis M. Kent ◽  
Bo Fu ◽  
Brittany D. Walls ◽  
Will Seidelman ◽  
M.A. Sublette ◽  
...  

The domain of welding recently has seen the introduction of numerous potential aids to the practitioners who monitor welding robots (“welbots”) or perform welds themselves. One such aid is the visual augmentation or visualization of particular weld pool parameters. The purpose of this study was to examine the potential effects of a simple, geometric weld pool visualization on the ability of novice welders to relate what they witnessed in welding videos to the completed welds on physically-present pipes (a situation awareness outcome). In addition to participants’ accuracies on the situation awareness measure, perceived usability and subjective workload were also assessed. There was no reliable benefit for the augmented condition compared to the raw, unedited video condition. The results may indicate a task specificity for the type of augmentation utilized or possible limitations of the current rendering methods.


Author(s):  
Jia hui Ma ◽  
Bernadette McCrory ◽  
David Claudio

Background: The National Aeronautics and Space Administration Task Load Index (NASA-TLX) a nd Surgery Task Load Index (SURG-TLX) are two subjective workload m easurement instruments. Both instruments provide unweighted andweightedworkload measurements. Weighted TLX m easures have seldomly been used byresearchers typically becauseof its high correlation with unweighted TLX a nd addedexperimentaltime to collect pairwise weightings. Therefore, theaim of thestudy wa s to compare weighted TLX scores with unweighted TLX scores and each’s sensitivity for demographic and experimentalfactors. Methods: Two simulated la paroscopic single site surgery tasks were completedusing four surgicalmethods by 25 participants. Pearson correlations, principle componentanalyses and mixed effect models were used to compare the unweightedandweighted TLXscores across tasks a nd methods. Results: There was a high correlation(r > 0.950, p < 0.001) betweentheunweighted TLXscores and the weighted TLX scores. The weighted TLX scores showed better sensitivity to both demographic and experimental factors for both TLX instruments. Conclusion: The overall weighted NASA-TLX and SURG-TLX scores differed from their respective unweighted scores even with a high correlation. The weighted TLXshowed potentialto better discriminate subjective workloadfor single-site ta sks. Future research is needed to identify and validatea more streamlinedweightingmethodparticularly for complex experimental taskslike single-site surgery.


Author(s):  
Anthony J. Aretz ◽  
Chris Johannsen ◽  
Keith Ober

A correlational design was used to regress NASA TLX subjective workload ratings onto several potential independent variables (i.e., the number of concurrent tasks, task combination, task resource demands, and flight experience) to determine task characteristics that influence pilot subjective workload ratings. A part task simulator was used to present up to six concurrent tasks, in different combinations, to 27 cadets at the USAF Academy. The results indicated the number of concurrent tasks had the largest impact on subjective workload ratings. In terms of multiple resource theory, spatial, verbal, and visual demands (in that order) contributed the most variance. The implication for theoreticians and designers is that the number of concurrent tasks, mental resource demands, and time constraints seem to be key contributors to subjective workload ratings.


Sign in / Sign up

Export Citation Format

Share Document