scholarly journals Adductory Vocal Fold Kinematic Trajectories During Conventional Versus High-Speed Videoendoscopy

2019 ◽  
Vol 62 (6) ◽  
pp. 1685-1706 ◽  
Author(s):  
Manuel Diaz-Cadiz ◽  
Victoria S. McKenna ◽  
Jennifer M. Vojtech ◽  
Cara E. Stepp

ObjectivePrephonatory vocal fold angle trajectories may supply useful information about the laryngeal system but were examined in previous studies using sigmoidal curves fit to data collected at 30 frames per second (fps). Here, high-speed videoendoscopy (HSV) was used to investigate the impacts of video frame rate and sigmoidal fitting strategy on vocal fold adductory patterns for voicing onsets.MethodTwenty-five participants with healthy voices performed /ifi/ sequences under flexible nasendoscopy at 1,000 fps. Glottic angles were extracted during adduction for voicing onset; resulting vocal fold trajectories (i.e., changes in glottic angle over time) were down-sampled to simulate different frame rate conditions (30–1,000 fps). Vocal fold adduction data were fit with asymmetric sigmoids using 5 fitting strategies with varying parameter restrictions. Adduction trajectories and maximum adduction velocities were compared between the fits and the actual HSV data. Adduction trajectory errors between HSV data and fits were evaluated using root-mean-square error and maximum angular velocity error.ResultsSimulated data were generally well fit by sigmoid models; however, when compared to the actual 1,000-fps data, sigmoid fits were found to overestimate maximum angle velocities. Errors decreased as frame rate increased, reaching a plateau by 120 fps.ConclusionIn healthy adults, vocal fold kinematic behavior during adduction is generally sigmoidal, although such fits can produce substantial errors when data are acquired at frame rates lower than 120 fps.

2011 ◽  
Vol 61 (3) ◽  
pp. 769-786 ◽  
Author(s):  
Lucjan Janowski ◽  
Piotr Romaniak ◽  
Zdzisław Papir

1994 ◽  
Vol 37 (5) ◽  
pp. 1204-1210 ◽  
Author(s):  
Melanie Vitkovitch ◽  
Paul Barber

In a study addressing future use of video-telephone systems, the ability of 52 young adults with normal hearing to shadow verbal passages was assessed when they could both hear and observe the speaker. This performance was compared to performance in an audio-alone condition. The passages were presented against an irrelevant background message. Effects of varying the video frame rate (i.e., the rate at which frames were sampled) were examined, using rates of 8.3, 12.5, 16.7, and 25 Hz. The presence of the visual image of the relevant speaker always improved performance when compared with a baseline audio-alone condition. The motion of the speaker’s face may generally support the focusing of attention on the target message. However, effects of video frame rate were also apparent, suggesting that specific visual cues became available as the temporal resolution improved. When frame rates of 8.3 Hz and the maximum available rate of 25 Hz were compared, shadowing performance was significantly better across the subject group at the higher frame rate. The comparison of frame rates of 12.5 and 25 Hz did not show reliably improved performance across the whole subject group at 25 Hz, although a small number of subjects seemed to benefit. This suggests there may be some differences in the visual cues used by subjects and consequent differences in the way individuals perform under different frame rates. Performance at 16.7 and 25 Hz did not differ, and this is consistent with previous research that tested people with hearing loss. A frame rate of 16.7 Hz may therefore be adequate for the transmission of facial images via a video communication link to a broad range of users; at the lower frame rates, the performance of users is likely to suffer.


2020 ◽  
Vol 59 (7) ◽  
pp. 2157
Author(s):  
Saher Junaid ◽  
Peter Tidemand-Lichtenberg ◽  
Christian Pedersen ◽  
Peter John Rodrigo

2018 ◽  
Vol 8 (12) ◽  
pp. 2666 ◽  
Author(s):  
Patrick Schlegel ◽  
Marion Semmler ◽  
Melda Kunduk ◽  
Michael Döllinger ◽  
Christopher Bohr ◽  
...  

Laryngeal high-speed videoendoscopy (HSV) allows objective quantification of vocal fold vibratory characteristics. However, it is unknown how the analyzed sequence length affects some of the computed parameters. To examine if varying sequence lengths influence parameter calculation, 20 HSV recordings of healthy females during sustained phonation were investigated. The clinical prevalent Photron Fastcam MC2 camera with a frame rate of 4000 fps and a spatial resolution of 512 × 256 pixels was used to collect HSV data. The glottal area waveform (GAW), describing the increase and decrease of the area between the vocal folds during phonation, was extracted. Based on the GAW, 16 perturbation parameters were computed for sequences of 5, 10, 20, 50 and 100 consecutive cycles. Statistical analysis was performed using SPSS Statistics, version 21. Only three parameters (18.8%) were statistically significantly influenced by changing sequence lengths. Of these parameters, one changed until 10 cycles were reached, one until 20 cycles were reached and one, namely Amplitude Variability Index (AVI), changed between almost all groups of different sequence lengths. Moreover, visually observable, but not statistically significant, changes within parameters were observed. These changes were often most prominent between shorter sequence lengths. Hence, we suggest using a minimum sequence length of at least 20 cycles and discarding the parameter AVI.


Sign in / Sign up

Export Citation Format

Share Document