scholarly journals Multimodal dataset of real-time 2D and static 3D MRI of healthy French speakers

2021 ◽  
Vol 8 (1) ◽  
Author(s):  
Karyna Isaieva ◽  
Yves Laprie ◽  
Justine Leclère ◽  
Ioannis K. Douros ◽  
Jacques Felblinger ◽  
...  

AbstractThe study of articulatory gestures has a wide spectrum of applications, notably in speech production and recognition. Sets of phonemes, as well as their articulation, are language-specific; however, existing MRI databases mostly include English speakers. In our present work, we introduce a dataset acquired with MRI from 10 healthy native French speakers. A corpus consisting of synthetic sentences was used to ensure a good coverage of the French phonetic context. A real-time MRI technology with temporal resolution of 20 ms was used to acquire vocal tract images of the participants speaking. The sound was recorded simultaneously with MRI, denoised and temporally aligned with the images. The speech was transcribed to obtain phoneme-wise segmentation of sound. We also acquired static 3D MR images for a wide list of French phonemes. In addition, we include annotations of spontaneous swallowing.

Author(s):  
Asterios Toutios ◽  
Shrikanth S. Narayanan

Real-time magnetic resonance imaging (rtMRI) of the moving vocal tract during running speech production is an important emerging tool for speech production research providing dynamic information of a speaker's upper airway from the entire midsagittal plane or any other scan plane of interest. There have been several advances in the development of speech rtMRI and corresponding analysis tools, and their application to domains such as phonetics and phonological theory, articulatory modeling, and speaker characterization. An important recent development has been the open release of a database that includes speech rtMRI data from five male and five female speakers of American English each producing 460 phonetically balanced sentences. The purpose of the present paper is to give an overview and outlook of the advances in rtMRI as a tool for speech research and technology development.


2021 ◽  
Vol 8 (1) ◽  
Author(s):  
Yongwan Lim ◽  
Asterios Toutios ◽  
Yannick Bliesener ◽  
Ye Tian ◽  
Sajan Goud Lingala ◽  
...  

AbstractReal-time magnetic resonance imaging (RT-MRI) of human speech production is enabling significant advances in speech science, linguistics, bio-inspired speech technology development, and clinical applications. Easy access to RT-MRI is however limited, and comprehensive datasets with broad access are needed to catalyze research across numerous domains. The imaging of the rapidly moving articulators and dynamic airway shaping during speech demands high spatio-temporal resolution and robust reconstruction methods. Further, while reconstructed images have been published, to-date there is no open dataset providing raw multi-coil RT-MRI data from an optimized speech production experimental setup. Such datasets could enable new and improved methods for dynamic image reconstruction, artifact correction, feature extraction, and direct extraction of linguistically-relevant biomarkers. The present dataset offers a unique corpus of 2D sagittal-view RT-MRI videos along with synchronized audio for 75 participants performing linguistically motivated speech tasks, alongside the corresponding public domain raw RT-MRI data. The dataset also includes 3D volumetric vocal tract MRI during sustained speech sounds and high-resolution static anatomical T2-weighted upper airway MRI for each participant.


1971 ◽  
Vol 36 (3) ◽  
pp. 397-409 ◽  
Author(s):  
Rachel E. Stark

Real-time amplitude contour and spectral displays were used in teaching speech production skills to a profoundly deaf, nonspeaking boy. This child had a visual attention problem, a behavior problem, and a poor academic record. In individual instruction, he was first taught to produce features of speech, for example, friction, nasal, and stop, which are present in vocalizations of 6- to 9-month-old infants, and then to combine these features in syllables and words. He made progress in speech, although sign language and finger spelling were taught at the same time. Speech production skills were retained after instruction was terminated. The results suggest that deaf children are able to extract information about the features of speech from visual displays, and that a developmental sequence should be followed as far as possible in teaching speech production skills to them.


Author(s):  
Tanner Sorensen ◽  
Asterios Toutios ◽  
Louis Goldstein ◽  
Shrikanth S. Narayanan
Keyword(s):  

2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Sherif M. Hanafy ◽  
Hussein Hoteit ◽  
Jing Li ◽  
Gerard T. Schuster

AbstractResults are presented for real-time seismic imaging of subsurface fluid flow by parsimonious refraction and surface-wave interferometry. Each subsurface velocity image inverted from time-lapse seismic data only requires several minutes of recording time, which is less than the time-scale of the fluid-induced changes in the rock properties. In this sense this is real-time imaging. The images are P-velocity tomograms inverted from the first-arrival times and the S-velocity tomograms inverted from dispersion curves. Compared to conventional seismic imaging, parsimonious interferometry reduces the recording time and increases the temporal resolution of time-lapse seismic images by more than an order-of-magnitude. In our seismic experiment, we recorded 90 sparse data sets over 4.5 h while injecting 12-tons of water into a sand dune. Results show that the percolation of water is mostly along layered boundaries down to a depth of a few meters, which is consistent with our 3D computational fluid flow simulations and laboratory experiments. The significance of parsimonious interferometry is that it provides more than an order-of-magnitude increase of temporal resolution in time-lapse seismic imaging. We believe that real-time seismic imaging will have important applications for non-destructive characterization in environmental, biomedical, and subsurface imaging.


2021 ◽  
Vol 13 (10) ◽  
pp. 1884
Author(s):  
Jingjing Hu ◽  
Yansong Bao ◽  
Jian Liu ◽  
Hui Liu ◽  
George P. Petropoulos ◽  
...  

The acquisition of real-time temperature and relative humidity (RH) profiles in the Arctic is of great significance for the study of the Arctic’s climate and Arctic scientific research. However, the operational algorithm of Fengyun-3D only takes into account areas within 60°N, the innovation of this work is that a new technique based on Neural Network (NN) algorithm was proposed, which can retrieve these parameters in real time from the Fengyun-3D Hyperspectral Infrared Radiation Atmospheric Sounding (HIRAS) observations in the Arctic region. Considering the difficulty of obtaining a large amount of actual observation (such as radiosonde) in the Arctic region, collocated ERA5 data from European Centre for Medium-Range Weather Forecasts (ECMWF) and HIRAS observations were used to train the neural networks (NNs). Brightness temperature and training targets were classified using two variables: season (warm season and cold season) and surface type (ocean and land). NNs-based retrievals were compared with ERA5 data and radiosonde observations (RAOBs) independent of the NN training sets. Results showed that (1) the NNs retrievals accuracy is generally higher on warm season and ocean; (2) the root-mean-square error (RMSE) of retrieved profiles is generally slightly higher in the RAOB comparisons than in the ERA5 comparisons, but the variation trend of errors with height is consistent; (3) the retrieved profiles by the NN method are closer to ERA5, comparing with the AIRS products. All the results demonstrated the potential value in time and space of NN algorithm in retrieving temperature and relative humidity profiles of the Arctic region from HIRAS observations under clear-sky conditions. As such, the proposed NN algorithm provides a valuable pathway for retrieving reliably temperature and RH profiles from HIRAS observations in the Arctic region, providing information of practical value in a wide spectrum of practical applications and research investigations alike.All in all, our work has important implications in broadening Fengyun-3D’s operational implementation range from within 60°N to the Arctic region.


Sign in / Sign up

Export Citation Format

Share Document