scholarly journals Chinese Tone Recognition Based on 3D Dynamic Muscle Information

2020 ◽  
Vol 2020 ◽  
pp. 1-9
Author(s):  
JianRong Wang ◽  
Li Wan ◽  
Ju Zhang ◽  
Qiang Fang ◽  
Fan Yang ◽  
...  

To advance the study of lip-reading recognition in accordance with Chinese pronunciation norms, we carefully investigated Mandarin tone recognition based on visual information, in contrast to that of the previous character-based Chinese lip reading technique. In this paper, we mainly studied the vowel tonal transformation in Chinese pronunciation and designed a lightweight skipping convolution network framework (SCNet). And, the experimental results showed that the SCNet was sensitive to the more detailed description of the pitch change than that of the traditional model and achieved a better tone recognition effect and outstanding antiinterference performance. In addition, we conducted a more detailed study on the assistance of the deep texture information in lip-reading recognition. We found that the deep texture information has a significant effect on tone recognition, and the possibility of multimodal lip reading in Chinese tone recognition was confirmed. Similarly, we verified the role of the SCNet syllable tone recognition and found that the vowel and syllable tone recognition accuracy of our model was as high as 97.3%, which also showed the robustness of our proposed method for Chinese tone recognition and it can be widely used for tone recognition.

Author(s):  
Mahesh K. Joshi ◽  
J.R. Klein

New technologies like artificial intelligence, robotics, machine intelligence, and the Internet of Things are seeing repetitive tasks move away from humans to machines. Humans cannot become machines, but machines can become more human-like. The traditional model of educating workers for the workforce is fast becoming irrelevant. There is a massive need for the retooling of human workers. Humans need to be trained to remain focused in a society which is constantly getting bombarded with information. The two basic elements of physical and mental capacity are slowly being taken over by machines and artificial intelligence. This changes the fundamental role of the global workforce.


Author(s):  
Vincenzo Butticè ◽  
Silvio Vismara

AbstractNowadays equity crowdfunding plays an important role in the entrepreneurial finance markets. To better understand the functioning of the industry, it is important to consider the entire equity crowdfunding process and all the actors involved. Equity crowdfunding platforms match indeed the demand of capital from entrepreneurial ventures with the supply of capital by investors. This manuscript is a first step in this direction, by (1) comparing equity crowdfunding with traditional sources of entrepreneurial finance; (2) discussing the potential and the perils of equity crowdfunding for inclusivity and democratization; (3) highlighting the role of visual information in digital finance; and (4) providing first insights on the industrial dynamics in equity crowdfunding. The paper gives researchers and practitioners orientation about recent developments in equity crowdfunding literature and provides relevant research directions.


Author(s):  
Lorenzo Cangiano ◽  
Sabrina Asteriti

AbstractIn the vertebrate retina, signals generated by cones of different spectral preference and by highly sensitive rod photoreceptors interact at various levels to extract salient visual information. The first opportunity for such interaction is offered by electrical coupling of the photoreceptors themselves, which is mediated by gap junctions located at the contact points of specialised cellular processes: synaptic terminals, telodendria and radial fins. Here, we examine the evolutionary pressures for and against interphotoreceptor coupling, which are likely to have shaped how coupling is deployed in different species. The impact of coupling on signal to noise ratio, spatial acuity, contrast sensitivity, absolute and increment threshold, retinal signal flow and colour discrimination is discussed while emphasising available data from a variety of vertebrate models spanning from lampreys to primates. We highlight the many gaps in our knowledge, persisting discrepancies in the literature, as well as some major unanswered questions on the actual extent and physiological role of cone-cone, rod-cone and rod-rod communication. Lastly, we point toward limited but intriguing evidence suggestive of the ancestral form of coupling among ciliary photoreceptors.


Author(s):  
Yuxia Wang ◽  
Xiaohu Yang ◽  
Hongwei Ding ◽  
Can Xu ◽  
Chang Liu

Purpose The purpose of this study was to examine the aging effects on the categorical perception (CP) of Mandarin lexical Tones 1–4 and Tones 1–2 in noise. It also investigated whether listeners' categorical tone perception in noise correlated with their general tone identification of 20 natural vowel-plus-tone signals in noise. Method Twelve younger and 12 older listeners with normal hearing were recruited in both tone identification and discrimination tasks in a CP paradigm where fundamental frequency contours of target stimuli varied systematically from the flat tone (Tone 1) to the rising/falling tones (Tones 2/4). Both tasks were conducted in quiet and noise with signal-to-noise ratios set at −5 and −10 dB, respectively, and general tone identification of natural speech signals was also tested in noise conditions. Results Compared with younger listeners, older listeners had shallower identification slopes and smaller discrimination peakedness in Tones 1–2/4 perception in all listening conditions, except for Tones 1–4 perception in quiet where no group differences were found. Meanwhile, noise affected Tones 1–2/4 perception: The signal-to-noise ratio condition at −10 dB brought shallower slope in Tones 1–2/4 identification and less peakedness in Tones 1–4 discrimination for both listener groups. Older listeners' CP in noise, the identification slopes in particular, positively correlated with their general tone identification in noise, but such correlations were partially missing for younger listeners. Conclusions Both aging and the presence of speech-shaped noise significantly reduced the CP of Mandarin Tones 1–2/4. Listeners' Mandarin tone recognition may be related to their CP of Mandarin tones.


2012 ◽  
Author(s):  
Rui Cao ◽  
Ratree Wayland ◽  
Edith Kaan
Keyword(s):  

2018 ◽  
Vol 40 (1) ◽  
pp. 93-109
Author(s):  
YI ZHENG ◽  
ARTHUR G. SAMUEL

AbstractIt has been documented that lipreading facilitates the understanding of difficult speech, such as noisy speech and time-compressed speech. However, relatively little work has addressed the role of visual information in perceiving accented speech, another type of difficult speech. In this study, we specifically focus on accented word recognition. One hundred forty-two native English speakers made lexical decision judgments on English words or nonwords produced by speakers with Mandarin Chinese accents. The stimuli were presented as either as videos that were of a relatively far speaker or as videos in which we zoomed in on the speaker’s head. Consistent with studies of degraded speech, listeners were more accurate at recognizing accented words when they saw lip movements from the closer apparent distance. The effect of apparent distance tended to be larger under nonoptimal conditions: when stimuli were nonwords than words, and when stimuli were produced by a speaker who had a relatively strong accent. However, we did not find any influence of listeners’ prior experience with Chinese accented speech, suggesting that cross-talker generalization is limited. The current study provides practical suggestions for effective communication between native and nonnative speakers: visual information is useful, and it is more useful in some circumstances than others.


Sign in / Sign up

Export Citation Format

Share Document