Study on Multichannel Speech Enhancement Technology in Voice Human-Computer Interaction

2011 ◽  
Vol 267 ◽  
pp. 762-767
Author(s):  
Ji Xiang Lu ◽  
Ping Wang ◽  
Hong Zhong Shi ◽  
Xin Wang

As the primary research area of the Multimoda1 Human-computer Interaction, Voice Interaction mainly involves extraction and identification of the natural speech signal, where the former provides the reliable signal sources, which are analyzed by the latter. The multichannel speech enhancement technology is studied in this paper, aiming at the Voice Interactive. The simulated results show the effectiveness and superiority of the improved algorithm proposed in the paper.

2010 ◽  
Vol 139-141 ◽  
pp. 2154-2157
Author(s):  
Ji Xiang Lu ◽  
Ping Wang ◽  
Long Yi

The voice interaction in cockpit mainly includes speech recognition, enhancement and synthesis. This interaction transfers the speech information to the corresponding orders to make machines in cockpit work unmistaken, also feedback the execution results to users by speech output devices or some other ways. The speech enhancement technology is studied in this paper, aiming at the Voice Interactive. We propose an improved spectral subtraction (SS) algorithm based on auditory masking effect, by using two steps SS. The simulated results based on the segment SNR compared to the traditional SS show the effectiveness and superiority of the improved algorithm.


2019 ◽  
Vol 34 (1) ◽  
pp. 28-47 ◽  
Author(s):  
Emna Chérif ◽  
Jean-François Lemoine

Virtual assistants are increasingly common on commercial websites. In view of the benefits they offer to businesses for improving navigation and interaction with the consumers, researchers and practitioners agree on the value of providing them with anthropomorphic characteristics. This study focuses on the effect of the voice of the virtual assistant. Although there are some studies of human–computer interaction in this field, there is no work that addresses the topic from a marketing perspective and compares the effect of a human voice versus a synthetic voice. Our findings show that consumers who interact with a virtual assistant with a human voice have a stronger impression of social presence than those interacting with a virtual assistant with a synthetic voice. The human voice also builds trust in the virtual assistant and generates stronger behavioural intentions.


2014 ◽  
Vol 556-562 ◽  
pp. 3408-3411
Author(s):  
Jing Xin Xiao ◽  
Lin Deng ◽  
Yun He ◽  
You Yi Li

Speech signal can be effect by the external interruption in the process of flight simulation, which results in the deteriorated speech processing performance in flight simulation system. To solve the problem, speech enhancement module has been developed using the spectral subtraction algorithm based on the flight simulation system developed by the research team. The module will improve the recognition rates and provide advanced anti-jam capabilities for the speech processing performance of the flight simulation system. The feasibility and validity of this speech enhancement module is validated by processing the interrupted speech signal. The work in this paper has laid foundation for the application of the speech enhancement technology in flight simulation system.


2014 ◽  
Vol 716-717 ◽  
pp. 1272-1276
Author(s):  
Yan Ping Wei ◽  
Hai Liu Xiao

With the development of computer technology and information technology, voice interaction has become a necessary means of human-computer interaction, and voice signal acquisition and processing is the precondition and foundation of human-computer interaction. This paper introduces the MATLAB visualization method into voice signal acquisition system, and uses MATLAB programming method to drive sound card directly, which realizes the identification and acquisition of voice signal and designs a new voice signal visualization acquisition system. In order to optimize the system, this paper introduces the variance analysis algorithm into the design of visualization system, which realizes the optimization of voice signal recognition model with different level parameters. At the end this paper does numerical simulation on the speech signal acquisition system; through signal acquisition 2D and 3D visualization voice signals are obtained. It extracts single signal characteristics, which provides a theoretical reference for the design of signal acquisition system.


Author(s):  
Douglas J. Gillan

Human-computer interaction (HCI) has been identified as a rich task for the real-world study of psychology; however, the theoretical approaches to the psychology of HCI have narrowly focused on problem-solving (e.g., GOMS and CE+), memory (e.g., mental models and metaphors), and social interaction (e.g., perceived control). An attempt to create a broader theoretical framework integrates the three approaches to the psychology of HCI with a theory, IP3. This paper (1) discusses each of the three psychologies of HCI, (2) describes the integrative theory, IP3 (verbally, as well as by a graphical representation), (3) applies the theory to one representative research area—transfer of training, and (4) applies the theory to the interpretation of selected HCI design guidelines.


Sign in / Sign up

Export Citation Format

Share Document