voice activity
Recently Published Documents


TOTAL DOCUMENTS

925
(FIVE YEARS 152)

H-INDEX

35
(FIVE YEARS 5)

2022 ◽  
Vol 40 (1) ◽  
pp. 1-23
Author(s):  
Jiaxing Shen ◽  
Jiannong Cao ◽  
Oren Lederman ◽  
Shaojie Tang ◽  
Alex “Sandy” Pentland

User profiling refers to inferring people’s attributes of interest ( AoIs ) like gender and occupation, which enables various applications ranging from personalized services to collective analyses. Massive nonlinguistic audio data brings a novel opportunity for user profiling due to the prevalence of studying spontaneous face-to-face communication. Nonlinguistic audio is coarse-grained audio data without linguistic content. It is collected due to privacy concerns in private situations like doctor-patient dialogues. The opportunity facilitates optimized organizational management and personalized healthcare, especially for chronic diseases. In this article, we are the first to build a user profiling system to infer gender and personality based on nonlinguistic audio. Instead of linguistic or acoustic features that are unable to extract, we focus on conversational features that could reflect AoIs. We firstly develop an adaptive voice activity detection algorithm that could address individual differences in voice and false-positive voice activities caused by people nearby. Secondly, we propose a gender-assisted multi-task learning method to combat dynamics in human behavior by integrating gender differences and the correlation of personality traits. According to the experimental evaluation of 100 people in 273 meetings, we achieved 0.759 and 0.652 in F1-score for gender identification and personality recognition, respectively.


2022 ◽  
Vol 5 (1) ◽  
pp. 23-31
Author(s):  
Al smadi Takialddin ◽  
Ahmed Handam

Currently, the direction of voice biometrics is actively developing, which includes two related tasks of recognizing the speaker by voice: the verification task, which consists in determining the speaker's personality, and the identification task, which is responsible for checking the belonging of the phonogram to a particular speaker. An open question remains related to improving the quality of the verification identification algorithms in real conditions and reducing the probability of error. In this work study Voice activity detection algorithm is proposed, which is a modification of the algorithm based on pitch statistics; VAD is investigated as a component of a speaker recognition system by voice, and therefore the main purpose of its work is to improve the quality of the system as a whole. On the example of the proposed modification of the VAD algorithm and the energy-based VAD algorithm, the analysis of the influence of the choice on the quality of the speaker recognition system is carried out.  


2021 ◽  
Author(s):  
Joon Gyu Maeng ◽  
Min Kyu Lee ◽  
Seung Yun ◽  
Sang Hun Kim

2021 ◽  
Vol 181 ◽  
pp. 108116
Author(s):  
Binh Thien Nguyen ◽  
Yukoh Wakabayashi ◽  
Kenta Iwai ◽  
Takanobu Nishiura

Sign in / Sign up

Export Citation Format

Share Document