speech activity detection Latest Research Papers

In speech technology, a pivotal role is being played by the Speaker diarization mechanism. In general, speaker diarization is the mechanism of partitioning the input audio stream into homogeneous segments based on the identity of the speakers. The automatic transcription readability can be improved with the speaker diarization as it is good in recognizing the audio stream into the speaker turn and often provides the true speaker identity. In this research work, a novel speaker diarization approach is introduced under three major phases: Feature Extraction, Speech Activity Detection (SAD), and Speaker Segmentation and Clustering process. Initially, from the input audio stream (Telugu language) collected, the Mel Frequency Cepstral coefficient (MFCC) based features are extracted. Subsequently, in Speech Activity Detection (SAD), the music and silence signals are removed. Then, the acquired speech signals are segmented for each individual speaker. Finally, the segmented signals are subjected to the speaker clustering process, where the Optimized Convolutional Neural Network (CNN) is used. To make the clustering more appropriate, the weight and activation function of CNN are fine-tuned by a new Self Adaptive Sea Lion Algorithm (SA-SLnO). Finally, a comparative analysis is made to exhibit the superiority of the proposed speaker diarization work. Accordingly, the accuracy of the proposed method is 0.8073, which is 5.255, 2.45%, and 0.075, superior to the existing works.

Download Full-text

Speech Activity Detection from Stereotactic EEG

10.1109/smc52423.2021.9659058 ◽

2021 ◽

Author(s):

P. Z. Soroush ◽

M. Angrick ◽

J. Shih ◽

T. Schultz ◽

D. J. Krusienski

Keyword(s):

Activity Detection ◽

Speech Activity ◽

Speech Activity Detection

Download Full-text

Speech Activity Detection Based on Multilingual Speech Recognition System

10.21437/interspeech.2021-1058 ◽

2021 ◽

Author(s):

Seyyed Saeed Sarfjoo ◽

Srikanth Madikeri ◽

Petr Motlicek

Keyword(s):

Speech Recognition ◽

Recognition System ◽

Speech Recognition System ◽

Activity Detection ◽

Speech Activity ◽

Multilingual Speech Recognition ◽

Speech Activity Detection

Download Full-text

EML Online Speech Activity Detection for the Fearless Steps Challenge Phase-III

10.21437/interspeech.2021-1456 ◽

2021 ◽

Author(s):

Omid Ghahabi ◽

Volker Fischer

Keyword(s):

Phase Iii ◽

Activity Detection ◽

Speech Activity ◽

Speech Activity Detection

Download Full-text

Unsupervised Representation Learning for Speech Activity Detection in the Fearless Steps Challenge 2021

10.21437/interspeech.2021-309 ◽

2021 ◽

Author(s):

Pablo Gimeno ◽

Alfonso Ortega ◽

Antonio Miguel ◽

Eduardo Lleida

Keyword(s):

Representation Learning ◽

Activity Detection ◽

Speech Activity ◽

Speech Activity Detection

Download Full-text

Using X-Vectors for Speech Activity Detection in Broadcast Streams

10.21437/interspeech.2021-192 ◽

2021 ◽

Author(s):

Lukas Mateju ◽

Frantisek Kynych ◽

Petr Cerva ◽

Jindrich Zdansky ◽

Jiri Malek

Keyword(s):

Activity Detection ◽

Speech Activity ◽

Speech Activity Detection

Download Full-text

Speech Activity Detection under Adverse Noisy Conditions at Low SNRs

2021 6th International Conference on Communication and Electronics Systems (ICCES) ◽

10.1109/icces51350.2021.9488934 ◽

2021 ◽

Author(s):

Rahul Jaiswal

Keyword(s):

Activity Detection ◽

Speech Activity ◽

Noisy Conditions ◽

Speech Activity Detection

Download Full-text

Convolutional Recurrent Neural Networks for Speech Activity Detection in Naturalistic Audio from Apollo Missions

10.21437/iberspeech.2021-6 ◽

2021 ◽

Author(s):

Pablo Gimeno ◽

Dayana Ribas ◽

Alfonso Ortega ◽

Antonio Miguel ◽

Eduardo Lleida

Keyword(s):

Neural Networks ◽

Recurrent Neural Networks ◽

Activity Detection ◽

Speech Activity ◽

Speech Activity Detection

Download Full-text

A Novel Approach to EEG Speech Activity Detection with Visual Stimuli and Mobile BCI

Applied Sciences ◽

10.3390/app11020674 ◽

2021 ◽

Vol 11 (2) ◽

pp. 674

Author(s):

Marianna Koctúrová ◽

Jozef Juhár

Keyword(s):

Visual Stimuli ◽

Computer Interface ◽

Single Subject ◽

Detection Accuracy ◽

The Novel ◽

Activity Detection ◽

Speech Activity ◽

Novel Approach ◽

Eeg Data ◽

Speech Activity Detection

With the ever-progressing development in the field of computational and analytical science the last decade has seen a big improvement in the accuracy of electroencephalography (EEG) technology. Studies try to examine possibilities to use high dimensional EEG data as a source for Brain to Computer Interface. Applications of EEG Brain to computer interface vary from emotion recognition, simple computer/device control, speech recognition up to Intelligent Prosthesis. Our research presented in this paper was focused on the study of the problematic speech activity detection using EEG data. The novel approach used in this research involved the use visual stimuli, such as reading and colour naming, and signals of speech activity detectable by EEG technology. Our proposed solution is based on a shallow Feed-Forward Artificial Neural Network with only 100 hidden neurons. Standard features such as signal energy, standard deviation, RMS, skewness, kurtosis were calculated from the original signal from 16 EEG electrodes. The novel approach in the field of Brain to computer interface applications was utilised to calculated additional set of features from the minimum phase signal. Our experimental results demonstrated F1 score of 86.80% and 83.69% speech detection accuracy based on the analysis of EEG signal from single subject and cross-subject models respectively. The importance of these results lies in the novel utilisation of the mobile device to record the nerve signals which can serve as the stepping stone for the transfer of Brain to computer interface technology from technology from a controlled environment to the real-life conditions.

Download Full-text

A neural network approach for speech activity detection for Apollo corpus

Computer Speech & Language ◽

10.1016/j.csl.2020.101137 ◽

2021 ◽

Vol 65 ◽

pp. 101137

Author(s):

Vishala Pannala ◽

B. Yegnanarayana

Keyword(s):

Neural Network ◽

Network Approach ◽

Activity Detection ◽

Neural Network Approach ◽

Speech Activity ◽

Speech Activity Detection

Download Full-text

speech activity detection
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Metaheuristic adapted convolutional neural network for Telugu speaker diarization

Speech Activity Detection from Stereotactic EEG

Speech Activity Detection Based on Multilingual Speech Recognition System

EML Online Speech Activity Detection for the Fearless Steps Challenge Phase-III

Unsupervised Representation Learning for Speech Activity Detection in the Fearless Steps Challenge 2021

Using X-Vectors for Speech Activity Detection in Broadcast Streams

Speech Activity Detection under Adverse Noisy Conditions at Low SNRs

Convolutional Recurrent Neural Networks for Speech Activity Detection in Naturalistic Audio from Apollo Missions

A Novel Approach to EEG Speech Activity Detection with Visual Stimuli and Mobile BCI

A neural network approach for speech activity detection for Apollo corpus

Export Citation Format

speech activity detectionRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Metaheuristic adapted convolutional neural network for Telugu speaker diarization

Speech Activity Detection from Stereotactic EEG

Speech Activity Detection Based on Multilingual Speech Recognition System

EML Online Speech Activity Detection for the Fearless Steps Challenge Phase-III

Unsupervised Representation Learning for Speech Activity Detection in the Fearless Steps Challenge 2021

Using X-Vectors for Speech Activity Detection in Broadcast Streams

Speech Activity Detection under Adverse Noisy Conditions at Low SNRs

Convolutional Recurrent Neural Networks for Speech Activity Detection in Naturalistic Audio from Apollo Missions

A Novel Approach to EEG Speech Activity Detection with Visual Stimuli and Mobile BCI

A neural network approach for speech activity detection for Apollo corpus

speech activity detection
Recently Published Documents