Robust Speaker Localization Guided by Deep Learning-Based Time-Frequency Masking

2019 ◽  
Vol 27 (1) ◽  
pp. 178-188 ◽  
Author(s):  
Zhong-Qiu Wang ◽  
Xueliang Zhang ◽  
DeLiang Wang
2016 ◽  
Vol 27 (02) ◽  
pp. 1650039 ◽  
Author(s):  
Francesco Carlo Morabito ◽  
Maurizio Campolo ◽  
Nadia Mammone ◽  
Mario Versaci ◽  
Silvana Franceschetti ◽  
...  

A novel technique of quantitative EEG for differentiating patients with early-stage Creutzfeldt–Jakob disease (CJD) from other forms of rapidly progressive dementia (RPD) is proposed. The discrimination is based on the extraction of suitable features from the time-frequency representation of the EEG signals through continuous wavelet transform (CWT). An average measure of complexity of the EEG signal obtained by permutation entropy (PE) is also included. The dimensionality of the feature space is reduced through a multilayer processing system based on the recently emerged deep learning (DL) concept. The DL processor includes a stacked auto-encoder, trained by unsupervised learning techniques, and a classifier whose parameters are determined in a supervised way by associating the known category labels to the reduced vector of high-level features generated by the previous processing blocks. The supervised learning step is carried out by using either support vector machines (SVM) or multilayer neural networks (MLP-NN). A subset of EEG from patients suffering from Alzheimer’s Disease (AD) and healthy controls (HC) is considered for differentiating CJD patients. When fine-tuning the parameters of the global processing system by a supervised learning procedure, the proposed system is able to achieve an average accuracy of 89%, an average sensitivity of 92%, and an average specificity of 89% in differentiating CJD from RPD. Similar results are obtained for CJD versus AD and CJD versus HC.


2020 ◽  
Vol 309 ◽  
pp. 03037
Author(s):  
Dongqiu Xing ◽  
Rui Chen ◽  
Lihua Qi ◽  
Jing Zhao ◽  
Yi Wang

This study establishes a multi-source fault identification method based on a combined deep learning strategy to identify a multi-source fault effectively in the fault diagnosis of complex industrial systems. This framework is composed of feature extraction and classifier design. In the first state, the signal is transformed to the time-frequency domain and the time-frequency feature is learned using stacked denoising autoencoders. A learning method that consists of unsupervised pre-learning and supervised fine-tuning is used to train this deep model. In the second state, a model for an ensemble multiple support vector machine classifier is created to recognize fault information. Ten types of rolling bearing signals were adopted in a simulation experiment to validate the effectiveness of the proposed framework. The results demonstrate that the joint model helps to obtain higher recognition accuracy.


2020 ◽  
pp. 107754632094971 ◽  
Author(s):  
Shoucong Xiong ◽  
Shuai He ◽  
Jianping Xuan ◽  
Qi Xia ◽  
Tielin Shi

Modern machinery becomes more precious with the advance of science, and fault diagnosis is vital for avoiding economical losses or casualties. Among massive diagnosis methods, deep learning algorithms stand out to open an era of intelligent fault diagnosis. Deep residual networks are the state-of-the-art deep learning models which can continuously improve performance by deepening the network structures. However, in vibration-based fault diagnosis, the transient property instability of vibration signal usually calls for time–frequency analysis methods, and the characters of time–frequency matrices are distinct from standard images, which brings some natural limitations for the diagnosis performance of deep learning algorithms. To handle this issue, an enhanced deep residual network named the multilevel correlation stack-deep residual network is proposed in this article. Wavelet packet transform is used to preprocess the sensor signal, and then the proposed multilevel correlation stack-deep residual network uses kernels with different shapes to fully dig various kinds of useful information from any local regions of the processed input. Experiments on two rolling bearing datasets are carried out. Test results show that the multilevel correlation stack-deep residual network exhibits a more satisfactory classification performance than original deep residual networks and other similar methods, revealing significant potentials for realistic fault diagnosis applications.


2020 ◽  
Vol 10 (20) ◽  
pp. 7068
Author(s):  
Minh Tuan Pham ◽  
Jong-Myon Kim ◽  
Cheol Hong Kim

Recent convolutional neural network (CNN) models in image processing can be used as feature-extraction methods to achieve high accuracy as well as automatic processing in bearing fault diagnosis. The combination of deep learning methods with appropriate signal representation techniques has proven its efficiency compared with traditional algorithms. Vital electrical machines require a strict monitoring system, and the accuracy of these machines’ monitoring systems takes precedence over any other factors. In this paper, we propose a new method for diagnosing bearing faults under variable shaft speeds using acoustic emission (AE) signals. Our proposed method predicts not only bearing fault types but also the degradation level of bearings. In the proposed technique, AE signals acquired from bearings are represented by spectrograms to obtain as much information as possible in the time–frequency domain. Feature extraction and classification processes are performed by deep learning using EfficientNet and a stochastic line-search optimizer. According to our various experiments, the proposed method can provide high accuracy and robustness under noisy environments compared with existing AE-based bearing fault diagnosis methods.


2020 ◽  
Vol 14 ◽  
Author(s):  
Yaqing Zhang ◽  
Jinling Chen ◽  
Jen Hong Tan ◽  
Yuxuan Chen ◽  
Yunyi Chen ◽  
...  

Emotion is the human brain reacting to objective things. In real life, human emotions are complex and changeable, so research into emotion recognition is of great significance in real life applications. Recently, many deep learning and machine learning methods have been widely applied in emotion recognition based on EEG signals. However, the traditional machine learning method has a major disadvantage in that the feature extraction process is usually cumbersome, which relies heavily on human experts. Then, end-to-end deep learning methods emerged as an effective method to address this disadvantage with the help of raw signal features and time-frequency spectrums. Here, we investigated the application of several deep learning models to the research field of EEG-based emotion recognition, including deep neural networks (DNN), convolutional neural networks (CNN), long short-term memory (LSTM), and a hybrid model of CNN and LSTM (CNN-LSTM). The experiments were carried on the well-known DEAP dataset. Experimental results show that the CNN and CNN-LSTM models had high classification performance in EEG-based emotion recognition, and their accurate extraction rate of RAW data reached 90.12 and 94.17%, respectively. The performance of the DNN model was not as accurate as other models, but the training speed was fast. The LSTM model was not as stable as the CNN and CNN-LSTM models. Moreover, with the same number of parameters, the training speed of the LSTM was much slower and it was difficult to achieve convergence. Additional parameter comparison experiments with other models, including epoch, learning rate, and dropout probability, were also conducted in the paper. Comparison results prove that the DNN model converged to optimal with fewer epochs and a higher learning rate. In contrast, the CNN model needed more epochs to learn. As for dropout probability, reducing the parameters by ~50% each time was appropriate.


IEEE Access ◽  
2020 ◽  
Vol 8 ◽  
pp. 172692-172706
Author(s):  
Peng Cheng ◽  
Zhencheng Chen ◽  
Quanzhong Li ◽  
Qiong Gong ◽  
Jianming Zhu ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document