Speech Separation Using Convolutional Neural Network and Attention Mechanism

Speech information is the most important means of human communication, and it is crucial to separate the target voice from the mixed sound signals. This paper proposes a speech separation model based on convolutional neural networks and attention mechanism. The magnitude spectrum of the mixed speech signals, as the input, has its high dimensionality. By analyzing the characteristics of the convolutional neural network and attention mechanism, it can be found that the convolutional neural network can effectively extract low-dimensional features and mine the spatiotemporal structure information in the speech signals, and the attention mechanism can reduce the loss of sequence information. The accuracy of speech separation can be improved effectively by combining two mechanisms. Compared to the typical speech separation model DRNN-2 + discrim, this method achieves 0.27 dB GNSDR gain and 0.51 dB GSIR gain, which illustrates that the speech separation model proposed in this paper has achieved an ideal separation effect.

Download Full-text

CNNDLP: A Method Based on Convolutional Autoencoder and Convolutional Neural Network with Adjacent Edge Attention for Predicting lncRNA–Disease Associations

International Journal of Molecular Sciences ◽

10.3390/ijms20174260 ◽

2019 ◽

Vol 20 (17) ◽

pp. 4260 ◽

Cited By ~ 6

Author(s):

Ping Xuan ◽

Nan Sheng ◽

Tiangang Zhang ◽

Yong Liu ◽

Yahong Guo

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Attention Mechanism ◽

Bipartite Networks ◽

Adjacent Edge ◽

Dimensional Network ◽

Disease Associations ◽

Convolutional Autoencoder ◽

New Type ◽

Low Dimensional

It is well known that the unusual expression of long non-coding RNAs (lncRNAs) is closely related to the physiological and pathological processes of diseases. Therefore, inferring the potential lncRNA–disease associations are helpful for understanding the molecular pathogenesis of diseases. Most previous methods have concentrated on the construction of shallow learning models in order to predict lncRNA-disease associations, while they have failed to deeply integrate heterogeneous multi-source data and to learn the low-dimensional feature representations from these data. We propose a method based on the convolutional neural network with the attention mechanism and convolutional autoencoder for predicting candidate disease-related lncRNAs, and refer to it as CNNDLP. CNNDLP integrates multiple kinds of data from heterogeneous sources, including the associations, interactions, and similarities related to the lncRNAs, diseases, and miRNAs. Two different embedding layers are established by combining the diverse biological premises about the cases that the lncRNAs are likely to associate with the diseases. We construct a novel prediction model based on the convolutional neural network with attention mechanism and convolutional autoencoder to learn the attention and the low-dimensional network representations of the lncRNA–disease pairs from the embedding layers. The different adjacent edges among the lncRNA, miRNA, and disease nodes have different contributions for association prediction. Hence, an attention mechanism at the adjacent edge level is established, and the left side of the model learns the attention representation of a pair of lncRNA and disease. A new type of lncRNA similarity and a new type of disease similarity are calculated by incorporating the topological structures of multiple bipartite networks. The low-dimensional network representation of the lncRNA-disease pairs is further learned by the autoencoder based convolutional neutral network on the right side of the model. The cross-validation experimental results confirm that CNNDLP has superior prediction performance compared to the state-of-the-art methods. Case studies on stomach cancer, breast cancer, and prostate cancer further show the ability of CNNDLP for discovering the potential disease lncRNAs.

Download Full-text

A Multi-Scale Fusion Convolutional Neural Network based on Attention Mechanism for the Visualization Analysis of EEG Signals Decoding

IEEE Transactions on Neural Systems and Rehabilitation Engineering ◽

10.1109/tnsre.2020.3037326 ◽

2020 ◽

pp. 1-1

Author(s):

Donglin Li ◽

Jiacan Xu ◽

Jianhui Wang ◽

Xiaoke Fang ◽

Ji Ying

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Attention Mechanism ◽

Eeg Signals ◽

Multi Scale ◽

Visualization Analysis

Download Full-text

Recognition of Robot Based on Attention Mechanism and Convolutional Neural Network

2019 IEEE 3rd Information Technology, Networking, Electronic and Automation Control Conference (ITNEC) ◽

10.1109/itnec.2019.8728976 ◽

2019 ◽

Cited By ~ 1

Author(s):

Hexi Li ◽

Jihua Li

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Attention Mechanism

Download Full-text

Non-tumorous Facial Pigmentation Classification based on Multi-View Convolutional Neural Network with Attention Mechanism

Neurocomputing ◽

10.1016/j.neucom.2022.01.011 ◽

2022 ◽

Author(s):

Yingjie Tian ◽

Shiding Sun ◽

Zhiquan Qi ◽

Ying Liu ◽

Zeyuan Wang

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Attention Mechanism

Download Full-text

Lw-TISNet: Light-Weight Convolutional Neural Network Incorporating Attention Mechanism and Multiple Supervision Strategy for Tongue Image Segmentation

Sensing and Imaging ◽

10.1007/s11220-021-00375-x ◽

2022 ◽

Vol 23 (1) ◽

Author(s):

Xiaodong Huang ◽

Li Zhuo ◽

Hui Zhang ◽

Xiaoguang Li ◽

Jing Zhang

Keyword(s):

Neural Network ◽

Image Segmentation ◽

Convolutional Neural Network ◽

Attention Mechanism ◽

Light Weight

Download Full-text

Slimming Convolutional Neural Network Based on Attention Mechanism for Pavement Crack Detection

10.1109/cyber53097.2021.9588247 ◽

2021 ◽

Author(s):

Wenning Huang ◽

Guijie Zhu ◽

Zhun Fan ◽

Wenji Li ◽

Yibiao Rong ◽

...

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Crack Detection ◽

Attention Mechanism ◽

Pavement Crack Detection

Download Full-text

Network Embedding via a Bi-Mode and Deep Neural Network Model

10.20944/preprints201712.0156.v1 ◽

2017 ◽

Author(s):

Yang Fang ◽

Xiang Zhao ◽

Zhen Tan

Keyword(s):

Neural Network ◽

Deep Neural Network ◽

Semantic Information ◽

Dimensional Space ◽

Relation Extraction ◽

Network Embedding ◽

Structure Information ◽

Second Mode ◽

Real World Datasets ◽

Low Dimensional

Network Embedding (NE) is an important method to learn the representations of network via a low-dimensional space. Conventional NE models focus on capturing the structure information and semantic information of vertices while neglecting such information for edges. In this work, we propose a novel NE model named BimoNet to capture both the structure and semantic information of edges. BimoNet is composed of two parts, i.e., the bi-mode embedding part and the deep neural network part. For bi-mode embedding part, the first mode named add-mode is used to express the entity-shared features of edges and the second mode named subtract-mode is employed to represent the entity-specific features of edges. These features actually reflect the semantic information. For deep neural network part, we firstly regard the edges in a network as nodes, and the vertices as links, which will not change the overall structure of the whole network. Then we take the nodes' adjacent matrix as the input of the deep neural network as it can obtain similar representations for nodes with similar structure. Afterwards, by jointly optimizing the objective function of these two parts, BimoNet could preserve both the semantic and structure information of edges. In experiments, we evaluate BimoNet on three real-world datasets and task of relation extraction, and BimoNet is demonstrated to outperform state-of-the-art baseline models consistently and significantly.

Download Full-text

Predicting gene regulatory regions with a convolutional neural network for processing double-strand genome sequence information

PLoS ONE ◽

10.1371/journal.pone.0235748 ◽

2020 ◽

Vol 15 (7) ◽

pp. e0235748

Author(s):

Koh Onimaru ◽

Osamu Nishimura ◽

Shigehiro Kuraku

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Genome Sequence ◽

Sequence Information ◽

Regulatory Regions ◽

Genome Sequence Information ◽

Gene Regulatory

Download Full-text

Attention-mechanism-based tracking method for intelligent Internet of vehicles

International Journal of Distributed Sensor Networks ◽

10.1177/1550147718805946 ◽

2018 ◽

Vol 14 (10) ◽

pp. 155014771880594 ◽

Cited By ~ 1

Author(s):

Xu Kang ◽

Bin Song ◽

Jie Guo ◽

Xiaojiang Du ◽

Mohsen Guizani

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Image Sensor ◽

Attention Mechanism ◽

Vehicle Tracking ◽

Background Region ◽

Internet Of Vehicles ◽

Tracking Method ◽

Siamese Network ◽

Selection Of

Vehicle tracking task plays an important role on the Internet of vehicles and intelligent transportation system. Beyond the traditional Global Positioning System sensor, the image sensor can capture different kinds of vehicles, analyze their driving situation, and can interact with them. Aiming at the problem that the traditional convolutional neural network is vulnerable to background interference, this article proposes vehicle tracking method based on human attention mechanism for self-selection of deep features with an inter-channel fully connected layer. It mainly includes the following contents: (1) a fully convolutional neural network fused attention mechanism with the selection of the deep features for convolution; (2) a separation method for template and semantic background region to separate target vehicles from the background in the initial frame adaptively; (3) a two-stage method for model training using our traffic dataset. The experimental results show that the proposed method improves the tracking accuracy without an increase in tracking time. Meanwhile, it strengthens the robustness of algorithm under the condition of the complex background region. The success rate of the proposed method in overall traffic datasets is higher than Siamese network by about 10%, and the overall precision is higher than Siamese network by 8%.

Download Full-text

Determining Number of Speakers from Single Microphone Speech Signals by Multi-Label Convolutional Neural Network

IECON 2018 - 44th Annual Conference of the IEEE Industrial Electronics Society ◽

10.1109/iecon.2018.8592773 ◽

2018 ◽

Cited By ~ 1

Author(s):

Haoran Wei ◽

Nasser Kehtarnavaz

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Speech Signals

Download Full-text