Automatic Lip-Reading System Based on Deep Convolutional Neural Network and Attention-Based Long Short-Term Memory

With the improvement of computer performance, virtual reality (VR) as a new way of visual operation and interaction method gives the automatic lip-reading technology based on visual features broad development prospects. In an immersive VR environment, the user’s state can be successfully captured through lip movements, thereby analyzing the user’s real-time thinking. Due to complex image processing, hard-to-train classifiers and long-term recognition processes, the traditional lip-reading recognition system is difficult to meet the requirements of practical applications. In this paper, the convolutional neural network (CNN) used to image feature extraction is combined with a recurrent neural network (RNN) based on attention mechanism for automatic lip-reading recognition. Our proposed method for automatic lip-reading recognition can be divided into three steps. Firstly, we extract keyframes from our own established independent database (English pronunciation of numbers from zero to nine by three males and three females). Then, we use the Visual Geometry Group (VGG) network to extract the lip image features. It is found that the image feature extraction results are fault-tolerant and effective. Finally, we compare two lip-reading models: (1) a fusion model with an attention mechanism and (2) a fusion model of two networks. The results show that the accuracy of the proposed model is 88.2% in the test dataset and 84.9% for the contrastive model. Therefore, our proposed method is superior to the traditional lip-reading recognition methods and the general neural networks.

Download Full-text

Hyperspectral image feature extraction method based on sparse constraint convolutional neural network

10.1117/12.2268499 ◽

2017 ◽

Author(s):

Peiyuan Jia ◽

Miao Zhang ◽

Wenbo Yu ◽

Yi Shen

Keyword(s):

Neural Network ◽

Feature Extraction ◽

Convolutional Neural Network ◽

Extraction Method ◽

Hyperspectral Image ◽

Image Feature ◽

Sparse Constraint ◽

Image Feature Extraction ◽

Feature Extraction Method

Download Full-text

A Convolutional Neural Network Based on Grouping Structure for Scene Classification

Remote Sensing ◽

10.3390/rs13132457 ◽

2021 ◽

Vol 13 (13) ◽

pp. 2457

Author(s):

Xuan Wu ◽

Zhijie Zhang ◽

Wanchang Zhang ◽

Yaning Yi ◽

Chuanrong Zhang ◽

...

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Feature Extraction ◽

Convolutional Neural Network ◽

Data Augmentation ◽

Remote Sensing Image ◽

Difficult Problem ◽

Image Features ◽

Attention Mechanism ◽

Scene Classification

Convolutional neural network (CNN) is capable of automatically extracting image features and has been widely used in remote sensing image classifications. Feature extraction is an important and difficult problem in current research. In this paper, data augmentation for avoiding over fitting was attempted to enrich features of samples to improve the performance of a newly proposed convolutional neural network with UC-Merced and RSI-CB datasets for remotely sensed scene classifications. A multiple grouped convolutional neural network (MGCNN) for self-learning that is capable of promoting the efficiency of CNN was proposed, and the method of grouping multiple convolutional layers capable of being applied elsewhere as a plug-in model was developed. Meanwhile, a hyper-parameter C in MGCNN is introduced to probe into the influence of different grouping strategies for feature extraction. Experiments on the two selected datasets, the RSI-CB dataset and UC-Merced dataset, were carried out to verify the effectiveness of this newly proposed convolutional neural network, the accuracy obtained by MGCNN was 2% higher than the ResNet-50. An algorithm of attention mechanism was thus adopted and incorporated into grouping processes and a multiple grouped attention convolutional neural network (MGCNN-A) was therefore constructed to enhance the generalization capability of MGCNN. The additional experiments indicate that the incorporation of the attention mechanism to MGCNN slightly improved the accuracy of scene classification, but the robustness of the proposed network was enhanced considerably in remote sensing image classifications.

Download Full-text

Learning Methods of Convolutional Neural Network Combined With Image Feature Extraction in Brain Tumor Detection

IEEE Access ◽

10.1109/access.2020.3016282 ◽

2020 ◽

Vol 8 ◽

pp. 152659-152668

Author(s):

Weiguang Wang ◽

Fanlong Bu ◽

Ziyi Lin ◽

Shuangqing Zhai

Keyword(s):

Neural Network ◽

Feature Extraction ◽

Brain Tumor ◽

Convolutional Neural Network ◽

Tumor Detection ◽

Image Feature ◽

Learning Methods ◽

Image Feature Extraction

Download Full-text

Gender Recognition from Human-Body Images Using Visible-Light and Thermal Camera Videos Based on a Convolutional Neural Network for Image Feature Extraction

Sensors ◽

10.3390/s17030637 ◽

2017 ◽

Vol 17 (3) ◽

pp. 637 ◽

Cited By ~ 16

Author(s):

Dat Nguyen ◽

Ki Kim ◽

Hyung Hong ◽

Ja Koo ◽

Min Kim ◽

...

Keyword(s):

Neural Network ◽

Feature Extraction ◽

Visible Light ◽

Convolutional Neural Network ◽

Human Body ◽

Image Feature ◽

Gender Recognition ◽

Thermal Camera ◽

Body Images ◽

Image Feature Extraction

Download Full-text

Dilated convolutional neural network for hyperspectral image feature extraction and classification

Eleventh International Conference on Graphics and Image Processing (ICGIP 2019) ◽

10.1117/12.2558057 ◽

2020 ◽

Author(s):

Fengzhe Zhang ◽

Lu Xiao ◽

Haibin Wang ◽

Huayu Gao ◽

Junxiang wang ◽

...

Keyword(s):

Neural Network ◽

Feature Extraction ◽

Convolutional Neural Network ◽

Hyperspectral Image ◽

Image Feature ◽

Image Feature Extraction

Download Full-text

Convolutional Neural Network for Image Feature Extraction Based on Concurrent Nested Inception Modules

2019 15th International Conference on Computational Intelligence and Security (CIS) ◽

10.1109/cis.2019.00028 ◽

2019 ◽

Author(s):

Zhengyan Wang ◽

Junfeng Chen ◽

Xiaolin Wang

Keyword(s):

Neural Network ◽

Feature Extraction ◽

Convolutional Neural Network ◽

Image Feature ◽

Image Feature Extraction

Download Full-text

Research on image feature extraction and retrieval algorithms based on convolutional neural network

Journal of Visual Communication and Image Representation ◽

10.1016/j.jvcir.2019.102705 ◽

2020 ◽

Vol 69 ◽

pp. 102705 ◽

Cited By ~ 1

Author(s):

Xushan Peng ◽

Xiaoming Zhang ◽

Yongping Li ◽

Bangquan Liu

Keyword(s):

Neural Network ◽

Feature Extraction ◽

Convolutional Neural Network ◽

Image Feature ◽

Image Feature Extraction ◽

Retrieval Algorithms

Download Full-text

An Advanced Relevance Feedback Method to Improve Performance of CBIR using Convolutional Neural Network and Comprehensive Values

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.b2741.129219 ◽

2019 ◽

Vol 9 (2) ◽

pp. 5427-5438

Keyword(s):

Neural Network ◽

Feature Extraction ◽

Image Retrieval ◽

Convolutional Neural Network ◽

Large Scale ◽

Activation Function ◽

Image Feature ◽

Similarity Measurement ◽

Query Image ◽

Image Production

Content-Based Image Retrieval (CBIR) is extensively used technique for image retrieval from large image databases. However, users are not satisfied with the conventional image retrieval techniques. In addition, the advent of web development and transmission networks, the number of images available to users continues to increase. Therefore, a permanent and considerable digital image production in many areas takes place. Quick access to the similar images of a given query image from this extensive collection of images pose great challenges and require proficient techniques. From query by image to retrieval of relevant images, CBIR has key phases such as feature extraction, similarity measurement, and retrieval of relevant images. However, extracting the features of the images is one of the important steps. Recently Convolutional Neural Network (CNN) shows good results in the field of computer vision due to the ability of feature extraction from the images. Alex Net is a classical Deep CNN for image feature extraction. We have modified the Alex Net Architecture with a few changes and proposed a novel framework to improve its ability for feature extraction and for similarity measurement. The proposal approach optimizes Alex Net in the aspect of pooling layer. In particular, average pooling is replaced by max-avg pooling and the non-linear activation function Maxout is used after every Convolution layer for better feature extraction. This paper introduces CNN for features extraction from images in CBIR system and also presents Euclidean distance along with the Comprehensive Values for better results. The proposed framework goes beyond image retrieval, including the large-scale database. The performance of the proposed work is evaluated using precision. The proposed work show better results than existing works.

Download Full-text

Multi-Regional Online Car-Hailing Order Quantity Forecasting Based on the Convolutional Neural Network

Information ◽

10.3390/info10060193 ◽

2019 ◽

Vol 10 (6) ◽

pp. 193 ◽

Cited By ~ 1

Author(s):

Zihao Huang ◽

Gang Huang ◽

Zhijun Chen ◽

Chaozhong Wu ◽

Xiaofeng Ma ◽

...

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Travel Demand ◽

Short Term Memory ◽

Demand Forecasting ◽

Image Feature ◽

Support Vector ◽

Data Set ◽

Demand Distribution ◽

Demand Forecasting Model

With the development of online cars, the demand for travel prediction is increasing in order to reduce the information asymmetry between passengers and drivers of online car-hailing. This paper proposes a travel demand forecasting model named OC-CNN based on the convolutional neural network to forecast the travel demand. In order to make full use of the spatial characteristics of the travel demand distribution, this paper meshes the prediction area and creates a travel demand data set of the graphical structure to preserve its spatial properties. Taking advantage of the convolutional neural network in image feature extraction, the historical demand data of the first twenty-five minutes of the entire region are used as a model input to predict the travel demand for the next five minutes. In order to verify the performance of the proposed method, one-month data from online car-hailing of the Chengdu Fourth Ring Road are used. The results show that the model successfully extracts the spatiotemporal features of the data, and the prediction accuracies of the proposed method are superior to those of the representative methods, including the Bayesian Ridge Model, Linear Regression, Support Vector Regression, and Long Short-Term Memory networks.

Download Full-text