A parametric texture model based on deep convolutional features closely matches texture appearance for humans

Our visual environment is full of texture—“stuff” like cloth, bark or gravel as distinct from “things” like dresses, trees or paths—and humans are adept at perceiving subtle variations in material properties. To investigate image features important for texture perception, we psychophysically compare a recent parameteric model of texture appearance (CNN model) that uses the features encoded by a deep convolutional neural network (VGG-19) with two other models: the venerable Portilla and Simoncelli model (PS) and an extension of the CNN model in which the power spectrum is additionally matched. Observers discriminated model-generated textures from original natural textures in a spatial three-alternative oddity paradigm under two viewing conditions: when test patches were briefly presented to the near-periphery (“parafoveal”) and when observers were able to make eye movements to all three patches (“inspection”). Under parafoveal viewing, observers were unable to discriminate 10 of 12 original images from CNN model images, and remarkably, the simpler PS model performed slightly better than the CNN model (11 textures). Under foveal inspection, matching CNN features captured appearance substantially better than the PS model (9 compared to 4 textures), and including the power spectrum improved appearance matching for two of the three remaining textures. None of the models we test here could produce indiscriminable images for one of the 12 textures under the inspection condition. While deep CNN (VGG-19) features can often be used to synthesise textures that humans cannot discriminate from natural textures, there is currently no uniformly best model for all textures and viewing conditions.

Download Full-text

A multi-objective opposition-based barnacles mating optimization for image super resolution using hyper-Spectral images

Journal of Engineering Design and Technology ◽

10.1108/jedt-01-2021-0030 ◽

2021 ◽

Vol ahead-of-print (ahead-of-print) ◽

Author(s):

Valli Bhasha A. ◽

Venkatramana Reddy B.D.

Keyword(s):

Neural Network ◽

Wavelet Transform ◽

Convolutional Neural Network ◽

Super Resolution ◽

Discrete Wavelet ◽

Content Type ◽

Resolution Model ◽

Deep Cnn ◽

Image Super Resolution ◽

Better Than

Purpose The problems of Super resolution are broadly discussed in diverse fields. Rather than the progression toward the super resolution models for real-time images, operating hyperspectral images still remains a challenging problem. Design/methodology/approach This paper aims to develop the enhanced image super-resolution model using “optimized Non-negative Structured Sparse Representation (NSSR), Adaptive Discrete Wavelet Transform (ADWT), and Optimized Deep Convolutional Neural Network”. Once after converting the HR images into LR images, the NSSR images are generated by the optimized NSSR. Then the ADWT is used for generating the subbands of both NSSR and HRSB images. The residual image with this information is obtained by the optimized Deep CNN. All the improvements on the algorithms are done by the Opposition-based Barnacles Mating Optimization (O-BMO), with the objective of attaining the multi-objective function concerning the “Peak Signal-to-Noise Ratio (PSNR), and Structural similarity (SSIM) index”. Extensive analysis on benchmark hyperspectral image datasets shows that the proposed model achieves superior performance over typical other existing super-resolution models. Findings From the analysis, the overall analysis of the suggested and the conventional super resolution models relies that the PSNR of the improved O-BMO-(NSSR+DWT+CNN) was 38.8% better than bicubic, 11% better than NSSR, 16.7% better than DWT+CNN, 1.3% better than NSSR+DWT+CNN, and 0.5% better than NSSR+FF-SHO-(DWT+CNN). Hence, it has been confirmed that the developed O-BMO-(NSSR+DWT+CNN) is performing well in converting LR images to HR images. Originality/value This paper adopts a latest optimization algorithm called O-BMO with optimized Non-negative Structured Sparse Representation (NSSR), Adaptive Discrete Wavelet Transform (ADWT) and Optimized Deep Convolutional Neural Network for developing the enhanced image super-resolution model. This is the first work that uses O-BMO-based Deep CNN for image super-resolution model enhancement.

Download Full-text

A Neural Network for Image Anomaly Detection with Deep Pyramidal Representations and Dynamic Routing

International Journal of Neural Systems ◽

10.1142/s0129065720500604 ◽

2020 ◽

Vol 30 (10) ◽

pp. 2050060

Author(s):

Pankaj Mishra ◽

Claudio Piciarelli ◽

Gian Luca Foresti

Keyword(s):

Neural Network ◽

Anomaly Detection ◽

State Of The Art ◽

Structural Similarity ◽

Input Image ◽

Image Features ◽

Anomalous Data ◽

Scale Levels ◽

Anomaly Detector ◽

Better Than

Image anomaly detection is an application-driven problem where the aim is to identify novel samples, which differ significantly from the normal ones. We here propose Pyramidal Image Anomaly DEtector (PIADE), a deep reconstruction-based pyramidal approach, in which image features are extracted at different scale levels to better catch the peculiarities that could help to discriminate between normal and anomalous data. The features are dynamically routed to a reconstruction layer and anomalies can be identified by comparing the input image with its reconstruction. Unlike similar approaches, the comparison is done by using structural similarity and perceptual loss rather than trivial pixel-by-pixel comparison. The proposed method performed at par or better than the state-of-the-art methods when tested on publicly available datasets such as CIFAR10, COIL-100 and MVTec.

Download Full-text

Using a Reinforcement Q-Learning-Based Deep Neural Network for Playing Video Games

Electronics ◽

10.3390/electronics8101128 ◽

2019 ◽

Vol 8 (10) ◽

pp. 1128 ◽

Cited By ~ 1

Author(s):

Lin ◽

Jhang ◽

Lee ◽

Lin ◽

Young

Keyword(s):

Neural Network ◽

Video Games ◽

Video Game ◽

Deep Neural Network ◽

Principal Component ◽

Image Features ◽

Convolution Kernel ◽

Training Time ◽

Q Learning ◽

Better Than

This study proposed a reinforcement Q-learning-based deep neural network (RQDNN) that combined a deep principal component analysis network (DPCANet) and Q-learning to determine a playing strategy for video games. Video game images were used as the inputs. The proposed DPCANet was used to initialize the parameters of the convolution kernel and capture the image features automatically. It performs as a deep neural network and requires less computational complexity than traditional convolution neural networks. A reinforcement Q-learning method was used to implement a strategy for playing the video game. Both Flappy Bird and Atari Breakout games were implemented to verify the proposed method in this study. Experimental results showed that the scores of our proposed RQDNN were better than those of human players and other methods. In addition, the training time of the proposed RQDNN was also far less than other methods.

Download Full-text

A Stochastic Max Pooling Strategy for Convolutional Neural Network Trained by Noisy Samples

International Journal of Computers Communications & Control ◽

10.15837/ijccc.2020.1.3712 ◽

2020 ◽

Vol 15 (1) ◽

Author(s):

Shuai Sun ◽

Bin Hu ◽

Zhou Yu ◽

Xiaona Song

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Image Features ◽

Noisy Image ◽

Training Strategy ◽

Max Pooling ◽

Network Training ◽

Pooling Strategy ◽

Remarkable Progress ◽

Better Than

The deep convolutional neural network (CNN) has made remarkable progress in image classification. However, this network performs poorly and even cannot converge in many actual applications, where the training and test samples contain lots of noises. To solve the problems, this paper puts forward a network training strategy based on stochastic max pooling. Unlike the traditional max pooling, the proposed strategy first ranks all the values in each receptive field, and then selects a random value from the top-n values as the pooling result. Compared with common pooling methods, stochastic max pooling can limit the pooling selection to a larger value that represents the main information of the pooling area which reduces the chance of introducing noises into the network, and enhances the robustness of extracting noisy image features. Experimental results show that the CNN used stochastic max pooling Strategy can converge better than traditional CNN and classified noisy images much more accurately than traditional pooling methods.

Download Full-text

Digitalization system of ancient architecture decoration art based on neural network and image features

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-189251 ◽

2020 ◽

pp. 1-12

Author(s):

Wu Xin ◽

Qiu Daping

Keyword(s):

Neural Network ◽

Construction Industry ◽

Three Dimensional ◽

Performance Testing ◽

Image Features ◽

Three Dimensional Model ◽

Performance Effect ◽

Data Process ◽

And Performance ◽

Construction Mode

The inheritance and innovation of ancient architecture decoration art is an important way for the development of the construction industry. The data process of traditional ancient architecture decoration art is relatively backward, which leads to the obvious distortion of the digitalization of ancient architecture decoration art. In order to improve the digital effect of ancient architecture decoration art, based on neural network, this paper combines the image features to construct a neural network-based ancient architecture decoration art data system model, and graphically expresses the static construction mode and dynamic construction process of the architecture group. Based on this, three-dimensional model reconstruction and scene simulation experiments of architecture groups are realized. In order to verify the performance effect of the system proposed in this paper, it is verified through simulation and performance testing, and data visualization is performed through statistical methods. The result of the study shows that the digitalization effect of the ancient architecture decoration art proposed in this paper is good.

Download Full-text

CirBiTree: Citrullination Site Inference Based on a Fuzzy Neural Network and Flexible Neural Tree

Scientific Programming ◽

10.1155/2020/8847694 ◽

2020 ◽

Vol 2020 ◽

pp. 1-8

Author(s):

Chuandong Song ◽

Haifeng Wang

Keyword(s):

Neural Network ◽

Fuzzy Neural Network ◽

Classification Model ◽

Peptide Sequence ◽

Sequence Information ◽

Post Translational Modification ◽

Fuzzy Neural ◽

Human Complex ◽

Better Than

Emerging evidence demonstrates that post-translational modification plays an important role in several human complex diseases. Nevertheless, considering the inherent high cost and time consumption of classical and typical in vitro experiments, an increasing attention has been paid to the development of efficient and available computational tools to identify the potential modification sites in the level of protein. In this work, we propose a machine learning-based model called CirBiTree for identification the potential citrullination sites. More specifically, we initially utilize the biprofile Bayesian to extract peptide sequence information. Then, a flexible neural tree and fuzzy neural network are employed as the classification model. Finally, the most available length of identified peptides has been selected in this model. To evaluate the performance of the proposed methods, some state-of-the-art methods have been employed for comparison. The experimental results demonstrate that the proposed method is better than other methods. CirBiTree can achieve 83.07% in sn%, 80.50% in sp, 0.8201 in F1, and 0.6359 in MCC, respectively.

Download Full-text

Classification of papillary thyroid carcinoma histological images based on deep learning

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-210100 ◽

2021 ◽

pp. 1-11

Author(s):

Yaning Liu ◽

Lin Han ◽

Hexiang Wang ◽

Bo Yin

Keyword(s):

Neural Network ◽

Differential Diagnosis ◽

Deep Learning ◽

Papillary Thyroid Carcinoma ◽

Thyroid Carcinoma ◽

Image Features ◽

Papillary Thyroid ◽

Histological Image ◽

Histological Images

Papillary thyroid carcinoma (PTC) is a common carcinoma in thyroid. As many benign thyroid nodules have the papillary structure which could easily be confused with PTC in morphology. Thus, pathologists have to take a lot of time on differential diagnosis of PTC besides personal diagnostic experience and there is no doubt that it is subjective and difficult to obtain consistency among observers. To address this issue, we applied deep learning to the differential diagnosis of PTC and proposed a histological image classification method for PTC based on the Inception Residual convolutional neural network (IRCNN) and support vector machine (SVM). First, in order to expand the dataset and solve the problem of histological image color inconsistency, a pre-processing module was constructed that included color transfer and mirror transform. Then, to alleviate overfitting of the deep learning model, we optimized the convolution neural network by combining Inception Network and Residual Network to extract image features. Finally, the SVM was trained via image features extracted by IRCNN to perform the classification task. Experimental results show effectiveness of the proposed method in the classification of PTC histological images.

Download Full-text

Image Restoration by Learning Morphological Opening-Closing Network

Mathematical Morphology - Theory and Applications ◽

10.1515/mathm-2020-0103 ◽

2020 ◽

Vol 4 (1) ◽

pp. 87-107

Author(s):

Ranjan Mondal ◽

Moni Shankar Dey ◽

Bhabatosh Chanda

Keyword(s):

Neural Network ◽

Image Restoration ◽

State Of The Art ◽

Source Code ◽

Back Propagation ◽

Image Features ◽

Main Difficulty ◽

The Right ◽

Right Order ◽

Morphological Opening

AbstractMathematical morphology is a powerful tool for image processing tasks. The main difficulty in designing mathematical morphological algorithm is deciding the order of operators/filters and the corresponding structuring elements (SEs). In this work, we develop morphological network composed of alternate sequences of dilation and erosion layers, which depending on learned SEs, may form opening or closing layers. These layers in the right order along with linear combination (of their outputs) are useful in extracting image features and processing them. Structuring elements in the network are learned by back-propagation method guided by minimization of the loss function. Efficacy of the proposed network is established by applying it to two interesting image restoration problems, namely de-raining and de-hazing. Results are comparable to that of many state-of-the-art algorithms for most of the images. It is also worth mentioning that the number of network parameters to handle is much less than that of popular convolutional neural network for similar tasks. The source code can be found here https://github.com/ranjanZ/Mophological-Opening-Closing-Net

Download Full-text

Sparse-FCM and Deep Convolutional Neural Network for the segmentation and classification of acute lymphoblastic leukaemia

Biomedical Engineering / Biomedizinische Technik ◽

10.1515/bmt-2018-0213 ◽

2020 ◽

Vol 65 (6) ◽

pp. 759-773

Author(s):

Segu Praveena ◽

Sohan Pal Singh

Keyword(s):

Neural Network ◽

Acute Lymphoblastic Leukaemia ◽

Convolutional Neural Network ◽

Optimization Algorithm ◽

Lymphoblastic Leukaemia ◽

Input Image ◽

Deep Convolutional Neural Network ◽

Grey Wolf ◽

Deep Cnn

AbstractLeukaemia detection and diagnosis in advance is the trending topic in the medical applications for reducing the death toll of patients with acute lymphoblastic leukaemia (ALL). For the detection of ALL, it is essential to analyse the white blood cells (WBCs) for which the blood smear images are employed. This paper proposes a new technique for the segmentation and classification of the acute lymphoblastic leukaemia. The proposed method of automatic leukaemia detection is based on the Deep Convolutional Neural Network (Deep CNN) that is trained using an optimization algorithm, named Grey wolf-based Jaya Optimization Algorithm (GreyJOA), which is developed using the Grey Wolf Optimizer (GWO) and Jaya Optimization Algorithm (JOA) that improves the global convergence. Initially, the input image is applied to pre-processing and the segmentation is performed using the Sparse Fuzzy C-Means (Sparse FCM) clustering algorithm. Then, the features, such as Local Directional Patterns (LDP) and colour histogram-based features, are extracted from the segments of the pre-processed input image. Finally, the extracted features are applied to the Deep CNN for the classification. The experimentation evaluation of the method using the images of the ALL IDB2 database reveals that the proposed method acquired a maximal accuracy, sensitivity, and specificity of 0.9350, 0.9528, and 0.9389, respectively.

Download Full-text

Convolutional Neural Network for Crowd Counting on Metro Platforms

Symmetry ◽

10.3390/sym13040703 ◽

2021 ◽

Vol 13 (4) ◽

pp. 703

Author(s):

Jun Zhang ◽

Jiaze Liu ◽

Zhizhong Wang

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Estimation Error ◽

Image Features ◽

Urban Rail Transit ◽

Crowd Counting ◽

Passenger Flow ◽

Urban Rail ◽

Density Map ◽

Flow Detection

Owing to the increased use of urban rail transit, the flow of passengers on metro platforms tends to increase sharply during peak periods. Monitoring passenger flow in such areas is important for security-related reasons. In this paper, in order to solve the problem of metro platform passenger flow detection, we propose a CNN (convolutional neural network)-based network called the MP (metro platform)-CNN to accurately count people on metro platforms. The proposed method is composed of three major components: a group of convolutional neural networks is used on the front end to extract image features, a multiscale feature extraction module is used to enhance multiscale features, and transposed convolution is used for upsampling to generate a high-quality density map. Currently, existing crowd-counting datasets do not adequately cover all of the challenging situations considered in this study. Therefore, we collected images from surveillance videos of a metro platform to form a dataset containing 627 images, with 9243 annotated heads. The results of the extensive experiments showed that our method performed well on the self-built dataset and the estimation error was minimum. Moreover, the proposed method could compete with other methods on four standard crowd-counting datasets.

Download Full-text