A neural network incorporating adaptive Gabor filters for image texture classification

Author(s):  
K. Kameyama ◽  
K. Mori ◽  
Y. Kosugi
2020 ◽  
Vol 2020 (10) ◽  
pp. 310-1-310-7
Author(s):  
Khalid Omer ◽  
Luca Caucci ◽  
Meredith Kupinski

This work reports on convolutional neural network (CNN) performance on an image texture classification task as a function of linear image processing and number of training images. Detection performance of single and multi-layer CNNs (sCNN/mCNN) are compared to optimal observers. Performance is quantified by the area under the receiver operating characteristic (ROC) curve, also known as the AUC. For perfect detection AUC = 1.0 and AUC = 0.5 for guessing. The Ideal Observer (IO) maximizes AUC but is prohibitive in practice because it depends on high-dimensional image likelihoods. The IO performance is invariant to any fullrank, invertible linear image processing. This work demonstrates the existence of full-rank, invertible linear transforms that can degrade both sCNN and mCNN even in the limit of large quantities of training data. A subsequent invertible linear transform changes the images’ correlation structure again and can improve this AUC. Stationary textures sampled from zero mean and unequal covariance Gaussian distributions allow closed-form analytic expressions for the IO and optimal linear compression. Linear compression is a mitigation technique for high-dimension low sample size (HDLSS) applications. By definition, compression strictly decreases or maintains IO detection performance. For small quantities of training data, linear image compression prior to the sCNN architecture can increase AUC from 0.56 to 0.93. Results indicate an optimal compression ratio for CNN based on task difficulty, compression method, and number of training images.


Author(s):  
Miroslav Benco ◽  
Patrik Kamencay ◽  
Robert Hudec ◽  
Martina Radilova ◽  
Peter Sykora

Author(s):  
Pankaj H. Chandankhede

Texture can be considered as a repeating pattern of local variation of pixel intensities. Cosine Transform (DCT) coefficients of texture images. As DCT works on gray level images, the color scheme of each image is transformed into gray levels. For classifying the images using DCT, two popular soft computing techniques namely neurocomputing and neuro-fuzzy computing are used. A feedforward neural network is used to train the backpropagation learning algorithm and an evolving fuzzy neural network to classify the textures. The soft computing models were trained using 80% of the texture data and the remaining was used for testing and validation purposes. A performance comparison was made among the soft computing models for the texture classification problem. In texture classification the goal is to assign an unknown sample image to a set of known texture classes. It is observed that the proposed neuro-fuzzy model performed better than the neural network.


Sign in / Sign up

Export Citation Format

Share Document