A novel ResNet101 model based on dense dilated convolution for image classification

AbstractImage classification plays an important role in computer vision. The existing convolutional neural network methods have some problems during image classification process, such as low accuracy of tumor classification and poor ability of feature expression and feature extraction. Therefore, we propose a novel ResNet101 model based on dense dilated convolution for medical liver tumors classification. The multi-scale feature extraction module is used to extract multi-scale features of images, and the receptive field of the network is increased. The depth feature extraction module is used to reduce background noise information and focus on effective features of the focal region. To obtain broader and deeper semantic information, a dense dilated convolution module is deployed in the network. This module combines the advantages of Inception, residual structure, and multi-scale dilated convolution to obtain a deeper level of feature information without causing gradient explosion and gradient disappearance. To solve the common feature loss problems in the classification network, the up- down-sampling module in the network is improved, and multiple convolution kernels with different scales are cascaded to widen the network, which can effectively avoid feature loss. Finally, experiments are carried out on the proposed method. Compared with the existing mainstream classification networks, the proposed method can improve the classification performance, and finally achieve accurate classification of liver tumors. The effectiveness of the proposed method is further verified by ablation experiments.Highlights The multi-scale feature extraction module is introduced to extract multi-scale features of images, it can extract deep context information of the lesion region and surrounding tissues to enhance the feature extraction ability of the network. The depth feature extraction module is used to focus on the local features of the lesion region from both channel and space, weaken the influence of irrelevant information, and strengthen the recognition ability of the lesion region. The feature extraction module is enhanced by the parallel structure of dense dilated convolution, and the deeper feature information is obtained without losing the image feature information to improve the classification accuracy.

Download Full-text

A Novel 2D-3D CNN with Spectral-Spatial Multi-Scale Feature Fusion for Hyperspectral Image Classification

Remote Sensing ◽

10.3390/rs13224621 ◽

2021 ◽

Vol 13 (22) ◽

pp. 4621

Author(s):

Dongxu Liu ◽

Guangliang Han ◽

Peixun Liu ◽

Hang Yang ◽

Xinglong Sun ◽

...

Keyword(s):

Feature Extraction ◽

Image Classification ◽

Classification Scheme ◽

Hyperspectral Image ◽

Feature Fusion ◽

Spectral Feature ◽

Hyperspectral Image Classification ◽

Scale Feature ◽

Multi Scale ◽

3D Cnn

Multifarious hyperspectral image (HSI) classification methods based on convolutional neural networks (CNN) have been gradually proposed and achieve a promising classification performance. However, hyperspectral image classification still suffers from various challenges, including abundant redundant information, insufficient spectral-spatial representation, irregular class distribution, and so forth. To address these issues, we propose a novel 2D-3D CNN with spectral-spatial multi-scale feature fusion for hyperspectral image classification, which consists of two feature extraction streams, a feature fusion module as well as a classification scheme. First, we employ two diverse backbone modules for feature representation, that is, the spectral feature and the spatial feature extraction streams. The former utilizes a hierarchical feature extraction module to capture multi-scale spectral features, while the latter extracts multi-stage spatial features by introducing a multi-level fusion structure. With these network units, the category attribute information of HSI can be fully excavated. Then, to output more complete and robust information for classification, a multi-scale spectral-spatial-semantic feature fusion module is presented based on a Decomposition-Reconstruction structure. Last of all, we innovate a classification scheme to lift the classification accuracy. Experimental results on three public datasets demonstrate that the proposed method outperforms the state-of-the-art methods.

Download Full-text

Automatically Adjustable Multi-Scale Feature Extraction Framework for Hyperspectral Image Classification

10.1109/igarss47720.2021.9554502 ◽

2021 ◽

Author(s):

Jiaqi Yang ◽

Bo Du ◽

Chen Wu ◽

Liangpei Zhang

Keyword(s):

Feature Extraction ◽

Image Classification ◽

Hyperspectral Image ◽

Hyperspectral Image Classification ◽

Scale Feature ◽

Multi Scale

Download Full-text

A Multi-Scale Feature Extraction-Based Normalized Attention Neural Network for Image Denoising

Electronics ◽

10.3390/electronics10030319 ◽

2021 ◽

Vol 10 (3) ◽

pp. 319

Author(s):

Yi Wang ◽

Xiao Song ◽

Guanghong Gong ◽

Ni Li

Keyword(s):

Neural Network ◽

Feature Extraction ◽

Image Denoising ◽

Color Image ◽

Rapid Development ◽

Similarity Index ◽

Structural Similarity ◽

Convolutional Network ◽

Scale Feature ◽

Multi Scale

Due to the rapid development of deep learning and artificial intelligence techniques, denoising via neural networks has drawn great attention due to their flexibility and excellent performances. However, for most convolutional network denoising methods, the convolution kernel is only one layer deep, and features of distinct scales are neglected. Moreover, in the convolution operation, all channels are treated equally; the relationships of channels are not considered. In this paper, we propose a multi-scale feature extraction-based normalized attention neural network (MFENANN) for image denoising. In MFENANN, we define a multi-scale feature extraction block to extract and combine features at distinct scales of the noisy image. In addition, we propose a normalized attention network (NAN) to learn the relationships between channels, which smooths the optimization landscape and speeds up the convergence process for training an attention model. Moreover, we introduce the NAN to convolutional network denoising, in which each channel gets gain; channels can play different roles in the subsequent convolution. To testify the effectiveness of the proposed MFENANN, we used both grayscale and color image sets whose noise levels ranged from 0 to 75 to do the experiments. The experimental results show that compared with some state-of-the-art denoising methods, the restored images of MFENANN have larger peak signal-to-noise ratios (PSNR) and structural similarity index measure (SSIM) values and get better overall appearance.

Download Full-text

Image Compressive Sensing via Multi-scale Feature Extraction and Attention Mechanism

2020 International Conference on Intelligent Computing, Automation and Systems (ICICAS) ◽

10.1109/icicas51530.2020.00061 ◽

2020 ◽

Author(s):

Chuning He

Keyword(s):

Feature Extraction ◽

Compressive Sensing ◽

Attention Mechanism ◽

Scale Feature ◽

Multi Scale

Download Full-text

Multi-scale feature extraction algorithm of ear image

2011 International Conference on Electric Information and Control Engineering ◽

10.1109/iceice.2011.5777641 ◽

2011 ◽

Cited By ~ 3

Author(s):

Zhi-qin Wang ◽

Xiao-dong Yan

Keyword(s):

Feature Extraction ◽

Scale Feature ◽

Multi Scale ◽

Feature Extraction Algorithm ◽

Extraction Algorithm

Download Full-text

Learning rich features with hybrid loss for brain tumor segmentation

BMC Medical Informatics and Decision Making ◽

10.1186/s12911-021-01431-y ◽

2021 ◽

Vol 21 (S2) ◽

Author(s):

Daobin Huang ◽

Minghui Wang ◽

Ling Zhang ◽

Haichun Li ◽

Minquan Ye ◽

...

Keyword(s):

Feature Extraction ◽

Brain Tumor ◽

Class Imbalance ◽

Feature Representation ◽

Loss Functions ◽

Radiotherapy Planning ◽

Tumor Segmentation ◽

Brain Tumor Segmentation ◽

Scale Feature ◽

Multi Scale

Abstract Background Accurately segment the tumor region of MRI images is important for brain tumor diagnosis and radiotherapy planning. At present, manual segmentation is wildly adopted in clinical and there is a strong need for an automatic and objective system to alleviate the workload of radiologists. Methods We propose a parallel multi-scale feature fusing architecture to generate rich feature representation for accurate brain tumor segmentation. It comprises two parts: (1) Feature Extraction Network (FEN) for brain tumor feature extraction at different levels and (2) Multi-scale Feature Fusing Network (MSFFN) for merge all different scale features in a parallel manner. In addition, we use two hybrid loss functions to optimize the proposed network for the class imbalance issue. Results We validate our method on BRATS 2015, with 0.86, 0.73 and 0.61 in Dice for the three tumor regions (complete, core and enhancing), and the model parameter size is only 6.3 MB. Without any post-processing operations, our method still outperforms published state-of-the-arts methods on the segmentation results of complete tumor regions and obtains competitive performance in another two regions. Conclusions The proposed parallel structure can effectively fuse multi-level features to generate rich feature representation for high-resolution results. Moreover, the hybrid loss functions can alleviate the class imbalance issue and guide the training process. The proposed method can be used in other medical segmentation tasks.

Download Full-text

A combined model of dissolved oxygen prediction in the pond based on multiple-factor analysis and multi-scale feature extraction

Aquacultural Engineering ◽

10.1016/j.aquaeng.2018.12.003 ◽

2019 ◽

Vol 84 ◽

pp. 50-59 ◽

Cited By ~ 4

Author(s):

Weijian Cao ◽

Juan Huan ◽

Chen Liu ◽

Yilin Qin ◽

Fan Wu

Keyword(s):

Factor Analysis ◽

Feature Extraction ◽

Dissolved Oxygen ◽

Multiple Factor Analysis ◽

Combined Model ◽

Scale Feature ◽

Multi Scale ◽

Multiple Factor

Download Full-text

SAR Image Change Detection Based on Multi-scale Feature Extraction

International Journal of Computer Applications Technology and Research ◽

10.7753/ijcatr1004.1002 ◽

2021 ◽

Vol 10 (4) ◽

pp. 077-081

Author(s):

Xiaoqian Yuan ◽

Chao Chen ◽

Shan Tian ◽

Jiandan Zhong

Keyword(s):

Feature Extraction ◽

Change Detection ◽

Speckle Noise ◽

Kernel Matrix ◽

Sar Image ◽

Scale Feature ◽

Difference Image ◽

Multi Scale ◽

Image Change Detection ◽

The Difference

In order to improve the contrast of the difference image and reduce the interference of the speckle noise in the synthetic aperture radar (SAR) image, this paper proposes a SAR image change detection algorithm based on multi-scale feature extraction. In this paper, a kernel matrix with weights is used to extract features of two original images, and then the logarithmic ratio method is used to obtain the difference images of two images, and the change area of the images are extracted. Then, the different sizes of kernel matrix are used to extract the abstract features of different scales of the difference image. This operation can make the difference image have a higher contrast. Finally, the cumulative weighted average is obtained to obtain the final difference image, which can further suppress the speckle noise in the image.

Download Full-text

Pedestrian detection algorithm based on improved muti-scale feature fusion

Journal of Physics Conference Series ◽

10.1088/1742-6596/2078/1/012008 ◽

2021 ◽

Vol 2078 (1) ◽

pp. 012008

Author(s):

Hui Liu ◽

Keyang Cheng

Keyword(s):

Clustering Algorithm ◽

Feature Fusion ◽

Pedestrian Detection ◽

Detection Algorithm ◽

Data Sets ◽

False Detection ◽

Scale Feature ◽

Multi Scale ◽

Dilated Convolution ◽

Small Targets

Abstract Aiming at the problem of false detection and missed detection of small targets and occluded targets in the process of pedestrian detection, a pedestrian detection algorithm based on improved multi-scale feature fusion is proposed. First, for the YOLOv4 multi-scale feature fusion module PANet, which does not consider the interaction relationship between scales, PANet is improved to reduce the semantic gap between scales, and the attention mechanism is introduced to learn the importance of different layers to strengthen feature fusion; then, dilated convolution is introduced. Dilated convolution reduces the problem of information loss during the downsampling process; finally, the K-means clustering algorithm is used to redesign the anchor box and modify the loss function to detect a single category. The experimental results show that the improved pedestrian detection algorithm in the INRIA and WiderPerson data sets under different congestion conditions, the AP reaches 96.83% and 59.67%, respectively. Compared with the pedestrian detection results of the YOLOv4 model, the algorithm improves by 2.41% and 1.03%, respectively. The problem of false detection and missed detection of small targets and occlusion has been significantly improved.

Download Full-text

A medical image segmentation method based on multi-scale feature extraction

10.1109/wcsp52459.2021.9613613 ◽

2021 ◽

Author(s):

Rong Zhao ◽

Weidan Yan ◽

Chao Zhou ◽

Dengyin Zhang

Keyword(s):

Feature Extraction ◽

Image Segmentation ◽

Medical Image ◽

Medical Image Segmentation ◽

Segmentation Method ◽

Scale Feature ◽

Multi Scale

Download Full-text