Adaptive Localizing Region-Based Level Set for Segmentation of Maxillary Sinus Based on Convolutional Neural Networks

In this paper, we propose a novel method, an adaptive localizing region-based level set using convolutional neural network, for improving performance of maxillary sinus segmentation. The healthy sinus without lesion inside is easy for conventional algorithms. However, in practice, most of the cases are filled with lesions of great heterogeneity which lead to lower accuracy. Therefore, we provide a strategy to avoid active contour from being trapped into a nontarget area. First, features of lesion and maxillary sinus are studied using a convolutional neural network (CNN) with two convolutional and three fully connected layers in architecture. In addition, outputs of CNN are devised to evaluate possibilities of zero level set location close to lesion or not. Finally, the method estimates stable points on the contour by an interactive process. If it locates in the lesion, the point needs to be paid a certain speed compensation based on the value of possibility via CNN, assisting itself to escape from the local minima. If not, the point preserves current status till convergence. Capabilities of our method have been demonstrated on a dataset of 200 CT images with possible lesions. To illustrate the strength of our method, we evaluated it against state-of-the-art methods, FLS and CRF-FCN. For all cases, our method, as assessed by Dice similarity coefficients, performed significantly better compared with currently available methods and obtained a significant Dice improvement, 0.25 than FLS and 0.12 than CRF-FCN, respectively, on an average.

Download Full-text

Weighted Channel Dropout for Regularization of Deep Convolutional Neural Network

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33018425 ◽

2019 ◽

Vol 33 ◽

pp. 8425-8432 ◽

Cited By ~ 7

Author(s):

Saihui Hou ◽

Zilei Wang

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Random Number ◽

Random Number Generator ◽

Test Phase ◽

Deep Convolutional Neural Network ◽

Multiple Datasets ◽

Novel Method ◽

Fully Connected ◽

Activation Status

In this work, we propose a novel method named Weighted Channel Dropout (WCD) for the regularization of deep Convolutional Neural Network (CNN). Different from Dropout which randomly selects the neurons to set to zero in the fully-connected layers, WCD operates on the channels in the stack of convolutional layers. Specifically, WCD consists of two steps, i.e., Rating Channels and Selecting Channels, and three modules, i.e., Global Average Pooling, Weighted Random Selection and Random Number Generator. It filters the channels according to their activation status and can be plugged into any two consecutive layers, which unifies the original Dropout and Channel-Wise Dropout. WCD is totally parameter-free and deployed only in training phase with very slight computation cost. The network in test phase remains unchanged and thus the inference cost is not added at all. Besides, when combining with the existing networks, it requires no re-pretraining on ImageNet and thus is well-suited for the application on small datasets. Finally, WCD with VGGNet-16, ResNet-101, Inception-V3 are experimentally evaluated on multiple datasets. The extensive results demonstrate that WCD can bring consistent improvements over the baselines.

Download Full-text

Deep Active Learning for Automatic Segmentation of Maxillary Sinus Lesions Using a Convolutional Neural Network

Diagnostics ◽

10.3390/diagnostics11040688 ◽

2021 ◽

Vol 11 (4) ◽

pp. 688

Author(s):

Seok-Ki Jung ◽

Ho-Kyung Lim ◽

Seungjun Lee ◽

Yongwon Cho ◽

In-Seok Song

Keyword(s):

Neural Network ◽

Active Learning ◽

Convolutional Neural Network ◽

Maxillary Sinus ◽

Automatic Segmentation ◽

Cone Beam ◽

Second Step ◽

Similarity Coefficients ◽

Maxillary Bone ◽

Learning Framework

The aim of this study was to segment the maxillary sinus into the maxillary bone, air, and lesion, and to evaluate its accuracy by comparing and analyzing the results performed by the experts. We randomly selected 83 cases of deep active learning. Our active learning framework consists of three steps. This framework adds new volumes per step to improve the performance of the model with limited training datasets, while inferring automatically using the model trained in the previous step. We determined the effect of active learning on cone-beam computed tomography (CBCT) volumes of dental with our customized 3D nnU-Net in all three steps. The dice similarity coefficients (DSCs) at each stage of air were 0.920 ± 0.17, 0.925 ± 0.16, and 0.930 ± 0.16, respectively. The DSCs at each stage of the lesion were 0.770 ± 0.18, 0.750 ± 0.19, and 0.760 ± 0.18, respectively. The time consumed by the convolutional neural network (CNN) assisted and manually modified segmentation decreased by approximately 493.2 s for 30 scans in the second step, and by approximately 362.7 s for 76 scans in the last step. In conclusion, this study demonstrates that a deep active learning framework can alleviate annotation efforts and costs by efficiently training on limited CBCT datasets.

Download Full-text

3D Brain Tumor Image Segmentation Integrating Cascaded Anisotropic Fully Convolutional Neural Network and Hybrid Level Set Method

Journal of Imaging Science and Technology ◽

10.2352/j.imagingsci.technol.2020.64.4.040411 ◽

2020 ◽

Vol 64 (4) ◽

pp. 40411-1-40411-10

Author(s):

Liu Zhao ◽

Qiang Li ◽

Ching-Hsin Wang ◽

Yu-Cheng Liao

Keyword(s):

Neural Network ◽

Image Segmentation ◽

Brain Tumor ◽

Convolutional Neural Network ◽

Level Set Method ◽

Level Set ◽

Performance Model ◽

Tumor Segmentation ◽

Classification Problems ◽

Similarity Coefficients

Abstract The accuracy of three-dimensional (3D) brain tumor image segmentation is of great significance to brain tumor diagnosis. To enhance the accuracy of segmentation, this study proposes an algorithm integrating a cascaded anisotropic fully convolutional neural network (FCNN) and the hybrid level set method. The algorithm first performs bias field correction and gray value normalization on T1, T1C, T2, and fluid-attenuated inversion recovery magnetic resonance imaging (MRI) images for preprocessing. It then uses a cascading mechanism to perform preliminary segmentation of whole tumors, tumor cores, and enhancing tumors by an anisotropic FCNN based on the relationships among the locations of the three types of tumor structures. This simplifies multiclass brain tumor image segmentation problems into three binary classification problems. At the same time, the anisotropic FCNN adopts dense connections and multiscale feature merging to further enhance performance. Model training is respectively conducted on the axial, coronal, and sagittal planes, and the segmentation results from the three different orthogonal views are combined. Finally, the hybrid level set method is adopted to refine the brain tumor boundaries in the preliminary segmentation results, thereby completing fine segmentation. The results indicate that the proposed algorithm can achieve 3D MRI brain tumor image segmentation of high accuracy and stability. Comparison of the whole-tumor, tumor-core, and enhancing-tumor segmentation results with the gold standards produced Dice similarity coefficients (Dice) of 0.9113, 0.8581, and 0.7976, respectively.

Download Full-text

KLASIFIKASI CITRA GENUS PANTHERA MENGGUNAKAN METODE CONVOLUTIONAL NEURAL NETWORK (CNN)

Jurnal Ilmiah Informatika Komputer ◽

10.35760/ik.2019.v24i3.2364 ◽

2019 ◽

Vol 24 (3) ◽

pp. 220-228

Author(s):

Gusti Alfahmi Anwar ◽

Desti Riminarsih

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Network Model ◽

Neural Network Model ◽

Input Layer ◽

Fully Connected

Panthera merupakan genus dari keluarga kucing yang memiliki empat spesies popular yaitu, harimau, jaguar, macan tutul, singa. Singa memiliki warna keemasan dan tidak memilki motif, harimau memiliki motif loreng dengan garis-garis panjang, jaguar memiliki tubuh yang lebih besar dari pada macan tutul serta memiliki motif tutul yang lebih lebar, sedangkan macan tutul memiliki tubuh yang sedikit lebih ramping dari pada jaguar dan memiliki tutul yang tidak terlalu lebar. Pada penelitian ini dilakukan klasifikasi genus panther yaitu harimau, jaguar, macan tutul, dan singa menggunakan metode Convolutional Neural Network. Model Convolutional Neural Network yang digunakan memiliki 1 input layer, 5 convolution layer, dan 2 fully connected layer. Dataset yang digunakan berupa citra harimau, jaguar, macan tutul, dan singa. Data training terdiri dari 3840 citra, data validasi sebanyak 960 citra, dan data testing sebanyak 800 citra. Hasil akurasi dari pelatihan model untuk training yaitu 92,31% dan validasi yaitu 81,88%, pengujian model menggunakan dataset testing mendapatan hasil 68%. Hasil akurasi prediksi didapatkan dari nilai F1-Score pada pengujian didapatkan sebesar 78% untuk harimau, 70% untuk jaguar, 37% untuk macan tutul, 74% untuk singa. Macan tutul mendapatkan akurasi terendah dibandingkan 3 hewan lainnya tetapi lebih baik dibandingkan hasil penelitian sebelumnya.

Download Full-text

Automatic ECG Classification Using Continuous Wavelet Transform and Convolutional Neural Network

Entropy ◽

10.3390/e23010119 ◽

2021 ◽

Vol 23 (1) ◽

pp. 119

Author(s):

Tao Wang ◽

Changhua Lu ◽

Yining Sun ◽

Mei Yang ◽

Chun Liu ◽

...

Keyword(s):

Neural Network ◽

Wavelet Transform ◽

Convolutional Neural Network ◽

Continuous Wavelet Transform ◽

Continuous Wavelet ◽

Time Frequency ◽

Ecg Signals ◽

Rr Interval ◽

Frequency Components ◽

Fully Connected

Early detection of arrhythmia and effective treatment can prevent deaths caused by cardiovascular disease (CVD). In clinical practice, the diagnosis is made by checking the electrocardiogram (ECG) beat-by-beat, but this is usually time-consuming and laborious. In the paper, we propose an automatic ECG classification method based on Continuous Wavelet Transform (CWT) and Convolutional Neural Network (CNN). CWT is used to decompose ECG signals to obtain different time-frequency components, and CNN is used to extract features from the 2D-scalogram composed of the above time-frequency components. Considering the surrounding R peak interval (also called RR interval) is also useful for the diagnosis of arrhythmia, four RR interval features are extracted and combined with the CNN features to input into a fully connected layer for ECG classification. By testing in the MIT-BIH arrhythmia database, our method achieves an overall performance of 70.75%, 67.47%, 68.76%, and 98.74% for positive predictive value, sensitivity, F1-score, and accuracy, respectively. Compared with existing methods, the overall F1-score of our method is increased by 4.75~16.85%. Because our method is simple and highly accurate, it can potentially be used as a clinical auxiliary diagnostic tool.

Download Full-text

A Novel Method of Plant Leaf Disease Detection Based on Deep Learning and Convolutional Neural Network

2021 6th International Conference on Intelligent Computing and Signal Processing (ICSP) ◽

10.1109/icsp51882.2021.9408806 ◽

2021 ◽

Author(s):

Xulang Guan

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

Disease Detection ◽

Plant Leaf ◽

Leaf Disease ◽

Novel Method

Download Full-text

Coal gangue recognition using multichannel auditory spectrogram of hydraulic support sound in convolutional neural network

Measurement Science and Technology ◽

10.1088/1361-6501/ac3709 ◽

2021 ◽

Author(s):

xu chen ◽

Shibo Wang ◽

Houguang Liu ◽

Jianhua Yang ◽

Songyong Liu ◽

...

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Spectral Characteristics ◽

Coal Gangue ◽

Hydraulic Support ◽

Practical Applications ◽

Network Training ◽

Auditory Recognition ◽

Auditory Signals ◽

Fully Connected

Abstract Many data-driven coal gangue recognition (CGR) methods based on the vibration or sound of collapsed coal and gangue have been proposed to achieve automatic CGR, which is important for realizing intelligent top-coal caving. However, the strong background noise and complex environment in underground coal mines render this task challenging in practical applications. Inspired by the fact that workers distinguish coal and gangue from underground noise by listening to the hydraulic support sound, we propose an auditory model based CGR method that simulates human auditory recognition by combining an auditory spectrogram with a convolutional neural network (CNN). First, we adjust the characteristic frequency (CF) distribution of the auditory peripheral model (APM) based on the spectral characteristics of collapsed sound signals from coal and gangue and then process the sound signals using the adjusted APM to obtain inferior colliculus auditory signals with multiple CFs. Subsequently, the auditory signals of all CFs are converted into gray images separately and then concatenated into a multichannel auditory spectrum along the channel dimension. Finally, we input the multichannel auditory spectrum as a feature map to the two-dimensional CNN, whose convolutional layers are used to automatically extract features, and the fully connected layer and softmax layer are used to flatten features and predict the recognition result, respectively. The CNN is optimized for the CGR based on a comparison study of four typical types of CNN structures with different network training hyperparameters. The experimental results show that this method affords an accurate CGR with a recognition accuracy of 99.5%. Moreover, this method offers excellent noise immunity compared with typically used CGR methods under various noisy conditions.

Download Full-text

Binary and Multiclass Text Classification by Means of Separable Convolutional Neural Network

Inventions ◽

10.3390/inventions6040070 ◽

2021 ◽

Vol 6 (4) ◽

pp. 70

Author(s):

Elena Solovyeva ◽

Ali Abdullah

Keyword(s):

Neural Network ◽

Neural Networks ◽

Convolutional Neural Network ◽

Recurrent Neural Networks ◽

Low Cost ◽

Computational Cost ◽

High Accuracy ◽

Activation Functions ◽

Fully Connected ◽

Fully Connected Networks

In this paper, the structure of a separable convolutional neural network that consists of an embedding layer, separable convolutional layers, convolutional layer and global average pooling is represented for binary and multiclass text classifications. The advantage of the proposed structure is the absence of multiple fully connected layers, which is used to increase the classification accuracy but raises the computational cost. The combination of low-cost separable convolutional layers and a convolutional layer is proposed to gain high accuracy and, simultaneously, to reduce the complexity of neural classifiers. Advantages are demonstrated at binary and multiclass classifications of written texts by means of the proposed networks under the sigmoid and Softmax activation functions in convolutional layer. At binary and multiclass classifications, the accuracy obtained by separable convolutional neural networks is higher in comparison with some investigated types of recurrent neural networks and fully connected networks.

Download Full-text

A Novel Method of Maize Leaf Disease Image Identification Based on a Multichannel Convolutional Neural Network

Transactions of the ASABE ◽

10.13031/trans.12440 ◽

2018 ◽

Vol 61 (5) ◽

pp. 1461-1474 ◽

Cited By ~ 8

Author(s):

Zhongqi Lin ◽

Shaomin Mu ◽

Aiju Shi ◽

Chao Pang ◽

Xiaoxiao Sun

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Graphics Processing Units ◽

High Efficiency ◽

Saliency Detection ◽

Machine Learning Algorithms ◽

Brown Spot ◽

Maize Leaf ◽

Leaf Disease ◽

Fully Connected

Abstract. Traditional methods for detecting maize leaf diseases (such as leaf blight, sooty blotch, brown spot, rust, and purple leaf sheaf) are typically labor-intensive and strongly subjective. With the aim of achieving high accuracy and efficiency in the identification of maize leaf diseases from digital imagery, this article proposes a novel multichannel convolutional neural network (MCNN). The MCNN is composed of an input layer, five convolutional layers, three subsampling layers, three fully connected layers, and an output layer. Using a method that imitates human visual behavior in video saliency detection, the first and second subsampling layers are connected directly with the first fully connected layer. In addition, the mixed modes of pooling and normalization methods, rectified linear units (ReLU), and dropout are introduced to prevent overfitting and gradient diffusion. The learning process corresponding to the network structure is also illustrated. At present, there are no large-scale images of maize leaf disease for use as experimental samples. To test the proposed MCNN, 10,820 RGB images containing five types of disease were collected from maize planting areas in Shandong Province, China. The original images could not be used directly in identification experiments because of noise and irrelevant regions. They were therefore denoised and segmented by homomorphic filtering and region of interest (ROI) segmentation to construct a standard database. A series of experiments on 8 GB graphics processing units (GPUs) showed that the MCNN could achieve an average accuracy of 92.31% and a high efficiency in the identification of maize leaf diseases. The multichannel design and the integration of different innovations proved to be helpful methods for boosting performance. Keywords: Artificial intelligence, Convolutional neural network, Deep learning, Image classification, Machine learning algorithms, Maize leaf disease.

Download Full-text

Edge-Nodes Representation Neural Machine for Link Prediction

Algorithms ◽

10.3390/a12010012 ◽

2019 ◽

Vol 12 (1) ◽

pp. 12 ◽

Cited By ~ 2

Author(s):

Guangluan Xu ◽

Xiaoke Wang ◽

Yang Wang ◽

Daoyu Lin ◽

Xian Sun ◽

...

Keyword(s):

Neural Network ◽

Formation Mechanism ◽

Link Prediction ◽

State Of The Art ◽

Prediction Methods ◽

Topological Features ◽

Novel Method ◽

Fully Connected ◽

Two Sides

Link prediction is a task predicting whether there is a link between two nodes in a network. Traditional link prediction methods that assume handcrafted features (such as common neighbors) as the link’s formation mechanism are not universal. Other popular methods tend to learn the link’s representation, but they cannot represent the link fully. In this paper, we propose Edge-Nodes Representation Neural Machine (ENRNM), a novel method which can learn abundant topological features from the network as the link’s representation to promote the formation of the link. The ENRNM learns the link’s formation mechanism by combining the representation of edge and the representations of nodes on the two sides of the edge as link’s full representation. To predict the link’s existence, we train a fully connected neural network which can learn meaningful and abundant patterns. We prove that the features of edge and two nodes have the same importance in link’s formation. Comprehensive experiments are conducted on eight networks, experiment results demonstrate that the method ENRNM not only exceeds plenty of state-of-the-art link prediction methods but also performs very well on diverse networks with different structures and characteristics.

Download Full-text