scholarly journals Melanoma Recognition and Lesion Segmentation using Multi-Instance Learning

Author(s):  
Yangling Ma ◽  
Zhouwang Yang

Abstract Melanoma is one of the deadliest forms of skin cancer, but early and accurate identification can significantly improve the survival rate of patients. In this paper, an end-to-end framework based on multi-instance learning is proposed for melanoma recognition and lesion segmentation simultaneously. To make full use from the information of high-resolution images, we take each image block (super-pixel) as an instance in a bag and use multi-instance learning based on a graph convolutional network to recognize melanoma. Moreover, skin lesion segmentation is derived from attention weights and is calibrated by classification probability vectors. As a result, the AUC of our method for melanoma recognition reaches 0.93, which is much higher compared with other related methods. Also, the Jaccard index (JA) of our method for melanoma-related skin lesion segmentation reaches 0.699. In our end-to-end approach, segmentation and recognition are treated as intimately coupled processes, and hence, a high JA is also an indication of the reliability of melanoma recognition. Collectively, these findings confirmed that our method effectively assists melanoma diagnosis.

2020 ◽  
Vol 10 (10) ◽  
pp. 3658
Author(s):  
Karshiev Sanjar ◽  
Olimov Bekhzod ◽  
Jaeil Kim ◽  
Jaesoo Kim ◽  
Anand Paul ◽  
...  

The early and accurate diagnosis of skin cancer is crucial for providing patients with advanced treatment by focusing medical personnel on specific parts of the skin. Networks based on encoder–decoder architectures have been effectively implemented for numerous computer-vision applications. U-Net, one of CNN architectures based on the encoder–decoder network, has achieved successful performance for skin-lesion segmentation. However, this network has several drawbacks caused by its upsampling method and activation function. In this paper, a fully convolutional network and its architecture are proposed with a modified U-Net, in which a bilinear interpolation method is used for upsampling with a block of convolution layers followed by parametric rectified linear-unit non-linearity. To avoid overfitting, a dropout is applied after each convolution block. The results demonstrate that our recommended technique achieves state-of-the-art performance for skin-lesion segmentation with 94% pixel accuracy and a 88% dice coefficient, respectively.


2019 ◽  
Vol 52 ◽  
pp. 226-237 ◽  
Author(s):  
Manoranjan Dash ◽  
Narendra D. Londhe ◽  
Subhojit Ghosh ◽  
Ashish Semwal ◽  
Rajendra S. Sonawane

2019 ◽  
Vol 78 ◽  
pp. 101658 ◽  
Author(s):  
Ebrahim Nasr-Esfahani ◽  
Shima Rafiei ◽  
Mohammad H. Jafari ◽  
Nader Karimi ◽  
James S. Wrobel ◽  
...  

2020 ◽  
Vol 39 (3) ◽  
pp. 169-185
Author(s):  
Omran Salih ◽  
Serestina Viriri

Deep learning techniques such as Deep Convolutional Networks have achieved great success in skin lesion segmentation towards melanoma detection. The performance is however restrained by distinctive and challenging features of skin lesions such as irregular and fuzzy border, noise and artefacts presence and low contrast between lesions. The methods are also restricted with scarcity of annotated lesion images training dataset and limited computing resources. Recent research in convolutional neural network (CNN) has provided a variety of new architectures for deep learning. One interesting new architecture is the local binary convolutional neural network (LBCNN), which can reduce the workload of CNNs and improve the classification accuracy. The proposed framework employs the local binary convolution on U-net architecture instead of the standard convolution in order to reduced-size deep convolutional encoder-decoder network that adopts loss function for robust segmentation. The proposed framework replaced the encoder part in U-net by LBCNN layers. The approach automatically learns and segments complex features of skin lesion images. The encoder stage learns the contextual information by extracting discriminative features while the decoder stage captures the lesion boundaries of the skin images. This addresses the issues with encoder-decoder network producing coarse segmented output with challenging skin lesions appearances such as low contrast between healthy and unhealthy tissues and fine grained variability. It also addresses issues with multi-size, multi-scale and multi-resolution skin lesion images. The deep convolutional network also adopts a reduced-size network with just five levels of encoding-decoding network. This reduces greatly the consumption of computational processing resources. The system was evaluated on publicly available dataset of ISIC and PH2. The proposed system outperforms most of the existing state-of-art.


2021 ◽  
Vol 11 (15) ◽  
pp. 6975
Author(s):  
Tao Zhang ◽  
Lun He ◽  
Xudong Li ◽  
Guoqing Feng

Lipreading aims to recognize sentences being spoken by a talking face. In recent years, the lipreading method has achieved a high level of accuracy on large datasets and made breakthrough progress. However, lipreading is still far from being solved, and existing methods tend to have high error rates on the wild data and have the defects of disappearing training gradient and slow convergence. To overcome these problems, we proposed an efficient end-to-end sentence-level lipreading model, using an encoder based on a 3D convolutional network, ResNet50, Temporal Convolutional Network (TCN), and a CTC objective function as the decoder. More importantly, the proposed architecture incorporates TCN as a feature learner to decode feature. It can partly eliminate the defects of RNN (LSTM, GRU) gradient disappearance and insufficient performance, and this yields notable performance improvement as well as faster convergence. Experiments show that the training and convergence speed are 50% faster than the state-of-the-art method, and improved accuracy by 2.4% on the GRID dataset.


Sign in / Sign up

Export Citation Format

Share Document