Automatic Liver Segmentation in CT Images with Enhanced GAN and Mask Region-Based CNN Architectures

Liver image segmentation has been increasingly employed for key medical purposes, including liver functional assessment, disease diagnosis, and treatment. In this work, we introduce a liver image segmentation method based on generative adversarial networks (GANs) and mask region-based convolutional neural networks (Mask R-CNN). Firstly, since most resulting images have noisy features, we further explored the combination of Mask R-CNN and GANs in order to enhance the pixel-wise classification. Secondly, k -means clustering was used to lock the image aspect ratio, in order to get more essential anchors which can help boost the segmentation performance. Finally, we proposed a GAN Mask R-CNN algorithm which achieved superior performance in comparison with the conventional Mask R-CNN, Mask-CNN, and k -means algorithms in terms of the Dice similarity coefficient (DSC) and the MICCAI metrics. The proposed algorithm also achieved superior performance in comparison with ten state-of-the-art algorithms in terms of six Boolean indicators. We hope that our work can be effectively used to optimize the segmentation and classification of liver anomalies.

Download Full-text

Adversarial Networks for Scale Feature-Attention Spectral Image Reconstruction from a Single RGB

Sensors ◽

10.3390/s20082426 ◽

2020 ◽

Vol 20 (8) ◽

pp. 2426

Author(s):

Pengfei Liu ◽

Huaici Zhao

Keyword(s):

State Of The Art ◽

Accurate Solution ◽

Modern Architecture ◽

Superior Performance ◽

Generative Adversarial Networks ◽

Spectral Image ◽

Adversarial Networks ◽

Feature Pyramid ◽

Feature Attention ◽

Spatial Semantics

Hyperspectral images reconstruction focuses on recovering the spectral information from a single RGBimage. In this paper, we propose two advanced Generative Adversarial Networks (GAN) for the heavily underconstrained inverse problem. We first propose scale attention pyramid UNet (SAPUNet), which uses U-Net with dilated convolution to extract features. We establish the feature pyramid inside the network and use the attention mechanism for feature selection. The superior performance of this model is due to the modern architecture and capturing of spatial semantics. To provide a more accurate solution, we propose another distinct architecture, named W-Net, that builds one more branch compared to U-Net to conduct boundary supervision. SAPUNet and scale attention pyramid WNet (SAPWNet) provide improvements on the Interdisciplinary Computational Vision Lab at Ben Gurion University (ICVL) datasetby 42% and 46.6%, and 45% and 50% in terms of root mean square error (RMSE) and relative RMSE, respectively. The experimental results demonstrate that our proposed models are more accurate than the state-of-the-art hyperspectral recovery methods

Download Full-text

Classification of lung nodule malignancy in computed tomography imaging utilising generative adversarial networks and semi-supervised transfer learning

Journal of Applied Biomedicine ◽

10.1016/j.bbe.2021.08.006 ◽

2021 ◽

Author(s):

Ioannis D. Apostolopoulos ◽

Nikolaos D. Papathanasiou ◽

George S. Panayiotakis

Keyword(s):

Computed Tomography ◽

Transfer Learning ◽

Lung Nodule ◽

Generative Adversarial Networks ◽

Computed Tomography Imaging ◽

Tomography Imaging ◽

Adversarial Networks

Download Full-text

Bidirectional cross-modality unsupervised domain adaptation using generative adversarial networks for cardiac image segmentation

Computers in Biology and Medicine ◽

10.1016/j.compbiomed.2021.104726 ◽

2021 ◽

pp. 104726

Author(s):

Hengfei Cui ◽

Chang Yuwen ◽

Lei Jiang ◽

Yong Xia ◽

Yanning Zhang

Keyword(s):

Image Segmentation ◽

Domain Adaptation ◽

Generative Adversarial Networks ◽

Unsupervised Domain Adaptation ◽

Cardiac Image ◽

Adversarial Networks ◽

Cardiac Image Segmentation

Download Full-text

Improving Skin Lesion Analysis with Generative Adversarial Networks

10.5753/sibgrapi.est.2020.12986 ◽

2020 ◽

Author(s):

Alceu Bissoto ◽

Sandra Avila

Keyword(s):

Skin Lesion ◽

State Of The Art ◽

Synthetic Data ◽

Clinical Information ◽

Analysis Data ◽

Training Dataset ◽

Generative Adversarial Networks ◽

Classification Models ◽

Adversarial Networks ◽

Lesion Analysis

Melanoma is the most lethal type of skin cancer. Early diagnosis is crucial to increase the survival rate of those patients due to the possibility of metastasis. Automated skin lesion analysis can play an essential role by reaching people that do not have access to a specialist. However, since deep learning became the state-of-the-art for skin lesion analysis, data became a decisive factor in pushing the solutions further. The core objective of this M.Sc. dissertation is to tackle the problems that arise by having limited datasets. In the first part, we use generative adversarial networks to generate synthetic data to augment our classification model’s training datasets to boost performance. Our method generates high-resolution clinically-meaningful skin lesion images, that when compound our classification model’s training dataset, consistently improved the performance in different scenarios, for distinct datasets. We also investigate how our classification models perceived the synthetic samples and how they can aid the model’s generalization. Finally, we investigate a problem that usually arises by having few, relatively small datasets that are thoroughly re-used in the literature: bias. For this, we designed experiments to study how our models’ use data, verifying how it exploits correct (based on medical algorithms), and spurious (based on artifacts introduced during image acquisition) correlations. Disturbingly, even in the absence of any clinical information regarding the lesion being diagnosed, our classification models presented much better performance than chance (even competing with specialists benchmarks), highly suggesting inflated performances.

Download Full-text

GANDALF: Generative Adversarial Networks with Discriminator-Adaptive Loss Fine-Tuning for Alzheimer’s Disease Diagnosis from MRI

Medical Image Computing and Computer Assisted Intervention – MICCAI 2020 - Lecture Notes in Computer Science ◽

10.1007/978-3-030-59713-9_66 ◽

2020 ◽

pp. 688-697

Author(s):

Hoo-Chang Shin ◽

◽

Alvin Ihsani ◽

Ziyue Xu ◽

Swetha Mandava ◽

...

Keyword(s):

Alzheimer’S Disease ◽

Alzheimer's Disease ◽

Disease Diagnosis ◽

Fine Tuning ◽

Generative Adversarial Networks ◽

Adversarial Networks ◽

Alzheimer’S Disease Diagnosis

Download Full-text

WTRPNet: An Explainable Graph Feature Convolutional Neural Network for Epileptic EEG Classification

ACM Transactions on Multimedia Computing Communications and Applications ◽

10.1145/3460522 ◽

2021 ◽

Vol 17 (3s) ◽

pp. 1-18

Author(s):

Qi Xin ◽

Shaohao Hu ◽

Shuaiqi Liu ◽

Ling Zhao ◽

Shuihua Wang

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

State Of The Art ◽

Traumatic Injury ◽

Recurrence Plot ◽

Superior Performance ◽

Eeg Classification ◽

Epilepsy Diagnosis ◽

Electroencephalogram Eeg

As one of the important tools of epilepsy diagnosis, the electroencephalogram (EEG) is noninvasive and presents no traumatic injury to patients. It contains a lot of physiological and pathological information that is easy to obtain. The automatic classification of epileptic EEG is important in the diagnosis and therapeutic efficacy of epileptics. In this article, an explainable graph feature convolutional neural network named WTRPNet is proposed for epileptic EEG classification. Since WTRPNet is constructed by a recurrence plot in the wavelet domain, it can fully obtain the graph feature of the EEG signal, which is established by an explainable graph features extracted layer called WTRP block . The proposed method shows superior performance over state-of-the-art methods. Experimental results show that our algorithm has achieved an accuracy of 99.67% in classification of focal and nonfocal epileptic EEG, which proves the effectiveness of the classification and detection of epileptic EEG.

Download Full-text

Multi-class Generative Adversarial Networks: Improving One-class Classification of Pneumonia Using Limited Labeled Data

10.1109/embc46164.2021.9629980 ◽

2021 ◽

Author(s):

Saman Motamed ◽

Farzad Khalvati

Keyword(s):

Generative Adversarial Networks ◽

Adversarial Networks ◽

One Class Classification

Download Full-text

A Multiscale CNN-CRF Framework for Environmental Microorganism Image Segmentation

BioMed Research International ◽

10.1155/2020/4621403 ◽

2020 ◽

Vol 2020 ◽

pp. 1-27

Author(s):

Jinghua Zhang ◽

Chen Li ◽

Frank Kulwa ◽

Xin Zhao ◽

Changhao Sun ◽

...

Keyword(s):

Image Segmentation ◽

State Of The Art ◽

Conditional Random Field ◽

Segmentation Method ◽

Recall Accuracy ◽

Evaluation Indexes ◽

Segmentation Quality ◽

Segmentation Approach ◽

Overall Evaluation

To assist researchers to identify Environmental Microorganisms (EMs) effectively, a Multiscale CNN-CRF (MSCC) framework for the EM image segmentation is proposed in this paper. There are two parts in this framework: The first is a novel pixel-level segmentation approach, using a newly introduced Convolutional Neural Network (CNN), namely, “mU-Net-B3”, with a dense Conditional Random Field (CRF) postprocessing. The second is a VGG-16 based patch-level segmentation method with a novel “buffer” strategy, which further improves the segmentation quality of the details of the EMs. In the experiment, compared with the state-of-the-art methods on 420 EM images, the proposed MSCC method reduces the memory requirement from 355 MB to 103 MB, improves the overall evaluation indexes (Dice, Jaccard, Recall, Accuracy) from 85.24%, 77.42%, 82.27%, and 96.76% to 87.13%, 79.74%, 87.12%, and 96.91%, respectively, and reduces the volume overlap error from 22.58% to 20.26%. Therefore, the MSCC method shows great potential in the EM segmentation field.

Download Full-text

RoCGAN: Robust Conditional GAN

International Journal of Computer Vision ◽

10.1007/s11263-020-01348-5 ◽

2020 ◽

Vol 128 (10-11) ◽

pp. 2665-2683 ◽

Cited By ~ 1

Author(s):

Grigorios G. Chrysos ◽

Jean Kossaifi ◽

Stefanos Zafeiriou

Keyword(s):

Large Scale ◽

Real Data ◽

Superior Performance ◽

Target Space ◽

Generative Adversarial Networks ◽

Natural Scenes ◽

Adversarial Networks ◽

Target Manifold ◽

The Face ◽

Intense Noise

Abstract Conditional image generation lies at the heart of computer vision and conditional generative adversarial networks (cGAN) have recently become the method of choice for this task, owing to their superior performance. The focus so far has largely been on performance improvement, with little effort in making cGANs more robust to noise. However, the regression (of the generator) might lead to arbitrarily large errors in the output, which makes cGANs unreliable for real-world applications. In this work, we introduce a novel conditional GAN model, called RoCGAN, which leverages structure in the target space of the model to address the issue. Specifically, we augment the generator with an unsupervised pathway, which promotes the outputs of the generator to span the target manifold, even in the presence of intense noise. We prove that RoCGAN share similar theoretical properties as GAN and establish with both synthetic and real data the merits of our model. We perform a thorough experimental validation on large scale datasets for natural scenes and faces and observe that our model outperforms existing cGAN architectures by a large margin. We also empirically demonstrate the performance of our approach in the face of two types of noise (adversarial and Bernoulli).

Download Full-text

Multi-Turn Chatbot Based on Query-Context Attentions and Dual Wasserstein Generative Adversarial Networks

Applied Sciences ◽

10.3390/app9183908 ◽

2019 ◽

Vol 9 (18) ◽

pp. 3908 ◽

Cited By ~ 3

Author(s):

Jintae Kim ◽

Shinhyeok Oh ◽

Oh-Woog Kwon ◽

Harksoo Kim

Keyword(s):

Performance Measures ◽

State Of The Art ◽

Attention Mechanism ◽

Generative Adversarial Networks ◽

Training Method ◽

Adversarial Networks ◽

Proposed Model ◽

Previous State ◽

Vector Representations

To generate proper responses to user queries, multi-turn chatbot models should selectively consider dialogue histories. However, previous chatbot models have simply concatenated or averaged vector representations of all previous utterances without considering contextual importance. To mitigate this problem, we propose a multi-turn chatbot model in which previous utterances participate in response generation using different weights. The proposed model calculates the contextual importance of previous utterances by using an attention mechanism. In addition, we propose a training method that uses two types of Wasserstein generative adversarial networks to improve the quality of responses. In experiments with the DailyDialog dataset, the proposed model outperformed the previous state-of-the-art models based on various performance measures.

Download Full-text