SA-GAN: Stain Acclimation Generative Adversarial Network for Histopathology Image Analysis

Histopathological image analysis is an examination of tissue under a light microscope for cancerous disease diagnosis. Computer-assisted diagnosis (CAD) systems work well by diagnosing cancer from histopathology images. However, stain variability in histopathology images is inevitable due to the use of different staining processes, operator ability, and scanner specifications. These stain variations present in histopathology images affect the accuracy of the CAD systems. Various stain normalization techniques have been developed to cope with inter-variability issues, allowing standardizing the appearance of images. However, in stain normalization, these methods rely on the single reference image rather than incorporate color distributions of the entire dataset. In this paper, we design a novel machine learning-based model that takes advantage of whole dataset distributions as well as color statistics of a single target image instead of relying only on a single target image. The proposed deep model, called stain acclimation generative adversarial network (SA-GAN), consists of one generator and two discriminators. The generator maps the input images from the source domain to the target domain. Among discriminators, the first discriminator forces the generated images to maintain the color patterns as of target domain. While second discriminator forces the generated images to preserve the structure contents as of source domain. The proposed model is trained using a color attribute metric, extracted from a selected template image. Therefore, the designed model not only learns dataset-specific staining properties but also image-specific textural contents. Evaluated results on four different histopathology datasets show the efficacy of SA-GAN to acclimate stain contents and enhance the quality of normalization by obtaining the highest values of performance metrics. Additionally, the proposed method is also evaluated for multiclass cancer type classification task, showing a 6.9% improvement in accuracy on ICIAR 2018 hidden test data.

Download Full-text

Multi-Source Domain Adaptation for Visual Sentiment Classification

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i03.5651 ◽

2020 ◽

Vol 34 (03) ◽

pp. 2661-2668

Author(s):

Chuang Lin ◽

Sicheng Zhao ◽

Lei Meng ◽

Tat-Seng Chua

Keyword(s):

Domain Adaptation ◽

Sentiment Classification ◽

Similar Distribution ◽

Single Source ◽

Target Domain ◽

Generative Adversarial Network ◽

Source Domain ◽

Adversarial Network ◽

Latent Space ◽

Benchmark Datasets

Existing domain adaptation methods on visual sentiment classification typically are investigated under the single-source scenario, where the knowledge learned from a source domain of sufficient labeled data is transferred to the target domain of loosely labeled or unlabeled data. However, in practice, data from a single source domain usually have a limited volume and can hardly cover the characteristics of the target domain. In this paper, we propose a novel multi-source domain adaptation (MDA) method, termed Multi-source Sentiment Generative Adversarial Network (MSGAN), for visual sentiment classification. To handle data from multiple source domains, it learns to find a unified sentiment latent space where data from both the source and target domains share a similar distribution. This is achieved via cycle consistent adversarial learning in an end-to-end manner. Extensive experiments conducted on four benchmark datasets demonstrate that MSGAN significantly outperforms the state-of-the-art MDA approaches for visual sentiment classification.

Download Full-text

Category-Sensitive Domain Adaptation for Land Cover Mapping in Aerial Scenes

Remote Sensing ◽

10.3390/rs11222631 ◽

2019 ◽

Vol 11 (22) ◽

pp. 2631 ◽

Cited By ~ 2

Author(s):

Bo Fang ◽

Rong Kou ◽

Li Pan ◽

Pengfei Chen

Keyword(s):

Land Cover ◽

Domain Adaptation ◽

Feature Space ◽

Aerial Images ◽

Land Cover Mapping ◽

Target Domain ◽

Generative Adversarial Network ◽

Source Domain ◽

Adversarial Network ◽

Semantic Labeling

Since manually labeling aerial images for pixel-level classification is expensive and time-consuming, developing strategies for land cover mapping without reference labels is essential and meaningful. As an efficient solution for this issue, domain adaptation has been widely utilized in numerous semantic labeling-based applications. However, current approaches generally pursue the marginal distribution alignment between the source and target features and ignore the category-level alignment. Therefore, directly applying them to land cover mapping leads to unsatisfactory performance in the target domain. In our research, to address this problem, we embed a geometry-consistent generative adversarial network (GcGAN) into a co-training adversarial learning network (CtALN), and then develop a category-sensitive domain adaptation (CsDA) method for land cover mapping using very-high-resolution (VHR) optical aerial images. The GcGAN aims to eliminate the domain discrepancies between labeled and unlabeled images while retaining their intrinsic land cover information by translating the features of the labeled images from the source domain to the target domain. Meanwhile, the CtALN aims to learn a semantic labeling model in the target domain with the translated features and corresponding reference labels. By training this hybrid framework, our method learns to distill knowledge from the source domain and transfers it to the target domain, while preserving not only global domain consistency, but also category-level consistency between labeled and unlabeled images in the feature space. The experimental results between two airborne benchmark datasets and the comparison with other state-of-the-art methods verify the robustness and superiority of our proposed CsDA.

Download Full-text

Normalization of HE-Stained Histological Images using Cycle Consistent Generative Adversarial Networks

10.21203/rs.3.rs-290356/v1 ◽

2021 ◽

Author(s):

Marlen Runz ◽

Daniel Rusche ◽

Martin R Weihrauch ◽

Jürgen Hesser ◽

Cleo-Aron Weis

Keyword(s):

Image Acquisition ◽

Similarity Measures ◽

Staining Intensity ◽

Generative Adversarial Networks ◽

Target Domain ◽

Inverse Mapping ◽

Generative Adversarial Network ◽

Source Domain ◽

Adversarial Network ◽

Histological Images

Abstract Background: Histological images show huge variance (e.g. illumination, color, staining quality) due to diﬀerences in image acquisition, tissue processing, staining, etc. The variance can impede many image analyzes such as staining intensity evaluation or classiﬁcation. Methods to reduce these variances are gathered under the term image normalization. Methods: We present the application of CylceGAN - a cycle consistent Generative Adversarial Network for color normalization in hematoxylin-eosin stained histological images using typical clinical data including variability of internal staining. The network consists of a generator network GB that learns to map an image X from a source domain A to a target domain B, i.e. GB : XA → XB. In addition, a discriminator network DB is trained to distinguish whether an image from domain B is an original or generated one. The same process is applied to another generator-discriminator pair (GA, DA), for the inverse mapping GA : XB → XA. Cycle consistency ensures that the generated image is close to the original image when being mapped backwards (GA(GB(XA)) ≈ XA and vice versa). We validate the CycleGAN approach on a breast cancer challenge and a follicular thyroid carcinoma dataset for various stain variations. We evaluate the quality of the generated images compared to the original images using similarity measures. Results: We present qualitative results of the images generated by our network compared to the original color distributions. Our evaluation shows that by mapping images from a source domain to a target domain, the similarity to original images from the target domain improve up to 96%. We also achieve a high cycle consistency for the inverse mapping by obtaining similarity indices bigger than 0.9. Conclusions: CycleGANs have proven to eﬃciently normalize HE-stained images. The approach enables to compensate for deviations resulting from image acquisition (e.g. diﬀerent scanning devices) as well as from tissue staining (e.g. diﬀerent staining protocols), and thus overcomes the staining variations in images from various institutions. The code is publicly available at https://github.com/m4ln/stainTransfer_CycleGAN_pytorch. The dataset supporting the solutions is available at https://heidata.uni-heidelberg. de/privateurl.xhtml?token=12493b50-1538-4bdf-aca5-03352a1399a8.

Download Full-text

Correction to “Noise Adaptation Generative Adversarial Network for Medical Image Analysis”

IEEE Transactions on Medical Imaging ◽

10.1109/tmi.2020.2988497 ◽

2020 ◽

Vol 39 (7) ◽

pp. 2566-2567

Author(s):

Tianyang Miller ◽

Jun Cheng ◽

Huazhu Fu ◽

Zaiwang Gu ◽

Yuting Xiao ◽

...

Keyword(s):

Image Analysis ◽

Medical Image ◽

Medical Image Analysis ◽

Generative Adversarial Network ◽

Adversarial Network

Download Full-text

No-Reference Image Quality Assessment Based on Multi-Task Generative Adversarial Network

IEEE Access ◽

10.1109/access.2019.2942625 ◽

2019 ◽

Vol 7 ◽

pp. 146893-146902 ◽

Cited By ~ 2

Author(s):

Yao Ma ◽

Xibiao Cai ◽

Fuming Sun ◽

Shijie Hao

Keyword(s):

Image Quality ◽

Quality Assessment ◽

Image Quality Assessment ◽

Reference Image ◽

Generative Adversarial Network ◽

Adversarial Network

Download Full-text

Unified Generative Adversarial Networks for Multidomain Fingerprint Presentation Attack Detection

Entropy ◽

10.3390/e23081089 ◽

2021 ◽

Vol 23 (8) ◽

pp. 1089

Author(s):

Soha B. Sandouka ◽

Yakoub Bazi ◽

Haikel Alhichri ◽

Naif Alajlan

Keyword(s):

Attack Detection ◽

Generative Adversarial Networks ◽

Generative Adversarial Network ◽

Source Domain ◽

The Public ◽

Adversarial Network ◽

Adversarial Networks ◽

Biometric Systems ◽

Security And Reliability ◽

Presentation Attack Detection

With the rapid growth of fingerprint-based biometric systems, it is essential to ensure the security and reliability of the deployed algorithms. Indeed, the security vulnerability of these systems has been widely recognized. Thus, it is critical to enhance the generalization ability of fingerprint presentation attack detection (PAD) cross-sensor and cross-material settings. In this work, we propose a novel solution for addressing the case of a single source domain (sensor) with large labeled real/fake fingerprint images and multiple target domains (sensors) with only few real images obtained from different sensors. Our aim is to build a model that leverages the limited sample issues in all target domains by transferring knowledge from the source domain. To this end, we train a unified generative adversarial network (UGAN) for multidomain conversion to learn several mappings between all domains. This allows us to generate additional synthetic images for the target domains from the source domain to reduce the distribution shift between fingerprint representations. Then, we train a scale compound network (EfficientNetV2) coupled with multiple head classifiers (one classifier for each domain) using the source domain and the translated images. The outputs of these classifiers are then aggregated using an additional fusion layer with learnable weights. In the experiments, we validate the proposed methodology on the public LivDet2015 dataset. The experimental results show that the proposed method improves the average classification accuracy over twelve classification scenarios from 67.80 to 80.44% after adaptation.

Download Full-text

Privacy-Preserving Generative Adversarial Network for Case-Based Explainability in Medical Image Analysis

IEEE Access ◽

10.1109/access.2021.3124844 ◽

2021 ◽

pp. 1-1

Author(s):

Helena Montenegro ◽

Wilson Silva ◽

Jaime S. Cardoso

Keyword(s):

Image Analysis ◽

Medical Image ◽

Medical Image Analysis ◽

Privacy Preserving ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Case Based

Download Full-text

Materials Representation and Transfer Learning for Multi-Property Prediction

10.26434/chemrxiv.14612307.v1 ◽

2021 ◽

Author(s):

Shufeng Kong ◽

Dan Guevarra ◽

Carla P. Gomes ◽

John Gregoire

Keyword(s):

Machine Learning ◽

Optical Absorption ◽

Transfer Learning ◽

Materials Science ◽

Training Data ◽

Target Domain ◽

Generative Adversarial Network ◽

Property Prediction ◽

Adversarial Network ◽

Correlation Learning

The adoption of machine learning in materials science has rapidly transformed materials property prediction. Hurdles limiting full capitalization of recent advancements in machine learning include the limited development of methods to learn the underlying interactions of multiple elements, as well as the relationships among multiple properties, to facilitate property prediction in new composition spaces. To address these issues, we introduce the Hierarchical Correlation Learning for Multi-property Prediction (H-CLMP) framework that seamlessly integrates (i) prediction using only a material’s composition, (ii) learning and exploitation of correlations among target properties in multitarget regression, and (iii) leveraging training data from tangential domains via generative transfer learning. The model is demonstrated for prediction of spectral optical absorption of complex metal oxides spanning 69 3-cation metal oxide composition spaces. H-CLMP accurately predicts non-linear composition-property relationships in composition spaces for which no training data is available, which broadens the purview of machine learning to the discovery of materials with exceptional properties. This achievement results from the principled integration of latent embedding learning, property correlation learning, generative transfer learning, and attention models. The best performance is obtained using H-CLMP with Transfer learning (H-CLMP(T)) wherein a generative adversarial network is trained on computational density of states data and deployed in the target domain to augment prediction of optical absorption from composition. H-CLMP(T) aggregates multiple knowledge sources with a framework that is well-suited for multi-target regression across the physical sciences.

Download Full-text

Multimodal Structure-Consistent Image-to-Image Translation

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i07.6814 ◽

2020 ◽

Vol 34 (07) ◽

pp. 11490-11498

Author(s):

Che-Tsung Lin ◽

Yen-Yi Wu ◽

Po-Hao Hsu ◽

Shang-Hong Lai

Keyword(s):

Data Augmentation ◽

Network Models ◽

Detection Accuracy ◽

Target Domain ◽

Generative Adversarial Network ◽

Image Objects ◽

Perceptual Distance ◽

Adversarial Network ◽

Image Translation ◽

Realistic Images

Unpaired image-to-image translation is proven quite effective in boosting a CNN-based object detector for a different domain by means of data augmentation that can well preserve the image-objects in the translated images. Recently, multimodal GAN (Generative Adversarial Network) models have been proposed and were expected to further boost the detector accuracy by generating a diverse collection of images in the target domain, given only a single/labelled image in the source domain. However, images generated by multimodal GANs would achieve even worse detection accuracy than the ones by a unimodal GAN with better object preservation. In this work, we introduce cycle-structure consistency for generating diverse and structure-preserved translated images across complex domains, such as between day and night, for object detector training. Qualitative results show that our model, Multimodal AugGAN, can generate diverse and realistic images for the target domain. For quantitative comparisons, we evaluate other competing methods and ours by using the generated images to train YOLO, Faster R-CNN and FCN models and prove that our model achieves significant improvement and outperforms other methods on the detection accuracies and the FCN scores. Also, we demonstrate that our model could provide more diverse object appearances in the target domain through comparison on the perceptual distance metric.

Download Full-text

A Reference-Guided Double Pipeline Face Image Completion Network

Electronics ◽

10.3390/electronics9111969 ◽

2020 ◽

Vol 9 (11) ◽

pp. 1969

Author(s):

Hongrui Liu ◽

Shuoshi Li ◽

Hongquan Wang ◽

Xinshan Zhu

Keyword(s):

Signal To Noise Ratio ◽

Receptive Fields ◽

Similarity Index ◽

Face Image ◽

Reference Image ◽

Image Completion ◽

Generative Adversarial Network ◽

Identity Information ◽

Adversarial Network ◽

Reference Images

The existing face image completion approaches cannot be utilized to rationally complete damaged face images where their identity information is completely lost due to being obscured by center masks. Hence, in this paper, a reference-guided double-pipeline face image completion network (RG-DP-FICN) is designed within the framework of the generative adversarial network (GAN) completing the identity information of damaged images utilizing reference images with the same identity as damaged images. To reasonably integrate the identity information of reference images into completed images, the reference image is decoupled into identity features (e.g., the contour of eyes, eyebrows, nose) and pose features (e.g., the orientation of face and the positions of the facial features), and then the resulting identity features are fused with posture features of damaged images. Specifically, a lightweight identity predictor is used to extract the pose features; an identity extraction module is designed to compress and globally extract the identity features of the reference images, and an identity transfer module is proposed to effectively fuse identity and pose features by performing identity rendering on different receptive fields. Furthermore, quantitative and qualitative evaluations are conducted on a public dataset CelebA-HQ. Compared to the state-of-the-art methods, the evaluation metrics peak signal-to-noise ratio (PSNR), structure similarity index (SSIM) and L1 loss are improved by 2.22 dB, 0.033 and 0.79%, respectively. The results indicate that RG-DP-FICN can generate completed images with reasonable identity, with superior completion effect compared to existing completion approaches.

Download Full-text