Domain Generalization Using a Mixture of Multiple Latent Domains

Toshihiko Matsuura; Tatsuya Harada

doi:10.1609/aaai.v34i07.6846

Domain Generalization Using a Mixture of Multiple Latent Domains

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i07.6846 ◽

2020 ◽

Vol 34 (07) ◽

pp. 11749-11756 ◽

Cited By ~ 2

Author(s):

Toshihiko Matsuura ◽

Tatsuya Harada

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Multiple Source ◽

Web Crawling ◽

Not Given ◽

Adversarial Learning ◽

Generalized Model ◽

Generalization Performance ◽

Invariant Feature ◽

Feature Extractor

When domains, which represent underlying data distributions, vary during training and testing processes, deep neural networks suffer a drop in their performance. Domain generalization allows improvements in the generalization performance for unseen target domains by using multiple source domains. Conventional methods assume that the domain to which each sample belongs is known in training. However, many datasets, such as those collected via web crawling, contain a mixture of multiple latent domains, in which the domain of each sample is unknown. This paper introduces domain generalization using a mixture of multiple latent domains as a novel and more realistic scenario, where we try to train a domain-generalized model without using domain labels. To address this scenario, we propose a method that iteratively divides samples into latent domains via clustering, and which trains the domain-invariant feature extractor shared among the divided latent domains via adversarial learning. We assume that the latent domain of images is reflected in their style, and thus, utilize style features for clustering. By using these features, our proposed method successfully discovers latent domains and achieves domain generalization even if the domain labels are not given. Experiments show that our proposed method can train a domain-generalized model without using domain labels. Moreover, it outperforms conventional domain generalization methods, including those that utilize domain labels.

Download Full-text

Multiple Source Localization in a Shallow Water Waveguide Exploiting Subarray Beamforming and Deep Neural Networks

Sensors ◽

10.3390/s19214768 ◽

2019 ◽

Vol 19 (21) ◽

pp. 4768 ◽

Cited By ~ 2

Author(s):

Zhaoqiong Huang ◽

Ji Xu ◽

Zaixiao Gong ◽

Haibin Wang ◽

Yonghong Yan

Keyword(s):

Neural Network ◽

Neural Networks ◽

Shallow Water ◽

Source Localization ◽

Deep Neural Networks ◽

Short Term Memory ◽

Direction Finding ◽

Multiple Source ◽

Feed Forward Neural Network ◽

Subarray Beamforming

Deep neural networks (DNNs) have been shown to be effective for single sound source localization in shallow water environments. However, multiple source localization is a more challenging task because of the interactions among multiple acoustic signals. This paper proposes a framework for multiple source localization on underwater horizontal arrays using deep neural networks. The two-stage DNNs are adopted to determine both the directions and ranges of multiple sources successively. A feed-forward neural network is trained for direction finding, while the long short term memory recurrent neural network is used for source ranging. Particularly, in the source ranging stage, we perform subarray beamforming to extract features of sources that are detected by the direction finding stage, because subarray beamforming can enhance the mixed signal to the desired direction while preserving the horizontal-longitudinal correlations of the acoustic field. In this way, a universal model trained in the single-source scenario can be applied to multi-source scenarios with arbitrary numbers of sources. Both simulations and experiments in a range-independent shallow water environment of SWellEx-96 Event S5 are given to demonstrate the effectiveness of the proposed method.

Download Full-text

VECA: A Method for Detecting Overfitting in Neural Networks (Student Abstract)

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i10.7167 ◽

2020 ◽

Vol 34 (10) ◽

pp. 13791-13792

Author(s):

Liangzhu Ge ◽

Yuexian Hou ◽

Yaju Jiang ◽

Shuai Yao ◽

Chao Yang

Keyword(s):

Neural Networks ◽

Strong Correlation ◽

Good Predictor ◽

Deep Neural Networks ◽

Training Data ◽

Training Process ◽

Generalization Performance ◽

Validation Set ◽

Fully Connected ◽

Fully Connected Networks

Despite their widespread applications, deep neural networks often tend to overfit the training data. Here, we propose a measure called VECA (Variance of Eigenvalues of Covariance matrix of Activation matrix) and demonstrate that VECA is a good predictor of networks' generalization performance during the training process. Experiments performed on fully-connected networks and convolutional neural networks trained on benchmark image datasets show a strong correlation between test loss and VECA, which suggest that we can calculate the VECA to estimate generalization performance without sacrificing training data to be used as a validation set.

Download Full-text

Comparative analysis of identification of dynamic objects by scale-invariant feature transform and deep neural networks

IOP Conference Series Materials Science and Engineering ◽

10.1088/1757-899x/1029/1/012085 ◽

2021 ◽

Vol 1029 ◽

pp. 012085

Author(s):

E A Ivliev ◽

P S Obukhov

Keyword(s):

Neural Networks ◽

Comparative Analysis ◽

Deep Neural Networks ◽

Scale Invariant Feature Transform ◽

Scale Invariant ◽

Invariant Feature ◽

Feature Transform ◽

Dynamic Objects ◽

Scale Invariant Feature

Download Full-text

Affine Equivariant Autoencoder

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2019/335 ◽

2019 ◽

Author(s):

Xifeng Guo ◽

En Zhu ◽

Xinwang Liu ◽

Jianping Yin

Keyword(s):

Neural Networks ◽

Affine Transformation ◽

Deep Neural Networks ◽

General Purpose ◽

Transformation Function ◽

The Self ◽

Discriminative Ability ◽

Invariant Features ◽

Feature Extractor

Existing deep neural networks mainly focus on learning transformation invariant features. However, it is the equivariant features that are more adequate for general purpose tasks. Unfortunately, few work has been devoted to learning equivariant features. To fill this gap, in this paper, we propose an affine equivariant autoencoder to learn features that are equivariant to the affine transformation in an unsupervised manner. The objective consists of the self-reconstruction of the original example and affine transformed example, and the approximation of the affine transformation function, where the reconstruction makes the encoder a valid feature extractor and the approximation encourages the equivariance. Extensive experiments are conducted to validate the equivariance and discriminative ability of the features learned by our affine equivariant autoencoder.

Download Full-text

Voice Conversion from Arbitrary Speakers Based on Deep Neural Networks with Adversarial Learning

Advances in Intelligent Information Hiding and Multimedia Signal Processing - Smart Innovation, Systems and Technologies ◽

10.1007/978-3-319-63859-1_13 ◽

2017 ◽

pp. 97-103

Author(s):

Sou Miyamoto ◽

Takashi Nose ◽

Suzunosuke Ito ◽

Harunori Koike ◽

Yuya Chiba ◽

...

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Voice Conversion ◽

Adversarial Learning

Download Full-text

Generalization Error Bounds of Gradient Descent for Learning Over-Parameterized Deep ReLU Networks

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i04.5736 ◽

2020 ◽

Vol 34 (04) ◽

pp. 3349-3356

Author(s):

Yuan Cao ◽

Quanquan Gu

Keyword(s):

Neural Networks ◽

Error Bounds ◽

Gradient Descent ◽

Deep Neural Networks ◽

Empirical Studies ◽

Training Data ◽

Generalization Error ◽

Generalization Performance ◽

Gradient Based ◽

Random Initialization

Empirical studies show that gradient-based methods can learn deep neural networks (DNNs) with very good generalization performance in the over-parameterization regime, where DNNs can easily fit a random labeling of the training data. Very recently, a line of work explains in theory that with over-parameterization and proper random initialization, gradient-based methods can find the global minima of the training loss for DNNs. However, existing generalization error bounds are unable to explain the good generalization performance of over-parameterized DNNs. The major limitation of most existing generalization bounds is that they are based on uniform convergence and are independent of the training algorithm. In this work, we derive an algorithm-dependent generalization error bound for deep ReLU networks, and show that under certain assumptions on the data distribution, gradient descent (GD) with proper random initialization is able to train a sufficiently over-parameterized DNN to achieve arbitrarily small generalization error. Our work sheds light on explaining the good generalization performance of over-parameterized deep neural networks.

Download Full-text

Scalable quorum-based deep neural networks with adversarial learning for automated lung lobe segmentation in fast helical free-breathing CTs

International Journal of Computer Assisted Radiology and Surgery ◽

10.1007/s11548-021-02454-6 ◽

2021 ◽

Author(s):

Bradley Stiehl ◽

Michael Lauria ◽

Kamal Singhrao ◽

Jonathan Goldin ◽

Igor Barjaktarevic ◽

...

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Free Breathing ◽

Adversarial Learning ◽

Lung Lobe ◽

Lobe Segmentation

Download Full-text

Product Categorization by Title Using Deep Neural Networks as Feature Extractor

2020 International Joint Conference on Neural Networks (IJCNN) ◽

10.1109/ijcnn48605.2020.9207093 ◽

2020 ◽

Author(s):

Leonardo S. Paulucio ◽

Thiago M. Paixao ◽

Rodrigo F. Berriel ◽

Alberto F. De Souza ◽

Claudine Badue ◽

...

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Feature Extractor

Download Full-text

Robust Neural Networks are More Interpretable for Genomics

10.1101/657437 ◽

2019 ◽

Cited By ~ 5

Author(s):

Peter K. Koo ◽

Sharon Qian ◽

Gal Kaplun ◽

Verena Volf ◽

Dimitris Kalimeris

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

State Of The Art ◽

Random Noise ◽

Genomic Data ◽

Training Methods ◽

Generalization Performance ◽

Regulatory Genomics ◽

Adversarial Training

AbstractDeep neural networks (DNNs) have been applied to a variety of regulatory genomics tasks. For interpretability, attribution methods are employed to provide importance scores for each nucleotide in a given sequence. However, even with state-of-the-art DNNs, there is no guarantee that these methods can recover interpretable, biological representations. Here we perform systematic experiments on synthetic genomic data to raise awareness of this issue. We find that deeper networks have better generalization performance, but attribution methods recover less interpretable representations. Then, we show training methods promoting robustness – including regularization, injecting random noise into the data, and adversarial training – significantly improve interpretability of DNNs, especially for smaller datasets.

Download Full-text

An intelligent diagnosis scheme based on generative adversarial learning deep neural networks and its application to planetary gearbox fault pattern recognition

Neurocomputing ◽

10.1016/j.neucom.2018.05.024 ◽

2018 ◽

Vol 310 ◽

pp. 213-222 ◽

Cited By ~ 73

Author(s):

Zirui Wang ◽

Jun Wang ◽

Youren Wang

Keyword(s):

Neural Networks ◽

Pattern Recognition ◽

Deep Neural Networks ◽

Adversarial Learning ◽

Intelligent Diagnosis ◽

Planetary Gearbox ◽

Fault Pattern

Download Full-text