Mixture density network estimation of continuous variable maximum likelihood using discrete training samples

AbstractMixture density networks (MDNs) can be used to generate posterior density functions of model parameters $$\varvec{\theta }$$ θ given a set of observables $${\mathbf {x}}$$ x . In some applications, training data are available only for discrete values of a continuous parameter $$\varvec{\theta }$$ θ . In such situations, a number of performance-limiting issues arise which can result in biased estimates. We demonstrate the usage of MDNs for parameter estimation, discuss the origins of the biases, and propose a corrective method for each issue.

Download Full-text

Constructing and model-fitting receiver operator characteristics using continuous data.

10.31234/osf.io/qdgjc ◽

2018 ◽

Author(s):

Josephine Ann Urquhart ◽

Akira O'Connor

Keyword(s):

Signal Detection ◽

Least Squares Method ◽

Model Fitting ◽

Simulated Data ◽

Continuous Variable ◽

Continuous Data ◽

Model Parameters ◽

Operating Characteristics ◽

Unequal Variance ◽

Discrete Discrimination

Receiver operating characteristics (ROCs) are plots which provide a visual summary of a classifier’s decision response accuracy at varying discrimination thresholds. Typical practice, particularly within psychological studies, involves plotting an ROC from a limited number of discrete thresholds before fitting signal detection parameters to the plot. We propose that additional insight into decision-making could be gained through increasing ROC resolution, using trial-by-trial measurements derived from a continuous variable, in place of discrete discrimination thresholds. Such continuous ROCs are not yet routinely used in behavioural research, which we attribute to issues of practicality (i.e. the difficulty of applying standard ROC model-fitting methodologies to continuous data). Consequently, the purpose of the current article is to provide a documented method of fitting signal detection parameters to continuous ROCs. This method reliably produces model fits equivalent to the unequal variance least squares method of model-fitting (Yonelinas et al., 1998), irrespective of the number of data points used in ROC construction. We present the suggested method in three main stages: I) building continuous ROCs, II) model-fitting to continuous ROCs and III) extracting model parameters from continuous ROCs. Throughout the article, procedures are demonstrated in Microsoft Excel, using an example continuous variable: reaction time, taken from a single-item recognition memory. Supplementary MATLAB code used for automating our procedures is also presented in Appendix B, with a validation of the procedure using simulated data shown in Appendix C.

Download Full-text

Siamese Reconstruction Network: Accurate Image Reconstruction from Human Brain Activity by Learning to Compare

Applied Sciences ◽

10.3390/app9224749 ◽

2019 ◽

Vol 9 (22) ◽

pp. 4749

Author(s):

Lingyun Jiang ◽

Kai Qiao ◽

Linyuan Wang ◽

Chi Zhang ◽

Jian Chen ◽

...

Keyword(s):

Deep Learning ◽

Human Brain ◽

Brain Activity ◽

Feature Space ◽

Training Data ◽

Reconstruction Method ◽

Learning Method ◽

Training Samples ◽

Visual Reconstruction ◽

Relationship Of

Decoding human brain activities, especially reconstructing human visual stimuli via functional magnetic resonance imaging (fMRI), has gained increasing attention in recent years. However, the high dimensionality and small quantity of fMRI data impose restrictions on satisfactory reconstruction, especially for the reconstruction method with deep learning requiring huge amounts of labelled samples. When compared with the deep learning method, humans can recognize a new image because our human visual system is naturally capable of extracting features from any object and comparing them. Inspired by this visual mechanism, we introduced the mechanism of comparison into deep learning method to realize better visual reconstruction by making full use of each sample and the relationship of the sample pair by learning to compare. In this way, we proposed a Siamese reconstruction network (SRN) method. By using the SRN, we improved upon the satisfying results on two fMRI recording datasets, providing 72.5% accuracy on the digit dataset and 44.6% accuracy on the character dataset. Essentially, this manner can increase the training data about from n samples to 2n sample pairs, which takes full advantage of the limited quantity of training samples. The SRN learns to converge sample pairs of the same class or disperse sample pairs of different class in feature space.

Download Full-text

A Critique and Improvement of the CL Common Language Effect Size Statistics of McGraw and Wong

Journal of Educational and Behavioral Statistics ◽

10.3102/10769986025002101 ◽

2000 ◽

Vol 25 (2) ◽

pp. 101-132 ◽

Cited By ~ 44

Author(s):

András Vargha ◽

Harold D. Delaney

Keyword(s):

Effect Size ◽

Continuous Distribution ◽

Interval Estimation ◽

Continuous Variable ◽

Common Language ◽

Exact Methods ◽

Normal Distributions ◽

The Difference ◽

Discrete Values ◽

Two Populations

McGraw and Wong (1992) described an appealing index of effect size, called CL, which measures the difference between two populations in terms of the probability that a score sampled at random from the first population will be greater than a score sampled at random from the second. McGraw and Wong introduced this "common language effect size statistic" for normal distributions and then proposed an approximate estimation for any continuous distribution. In addition, they generalized CL to the n-group case, the correlated samples case, and the discrete values case. In the current paper a different generalization of CL, called the A measure of stochastic superiority, is proposed, which may be directly applied for any discrete or continuous variable that is at least ordinally scaled. Exact methods for point and interval estimation as well as the significance tests of the A = .5 hypothesis are provided. New generalizations ofCL are provided for the multi-group and correlated samples cases.

Download Full-text

GaitSense: Towards Ubiquitous Gait-Based Human Identification with Wi-Fi

ACM Transactions on Sensor Networks ◽

10.1145/3466638 ◽

2022 ◽

Vol 18 (1) ◽

pp. 1-24

Author(s):

Yi Zhang ◽

Yue Zheng ◽

Guidong Zhang ◽

Kun Qian ◽

Chen Qian ◽

...

Keyword(s):

Data Augmentation ◽

Gait Recognition ◽

Wearable Sensors ◽

Human Identification ◽

Training Data ◽

Identification Accuracy ◽

Identification System ◽

Gait Patterns ◽

Training Samples ◽

Augmentation Techniques

Gait, the walking manner of a person, has been perceived as a physical and behavioral trait for human identification. Compared with cameras and wearable sensors, Wi-Fi-based gait recognition is more attractive because Wi-Fi infrastructure is almost available everywhere and is able to sense passively without the requirement of on-body devices. However, existing Wi-Fi sensing approaches impose strong assumptions of fixed user walking trajectories, sufficient training data, and identification of already known users. In this article, we present GaitSense , a Wi-Fi-based human identification system, to overcome the above unrealistic assumptions. To deal with various walking trajectories and speeds, GaitSense first extracts target specific features that best characterize gait patterns and applies novel normalization algorithms to eliminate gait irrelevant perturbation in signals. On this basis, GaitSense reduces the training efforts in new deployment scenarios by transfer learning and data augmentation techniques. GaitSense also enables a distinct feature of illegal user identification by anomaly detection, making the system readily available for real-world deployment. Our implementation and evaluation with commodity Wi-Fi devices demonstrate a consistent identification accuracy across various deployment scenarios with little training samples, pushing the limit of gait recognition with Wi-Fi signals.

Download Full-text

New active learning algorithms for near-infrared spectroscopy in agricultural applications

at - Automatisierungstechnik ◽

10.1515/auto-2020-0143 ◽

2021 ◽

Vol 69 (4) ◽

pp. 297-306

Author(s):

Julius Krause ◽

Maurice Günder ◽

Daniel Schulz ◽

Robin Gruna

Keyword(s):

Active Learning ◽

Near Infrared ◽

Agricultural Products ◽

Training Data ◽

Calibration Model ◽

Learning Approaches ◽

Training Samples ◽

Agricultural Applications ◽

Selection Of

Abstract The selection of training data determines the quality of a chemometric calibration model. In order to cover the entire parameter space of known influencing parameters, an experimental design is usually created. Nevertheless, even with a carefully prepared Design of Experiment (DoE), redundant reference analyses are often performed during the analysis of agricultural products. Because the number of possible reference analyses is usually very limited, the presented active learning approaches are intended to provide a tool for better selection of training samples.

Download Full-text

Flight crew diagnostic using aviation simulator training data

Experimental Psychology (Russia) ◽

10.17759/exppsy.2016090310 ◽

2016 ◽

Vol 9 (3) ◽

pp. 118-137

Author(s):

L.S. Kuravsky ◽

P.A. Marmalyuk ◽

G.A. Yuryev ◽

O.B. Belyaeva ◽

O.Yu. Prokopieva

Keyword(s):

Continuous Time ◽

Goodness Of Fit ◽

Training Data ◽

Simulator Training ◽

Model Parameters ◽

Flight Crew ◽

Advantages And Disadvantages ◽

Goodness Of Fit Tests ◽

Flight Simulators ◽

Discrete States

This paper describes a new concept of flight crew assessment based on flight simulators training result. It is based on representation of pilot gaze movement with the aid of continuous-time Markov processes with discrete states. Considered are both the procedure of model parameters identification provided with goodness-of-fit tests in use and the classifier-building technique, which makes it possible to estimate degree of correspondence between the observed gaze motion distribution under study and reference distributions identified for different diagnosed groups. The final assessing criterion is formed on the basis of integrated diagnostic parameters, which are determined by the parameters of the identified models. The article provides a description of the experiment, illustrations, and results of studies aimed at assessing the reliability of the developed models and criteria, as well as conclusions about the applicability of the approach, its advantages and disadvantages.

Download Full-text

Unsupervised Deep Learning via Affinity Diffusion

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i07.6757 ◽

2020 ◽

Vol 34 (07) ◽

pp. 11029-11036

Author(s):

Jiabo Huang ◽

Qi Dong ◽

Shaogang Gong ◽

Xiatian Zhu

Keyword(s):

Deep Learning ◽

State Of The Art ◽

General Purpose ◽

Training Data ◽

Learning Approach ◽

Model Learning ◽

Feature Representations ◽

Discriminative Feature ◽

Training Samples ◽

Unsupervised Deep Learning

Convolutional neural networks (CNNs) have achieved unprecedented success in a variety of computer vision tasks. However, they usually rely on supervised model learning with the need for massive labelled training data, limiting dramatically their usability and deployability in real-world scenarios without any labelling budget. In this work, we introduce a general-purpose unsupervised deep learning approach to deriving discriminative feature representations. It is based on self-discovering semantically consistent groups of unlabelled training samples with the same class concepts through a progressive affinity diffusion process. Extensive experiments on object image classification and clustering show the performance superiority of the proposed method over the state-of-the-art unsupervised learning models using six common image recognition benchmarks including MNIST, SVHN, STL10, CIFAR10, CIFAR100 and ImageNet.

Download Full-text

Uncertainty quantification in fault detection using convolutional neural networks

Geophysics ◽

10.1190/geo2020-0424.1 ◽

2021 ◽

pp. 1-45

Author(s):

Runhai Feng ◽

Dario Grana ◽

Niels Balling

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Convolutional Neural Networks ◽

Reservoir Characterization ◽

Epistemic Uncertainty ◽

Confidence Regions ◽

Training Data ◽

Model Parameters ◽

Sigmoid Function ◽

Learning Tools

Segmentation of faults based on seismic images is an important step in reservoir characterization. With the recent developments of deep-learning methods and the availability of massive computing power, automatic interpretation of seismic faults has become possible. The likelihood of occurrence for a fault can be quantified using a sigmoid function. Our goal is to quantify the fault model uncertainty that is generally not captured by deep-learning tools. We propose to use the dropout approach, a regularization technique to prevent overfitting and co-adaptation in hidden units, to approximate the Bayesian inference and estimate the principled uncertainty over functions. Particularly, the variance of the learned model has been decomposed into aleatoric and epistemic parts. The proposed method is applied to a real dataset from the Netherlands F3 block with two different dropout ratios in convolutional neural networks. The aleatoric uncertainty is irreducible since it relates to the stochastic dependency within the input observations. As the number of Monte-Carlo realizations increases, the epistemic uncertainty asymptotically converges and the model standard deviation decreases, because the variability of model parameters is better simulated or explained with a larger sample size. This analysis can quantify the confidence to use fault predictions with less uncertainty. Additionally, the analysis suggests where more training data are needed to reduce the uncertainty in low confidence regions.

Download Full-text

Learning to Transfer: Unsupervised Domain Translation via Meta-Learning

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i07.6816 ◽

2020 ◽

Vol 34 (07) ◽

pp. 11507-11514

Author(s):

Jianxin Lin ◽

Yijun Wang ◽

Zhibo Chen ◽

Tianyu He

Keyword(s):

Learning Experiences ◽

Training Data ◽

Training Procedure ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Training Samples ◽

Meta Learning ◽

Translation Methods ◽

Meta Optimization ◽

Dual Translation

Unsupervised domain translation has recently achieved impressive performance with Generative Adversarial Network (GAN) and sufficient (unpaired) training data. However, existing domain translation frameworks form in a disposable way where the learning experiences are ignored and the obtained model cannot be adapted to a new coming domain. In this work, we take on unsupervised domain translation problems from a meta-learning perspective. We propose a model called Meta-Translation GAN (MT-GAN) to find good initialization of translation models. In the meta-training procedure, MT-GAN is explicitly trained with a primary translation task and a synthesized dual translation task. A cycle-consistency meta-optimization objective is designed to ensure the generalization ability. We demonstrate effectiveness of our model on ten diverse two-domain translation tasks and multiple face identity translation tasks. We show that our proposed approach significantly outperforms the existing domain translation methods when each domain contains no more than ten training samples.

Download Full-text

Simple Degree-of-Freedom Modeling of the Random Fluctuation Arising in Human–Bicycle Balance

Applied Sciences ◽

10.3390/app9102154 ◽

2019 ◽

Vol 9 (10) ◽

pp. 2154 ◽

Cited By ~ 1

Author(s):

Katsutoshi Yoshida ◽

Keishi Sato ◽

Yoshikazu Yamanaka

Keyword(s):

Angular Displacement ◽

Degree Of Freedom ◽

Training Data ◽

Statistical Hypothesis ◽

Random Fluctuation ◽

Model Parameters ◽

Statistical Hypothesis Testing ◽

Measured Time ◽

Kolmogorov Smirnov ◽

Fluctuation Model

In this study, we propose a new simple degree-of-freedom fluctuation model that accurately reproduces the probability density functions (PDFs) of human–bicycle balance motions as simply as possible. First, we measure the time series of the roll angular displacement and velocity of human–bicycle balance motions and construct their PDFs. Next, using these PDFs as training data, we identify the model parameters by means of particle swarm optimization; in particular, we minimize the Kolmogorov–Smirnov distance between the human PDFs from the participants and the PDFs simulated by our model. The resulting PDF fitnesses were over 98.7 % for all participants, indicating that our simulated PDFs were in close agreement with human PDFs. Furthermore, the Kolmogorov–Smirnov statistical hypothesis testing was applied to the resulting human–bicycle fluctuation model, showing that the measured time responses were much better supported by our model than the Gaussian distribution.

Download Full-text