probability density estimation Latest Research Papers

A comprehensive methodology for semiparametric probability density estimation is introduced and explored. The probability density is modelled by sequences of mostly regular or steep exponential families generated by flexible sets of basis functions, possibly including boundary terms. Parameters are estimated by global maximum likelihood without any roughness penalty. A statistically orthogonal formulation of the inference problem and a numerically stable and fast convex optimization algorithm for its solution are presented. Automatic model selection over the type and number of basis functions is performed with the Bayesian information criterion. The methodology can naturally be applied to densities supported on bounded, infinite or semi-infinite domains without boundary bias. Relationships to the truncated moment problem and the moment-constrained maximum entropy principle are discussed and a new theorem on the existence of solutions is contributed. The new technique compares very favourably to kernel density estimation, the diffusion estimator, finite mixture models and local likelihood density estimation across a diverse range of simulation and observation data sets. The semiparametric estimator combines a very small mean integrated squared error with a high degree of smoothness which allows for a robust and reliable detection of the modality of the probability density in terms of the number of modes and bumps.

Download Full-text

Gaussian Process based Remaining useful life Prediction for Electric Energy Metering Equipment

Journal of Physics Conference Series ◽

10.1088/1742-6596/2125/1/012032 ◽

2021 ◽

Vol 2125 (1) ◽

pp. 012032

Author(s):

Ning Li ◽

Junfeng Duan ◽

Jun Ma ◽

Wei Qiu ◽

Wei Zhang ◽

...

Keyword(s):

Gaussian Process ◽

Extreme Environments ◽

Electric Energy ◽

Gaussian Process Regression ◽

Remaining Useful Life ◽

Probability Density Estimation ◽

Gauss Kernel ◽

Markov Chain Method ◽

Energy Metering ◽

Useful Life

Abstract Electric energy metering equipment (EEME) will fail in advance not as designed running in extreme environments. A multi-kernel Gaussian process regression model using measurement error data to perceive remaining useful life (RUL) for EEME is proposed. Firstly, the gauss kernel and periodic kernel are used to match the health index trend of EEME under a variety of typical environmental stresses. Furthermore, the Bayesian method and Monte Carlo Markov chain method are used to solve the model, and the Weibull distribution is used to fit the posterior trajectory to get the probability density estimation of the RUL.

Download Full-text

The Probability Density Estimation of Wave Direction Data in Korea

Journal of Coastal Research ◽

10.2112/jcr-si114-005.1 ◽

2021 ◽

Vol 114 (sp1) ◽

Author(s):

Gi Seop Lee ◽

Uk Jae Lee ◽

Hong Yeon Cho

Keyword(s):

Probability Density ◽

Density Estimation ◽

Probability Density Estimation ◽

Wave Direction

Download Full-text

Probability Density Machine: A New Solution of Class Imbalance Learning

Scientific Programming ◽

10.1155/2021/7555587 ◽

2021 ◽

Vol 2021 ◽

pp. 1-14

Author(s):

Ruihan Cheng ◽

Longfei Zhang ◽

Shiqi Wu ◽

Sen Xu ◽

Shang Gao ◽

...

Keyword(s):

Probability Density ◽

Predictive Model ◽

Data Distribution ◽

Class Imbalance ◽

Imbalanced Data ◽

Training Data ◽

Probability Density Estimation ◽

Imbalance Learning ◽

Class Imbalance Learning ◽

The Impact

Class imbalance learning (CIL) is an important branch of machine learning as, in general, it is difficult for classification models to learn from imbalanced data; meanwhile, skewed data distribution frequently exists in various real-world applications. In this paper, we introduce a novel solution of CIL called Probability Density Machine (PDM). First, in the context of Gaussian Naive Bayes (GNB) predictive model, we analyze the reason why imbalanced data distribution makes the performance of predictive model decline in theory and draw a conclusion regarding the impact of class imbalance that is only associated with the prior probability, but does not relate to the conditional probability of training data. Then, in such context, we show the rationality of several traditional CIL techniques. Furthermore, we indicate the drawback of combining GNB with these traditional CIL techniques. Next, profiting from the idea of K-nearest neighbors probability density estimation (KNN-PDE), we propose the PDM which is an improved GNB-based CIL algorithm. Finally, we conduct experiments on lots of class imbalance data sets, and the proposed PDM algorithm shows the promising results.

Download Full-text

Approximations of conditional probability density functions in Lebesgue spaces via mixture of experts models

Journal of Statistical Distributions and Applications ◽

10.1186/s40488-021-00125-0 ◽

2021 ◽

Vol 8 (1) ◽

Author(s):

Hien Duy Nguyen ◽

TrungTin Nguyen ◽

Faicel Chamroukhi ◽

Geoffrey John McLachlan

Keyword(s):

Probability Density ◽

Conditional Probability ◽

Function Class ◽

Convergence Result ◽

Conditional Probability Density ◽

Probability Density Estimation ◽

Mixture Of Experts ◽

Lebesgue Spaces ◽

Estimation Problems ◽

Almost Uniform Convergence

AbstractMixture of experts (MoE) models are widely applied for conditional probability density estimation problems. We demonstrate the richness of the class of MoE models by proving denseness results in Lebesgue spaces, when inputs and outputs variables are both compactly supported. We further prove an almost uniform convergence result when the input is univariate. Auxiliary lemmas are proved regarding the richness of the soft-max gating function class, and their relationships to the class of Gaussian gating functions.

Download Full-text

An Incipient Fault diagnosis Methodology Using Local Mahalanobis Distance: Detection process based on Empirical Probability Density Estimation

Signal Processing ◽

10.1016/j.sigpro.2021.108308 ◽

2021 ◽

pp. 108308

Author(s):

Junjie Yang ◽

Claude Delpha

Keyword(s):

Fault Diagnosis ◽

Probability Density ◽

Density Estimation ◽

Mahalanobis Distance ◽

Probability Density Estimation ◽

Detection Process ◽

Empirical Probability ◽

Incipient Fault ◽

Incipient Fault Diagnosis

Download Full-text

COVID-19 mortality analysis from soft-data multivariate curve regression and machine learning

10.21203/rs.3.rs-158858/v1 ◽

2021 ◽

Author(s):

Antoni Torres-Signes ◽

M. Pilar Frías ◽

Maria Dolores Ruiz-Medina

Keyword(s):

Machine Learning ◽

Time Series ◽

Correlation Analysis ◽

Multivariate Time Series ◽

Quadratic Loss ◽

Probability Density Estimation ◽

Soft Data ◽

Residual Correlation ◽

Data Framework ◽

Trigonometric Regression

Abstract A multiple objective space-time forecasting approach is presented involving cyclical curve log-regression, and multivariate time series spatial residual correlation analysis. Specifically, the mean quadratic loss function is minimized in the framework of trigonometric regression. While, in our subsequent spatial residual correlation analysis, maximization of the likelihood allows us to compute the posterior mode in a Bayesian multivariate time series soft-data framework. The presented approach is applied to the analysis of COVID-19 mortality in the first wave affecting the Spanish Communities, since March, 8, 2020 until May, 13, 2020. An empirical comparative study with Machine Learning (ML) regression, based on random k-fold cross-validation, and bootstrapping confidence interval and probability density estimation, is carried out. This empirical analysis also investigates the performance of ML regression models in a hard- and soft-data frameworks. The results could be extrapolated to other counts, countries, and posterior COVID-19 waves.

Download Full-text

Bioinspired polarized light compass in moonlit sky for heading determination based on probability density estimation

Chinese Journal of Aeronautics ◽

10.1016/j.cja.2021.03.005 ◽

2021 ◽

Author(s):

Yueting YANG ◽

Yan WANG ◽

Lei GUO ◽

Bo TIAN ◽

Jian YANG ◽

...

Keyword(s):

Probability Density ◽

Density Estimation ◽

Polarized Light ◽

Probability Density Estimation

Download Full-text

FlexSketch: Estimation of Probability Density for Stationary and Non-Stationary Data Streams

Sensors ◽

10.3390/s21041080 ◽

2021 ◽

Vol 21 (4) ◽

pp. 1080

Author(s):

Namuk Park ◽

Songkuk Kim

Keyword(s):

Probability Distribution ◽

Probability Density ◽

Data Streams ◽

Data Stream ◽

Concept Drift ◽

Estimation Algorithm ◽

Accurate Estimation ◽

Probability Density Estimation ◽

Data History ◽

Changes Over Time

Efficient and accurate estimation of the probability distribution of a data stream is an important problem in many sensor systems. It is especially challenging when the data stream is non-stationary, i.e., its probability distribution changes over time. Statistical models for non-stationary data streams demand agile adaptation for concept drift while tolerating temporal fluctuations. To this end, a statistical model needs to forget old data samples and to detect concept drift swiftly. In this paper, we propose FlexSketch, an online probability density estimation algorithm for data streams. Our algorithm uses an ensemble of histograms, each of which represents a different length of data history. FlexSketch updates each histogram for a new data sample and generates probability distribution by combining the ensemble of histograms while monitoring discrepancy between recent data and existing models periodically. When it detects concept drift, a new histogram is added to the ensemble and the oldest histogram is removed. This allows us to estimate the probability density function with high update speed and high accuracy using only limited memory. Experimental results demonstrate that our algorithm shows improved speed and accuracy compared to existing methods for both stationary and non-stationary data streams.

Download Full-text

probability density estimation
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Distance Based Joint Probability Density Estimation For Unsupervised Outlier Detection

Semiparametric maximum likelihood probability density estimation

Gaussian Process based Remaining useful life Prediction for Electric Energy Metering Equipment

The Probability Density Estimation of Wave Direction Data in Korea

Probability Density Machine: A New Solution of Class Imbalance Learning

Approximations of conditional probability density functions in Lebesgue spaces via mixture of experts models

An Incipient Fault diagnosis Methodology Using Local Mahalanobis Distance: Detection process based on Empirical Probability Density Estimation

COVID-19 mortality analysis from soft-data multivariate curve regression and machine learning

Bioinspired polarized light compass in moonlit sky for heading determination based on probability density estimation

FlexSketch: Estimation of Probability Density for Stationary and Non-Stationary Data Streams

Export Citation Format

probability density estimationRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Distance Based Joint Probability Density Estimation For Unsupervised Outlier Detection

Semiparametric maximum likelihood probability density estimation

Gaussian Process based Remaining useful life Prediction for Electric Energy Metering Equipment

The Probability Density Estimation of Wave Direction Data in Korea

Probability Density Machine: A New Solution of Class Imbalance Learning

Approximations of conditional probability density functions in Lebesgue spaces via mixture of experts models

An Incipient Fault diagnosis Methodology Using Local Mahalanobis Distance: Detection process based on Empirical Probability Density Estimation

COVID-19 mortality analysis from soft-data multivariate curve regression and machine learning

Bioinspired polarized light compass in moonlit sky for heading determination based on probability density estimation

FlexSketch: Estimation of Probability Density for Stationary and Non-Stationary Data Streams

probability density estimation
Recently Published Documents