nonstationary data Latest Research Papers

Abstract Catastrophic forgetting is the notorious vulnerability of neural networks to the changes in the data distribution during learning. This phenomenon has long been considered a major obstacle for using learning agents in realistic continual learning settings. A large body of continual learning research assumes that task boundaries are known during training. However, only a few works consider scenarios in which task boundaries are unknown or not well defined: task-agnostic scenarios. The optimal Bayesian solution for this requires an intractable online Bayes update to the weights posterior. We aim to approximate the online Bayes update as accurately as possible. To do so, we derive novel fixed-point equations for the online variational Bayes optimization problem for multivariate gaussian parametric distributions. By iterating the posterior through these fixed-point equations, we obtain an algorithm (FOO-VB) for continual learning that can handle nonstationary data distribution using a fixed architecture and without using external memory (i.e., without access to previous data). We demonstrate that our method (FOO-VB) outperforms existing methods in task-agnostic scenarios. FOO-VB Pytorch implementation is available at https://github.com/chenzeno/FOO-VB.

Download Full-text

Kernel Online System for Fast Principal Component Analysis and its Adaptive Learning

International Journal of Computing ◽

10.47839/ijc.20.2.2164 ◽

2021 ◽

pp. 175-180

Author(s):

Yevgeniy Bodyanskiy ◽

Anastasiia Deineko ◽

Antonina Bondarchuk ◽

Maksym Shalamov

Keyword(s):

Initial Data ◽

Adaptive Learning ◽

Data Stream ◽

High Speed ◽

Optimal Algorithm ◽

Principal Component ◽

Neural System ◽

Process Data ◽

Ability To Work ◽

Nonstationary Data

An artificial neural system for data compression that sequentially processes linearly nonseparable classes is proposed. The main elements of this system include adjustable radial-basis functions (Epanechnikov’s kernels), an adaptive linear associator learned by a multistep optimal algorithm, and Hebb-Sanger neural network whose nodes are formed by Oja’s neurons. For tuning the modified Oja’s algorithm, additional filtering (in case of noisy data) and tracking (in case of nonstationary data) properties were introduced. The main feature of the proposed system is the ability to work in conditions of significant nonlinearity of the initial data that are sequentially fed to the system and have a non-stationary nature. The effectiveness of the developed approach was confirmed by the experimental results. The proposed kernel online neural system is designed to solve compression and visualization tasks when initial data form linearly nonseparable classes in general problem of Data Stream Mining and Dynamic Data Mining. The main benefit of the proposed approach is high speed and ability to process data whose characteristics are changed in time.

Download Full-text

Optimum wavelet selection for nonparametric analysis toward structural health monitoring for processing big data from sensor network: A comparative study

Structural Health Monitoring ◽

10.1177/14759217211010261 ◽

2021 ◽

pp. 147592172110102

Author(s):

Ahmed Silik ◽

Mohammad Noori ◽

Wael A Altabey ◽

Ji Dang ◽

Ramin Ghiasi ◽

...

Keyword(s):

Wavelet Transform ◽

Structural Health Monitoring ◽

Health Monitoring ◽

Civil Engineering ◽

Engineering Structures ◽

Structural Health ◽

Nonstationary Data ◽

Civil Engineering Structures ◽

Wavelet Selection ◽

The Right

A critical problem encountered in structural health monitoring of civil engineering structures, and other structures such as mechanical or aircraft structures, is how to convincingly analyze the nonstationary data that is coming online, how to reduce the high-dimensional features, and how to extract informative features associated with damage to infer structural conditions. Wavelet transform among other techniques has proven to be an effective technique for processing and analyzing nonstationary data due to its unique characteristics. However, the biggest challenge frequently encountered in assuring the effectiveness of wavelet transform in analyzing massive nonstationary data from civil engineering structures, and in structural health diagnosis, is how to select the right wavelet. The question of which wavelet function is appropriate for processing and analyzing the nonstationary data in civil engineering structures has not been clearly addressed, and no clear guidelines or rules have been reported in the literature to show how the right wavelet is chosen. Therefore, this study aims to address an important question in this regard by proposing a new framework for choosing a proper wavelet that can be customized for massive nonstationary data analysis, disturbances separation, and extraction of informative features associated with damage. The proposed method takes into account data type, data and wavelet characteristics, similarity, sharing information, and data recovery accuracy. The novelty of this study lies in integrating multi-criteria which are associated directly with features that correlated well with change in structures due to damage, including common criteria such as energy, entropy, linear correlation index, and variance. Also, it introduces and considers new proposed measures, such as wavelet-based nonlinear correlation such as cosh spectral distance and mutual information, wavelet-based energy fluctuation, measures-based recovery accuracy, such as sensitive feature extraction, noise reduction, and others to evaluate various base wavelets’ function capabilities for appropriate decomposition and reconstruction of structural dynamic responses. The proposed method is verified by experimental and simulated data. The results revealed that the proposed method has a satisfactory performance for base wavelet selection and the small order of Daubechies and Symlet provide the best results, especially order 3. The idea behind our proposed framework can be applied to other structural applications.

Download Full-text

Spurious Factor Analysis

Econometrica ◽

10.3982/ecta16703 ◽

2021 ◽

Vol 89 (2) ◽

pp. 591-614

Author(s):

Alexei Onatski ◽

Chen Wang

Keyword(s):

Factor Analysis ◽

Principal Components Analysis ◽

Principal Components ◽

Unit Root ◽

Information Criteria ◽

High Dimensional ◽

Spurious Regression ◽

Nonstationary Data ◽

Components Analysis ◽

Data Variation

This paper draws parallels between the principal components analysis of factorless high‐dimensional nonstationary data and the classical spurious regression. We show that a few of the principal components of such data absorb nearly all the data variation. The corresponding scree plot suggests that the data contain a few factors, which is corroborated by the standard panel information criteria. Furthermore, the Dickey–Fuller tests of the unit root hypothesis applied to the estimated “idiosyncratic terms” often reject, creating an impression that a few factors are responsible for most of the nonstationarity in the data. We warn empirical researchers of these peculiar effects and suggest to always compare the analysis in levels with that in differences.

Download Full-text

OSNN: An Online Semisupervised Neural Network for Nonstationary Data Streams

IEEE Transactions on Neural Networks and Learning Systems ◽

10.1109/tnnls.2021.3132584 ◽

2021 ◽

pp. 1-13

Author(s):

Rodrigo G. F. Soares ◽

Leandro L. Minku

Keyword(s):

Neural Network ◽

Data Streams ◽

Nonstationary Data

Download Full-text

A Novel Approach to Maximize G-mean in Nonstationary Data with Recurrent Imbalance Shifts

The International Arab Journal of Information Technology ◽

10.34028/iajit/18/1/12 ◽

2020 ◽

Vol 18 (1) ◽

pp. 103-113

Keyword(s):

State Of The Art ◽

Class Imbalance ◽

Imbalanced Data ◽

Streaming Data ◽

Novel Approach ◽

Benchmark Datasets ◽

Boosting Algorithms ◽

Nonstationary Data ◽

Test Outcomes

One of the noteworthy difficulties in the classification of nonstationary data is handling data with class imbalance. Imbalanced data possess the characteristics of having a lot of samples of one class than the other. It, thusly, results in the biased accuracy of a classifier in favour of a majority class. Streaming data may have inherent imbalance resulting from the nature of dataspace or extrinsic imbalance due to its nonstationary environment. In streaming data, timely varying class priors may lead to a shift in imbalance ratio. The researchers have contemplated ensemble learning, online learning, issue of class imbalance and cost-sensitive algorithms autonomously. They have scarcely ever tended to every one of these issues mutually to deal with imbalance shift in nonstationary data. This correspondence shows a novel methodology joining these perspectives to augment G-mean in no stationary data with Recurrent Imbalance Shifts (RIS). This research modifies the state-of-the-art boosting algorithms,1) AdaC2 to get G-mean based Online AdaC2 for Recurrent Imbalance Shifts (GOA-RIS) and AGOA-RIS (Ageing and G-mean based Online AdaC2 for Recurrent Imbalance Shifts), and 2) CSB2 to get G-mean based Online CSB2 for Recurrent Imbalance Shifts (GOC-RIS) and Ageing and G-mean based Online CSB2 for Recurrent Imbalance Shifts (AGOC-RIS). The study has empirically and statistically analysed the performances of the proposed algorithms and Online AdaC2 (OA) and Online CSB2 (OC) algorithms using benchmark datasets. The test outcomes demonstrate that the proposed algorithms globally beat the performances of OA and OC

Download Full-text

Time-varying wavelet estimation and deconvolution for nonstationary data based on a FWE function

Journal of Applied Geophysics ◽

10.1016/j.jappgeo.2020.104198 ◽

2020 ◽

Vol 183 ◽

pp. 104198

Author(s):

Yumeng Jiang ◽

Siyuan Cao ◽

Siyuan Chen ◽

Hang Wang ◽

Hengchang Dai ◽

...

Keyword(s):

Time Varying ◽

Wavelet Estimation ◽

Nonstationary Data

Download Full-text

Counterfactual Analysis and Inference With Nonstationary Data

Journal of Business and Economic Statistics ◽

10.1080/07350015.2020.1799814 ◽

2020 ◽

pp. 1-13

Author(s):

Ricardo Masini ◽

Marcelo C. Medeiros

Keyword(s):

Counterfactual Analysis ◽

Nonstationary Data

Download Full-text

Microcluster-Based Incremental Ensemble Learning for Noisy, Nonstationary Data Streams

Complexity ◽

10.1155/2020/6147378 ◽

2020 ◽

Vol 2020 ◽

pp. 1-12

Author(s):

Sanmin Liu ◽

Shan Xue ◽

Fanzhen Liu ◽

Jieren Cheng ◽

Xiulai Li ◽

...

Keyword(s):

Ensemble Learning ◽

Data Streams ◽

Data Stream ◽

Concept Drift ◽

Majority Vote ◽

Stream Classification ◽

Model Stability ◽

Data Stream Classification ◽

Nonstationary Data ◽

Synthetic Datasets

Data stream classification becomes a promising prediction work with relevance to many practical environments. However, under the environment of concept drift and noise, the research of data stream classification faces lots of challenges. Hence, a new incremental ensemble model is presented for classifying nonstationary data streams with noise. Our approach integrates three strategies: incremental learning to monitor and adapt to concept drift; ensemble learning to improve model stability; and a microclustering procedure that distinguishes drift from noise and predicts the labels of incoming instances via majority vote. Experiments with two synthetic datasets designed to test for both gradual and abrupt drift show that our method provides more accurate classification in nonstationary data streams with noise than the two popular baselines.

Download Full-text

nonstationary data
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Spectral density estimation for nonstationary data with nonzero mean function

Task-Agnostic Continual Learning Using Online Variational Bayes with Fixed-Point Updates

Kernel Online System for Fast Principal Component Analysis and its Adaptive Learning

Optimum wavelet selection for nonparametric analysis toward structural health monitoring for processing big data from sensor network: A comparative study

Spurious Factor Analysis

OSNN: An Online Semisupervised Neural Network for Nonstationary Data Streams

A Novel Approach to Maximize G-mean in Nonstationary Data with Recurrent Imbalance Shifts

Time-varying wavelet estimation and deconvolution for nonstationary data based on a FWE function

Counterfactual Analysis and Inference With Nonstationary Data

Microcluster-Based Incremental Ensemble Learning for Noisy, Nonstationary Data Streams

Export Citation Format

nonstationary dataRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Spectral density estimation for nonstationary data with nonzero mean function

Task-Agnostic Continual Learning Using Online Variational Bayes with Fixed-Point Updates

Kernel Online System for Fast Principal Component Analysis and its Adaptive Learning

Optimum wavelet selection for nonparametric analysis toward structural health monitoring for processing big data from sensor network: A comparative study

Spurious Factor Analysis

OSNN: An Online Semisupervised Neural Network for Nonstationary Data Streams

A Novel Approach to Maximize G-mean in Nonstationary Data with Recurrent Imbalance Shifts

Time-varying wavelet estimation and deconvolution for nonstationary data based on a FWE function

Counterfactual Analysis and Inference With Nonstationary Data

Microcluster-Based Incremental Ensemble Learning for Noisy, Nonstationary Data Streams

nonstationary data
Recently Published Documents