Evolving Fuzzy Systems from Data Streams in Real-Time

Now-a-days data streams or information streams are gigantic and quick changing. The usage of information streams can fluctuate from basic logical, scientific applications to vital business and money related ones. The useful information is abstracted from the stream and represented in the form of micro-clusters in the online phase. In offline phase micro-clusters are merged to form the macro clusters. DBSTREAM technique captures the density between micro-clusters by means of a shared density graph in the online phase. The density data in this graph is then used in reclustering for improving the formation of clusters but DBSTREAM takes more time in handling the corrupted data points In this paper an early pruning algorithm is used before pre-processing of information and a bloom filter is used for recognizing the corrupted information. Our experiments on real time datasets shows that using this approach improves the efficiency of macro-clusters by 90% and increases the generation of more number of micro-clusters within in a short time.

Download Full-text

Real-Time Compression for Tactile Internet Data Streams

Sensors ◽

10.3390/s21051924 ◽

2021 ◽

Vol 21 (5) ◽

pp. 1924

Author(s):

Patrick Seeling ◽

Martin Reisslein ◽

Frank H. P. Fitzek

Keyword(s):

Real Time ◽

Data Streams ◽

Control Loop ◽

Real World Data ◽

Perceptual Coding ◽

Great Flexibility ◽

Broad Application ◽

Time Compression ◽

Perceptual Threshold ◽

Selection Of

The Tactile Internet will require ultra-low latencies for combining machines and humans in systems where humans are in the control loop. Real-time and perceptual coding in these systems commonly require content-specific approaches. We present a generic approach based on deliberately reduced number accuracy and evaluate the trade-off between savings achieved and errors introduced with real-world data for kinesthetic movement and tele-surgery. Our combination of bitplane-level accuracy adaptability with perceptual threshold-based limits allows for great flexibility in broad application scenarios. Combining the attainable savings with the relatively small introduced errors enables the optimal selection of a working point for the method in actual implementations.

Download Full-text

Online sequential ensembling of predictive fuzzy systems

Evolving Systems ◽

10.1007/s12530-021-09398-x ◽

2021 ◽

Author(s):

Edwin Lughofer ◽

Mahardhika Pratama

Keyword(s):

Data Streams ◽

Fuzzy Systems ◽

Large Scale ◽

Fuzzy Model ◽

System Delay ◽

Actual System ◽

Processing Times ◽

Target Values ◽

Improved Performance ◽

Prediction Techniques

AbstractEvolving fuzzy systems (EFS) have enjoyed a wide attraction in the community to handle learning from data streams in an incremental, single-pass and transparent manner. The main concentration so far lied in the development of approaches for single EFS models, basically used for prediction purposes. Forgetting mechanisms have been used to increase their flexibility, especially for the purpose to adapt quickly to changing situations such as drifting data distributions. These require forgetting factors steering the degree of timely out-weighing older learned concepts, whose adequate setting in advance or in adaptive fashion is not an easy and not a fully resolved task. In this paper, we propose a new concept of learning fuzzy systems from data streams, which we call online sequential ensembling of fuzzy systems (OS-FS). It is able to model the recent dependencies in streams on a chunk-wise basis: for each new incoming chunk, a new fuzzy model is trained from scratch and added to the ensemble (of fuzzy systems trained before). This induces (i) maximal flexibility in terms of being able to apply variable chunk sizes according to the actual system delay in receiving target values and (ii) fast reaction possibilities in the case of arising drifts. The latter are realized with specific prediction techniques on new data chunks based on the sequential ensemble members trained so far over time. We propose four different prediction variants including various weighting concepts in order to put higher weights on the members with higher inference certainty during the amalgamation of predictions of single members to a final prediction. In this sense, older members, which keep in mind knowledge about past states, may get dynamically reactivated in the case of cyclic drifts, which induce dynamic changes in the process behavior which are re-occurring from time to time later. Furthermore, we integrate a concept for properly resolving possible contradictions among members with similar inference certainties. The reaction onto drifts is thus autonomously handled on demand and on the fly during the prediction stage (and not during model adaptation/evolution stage as conventionally done in single EFS models), which yields enormous flexibility. Finally, in order to cope with large-scale and (theoretically) infinite data streams within a reasonable amount of prediction time, we demonstrate two concepts for pruning past ensemble members, one based on atypical high error trends of single members and one based on the non-diversity of ensemble members. The results based on two data streams showed significantly improved performance compared to single EFS models in terms of a better convergence of the accumulated chunk-wise ahead prediction error trends, especially in the case of regular and cyclic drifts. Moreover, the more advanced prediction schemes could significantly outperform standard averaging over all members’ outputs. Furthermore, resolving contradictory outputs among members helped to improve the performance of the sequential ensemble further. Results on a wider range of data streams from different application scenarios showed (i) improved error trend lines over single EFS models, as well as over related AI methods OS-ELM and MLPs neural networks retrained on data chunks, and (ii) slightly worse trend lines than on-line bagged EFS (as specific EFS ensembles), but with around 100 times faster processing times (achieving low processing times way below requiring milli-seconds for single samples updates).

Download Full-text

Multimodal analysis of body sensor network data streams for real-time healthcare

Proceedings of the international conference on Multimedia information retrieval - MIR '10 ◽

10.1145/1743384.1743467 ◽

2010 ◽

Cited By ~ 22

Author(s):

Manoj K. Garg ◽

Duk-Jin Kim ◽

Deepak S. Turaga ◽

Balakrishnan Prabhakaran

Keyword(s):

Real Time ◽

Sensor Network ◽

Data Streams ◽

Network Data ◽

Body Sensor Network ◽

Multimodal Analysis

Download Full-text

A novel energy-based online sequential extreme learning machine to detect anomalies over real-time data streams

Neural Computing and Applications ◽

10.1007/s00521-021-05731-2 ◽

2021 ◽

Author(s):

Xiaoping Wang ◽

Shanshan Tu ◽

Wei Zhao ◽

Chengjie Shi

Keyword(s):

Real Time ◽

Extreme Learning Machine ◽

Data Streams ◽

Time Data ◽

Real Time Data ◽

Learning Machine

Download Full-text

Measuring the Effectiveness of Adaptive Random Forest for Handling Concept Drift in Big Data Streams

Entropy ◽

10.3390/e23070859 ◽

2021 ◽

Vol 23 (7) ◽

pp. 859

Author(s):

Abdulaziz O. AlQabbany ◽

Aqil M. Azmi

Keyword(s):

Big Data ◽

Random Forest ◽

Real Time ◽

Data Streams ◽

Learning Algorithm ◽

Concept Drift ◽

The United States ◽

Careful Consideration ◽

Data Sets ◽

Stream Data

We are living in the age of big data, a majority of which is stream data. The real-time processing of this data requires careful consideration from different perspectives. Concept drift is a change in the data’s underlying distribution, a significant issue, especially when learning from data streams. It requires learners to be adaptive to dynamic changes. Random forest is an ensemble approach that is widely used in classical non-streaming settings of machine learning applications. At the same time, the Adaptive Random Forest (ARF) is a stream learning algorithm that showed promising results in terms of its accuracy and ability to deal with various types of drift. The incoming instances’ continuity allows for their binomial distribution to be approximated to a Poisson(1) distribution. In this study, we propose a mechanism to increase such streaming algorithms’ efficiency by focusing on resampling. Our measure, resampling effectiveness (ρ), fuses the two most essential aspects in online learning; accuracy and execution time. We use six different synthetic data sets, each having a different type of drift, to empirically select the parameter λ of the Poisson distribution that yields the best value for ρ. By comparing the standard ARF with its tuned variations, we show that ARF performance can be enhanced by tackling this important aspect. Finally, we present three case studies from different contexts to test our proposed enhancement method and demonstrate its effectiveness in processing large data sets: (a) Amazon customer reviews (written in English), (b) hotel reviews (in Arabic), and (c) real-time aspect-based sentiment analysis of COVID-19-related tweets in the United States during April 2020. Results indicate that our proposed method of enhancement exhibited considerable improvement in most of the situations.

Download Full-text

A comparison of select trigger algorithms for automated global seismic phase and event detection

Bulletin of the Seismological Society of America ◽

10.1785/bssa0880010095 ◽

1998 ◽

Vol 88 (1) ◽

pp. 95-106 ◽

Cited By ~ 12

Author(s):

Mitchell Withers ◽

Richard Aster ◽

Christopher Young ◽

Judy Beiriger ◽

Mark Harris ◽

...

Keyword(s):

Real Time ◽

Data Streams ◽

Event Detection ◽

Spectral Characteristics ◽

Digital Data ◽

Robust Detection ◽

Location System ◽

Global Correlation ◽

Seismic Coda ◽

Wide Range

Abstract Digital algorithms for robust detection of phase arrivals in the presence of stationary and nonstationary noise have a long history in seismology and have been exploited primarily to reduce the amount of data recorded by data logging systems to manageable levels. In the present era of inexpensive digital storage, however, such algorithms are increasingly being used to flag signal segments in continuously recorded digital data streams for subsequent processing by automatic and/or expert interpretation systems. In the course of our development of an automated, near-real-time, waveform correlation event-detection and location system (WCEDS), we have surveyed the abilities of such algorithms to enhance seismic phase arrivals in teleseismic data streams. Specifically, we have considered envelopes generated by energy transient (STA/LTA), Z-statistic, frequency transient, and polarization algorithms. The WCEDS system requires a set of input data streams that have a smooth, low-amplitude response to background noise and seismic coda and that contain peaks at times corresponding to phase arrivals. The algorithm used to generate these input streams from raw seismograms must perform well under a wide range of source, path, receiver, and noise scenarios. Present computational capabilities allow the application of considerably more robust algorithms than have been historically used in real time. However, highly complex calculations can still be computationally prohibitive for current workstations when the number of data streams become large. While no algorithm was clearly optimal under all source, receiver, path, and noise conditions tested, an STA/LTA algorithm incorporating adaptive window lengths controlled by nonstationary seismogram spectral characteristics was found to provide an output that best met the requirements of a global correlation-based event-detection and location system.

Download Full-text

Evolving Fuzzy Systems from Data Streams in Real-Time

Real-time human activity recognition from wireless sensors using evolving fuzzy systems

Evolving fuzzy systems for data streams: a survey

Handling drifts and shifts in on-line data streams with evolving fuzzy systems

Improved Macro-clusters generation using Top-k shared Micro-clusters in Data Streams

Real-Time Compression for Tactile Internet Data Streams

Online sequential ensembling of predictive fuzzy systems

Multimodal analysis of body sensor network data streams for real-time healthcare

A novel energy-based online sequential extreme learning machine to detect anomalies over real-time data streams

Measuring the Effectiveness of Adaptive Random Forest for Handling Concept Drift in Big Data Streams

A comparison of select trigger algorithms for automated global seismic phase and event detection

Export Citation Format