Methods on Detecting Closely Related Topics and Spatial Events

Topic detection is a hot issue that many researchers are interested in. The previous researches focused on the single data stream, they did not consider the topic detection from different data streams in a harmonious way, so they cannot detect closely related topics from different data streams. Recently, Twitter, along with other SNS such as Weibo, and Yelp, began backing position services in their texts. Previous approaches are either complex to be conducted or oversimplified that cannot achieve better performance on detecting spatial topics. In our paper, we introduce a probabilistic method which can precisely detect closely related bursty topics and their bursty periods across different data streams in a unified way. We also introduce a probabilistic method called Latent Spatial Events Model (LSEM) that can find areas as well as to detect the spatial events, it can also predict positions of the texts. We evaluate LSEM on different datasets and reflect that our approach outperforms other baseline approaches in different indexes such as perplexity, entropy of topic and KL-divergence, range error. Evaluation of our first proposed approach on different datasets shows that it can detect closely related topics and meaningful bursty time periods from different datasets.

Download Full-text

STRUCTURE DISCOVERY IN SEQUENTIALLY-CONNECTED DATA STREAMS

International Journal of Artificial Intelligence Tools ◽

10.1142/s0218213006003041 ◽

2006 ◽

Vol 15 (06) ◽

pp. 917-944 ◽

Cited By ~ 8

Author(s):

JEFFREY COBLE ◽

DIANE J. COOK ◽

LAWRENCE B. HOLDER

Keyword(s):

Data Mining ◽

Change Detection ◽

Data Streams ◽

Data Stream ◽

Relational Data ◽

Intelligence Analysis ◽

Central Tendency ◽

Continuous Stream ◽

Single Data ◽

Complex Relationships

Historically, data mining research has been focused on discovering sets of attributes that discriminate data entities into classes or association rules between attributes. In contrast, we are working to develop data mining techniques to discover patterns consisting of complex relationships between entities. Our research is particularly applicable to domains in which the data is event driven, such as counter-terrorism intelligence analysis. In this paper we describe an algorithm designed to operate over relational data received from a continuous stream. Our approach includes a mechanism for summarizing discoveries from previous data increments so that the globally best patterns can be computed by examining only the new data increment. We then describe a method by which relational dependencies that span across temporal increment boundaries can be efficiently resolved so that additional pattern instances, which do not reside entirely in a single data increment, can be discovered. We also describe a method for change detection using a measure of central tendency designed for graph data. We contrast two formulations of the change detection process and demonstrate the ability to identify salient changes along meaningful dimensions and recognize trends in a relational data stream.

Download Full-text

Recurring concept memory management in data streams: exploiting data stream concept evolution to improve performance and transparency

Data Mining and Knowledge Discovery ◽

10.1007/s10618-021-00736-w ◽

2021 ◽

Author(s):

Ben Halstead ◽

Yun Sing Koh ◽

Patricia Riddle ◽

Russel Pears ◽

Mykola Pechenizkiy ◽

...

Keyword(s):

Data Streams ◽

Data Stream ◽

Memory Management ◽

Improve Performance ◽

Concept Evolution

Download Full-text

Analysis of Data Stream Processing At Edge Layer for Internet of Things

Journal of ISMAC - June 2019 ◽

10.36548/jismac.2020.1.003 ◽

2020 ◽

Vol 2 (1) ◽

pp. 26-37

Author(s):

Dr. Pasumponpandian

Keyword(s):

Internet Of Things ◽

Data Streams ◽

Data Stream ◽

Smart Cities ◽

Stream Processing ◽

Middle Layer ◽

Cloud Services ◽

Decentralized Systems ◽

Data Stream Processing ◽

Edge Layer

The progress of internet of things at a rapid pace and simultaneous development of the technologies and the processing capabilities has paved way for the development of decentralized systems that are relying on cloud services. Though the decentralized systems are founded on cloud complexities still prevail in transferring all the information’s that are been sensed through the IOT devices to the cloud. This because of the huge streams of information’s gathered by certain applications and the expectation to have a timely response, incurring minimized delay, computing energy and enhanced reliability. So this kind of decentralization has led to the development of middle layer between the cloud and the IOT, and was termed as the Edge layer, meaning bringing down the service of the cloud to the user edge. The paper puts forth the analysis of the data stream processing in the edge layer taking in the complexities involved in the computing the data streams of IOT in an edge layer and puts forth the real time analytics in the edge layer to examine the data streams of the internet of things offering a data- driven insight for parking system in the smart cities.

Download Full-text

The study of the structure processor performance in the computer system with multiple-instruction streams and single-data stream

Engineering Journal Science and Innovation ◽

10.18698/2308-6033-2013-11-1048 ◽

2013 ◽

Cited By ~ 1

Author(s):

А.Ю. Попов ◽

Keyword(s):

Computer System ◽

Data Stream ◽

Processor Performance ◽

Single Data

Download Full-text

Knowledge Discovery From Evolving Data Streams

Advances in Business Information Systems and Analytics - Machine Learning Techniques for Improved Business Analytics ◽

10.4018/978-1-5225-3534-8.ch002 ◽

2019 ◽

pp. 19-39

Author(s):

Prasanna Lakshmi Kompalli

Keyword(s):

Real Time ◽

Data Streams ◽

Data Stream ◽

Concept Drift ◽

Data Stream Mining ◽

Time Data ◽

Stream Mining ◽

New Challenges ◽

Mining Data Streams ◽

Different Sources

Data coming from different sources is referred to as data streams. Data stream mining is an online learning technique where each data point must be processed as the data arrives and discarded as the processing is completed. Progress of technologies has resulted in the monitoring these data streams in real time. Data streams has created many new challenges to the researchers in real time. The main features of this type of data are they are fast flowing, large amounts of data which are continuous and growing in nature, and characteristics of data might change in course of time which is termed as concept drift. This chapter addresses the problems in mining data streams with concept drift. Due to which, isolating the correct literature would be a grueling task for researchers and practitioners. This chapter tries to provide a solution as it would be an amalgamation of all techniques used for data stream mining with concept drift.

Download Full-text

Exploring Calendar-Based Pattern Mining in Data Streams

Complex Data Warehousing and Knowledge Discovery for Advanced Retrieval Development ◽

10.4018/978-1-60566-748-5.ch016 ◽

2010 ◽

pp. 342-360

Author(s):

Rodrigo Salvador Monteiro ◽

Geraldo Zimbrão ◽

Holger Schwarz ◽

Bernhard Mitschang ◽

Jano Moreira de Souza

Keyword(s):

Data Warehouse ◽

Data Streams ◽

Data Stream ◽

Pattern Mining ◽

A Priori ◽

Frequent Itemsets ◽

Detailed Data ◽

Series Of Experiments ◽

Working Day

Calendar-based pattern mining aims at identifying patterns on specific calendar partitions. Potential calendar partitions are for example: every Monday, every first working day of each month, every holiday. Providing flexible mining capabilities for calendar-based partitions is especially challenging in a data stream scenario. The calendar partitions of interest are not known a priori and at each point in time only a subset of the detailed data is available. The authors show how a data warehouse approach can be applied to this problem. The data warehouse that keeps track of frequent itemsets holding on different partitions of the original stream has low storage requirements. Nevertheless, it allows to derive sets of patterns that are complete and precise. Furthermore, the authors demonstrate the effectiveness of their approach by a series of experiments.

Download Full-text

Research on Sequence Query Processing Techniques over Data Streams

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.284-287.3507 ◽

2013 ◽

Vol 284-287 ◽

pp. 3507-3511 ◽

Cited By ~ 5

Author(s):

Edgar Chia Han Lin

Keyword(s):

Query Processing ◽

Data Streams ◽

Management System ◽

Data Stream ◽

Sensor Data ◽

Data Management System ◽

Content Filtering ◽

Data Stream Management ◽

Stream Management ◽

Great Progress

Due to the great progress of computer technology and mature development of network, more and more data are generated and distributed through the network, which is called data streams. During the last couple of years, a number of researchers have paid their attention to data stream management, which is different from the conventional database management. At present, the new type of data management system, called data stream management system (DSMS), has become one of the most popular research areas in data engineering field. Lots of research projects have made great progress in this area. Since the current DSMS does not support queries on sequence data, this project will study the issues related to two types of data. First, we will focus on the content filtering on single-attribute streams, such as sensor data. Second, we will focus on multi-attribute streams, such as video films. We will discuss the related issues such as how to build an efficient index for all queries of different streams and the corresponding query processing mechanisms.

Download Full-text

An Efficient Frequent Patterns Mining Algorithm over Data Streams Based on FPD-Graph

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.433-440.4457 ◽

2012 ◽

Vol 433-440 ◽

pp. 4457-4462 ◽

Cited By ~ 1

Author(s):

Jun Shan Tan ◽

Zhu Fang Kuang ◽

Guo Gui Yang

Keyword(s):

Data Streams ◽

Data Stream ◽

Pattern Mining ◽

Frequent Pattern Mining ◽

Frequent Pattern ◽

Frequent Patterns ◽

Data Generation ◽

Experiment Data ◽

Mining Algorithm ◽

Head Node

The design of synopses structure is an important issue of frequent patterns mining over data stream. A data stream synopses structure FPD-Graph which is based on directed graph is proposed in this paper. The FPD-Graph contains list head node FPDG-Head and list node FPDG-Node. The operations of FPD-Graph consist of insert operation and deletion operation. A frequent pattern mining algorithm DGFPM based on sliding window over data stream is proposed in this paper. The IBM synthesizes data generation which output customers shopping a data are adopted as experiment data. The DGFPM algorithm not only has high precision for mining frequent patterns, but also has low processing time.

Download Full-text

A Team’s Neurodynamic Organization is More than the Sum of its Members

Proceedings of the Human Factors and Ergonomics Society Annual Meeting ◽

10.1177/1541931213601997 ◽

2017 ◽

Vol 61 (1) ◽

pp. 2010-2014 ◽

Cited By ~ 2

Author(s):

Ronald Stevens ◽

Trysha Galloway ◽

Ann Willemson-Dunlap

Keyword(s):

Decision Making ◽

Data Streams ◽

Data Stream ◽

Team Member ◽

Specific Information ◽

Naturalistic Decision Making ◽

Team Members ◽

Total Information

The information within the neurodynamic data streams of teams engaged in naturalistic decision making was separated into information unique to each team member, the information shared by two or more team members, and team-specific information related to interactions with the task and team members. Most of the team information consisted of the information contained in an individual’s neurodynamic data stream. The information in an individual’s data stream that was shared with another team member was highly variable being 1-60% of the total information in another person’s data stream. From the shared, individual, and team information it becomes possible to assign quantitative values to both the neurodynamics of each team member during the task, as well as the interactions among the members of the team.

Download Full-text