Technology of Continuous Query Optimization over Data Streams

HIFUN is a high-level query language for expressing analytic queries of big datasets, offering a clear separation between the conceptual layer, where analytic queries are defined independently of the nature and location of data, and the physical layer, where queries are evaluated. In this paper, we present a methodology based on the HIFUN language, and the corresponding algorithms for the incremental evaluation of continuous queries. In essence, our approach is able to process the most recent data batch by exploiting already computed information, without requiring the evaluation of the query over the complete dataset. We present the generic algorithm which we translated to both SQL and MapReduce using SPARK; it implements various query rewriting methods. We demonstrate the effectiveness of our approach in temrs of query answering efficiency. Finally, we show that by exploiting the formal query rewriting methods of HIFUN, we can further reduce the computational cost, adding another layer of query optimization to our implementation.

Download Full-text

Composite Event Processing for Data Streams and Domain Knowledge

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.219-220.927 ◽

2011 ◽

Vol 219-220 ◽

pp. 927-931

Author(s):

Jun Qiang Liu ◽

Xiao Ling Guan

Keyword(s):

Query Optimization ◽

Data Streams ◽

Domain Knowledge ◽

Semantic Information ◽

Query Language ◽

Processing System ◽

Optimization Techniques ◽

Research Attention ◽

Composite Event ◽

Solid Foundation

In recent years the processing of composite event queries over data streams has attracted a lot of research attention. Traditional database techniques were not designed for stream processing system. Furthermore, example continuous queries are often formulated in declarative query language without specifying the semantics. To overcome these deficiencies, this article presents the design, implementation, and evaluation of a system that executes data streams with semantic information. Then, a set of optimization techniques are proposed for handling query. So, our approach not only makes it possible to express queries with a sound semantics, but also provides a solid foundation for query optimization. Experiment results show that our approach is effective and efficient for data streams and domain knowledge.

Download Full-text

Quality-Driven Continuous Query Execution over Out-of-Order Data Streams

Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data - SIGMOD '15 ◽

10.1145/2723372.2735371 ◽

2015 ◽

Cited By ~ 15

Author(s):

Yuanzhen Ji ◽

Hongjin Zhou ◽

Zbigniew Jerzak ◽

Anisoara Nica ◽

Gregor Hackenbroich ◽

...

Keyword(s):

Data Streams ◽

Continuous Query ◽

Query Execution

Download Full-text

Continuous query processing in data streams using duality of data and queries

Proceedings of the 2006 ACM SIGMOD international conference on Management of data - SIGMOD '06 ◽

10.1145/1142473.1142509 ◽

2006 ◽

Cited By ~ 21

Author(s):

Hyo-Sang Lim ◽

Jae-Gil Lee ◽

Min-Jae Lee ◽

Kyu-Young Whang ◽

Il-Yeol Song

Keyword(s):

Query Processing ◽

Data Streams ◽

Continuous Query ◽

Continuous Query Processing

Download Full-text

Logical Foundations of Continuous Query Languages for Data Streams

Lecture Notes in Computer Science - Datalog in Academia and Industry ◽

10.1007/978-3-642-32925-8_18 ◽

2012 ◽

pp. 177-189 ◽

Cited By ~ 12

Author(s):

Carlo Zaniolo

Keyword(s):

Data Streams ◽

Query Languages ◽

Continuous Query ◽

Logical Foundations

Download Full-text

Adaptive Continuous Query Reoptimization over Data Streams

IEICE Transactions on Information and Systems ◽

10.1587/transinf.e92.d.1421 ◽

2009 ◽

Vol E92-D (7) ◽

pp. 1421-1428 ◽

Cited By ~ 1

Author(s):

Hong Kyu PARK ◽

Won Suk LEE

Keyword(s):

Data Streams ◽

Continuous Query

Download Full-text

RSP-QL Semantics

International Journal on Semantic Web and Information Systems ◽

10.4018/ijswis.2014100102 ◽

2014 ◽

Vol 10 (4) ◽

pp. 17-44 ◽

Cited By ~ 38

Author(s):

Daniele Dell'Aglio ◽

Emanuele Della Valle ◽

Jean-Paul Calbimonte ◽

Oscar Corcho

Keyword(s):

Data Streams ◽

Operational Semantics ◽

Stream Processing ◽

Query Languages ◽

Continuous Query ◽

Comprehensive Model ◽

Continuous Processing ◽

Data Interchange ◽

The Web

RDF and SPARQL are established standards for data interchange and querying on the Web. While they have been shown to be useful and applicable in many scenarios, they are not sufficiently adequate for dealing with streams of data and their intrinsic continuous nature. In the last years data and query languages have been proposed to extend both RDF and SPARQL for streams and continuous processing, under the name of RDF Stream Processing – RSP. These efforts resulted in several models and implementations that, at a first look, appear to propose alternative syntaxes but equivalent semantics. However, when asked to continuously answer the same queries on the same data streams, they provide different answers at disparate moments due to the heterogeneity of their operational semantics. These discrepancies render the process of understanding and comparing continuous query results complex and misleading. In this work, the authors propose RSP-QL, a comprehensive model that formally defines the semantics of an RSP system. RSP-QL makes explicit the hidden assumptions of currently available RSP systems, allows defining a formal notion of correctness for RSP query results and, thus, explains why available implementations provide different answers at disparate moments.

Download Full-text