Query Rewriting for Incremental Continuous Query Evaluation in HIFUN

Petros Zervoudakis; Haridimos Kondylakis; Nicolas Spyratos; Dimitris Plexousakis

doi:10.3390/a14050149

Query Rewriting for Incremental Continuous Query Evaluation in HIFUN

Algorithms ◽

10.3390/a14050149 ◽

2021 ◽

Vol 14 (5) ◽

pp. 149

Author(s):

Petros Zervoudakis ◽

Haridimos Kondylakis ◽

Nicolas Spyratos ◽

Dimitris Plexousakis

Keyword(s):

Query Optimization ◽

Query Language ◽

Computational Cost ◽

Continuous Queries ◽

Continuous Query ◽

Query Rewriting ◽

Query Evaluation ◽

Clear Separation ◽

Complete Dataset ◽

High Level

HIFUN is a high-level query language for expressing analytic queries of big datasets, offering a clear separation between the conceptual layer, where analytic queries are defined independently of the nature and location of data, and the physical layer, where queries are evaluated. In this paper, we present a methodology based on the HIFUN language, and the corresponding algorithms for the incremental evaluation of continuous queries. In essence, our approach is able to process the most recent data batch by exploiting already computed information, without requiring the evaluation of the query over the complete dataset. We present the generic algorithm which we translated to both SQL and MapReduce using SPARK; it implements various query rewriting methods. We demonstrate the effectiveness of our approach in temrs of query answering efficiency. Finally, we show that by exploiting the formal query rewriting methods of HIFUN, we can further reduce the computational cost, adding another layer of query optimization to our implementation.

Get full-text (via PubEx)

C-SPARQL: A CONTINUOUS QUERY LANGUAGE FOR RDF DATA STREAMS

International Journal of Semantic Computing ◽

10.1142/s1793351x10000936 ◽

2010 ◽

Vol 04 (01) ◽

pp. 3-25 ◽

Cited By ~ 136

Author(s):

DAVIDE FRANCESCO BARBIERI ◽

DANIELE BRAGA ◽

STEFANO CERI ◽

EMANUELE DELLA VALLE ◽

MICHAEL GROSSNIKLAUS

Keyword(s):

Data Streams ◽

Query Language ◽

Formal Semantics ◽

Streaming Data ◽

Continuous Queries ◽

Continuous Query ◽

Urban Computing ◽

Rdf Data ◽

Over Time

This article defines C-SPARQL, an extension of SPARQL whose distinguishing feature is the support of continuous queries, i.e. queries registered over RDF data streams and then continuously executed. Queries consider windows, i.e. the most recent triples of such streams, observed while data is continuously flowing. Supporting streams in RDF format guarantees interoperability and opens up important applications, in which reasoners can deal with evolving knowledge over time. C-SPARQL is presented by means of a full specification of the syntax, a formal semantics, and a comprehensive set of examples, relative to urban computing applications, that systematically cover the SPARQL extensions. The expression of meaningful queries over streaming data is strictly connected to the availability of aggregation primitives, thus C-SPARQL also includes extensions in this respect.

Get full-text (via PubEx)

Ontology-Assisted Enterprise Information Systems Integration in Manufacturing Supply Chain

Handbook of Research on Developments and Trends in Industrial and Materials Engineering - Advances in Chemical and Materials Engineering ◽

10.4018/978-1-7998-1831-1.ch005 ◽

2020 ◽

pp. 96-122

Author(s):

Kamalendu Pal

Keyword(s):

Query Language ◽

Query Rewriting ◽

Enterprise Information ◽

Web Based ◽

Information Systems Integration ◽

Manufacturing Supply Chain ◽

Recent Developments ◽

Description Framework ◽

High Level ◽

Representation Techniques

Manufacturing communities around the globe are eagerly witnessing the recent developments in semantic web technology (SWT). This technology combines a set of new mechanisms with grounded knowledge representation techniques to address the needs of formal information modelling and reasoning for web-based services. This chapter provides a high-level summary of SWT to help better understand the impact that this technology will have on wider enterprise information architectures. In many cases it also reuses familiar concepts with a new twist. For example, “ontologies” for “data dictionaries” and “semantic models” for “data models.” This chapter presents the usefulness of a proposed architecture by applying a theory to integrating data from multiple heterogeneous sources which entails dealing with semantic mapping between source schema and a resource description framework (RDF) ontology described declaratively using specific query language (i.e. SPARQL) queries. Finally, the semantic of query rewriting is further discussed and a query rewriting algorithm is presented.

Get full-text (via PubEx)

BAND NN: A Deep Learning Framework For Energy Prediction and Geometry Optimization of Organic Small Molecules

10.26434/chemrxiv.9763094 ◽

2019 ◽

Author(s):

Siddhartha Laghuvarapu ◽

Yashaswi Pathak ◽

U. Deva Priyakumar

Keyword(s):

Machine Learning ◽

Density Functional ◽

Computational Cost ◽

Geometry Optimization ◽

Dft Methods ◽

Energy Prediction ◽

Machine Learning Model ◽

Equilibrium Structures ◽

High Level ◽

Non Equilibrium

Recent advances in artificial intelligence along with development of large datasets of energies calculated using quantum mechanical (QM)/density functional theory (DFT) methods have enabled prediction of accurate molecular energies at reasonably low computational cost. However, machine learning models that have been reported so far requires the atomic positions obtained from geometry optimizations using high level QM/DFT methods as input in order to predict the energies, and do not allow for geometry optimization. In this paper, a transferable and molecule-size independent machine learning model (BAND NN) based on a chemically intuitive representation inspired by molecular mechanics force fields is presented. The model predicts the atomization energies of equilibrium and non-equilibrium structures as sum of energy contributions from bonds (B), angles (A), nonbonds (N) and dihedrals (D) at remarkable accuracy. The robustness of the proposed model is further validated by calculations that span over the conformational, configurational and reaction space. The transferability of this model on systems larger than the ones in the dataset is demonstrated by performing calculations on select large molecules. Importantly, employing the BAND NN model, it is possible to perform geometry optimizations starting from non-equilibrium structures along with predicting their energies.

Get full-text (via PubEx)

Numerical simulation and experimental validation of cavitating flow in a multi-hole diesel fuel injector

International Journal of Engine Research ◽

10.1177/1468087421998631 ◽

2021 ◽

pp. 146808742199863

Author(s):

Aishvarya Kumar ◽

Ali Ghobadian ◽

Jamshid Nouri

Keyword(s):

Computational Cost ◽

Cavitating Flow ◽

Field Analysis ◽

Navier Stokes ◽

Fuel Injector ◽

Fluid Behavior ◽

Flow Field Analysis ◽

Charged Couple Device ◽

Biodiesel Fuels ◽

High Level

This study assesses the predictive capability of the ZGB (Zwart-Gerber-Belamri) cavitation model with the RANS (Reynolds Averaged Navier-Stokes), the realizable k-epsilon turbulence model, and compressibility of gas/liquid models for cavitation simulation in a multi-hole fuel injector at different cavitation numbers (CN) for diesel and biodiesel fuels. The prediction results were assessed quantitatively by comparison of predicted velocity profiles with those of measured LDV (Laser Doppler Velocimetry) data. Subsequently, predictions were assessed qualitatively by visual comparison of the predicted void fraction with experimental CCD (Charged Couple Device) recorded images. Both comparisons showed that the model could predict fluid behavior in such a condition with a high level of confidence. Additionally, flow field analysis of numerical results showed the formation of vortices in the injector sac volume. The analysis showed two main types of vortex structures formed. The first kind appeared connecting two adjacent holes and is known as “hole-to-hole” connecting vortices. The second type structure appeared as double “counter-rotating” vortices emerging from the needle wall and entering the injector hole facing it. The use of RANS proved to save significant computational cost and time in predicting the cavitating flow with good accuracy.

Get full-text (via PubEx)

A Review of Algorithms and Hardware Implementations for Spiking Neural Networks

Journal of Low Power Electronics and Applications ◽

10.3390/jlpea11020023 ◽

2021 ◽

Vol 11 (2) ◽

pp. 23

Author(s):

Duy-Anh Nguyen ◽

Xuan-Tu Tran ◽

Francesca Iacopi

Keyword(s):

Neural Networks ◽

Computational Cost ◽

Superior Performance ◽

Training Algorithms ◽

Current State ◽

Hardware Implementations ◽

Neuromorphic Hardware ◽

High Level ◽

Event Based ◽

Hardware Platforms

Deep Learning (DL) has contributed to the success of many applications in recent years. The applications range from simple ones such as recognizing tiny images or simple speech patterns to ones with a high level of complexity such as playing the game of Go. However, this superior performance comes at a high computational cost, which made porting DL applications to conventional hardware platforms a challenging task. Many approaches have been investigated, and Spiking Neural Network (SNN) is one of the promising candidates. SNN is the third generation of Artificial Neural Networks (ANNs), where each neuron in the network uses discrete spikes to communicate in an event-based manner. SNNs have the potential advantage of achieving better energy efficiency than their ANN counterparts. While generally there will be a loss of accuracy on SNN models, new algorithms have helped to close the accuracy gap. For hardware implementations, SNNs have attracted much attention in the neuromorphic hardware research community. In this work, we review the basic background of SNNs, the current state and challenges of the training algorithms for SNNs and the current implementations of SNNs on various hardware platforms.

Get full-text (via PubEx)

The S66 Non-Covalent Interactions Benchmark Reconsidered Using Explicitly Correlated Methods Near the Basis Set Limit

Australian Journal of Chemistry ◽

10.1071/ch17588 ◽

2018 ◽

Vol 71 (4) ◽

pp. 238 ◽

Cited By ~ 13

Author(s):

Manoj K. Kesharwani ◽

Amir Karton ◽

Nitai Sylvetsky ◽

Jan M. L. Martin

Keyword(s):

Computational Cost ◽

The Other ◽

Basis Set ◽

Basis Sets ◽

Explicitly Correlated ◽

Non Covalent Interactions ◽

Explicitly Correlated Methods ◽

The One ◽

High Level ◽

Covalent Interactions

The S66 benchmark for non-covalent interactions has been re-evaluated using explicitly correlated methods with basis sets near the one-particle basis set limit. It is found that post-MP2 ‘high-level corrections’ are treated adequately well using a combination of CCSD(F12*) with (aug-)cc-pVTZ-F12 basis sets on the one hand, and (T) extrapolated from conventional CCSD(T)/heavy-aug-cc-pV{D,T}Z on the other hand. Implications for earlier benchmarks on the larger S66×8 problem set in particular, and for accurate calculations on non-covalent interactions in general, are discussed. At a slight cost in accuracy, (T) can be considerably accelerated by using sano-V{D,T}Z+ basis sets, whereas half-counterpoise CCSD(F12*)(T)/cc-pVDZ-F12 offers the best compromise between accuracy and computational cost.

Get full-text (via PubEx)

Technology of Continuous Query Optimization over Data Streams

2008 International Symposium on Information Science and Engineering ◽

10.1109/isise.2008.36 ◽

2008 ◽

Author(s):

Feng Weibing ◽

Li Zhanhuai

Keyword(s):

Query Optimization ◽

Data Streams ◽

Continuous Query

Get full-text (via PubEx)

A Fast and Effective Method to Identify Relevant Sets of Variables in Complex Systems

Mathematics ◽

10.3390/math9091022 ◽

2021 ◽

Vol 9 (9) ◽

pp. 1022

Author(s):

Gianluca D’Addese ◽

Martina Casari ◽

Roberto Serra ◽

Marco Villani

Keyword(s):

Complex Systems ◽

Gene Regulatory Networks ◽

Regulatory Networks ◽

Computational Cost ◽

Graph Analysis ◽

The Past ◽

Medium Level ◽

Micro Level ◽

Gene Regulatory ◽

High Level

In many complex systems one observes the formation of medium-level structures, whose detection could allow a high-level description of the dynamical organization of the system itself, and thus to its better understanding. We have developed in the past a powerful method to achieve this goal, which however requires a heavy computational cost in several real-world cases. In this work we introduce a modified version of our approach, which reduces the computational burden. The design of the new algorithm allowed the realization of an original suite of methods able to work simultaneously at the micro level (that of the binary relationships of the single variables) and at meso level (the identification of dynamically relevant groups). We apply this suite to a particularly relevant case, in which we look for the dynamic organization of a gene regulatory network when it is subject to knock-outs. The approach combines information theory, graph analysis, and an iterated sieving algorithm in order to describe rather complex situations. Its application allowed to derive some general observations on the dynamical organization of gene regulatory networks, and to observe interesting characteristics in an experimental case.

Get full-text (via PubEx)

Composite Event Processing for Data Streams and Domain Knowledge

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.219-220.927 ◽

2011 ◽

Vol 219-220 ◽

pp. 927-931

Author(s):

Jun Qiang Liu ◽

Xiao Ling Guan

Keyword(s):

Query Optimization ◽

Data Streams ◽

Domain Knowledge ◽

Semantic Information ◽

Query Language ◽

Processing System ◽

Optimization Techniques ◽

Research Attention ◽

Composite Event ◽

Solid Foundation

In recent years the processing of composite event queries over data streams has attracted a lot of research attention. Traditional database techniques were not designed for stream processing system. Furthermore, example continuous queries are often formulated in declarative query language without specifying the semantics. To overcome these deficiencies, this article presents the design, implementation, and evaluation of a system that executes data streams with semantic information. Then, a set of optimization techniques are proposed for handling query. So, our approach not only makes it possible to express queries with a sound semantics, but also provides a solid foundation for query optimization. Experiment results show that our approach is effective and efficient for data streams and domain knowledge.

Get full-text (via PubEx)

A Query Language for Workflow Logs

ACM Transactions on Management Information Systems ◽

10.1145/3482968 ◽

2022 ◽

Vol 13 (2) ◽

pp. 1-28

Author(s):

Yan Tang ◽

Weilong Cui ◽

Jianwen Su

Keyword(s):

Business Process ◽

Evaluation Method ◽

Ad Hoc ◽

Query Language ◽

Cost Model ◽

Formal Semantics ◽

Control Flow ◽

Query Evaluation ◽

Evaluation Algorithm ◽

Laws And Policies

A business process (workflow) is an assembly of tasks to accomplish a business goal. Real-world workflow models often demanded to change due to new laws and policies, changes in the environment, and so on. To understand the inner workings of a business process to facilitate changes, workflow logs have the potential to enable inspecting, monitoring, diagnosing, analyzing, and improving the design of a complex workflow. Querying workflow logs, however, is still mostly an ad hoc practice by workflow managers. In this article, we focus on the problem of querying workflow log concerning both control flow and dataflow properties. We develop a query language based on “incident patterns” to allow the user to directly query workflow logs instead of having to transform such queries into database operations. We provide the formal semantics and a query evaluation algorithm of our language. By deriving an accurate cost model, we develop an optimization mechanism to accelerate query evaluation. Our experiment results demonstrate the effectiveness of the optimization and achieves up to 50× speedup over an adaption of existing evaluation method.

Get full-text (via PubEx)