scholarly journals A flexible stand-off data model with query language for multi-level annotation

2005 ◽  
Author(s):  
Christoph Müller
Author(s):  
Andreas Jordan ◽  
Matt Selway ◽  
Georg Grossmann ◽  
Wolfgang Mayer ◽  
Markus Stumptner
Keyword(s):  

2016 ◽  
Vol 27 (2) ◽  
pp. 27-48
Author(s):  
András Benczúr ◽  
Gyula I. Szabó

This paper introduces a generalized data base concept that unites relational and semi structured data models. As an important theoretical result we could find a quadratic decision algorithm for the implication problem of functional and join dependencies defined on the united data model. As practical contribution we presented a normal form for the new data model as a tool for data base design. With our novel representations of regular expressions, a more effective searching method could be developed. XML elements are described by XML schema languages such as a DTD or an XML Schema definition. The instances of these elements are semi-structured tuples. A semi-structured tuple is an ordered list of (attribute: value) pairs. We may think of a semi-structured tuple as a sentence of a formal language, where the values are the terminal symbols and the attribute names are the non-terminal symbols. In the authors' former work (Szabó and Benczúr, 2015) they introduced the notion of the extended tuple as a sentence from a regular language generated by a grammar where the non-terminal symbols of the grammar are the attribute names of the tuple. Sets of extended tuples are the extended relations. The authors then introduced the dual language, which generates the tuple types allowed to occur in extended relations. They defined functional dependencies (regular FD - RFD) over extended relations. In this paper they rephrase the RFD concept by directly using regular expressions over attribute names to define extended tuples. By the help of a special vertex labeled graph associated to regular expressions the specification of substring selection for the projection operation can be defined. The normalization for regular schemas is more complex than it is in the relational model, because the schema of an extended relation can contain an infinite number of tuple types. However, the authors can define selection, projection and join operations on extended relations too, so a lossless-join decomposition can be performed. They extended their previous model to deal with XML schema indicators too, e.g., with numerical constraints. They added line and set constructors too, in order to extend their model with more general projection and selection operators. This model establishes a query language with table join functionality for collected XML element data.


Data ◽  
2019 ◽  
Vol 4 (2) ◽  
pp. 83 ◽  
Author(s):  
Timm Fitschen ◽  
Alexander Schlemmer ◽  
Daniel Hornung ◽  
Henrik tom Wörden ◽  
Ulrich Parlitz ◽  
...  

We present CaosDB, a Research Data Management System (RDMS) designed to ensure seamless integration of inhomogeneous data sources and repositories of legacy data in a FAIR way. Its primary purpose is the management of data from biomedical sciences, both from simulations and experiments during the complete research data lifecycle. An RDMS for this domain faces particular challenges: research data arise in huge amounts, from a wide variety of sources, and traverse a highly branched path of further processing. To be accepted by its users, an RDMS must be built around workflows of the scientists and practices and thus support changes in workflow and data structure. Nevertheless, it should encourage and support the development and observation of standards and furthermore facilitate the automation of data acquisition and processing with specialized software. The storage data model of an RDMS must reflect these complexities with appropriate semantics and ontologies while offering simple methods for finding, retrieving, and understanding relevant data. We show how CaosDB responds to these challenges and give an overview of its data model, the CaosDB Server and its easy-to-learn CaosDB Query Language. We briefly discuss the status of the implementation, how we currently use CaosDB, and how we plan to use and extend it.


Author(s):  
Bo Huang ◽  
Christophe Claramunt

Management of spatiotemporal information requires a more generic and consolidated data model to facilitate applications such as tracking land use parcel changes. This paper presents such a spatiotemporal data model in the context of object databases by extending the Object Data Management Group (ODMG) standard and examines its feasibility in a land use application. This model extends the ODMG object model with a parameterized type, TimeSeries<T>, which allows the shifting of spatial types into spatiotemporal types to support the representation of a series of states (i.e., the history) of an object. An object query language (OQL), spatiotemporal OQL (STOQL), which adds spatial and temporal dimensions to ODMG's OQL, is also designed. A case study demonstrates that STOQL supports the formulation of various spatiotemporal queries pertaining to historical states of spatial objects as well as spatial changes, including spatial type substitution. The model and query language have been implemented by using an object-oriented language in a geographic information system environment.


Author(s):  
Frederic Pereyrol ◽  
Remy Dupas ◽  
Thecle Alix ◽  
Jean Paul Bourrieres

Sign in / Sign up

Export Citation Format

Share Document