The KnowledgeStore

Author(s):  
Francesco Corcoglioniti ◽  
Marco Rospocher ◽  
Roldano Cattoni ◽  
Bernardo Magnini ◽  
Luciano Serafini

Although the quantity of structured information on the Web and within organizations is increasing, the majority of information remains available only in unstructured form. While different in form, both unstructured and structured information sources provide information about entities in the world and their properties and relations; still, frameworks for their seamless integration have not been deeply investigated. In this paper the authors describe the KnowledgeStore, a scalable, fault-tolerant, and Semantic Web grounded open-source storage system for interlinking structured and unstructured data. They present the concept, design, function and implementation of the system, and report on its concrete usage in three application scenarios within the NewsReader EU project, where it stores and supports the querying of millions of news articles interlinked with millions of RDF triples extracted from text and imported from Linked Open Data sources. The authors report on data population and data retrieval performances of the system measured through a number of experiments, and they also discuss the practical issues and lessons learned from these experiences.


Author(s):  
Francesco Corcoglioniti ◽  
Marco Rospocher ◽  
Roldano Cattoni ◽  
Bernardo Magnini ◽  
Luciano Serafini

Although the quantity of structured information on the Web and within organizations is increasing, the majority of information remains available only in unstructured form. While different in form, both unstructured and structured information sources provide information about entities in the world and their properties and relations; still, frameworks for their seamless integration have not been deeply investigated. In this paper the authors describe the KnowledgeStore, a scalable, fault-tolerant, and Semantic Web grounded open-source storage system for interlinking structured and unstructured data. They present the concept, design, function and implementation of the system, and report on its concrete usage in three application scenarios within the NewsReader EU project, where it stores and supports the querying of millions of news articles interlinked with millions of RDF triples extracted from text and imported from Linked Open Data sources. The authors report on data population and data retrieval performances of the system measured through a number of experiments, and they also discuss the practical issues and lessons learned from these experiences.



Author(s):  
Francesco Corcoglioniti ◽  
Marco Rospocher ◽  
Roldano Cattoni ◽  
Bernardo Magnini ◽  
Luciano Serafini

This chapter describes the KnowledgeStore, a scalable, fault-tolerant, and Semantic Web grounded open-source storage system to jointly store, manage, retrieve, and query interlinked structured and unstructured data, especially designed to manage all the data involved in Knowledge Extraction applications. The chapter presents the concept, design, function and implementation of the KnowledgeStore, and reports on its concrete usage in four application scenarios within the NewsReader EU project, where it has been successfully used to store and support the querying of millions of news articles interlinked with billions of RDF triples, both extracted from text and imported from Linked Open Data sources.



Energies ◽  
2021 ◽  
Vol 14 (15) ◽  
pp. 4565
Author(s):  
Marcin Szott ◽  
Marcin Jarnut ◽  
Jacek Kaniewski ◽  
Łukasz Pilimon ◽  
Szymon Wermiński

This paper introduces the concept of fault-tolerant control (FTC) of a multi-string battery energy storage system (BESS) in the dynamic reduction system of a traction substation load (DROPT). The major task of such a system is to reduce the maximum demand for contracted peak power, averaged for 15 min. The proposed concept, based on a multi-task control algorithm, takes into account: a three-threshold power limitation of the traction substation, two-level reduction of available power of a BESS and a multi-string structure of a BESS. It ensures the continuity of the maximum peak power demand at the contracted level even in the case of damage or disconnection of at least one chain of cells of the battery energy storage (BES) or at least one converter of the power conversion system (PCS). The proposed control strategy has been tested in a model of the system for dynamic reduction of traction substation load with a rated power of 5.5 MW. Two different BESS implementations have been proposed and several possible cases of failure of operations have been investigated. The simulation results have shown that the implementation of a multi-string BESS and an appropriate control algorithm (FTC) may allow for maintenance of the major assumption of DROPT, which is demanded power reduction (from 3.1 MW to 0.75 MW), even with a reduction of the BESS available power by at least 25% and more in the even in fault cases.



Energies ◽  
2018 ◽  
Vol 11 (12) ◽  
pp. 3367 ◽  
Author(s):  
Karl Stein ◽  
Moe Tun ◽  
Keith Musser ◽  
Richard Rocheleau

Battery energy storage systems (BESSs) are being deployed on electrical grids in significant numbers to provide fast-response services. These systems are normally procured by the end user, such as a utility grid owner or independent power producer. This paper introduces a novel research project in which a research institution has purchased a 1 MW BESS and turned ownership over to a utility company under an agreement that allowed the institution to perform experimentation and data collection on the grid for a multi-year period. This arrangement, along with protocols governing experimentation, has created a unique research opportunity to actively and systematically test the impact of a BESS on a live island grid. The 2012 installation and commissioning of the BESS was facilitated by a partnership between the Hawaii Natural Energy Institute (HNEI) and the utility owner, the Hawaiian Electric and Light Company (HELCO). After the test period ended, HELCO continued to allow data collection (including health testing). In 2018, after 8500 equivalent cycles, the BESS continues to operate within specifications. HNEI continues to provide HELCO with expertise to aid with diagnostics as needed. Details about the BESS design, installation, experimental protocols, initial results, and lessons learned are presented in this paper.



Author(s):  
Jaeho Jeong ◽  
Seong-Joon Park ◽  
Jae-Won Kim ◽  
Jong-Seon No ◽  
Ha Hyeon Jeon ◽  
...  

Abstract Motivation In DNA storage systems, there are tradeoffs between writing and reading costs. Increasing the code rate of error-correcting codes may save writing cost, but it will need more sequence reads for data retrieval. There is potentially a way to improve sequencing and decoding processes in such a way that the reading cost induced by this tradeoff is reduced without increasing the writing cost. In past researches, clustering, alignment, and decoding processes were considered as separate stages but we believe that using the information from all these processes together may improve decoding performance. Actual experiments of DNA synthesis and sequencing should be performed because simulations cannot be relied on to cover all error possibilities in practical circumstances. Results For DNA storage systems using fountain code and Reed-Solomon (RS) code, we introduce several techniques to improve the decoding performance. We designed the decoding process focusing on the cooperation of key components: Hamming-distance based clustering, discarding of abnormal sequence reads, RS error correction as well as detection, and quality score-based ordering of sequences. We synthesized 513.6KB data into DNA oligo pools and sequenced this data successfully with Illumina MiSeq instrument. Compared to Erlich’s research, the proposed decoding method additionally incorporates sequence reads with minor errors which had been discarded before, and thuswas able to make use of 10.6–11.9% more sequence reads from the same sequencing environment, this resulted in 6.5–8.9% reduction in the reading cost. Channel characteristics including sequence coverage and read-length distributions are provided as well. Availability The raw data files and the source codes of our experiments are available at: https://github.com/jhjeong0702/dna-storage.





2018 ◽  
Vol 33 (4) ◽  
Author(s):  
Murari Kumar ◽  
Samir Farooqi ◽  
K. K. Chaturvedi ◽  
Chandan Kumar Deb ◽  
Pankaj Das

Bibliographic data contains necessary information about literature to help users to recognize and retrieve that resource. These data are used quantitatively by a “Bibliometrician” for analysis and dissemination purpose but with the increasing rate of literature publication in open access journals such as Nucleic Acids Research (NAR), Springer, Oxford Journals etc., it has become difficult to retrieve structured bibliographic information in desired format. A digital bibliographic database contains necessary and structured information about published literature. Bibliographic records of different articles are scattered and resides on different web pages. This thesis presents the retrieval system for bibliographic data of NAR at a single place. For this purpose, parser agents have been developed which access the web pages of NAR and parse the scattered bibliographic data and finally store it into a local bibliographic database. Based on the bibliographic database, “three-tier architecture” has been utilized to display the bibliographic information in systematized format. Using this system, it would be possible to build the network between different authors and affiliations and also other analytical reports can be generated.



2015 ◽  
Vol 10 (1) ◽  
pp. 249-259 ◽  
Author(s):  
Graham A Parton ◽  
Steven Donegan ◽  
Stephen Pascoe ◽  
Ag Stephens ◽  
Spiros Ventouras ◽  
...  

ISO19156 Observations and Measurements (O&M) provides a standardised framework for organising information about the collection of information about the environment.  Here we describe the implementation of a specialisation of O&M for environmental data, the Metadata Objects for Linking Environmental Sciences (MOLES3).MOLES3 provides support for organising information about data, and for user navigation around data holdings. The implementation described here, “CEDA-MOLES”, also supports data management functions for the Centre for Environmental Data Archival, CEDA. The previous iteration of MOLES (MOLES2) saw active use over five years, being replaced by CEDA-MOLES in late 2014. During that period important lessons were learnt both about the information needed, as well as how to design and maintain the necessary information systems. In this paper we review the problems encountered in MOLES2; how and why CEDA-MOLES was developed and engineered; the migration of information holdings from MOLES2 to CEDA-MOLES; and, finally, provide an early assessment of MOLES3 (as implemented in CEDA-MOLES) and its limitations.Key drivers for the MOLES3 development included the necessity for improved data provenance, for further structured information to support ISO19115 discovery metadata  export (for EU INSPIRE compliance), and to provide appropriate fixed landing pages for Digital Object Identifiers (DOIs) in the presence of evolving datasets. Key lessons learned included the importance of minimising information structure in free text fields, and the necessity to support as much agility in the information infrastructure as possible without compromising on maintainability both by those using the systems internally and externally (e.g. citing in to the information infrastructure), and those responsible for the systems themselves. The migration itself needed to ensure continuity of service and traceability of archived assets.



Sign in / Sign up

Export Citation Format

Share Document