Supporting Interoperability Between Open-Source Search Engines with the Common Index File Format

A Spatial Database Infrastructure (SDI) is a framework of geospatial data, metadata, users and tools intended to provide the most efficient and flexible way to use spatial information. One of the key software component of a SDI is the catalogue service, needed to discover, query and manage the metadata. Catalogue services in a SDI are typically based on the Open Geospatial Consortium (OGC) Catalogue Service for the Web (CSW) standard, that defines common interfaces to access the metadata information. A search engine is a software system able to perform very fast and reliable search, with features such as full text search, natural language processing, weighted results, fuzzy tolerance results, faceting, hit highlighting and many others. The Centre of Geographic Analysis (CGA) at Harvard University is trying to integrate within its public domain SDI (named WorldMap), the benefits of both worlds (OGC catalogs and search engines). Harvard Hypermap (HHypermap) is a component that will be part of WorldMap, totally built on an open source stack, implementing an OGC catalog, based on pycsw, to provide access to metadata in a standard way, and a search engine, based on Solr/Lucene, to provide the advanced search features typically found in search engines.

Download Full-text

The Popgen Pipeline Platform: A Software Platform for Facilitating Population Genomic Analyses

10.1101/785774 ◽

2019 ◽

Author(s):

Andrew Webb ◽

Jared Knoblauch ◽

Nitesh Sabankar ◽

Apeksha Sukesh Kallur ◽

Jody Hey ◽

...

Keyword(s):

Open Source ◽

Development Time ◽

End Users ◽

File Format ◽

Software Platform ◽

Format Conversion ◽

Link Type ◽

Population Genomic ◽

Genomic Analyses ◽

File Format Conversion

AbstractHere we present the Pop-Gen Pipeline Platform (PPP), a software platform with the goal of reducing the computational expertise required for conducting population genomic analyses. The PPP was designed as a collection of scripts that facilitate common population genomic workflows in a consistent and standardized Python environment. Functions were developed to encompass entire workflows, including: input preparation, file format conversion, various population genomic analyses, output generation, and visualization. By facilitating entire workflows, the PPP offers several benefits to prospective end users - it reduces the need of redundant in-house software and scripts that would require development time and may be error-prone, or incorrect. The platform has also been developed with reproducibility and extensibility of analyses in mind. The PPP is an open-source package that is available for download and use at https://ppp.readthedocs.io/en/latest/PPP_pages/install.html

Download Full-text

Implementing an open source spatio-temporal search platform for Spatial Data Infrastructures

10.7287/peerj.preprints.2238v3 ◽

2016 ◽

Author(s):

Paolo Corti ◽

Benjamin G Lewis ◽

Tom Kralidis ◽

Jude Mwenda

Keyword(s):

Open Source ◽

Search Engine ◽

Language Processing ◽

Spatial Data ◽

Search Engines ◽

Spatial Information ◽

Data Infrastructure ◽

Advanced Search ◽

Data Infrastructures ◽

Spatio Temporal

A Spatial Data Infrastructure (SDI) is a framework of geospatial data, metadata, users and tools intended to provide the most efficient and flexible way to use spatial information. One of the key software components of a SDI is the catalogue service, needed to discover, query and manage the metadata. Catalogue services in a SDI are typically based on the Open Geospatial Consortium (OGC) Catalogue Service for the Web (CSW) standard, that defines common interfaces to access the metadata information. A search engine is a software system able to perform very fast and reliable search, with features such as full text search, natural language processing, weighted results, fuzzy tolerance results, faceting, hit highlighting and many others. The Centre of Geographic Analysis (CGA) at Harvard University is trying to integrate within its public domain SDI (named WorldMap), the benefits of both worlds (OGC catalogues and search engines). Harvard Hypermap (HHypermap) is a component that will be part of WorldMap, totally built on an open source stack, implementing an OGC catalogue, based on pycsw, to provide access to metadata in a standard way, and a search engine, based on Solr/Lucene, to provide the advanced search features typically found in search engines.

Download Full-text

Implementing an open source spatio-temporal search platform for Spatial Data Infrastructures

10.7287/peerj.preprints.2238v4 ◽

2016 ◽

Author(s):

Paolo Corti ◽

Benjamin G Lewis ◽

Tom Kralidis ◽

Jude Mwenda

Keyword(s):

Open Source ◽

Search Engine ◽

Language Processing ◽

Spatial Data ◽

Search Engines ◽

Spatial Information ◽

Data Infrastructure ◽

Advanced Search ◽

Data Infrastructures ◽

Spatio Temporal

A Spatial Data Infrastructure (SDI) is a framework of geospatial data, metadata, users and tools intended to provide the most efficient and flexible way to use spatial information. One of the key software components of a SDI is the catalogue service, needed to discover, query and manage the metadata. Catalogue services in a SDI are typically based on the Open Geospatial Consortium (OGC) Catalogue Service for the Web (CSW) standard, that defines common interfaces to access the metadata information. A search engine is a software system able to perform very fast and reliable search, with features such as full text search, natural language processing, weighted results, fuzzy tolerance results, faceting, hit highlighting and many others. The Centre of Geographic Analysis (CGA) at Harvard University is trying to integrate within its public domain SDI (named WorldMap), the benefits of both worlds (OGC catalogues and search engines). Harvard Hypermap (HHypermap) is a component that will be part of WorldMap, totally built on an open source stack, implementing an OGC catalogue, based on pycsw, to provide access to metadata in a standard way, and a search engine, based on Solr/Lucene, to provide the advanced search features typically found in search engines.

Download Full-text

Digital File Formats for Digital Preservation

Digital Curation ◽

10.4018/978-1-5225-6921-3.ch010 ◽

2018 ◽

pp. 218-233

Author(s):

Mayank Yuvaraj

Keyword(s):

Digital Libraries ◽

Digital Preservation ◽

Institutional Repository ◽

File Format ◽

Institutional Repositories ◽

File Formats ◽

Digital File ◽

The Common ◽

Common Understanding

During the course of planning an institutional repository, digital library collections or digital preservation service it is inevitable to draft file format policies in order to ensure long term digital preservation, its accessibility and compatibility. Sincere efforts have been made to encourage the adoption of standard formats yet the digital preservation policies vary from library to library. The present paper is based against this background to present the digital preservation community with a common understanding of the common file formats used in the digital libraries or institutional repositories. The paper discusses both open and proprietary file formats for several media.

Download Full-text

An Overview of Learning Management Systems

Learning Management System Technologies and Software Solutions for Online Teaching ◽

10.4018/978-1-61520-853-1.ch001 ◽

2010 ◽

pp. 1-19 ◽

Cited By ~ 9

Author(s):

Anthony A. Piña

Keyword(s):

Open Source ◽

Learning Management Systems ◽

Macro Level ◽

Management Systems ◽

Common Features ◽

Advantages And Disadvantages ◽

The Past ◽

The Common ◽

Learning Management

In this chapter, the reader is taken through a macro level view of learning management systems, with a particular emphasis on systems offered by commercial vendors. Included is a consideration of the growth of learning management systems during the past decade, the common features and tools contained within these systems, and a look at the advantages and disadvantages that learning management systems provide to institutions. In addition, the reader is presented with specific resources and options for evaluating, selecting and deploying learning management systems. A section highlighting the possible advantages and disadvantages of selecting a commercial versus an open source system is followed by a series of brief profiles of the leading vendors of commercial and open source learning management systems.

Download Full-text

Critical Analysis of Major Search Engines

Journal of Computational and Theoretical Nanoscience ◽

10.1166/jctn.2019.8239 ◽

2019 ◽

Vol 16 (9) ◽

pp. 3712-3716

Author(s):

Kailash Kumar ◽

Abdulaziz Al-Besher

Keyword(s):

Search Engine ◽

Search Engines ◽

Critical Analysis ◽

Research Paper ◽

Web Pages ◽

Ranking Algorithm ◽

The Common Optimization INterface for Operations Research: Promoting open-source software in the operations research community

IBM Journal of Research and Development ◽

10.1147/rd.471.0057 ◽

2003 ◽

Vol 47 (1) ◽

pp. 57-66 ◽

Cited By ~ 198

Author(s):

R. Lougee-Heimer

Keyword(s):

Open Source ◽

Open Source Software ◽

Operations Research ◽

Research Community ◽

The Common

Download Full-text

The case for free and open source software in research and scholarship

Philosophical Transactions of The Royal Society A Mathematical Physical and Engineering Sciences ◽

10.1098/rsta.2020.0079 ◽

2021 ◽

Vol 379 (2197) ◽

Cited By ~ 1

Author(s):

Laura Fortunato ◽

Mark Galassi

Keyword(s):

Open Source ◽

Open Source Software ◽

Research Process ◽

Open Science ◽

Support Staff ◽

Research Quality ◽

Open Research ◽

The Common

Free and open source software (FOSS) is any computer program released under a licence that grants users rights to run the program for any purpose, to study it, to modify it, and to redistribute it in original or modified form. Our aim is to explore the intersection between FOSS and computational reproducibility. We begin by situating FOSS in relation to other ‘open’ initiatives, and specifically open science, open research, and open scholarship. In this context, we argue that anyone who actively contributes to the research process today is a computational researcher, in that they use computers to manage and store information. We then provide a primer to FOSS suitable for anyone concerned with research quality and sustainability—including researchers in any field, as well as support staff, administrators, publishers, funders, and so on. Next, we illustrate how the notions introduced in the primer apply to resources for scientific computing, with reference to the GNU Scientific Library as a case study. We conclude by discussing why the common interpretation of ‘open source’ as ‘open code’ is misplaced, and we use this example to articulate the role of FOSS in research and scholarship today. This article is part of the theme issue ‘Reliability and reproducibility in computational science: implementing verification, validation and uncertainty quantification in silico ’.

Download Full-text

Scientific and technical information's restoration in the open access environment through smart search engines

ARID International Journal of Informetrics ◽

10.36772/arid.aijisc.2022.345 ◽

2022 ◽

pp. 134-159

Author(s):

عبد الرزاق بوسمينة ◽

كمال بطوش

Keyword(s):

Open Access ◽

Search Engines ◽

Analytical Approach ◽

Turning Point ◽

Web Applications ◽

Technical Information ◽

Search Results ◽

Scientific And Technical Information ◽

The Common ◽

Specific Search

Open access is one of the topics that attracted the researchers interest as it is a turning point for the recovery of technical and scientific information’s recovery which requires a set of tools and technical skills. This study aims to discover the main problems of information’s recovery within open access in addition to the inventory of the most important smart search engines and to know the strategies of information’s recovery. The study adopted the descriptive analytical approach, and came out with a number of important conclusions, the most important are : Searching for scientific and technical information in the open access environment has become a very difficult and the researcher does not know which of them is more useful. Relying on the common ranking of sites, smart search engines in its work, depends on semantic web applications, most notably XML, RDF and ontology, users can quickly find specific search results through smart search engines without having to become experts in search engines or have a well-defined strategies for searching within the open access environment. The study also showed that the semantic scholar search engine deals with open sources more efficiently than traditional search engines through its ability to discover these sources and display them to the beneficiary in a distinctive way.

Download Full-text