Searching Legal Information in Multiple Asian Languages

2012 ◽  
Vol 12 (3) ◽  
pp. 173-184 ◽  
Author(s):  
Philip Chung ◽  
Andrew Mowbray ◽  
Graham Greenleaf

AbstractIn this article Philip Chung, Andrew Mowbray, and Graham Greenleaf, the Co-Directors of the Australasian Legal Information Institute (AustLII), explain the need for an open source search engine which can search simultaneously over legal materials in European languages and also in Asian languages, particularly those that require a ‘double byte’ representation, and the difficulties this task presents. A solution is proposed; the ‘u16a’ modifications to AustLII's open source search engine (Sino) which is used by many legal information institutes. Two implementations of the Sino u16A approach, on the Hong Kong Legal Information Institute (HKLII), for English and Chinese, and on the Asian Legal Information Institute (AsianLII), for multiple Asian languages, are described. The implementations have been successful, though many challenges (discussed briefly) remain before this approach will provide a full multi-lingual search facility.

2009 ◽  
Vol 2 (1) ◽  
pp. 48-68 ◽  
Author(s):  
Angela Ralli

This paper deals with [V V] dvandva compounds, which are frequently used in East and Southeast Asian languages but also in Greek and its dialects: Greek is in this respect uncommon among Indo-European languages. It examines the appearance of this type of compounding in Greek by tracing its development in the late Medieval period, and detects a high rate of productivity in most Modern Greek dialects. It argues that the emergence of the [V V] dvandva pattern is not due to areal pressure or to a language-contact situation, but it is induced by a language internal change. It associates this change with the rise of productivity of compounding in general, and the expansion of verbal compounds in particular. It also suggests that the change contributes to making the compound-formation patterns of the language more uniform and systematic. Claims and proposals are illustrated with data from Standard Modern Greek and its dialects. It is shown that dialectal evidence is crucial for the study of the rise and productivity of [V V] dvandva compounds, since changes are not usually portrayed in the standard language.


Author(s):  
W. Buntine ◽  
J. Lofstrom ◽  
J. Perkio ◽  
S. Perttu ◽  
V. Poroshin ◽  
...  
Keyword(s):  

Author(s):  
Paolo Corti ◽  
Benjamin G Lewis ◽  
Athanasios Tom Kralidis ◽  
Ntabathia Jude Mwenda

A Spatial Data Infrastructure (SDI) is a framework of geospatial data, metadata, users and tools intended to provide an efficient and flexible way to use spatial information. One of the key software components of an SDI is the catalogue service which is needed to discover, query, and manage the metadata. Catalogue services in an SDI are typically based on the Open Geospatial Consortium (OGC) Catalogue Service for the Web (CSW) standard which defines common interfaces for accessing the metadata information. A search engine is a software system capable of supporting fast and reliable search, which may use “any means necessary” to get users to the resources they need quickly and efficiently. These techniques may include features such as full text search, natural language processing, weighted results, fuzzy tolerance results, faceting, hit highlighting, recommendations, feedback mechanisms based on log mining, usage statistic gathering, and many others. In this paper we will be focusing on improving geospatial search with a search engine platform that uses Lucene, a Java-based search library, at its core. In work funded by the National Endowment for the Humanities, the Centre for Geographic Analysis (CGA) at Harvard University is in the process of re-engineering the search component of its public domain SDI (WorldMap http://worldmap.harvard.edu ) which is based on the GeoNode platform. In the process the CGA has developed Harvard Hypermap (HHypermap), a map services registry and search platform independent from WorldMap. The goal of HHypermap is to provide a framework for building and maintaining a comprehensive registry of web map services, and because such a registry is expected to be large, the system supports the development of clients with modern search capabilities such as spatial and temporal faceting and instant previews via an open API. Behind the scenes HHypermap scalably harvests OGC and Esri service metadata from distributed servers, organizes that information, and pushes it to a search engine. The system monitors services for reliability and uses that to improve search. End users will be able to search the SDI metadata using standard interfaces provided by the internal CSW catalogue, and will benefit from the enhanced search possibilities provided by an advanced search engine. HHypermap is built on an open source software source stack.


2016 ◽  
Author(s):  
Paolo Corti ◽  
Benjamin G Lewis ◽  
Tom Kralidis ◽  
Jude Mwenda

A Spatial Database Infrastructure (SDI) is a framework of geospatial data, metadata, users and tools intended to provide the most efficient and flexible way to use spatial information. One of the key software component of a SDI is the catalogue service, needed to discover, query and manage the metadata. Catalogue services in a SDI are typically based on the Open Geospatial Consortium (OGC) Catalogue Service for the Web (CSW) standard, that defines common interfaces to access the metadata information. A search engine is a software system able to perform very fast and reliable search, with features such as full text search, natural language processing, weighted results, fuzzy tolerance results, faceting, hit highlighting and many others. The Centre of Geographic Analysis (CGA) at Harvard University is trying to integrate within its public domain SDI (named WorldMap), the benefits of both worlds (OGC catalogs and search engines). Harvard Hypermap (HHypermap) is a component that will be part of WorldMap, totally built on an open source stack, implementing an OGC catalog, based on pycsw, to provide access to metadata in a standard way, and a search engine, based on Solr/Lucene, to provide the advanced search features typically found in search engines.


2016 ◽  
Author(s):  
Paolo Corti ◽  
Benjamin G Lewis ◽  
Tom Kralidis ◽  
Jude Mwenda

A Spatial Data Infrastructure (SDI) is a framework of geospatial data, metadata, users and tools intended to provide the most efficient and flexible way to use spatial information. One of the key software components of a SDI is the catalogue service, needed to discover, query and manage the metadata. Catalogue services in a SDI are typically based on the Open Geospatial Consortium (OGC) Catalogue Service for the Web (CSW) standard, that defines common interfaces to access the metadata information. A search engine is a software system able to perform very fast and reliable search, with features such as full text search, natural language processing, weighted results, fuzzy tolerance results, faceting, hit highlighting and many others. The Centre of Geographic Analysis (CGA) at Harvard University is trying to integrate within its public domain SDI (named WorldMap), the benefits of both worlds (OGC catalogues and search engines). Harvard Hypermap (HHypermap) is a component that will be part of WorldMap, totally built on an open source stack, implementing an OGC catalogue, based on pycsw, to provide access to metadata in a standard way, and a search engine, based on Solr/Lucene, to provide the advanced search features typically found in search engines.


2016 ◽  
Author(s):  
Paolo Corti ◽  
Benjamin G Lewis ◽  
Tom Kralidis ◽  
Jude Mwenda

A Spatial Data Infrastructure (SDI) is a framework of geospatial data, metadata, users and tools intended to provide the most efficient and flexible way to use spatial information. One of the key software components of a SDI is the catalogue service, needed to discover, query and manage the metadata. Catalogue services in a SDI are typically based on the Open Geospatial Consortium (OGC) Catalogue Service for the Web (CSW) standard, that defines common interfaces to access the metadata information. A search engine is a software system able to perform very fast and reliable search, with features such as full text search, natural language processing, weighted results, fuzzy tolerance results, faceting, hit highlighting and many others. The Centre of Geographic Analysis (CGA) at Harvard University is trying to integrate within its public domain SDI (named WorldMap), the benefits of both worlds (OGC catalogues and search engines). Harvard Hypermap (HHypermap) is a component that will be part of WorldMap, totally built on an open source stack, implementing an OGC catalogue, based on pycsw, to provide access to metadata in a standard way, and a search engine, based on Solr/Lucene, to provide the advanced search features typically found in search engines.


2021 ◽  
Vol ahead-of-print (ahead-of-print) ◽  
Author(s):  
Chenggui Duan ◽  
Tracy K. Lee

Purpose Free and open-source software (FOSS) has been used worldwide because of the advantages of user control, cost-saving, flexibility, openness, freedom, more security and better stability. The purpose of this study is to explore the status quo of educational application of FOSS and the trends from international perspectives and its implications for higher education in Hong Kong. Design/methodology/approach The method of cluster analysis was used in this study. The Web of Science database was used as the data source and all relevant literature for the year 2010–2020 on the theme of “FOSS” was collected for analysis. The information visualization software CiteSpace was used for citation visualization analysis, revealing the research results of FOSS worldwide, including hot spots and development trends. Findings This paper found that FOSS has become an important research area and is playing an important role in the reform and development of education. Meanwhile, the development and application of FOSS have regional imbalances and strong differentiation, including the educational sector. The paper also found that although FOSS has entered the stage of interdisciplinary development, the research and development of FOSS in the field of education is insufficient, which poses a huge challenge to decision-makers, teachers and students. Originality/value Implications for higher education in Hong Kong including: attach importance to and vigorously promote FOSS research and practice to benefit more teachers and students; teachers and students need to be trained for acquiring the awareness and skills of FOSS applications and formulate different strategies; the government should provide greater support to formulate and implement a short and middle-term development plan to facilitate the application of FOSS; and Hong Kong higher education institutions may strengthen exchanges and cooperation with counterparts around the world to jointly promote the development of FOSS. It is hoped that the findings will provide a reference for the study and application of FOSS in higher education in Hong Kong.


Author(s):  
Demian Katz ◽  
Andrew Nagy

Apache Solr, an open source Java-based search engine, forms the core of many Library 2.0 products. The use of an index in place of a relational database allows faster data retrieval along with key features like faceting and similarity analysis that are not practical in the previous generation of library software. The popular VuFind discovery tool was built to provide a library-friendly front-end for Solr’s powerful searching capabilities, and its development provides an informative case study on the use of Solr in a library setting. VuFind is just one of many library packages using Solr, and examples like Blacklight, Summon, and the eXtensible Catalog project show other possible approaches to its use.


Sign in / Sign up

Export Citation Format

Share Document