scholarly journals Finite-State Back-Transliteration for Marathi

2017 ◽  
Vol 108 (1) ◽  
pp. 319-329 ◽  
Author(s):  
Vinit Ravishankar

AbstractIn this paper, we describe the creation of an open-source, finite-state based system for back-transliteration of Latin text in the Indian language Marathi. We outline the advantages of our system and compare it to other existing systems, evaluate its recall, and evaluate the coverage of an open-source morphological analyser on our back-transliterated corpus.

Agronomy ◽  
2021 ◽  
Vol 11 (5) ◽  
pp. 952
Author(s):  
Lia Duarte ◽  
Ana Cláudia Teodoro ◽  
Joaquim J. Sousa ◽  
Luís Pádua

In a precision agriculture context, the amount of geospatial data available can be difficult to interpret in order to understand the crop variability within a given terrain parcel, raising the need for specific tools for data processing and analysis. This is the case for data acquired from Unmanned Aerial Vehicles (UAV), in which the high spatial resolution along with data from several spectral wavelengths makes data interpretation a complex process regarding vegetation monitoring. Vegetation Indices (VIs) are usually computed, helping in the vegetation monitoring process. However, a crop plot is generally composed of several non-crop elements, which can bias the data analysis and interpretation. By discarding non-crop data, it is possible to compute the vigour distribution for a specific crop within the area under analysis. This article presents QVigourMaps, a new open source application developed to generate useful outputs for precision agriculture purposes. The application was developed in the form of a QGIS plugin, allowing the creation of vigour maps, vegetation distribution maps and prescription maps based on the combination of different VIs and height information. Multi-temporal data from a vineyard plot and a maize field were used as case studies in order to demonstrate the potential and effectiveness of the QVigourMaps tool. The presented application can contribute to making the right management decisions by providing indicators of crop variability, and the outcomes can be used in the field to apply site-specific treatments according to the levels of vigour.


2017 ◽  
Author(s):  
Mickael Silva ◽  
Miguel Machado ◽  
Diogo N. Silva ◽  
Mirko Rossi ◽  
Jacob Moran-Gilad ◽  
...  

ABSTRACTGene-by-gene approaches are becoming increasingly popular in bacterial genomic epidemiology and outbreak detection. However, there is a lack of open-source scalable software for schema definition and allele calling for these methodologies. The chewBBACA suite was designed to assist users in the creation and evaluation of novel whole-genome or core-genome gene-by-gene typing schemas and subsequent allele calling in bacterial strains of interest. The software can run in a laptop or in high performance clusters making it useful for both small laboratories and large reference centers. ChewBBACA is available athttps://github.com/B-UMMI/chewBBACAor as a docker image athttps://hub.docker.com/r/ummidock/chewbbaca/.DATA SUMMARYAssembled genomes used for the tutorial were downloaded from NCBI in August 2016 by selecting those submitted asStreptococcus agalactiaetaxon or sub-taxa. All the assemblies have been deposited as a zip file in FigShare (https://figshare.com/s/9cbe1d422805db54cd52), where a file with the original ftp link for each NCBI directory is also available.Code for the chewBBACA suite is available athttps://github.com/B-UMMI/chewBBACAwhile the tutorial example is found athttps://github.com/B-UMMI/chewBBACA_tutorial.I/We confirm all supporting data, code and protocols have been provided within the article or through supplementary data files. ⊠IMPACT STATEMENTThe chewBBACA software offers a computational solution for the creation, evaluation and use of whole genome (wg) and core genome (cg) multilocus sequence typing (MLST) schemas. It allows researchers to develop wg/cgMLST schemes for any bacterial species from a set of genomes of interest. The alleles identified by chewBBACA correspond to potential coding sequences, possibly offering insights into the correspondence between the genetic variability identified and phenotypic variability. The software performs allele calling in a matter of seconds to minutes per strain in a laptop but is easily scalable for the analysis of large datasets of hundreds of thousands of strains using multiprocessing options. The chewBBACA software thus provides an efficient and freely available open source solution for gene-by-gene methods. Moreover, the ability to perform these tasks locally is desirable when the submission of raw data to a central repository or web services is hindered by data protection policies or ethical or legal concerns.


2012 ◽  
Vol 1 (3) ◽  
pp. 16-25 ◽  
Author(s):  
Martin Christen ◽  
Stephan Nebiker ◽  
Benjamin Loesch

In this paper, the authors present the OpenWebGlobe project (http://www.openwebglobe.org). The authors also discuss the OpenWebGlobe SDK. OpenWebGlobe SDK is an open source framework for creating massive 3D virtual globe environments and interactively exploiting them in web browsers using HTML5 and WebGL, allowing for the creation of large scale virtual 3D globes with detailed contents and their interactive visualization directly within a broad spectrum of Web browsers.


First Monday ◽  
2006 ◽  
Author(s):  
David M. Berry ◽  
Giles Moss

The project of ‘free culture’ is committed to the creation of a cultural space, rather like the ‘public domain’, seeking to complement/replace that of proprietary cultural commodities and privatized meaning. This has been given a new impetus with the birth of the Creative Commons. This organization has sought to introduce cultural producers across the world to the possibilities of sharing, co–operation and commons–based peer–production by creating a set of interwoven licenses for creators to append to their artwork, music and text. In this paper, we chart the connections between this movement and the early Free Software and Open Source movements and question whether underlying assumptions that are ignored or de–politicized are a threat to the very free culture that the project purports to save. We then move to suggest a new discursive project linked to notions of radical democracy.


2020 ◽  
Author(s):  
William Hannas ◽  
Huey-Meei Chang ◽  
Catherine Aiken ◽  
Daniel Chou

Since 2016, China has engaged in a nationwide effort to "merge" AI and neuroscience research as a major part of its next-generation AI development program. This report explores China’s AI-brain program — identifying key players and organizations and recommending the creation of an open source S&T monitoring capability within the U.S. government.


2013 ◽  
Vol 10 (1) ◽  
Author(s):  
I Wayan Sutaya

Tujuan dari penelitian ini adalah untuk membuat perangkat generator framework sistem embedded. Framework sistem embedded yang dihasilkan oleh perangkat generator ini berupa kode-kode C dengan menggunakan model FSM (Finite State Machine). Perangkat generator ini dibuat dengan menkustomisasi aplikasi opersource ArgoUML. Perangkat generator yang dibuat telah diuji dengan membuat studi kasus aplikasi sistem embedded kalkulator. Dari studi kasus ini disimpulkan bahwa penggunaan perangkat generator ini bisa menghemat waktu pengerjaan proyek sebesar 70%.   Kata-kata kunci: framework, sistem embedded, FSM, ArgoUML.


Author(s):  
I Wayan Sutaya

Tujuan dari penelitian ini adalah untuk membuat perangkat generator framework sistem embedded. Framework sistem embedded yang dihasilkan oleh perangkat generator ini berupa kode-kode C dengan menggunakan model FSM (Finite State Machine). Perangkat generator ini dibuat dengan menkustomisasi aplikasi opersource ArgoUML. Perangkat generator yang dibuat telah diuji dengan membuat studi kasus aplikasi sistem embedded kalkulator. Dari studi kasus ini disimpulkan bahwa penggunaan perangkat generator ini bisa menghemat waktu pengerjaan proyek sebesar 70%.   Kata-kata kunci: framework, sistem embedded, FSM, ArgoUML.


2018 ◽  
Vol 2 (1) ◽  
pp. 39-46
Author(s):  
Kremena Tsankova Markova ◽  
Tihomir Atanassov Dovramadjiev ◽  
Ginka Velikova Jecheva

The creation of 3D paper unfolded models with complex geometry requires precision in the design process. Achieving this is through the use of accessible resources and modern technological means which possess the necessary technical and functional capabilities. The successful realization of the models is through the use of a structured methodology containing design stages that are defined in the exact sequence. The present study aims to create the unfoldings of complex three-dimensional models by covering the stages of providing accessible resurses with a personal and commercial license that are transformed in sequence until the ready-made variants.


2018 ◽  
Vol 11 (3) ◽  
pp. 1-25
Author(s):  
Leonel Figueiredo de Alencar ◽  
Bruno Cuconato ◽  
Alexandre Rademaker

ABSTRACT: One of the prerequisites for many natural language processing technologies is the availability of large lexical resources. This paper reports on MorphoBr, an ongoing project aiming at building a comprehensive full-form lexicon for morphological analysis of Portuguese. A first version of the resource is already freely available online under an open source, free software license. MorphoBr combines analogous free resources, correcting several thousand errors and gaps, and systematically adding new entries. In comparison to the integrated resources, lexical entries in MorphoBr follow a more user-friendly format, which can be straightforwardly compiled into finite-state transducers for morphological analysis, e.g. in the context of syntactic parsing with a grammar in the LFG formalism using the XLE system. MorphoBr results from a combination of computational techniques. Errors and the more obvious gaps in the integrated resources were automatically corrected with scripts. However, MorphoBr's main contribution is the expansion in the inventory of nouns and adjectives. This was carried out by systematically modeling diminutive formation in the paradigm of finite-state morphology. This allowed MorphoBr to significantly outperform analogous resources in the coverage of diminutives. The first evaluation results show MorphoBr to be a promising initiative which will directly contribute to the development of more robust natural language processing tools and applications which depend on wide-coverage morphological analysis.KEYWORDS: computational linguistics; natural language processing; morphological analysis; full-form lexicon; diminutive formation. RESUMO: Um dos pré-requisitos para muitas tecnologias de processamento de linguagem natural é a disponibilidade de vastos recursos lexicais. Este artigo trata do MorphoBr, um projeto em desenvolvimento voltado para a construção de um léxico de formas plenas abrangente para a análise morfológica do português. Uma primeira versão do recurso já está disponível gratuitamente on-line sob uma licença de software livre e de código aberto. MorphoBr combina recursos livres análogos, corrigindo vários milhares de erros e lacunas. Em comparação com os recursos integrados, as entradas lexicais do MorphoBr seguem um formato mais amigável, o qual pode ser compilado diretamente em transdutores de estados finitos para análise morfológica, por exemplo, no contexto do parsing sintático com uma gramática no formalismo da LFG usando o sistema XLE. MorphoBr resulta de uma combinação de técnicas computacionais. Erros e lacunas mais óbvias nos recursos integrados foram automaticamente corrigidos com scripts. No entanto, a principal contribuição de MorphoBr é a expansão no inventário de substantivos e adjetivos. Isso foi alcançado pela modelação sistemática da formação de diminutivos no paradigma da morfologia de estados finitos. Isso possibilitou a MorphoBr superar de forma significativa recursos análogos na cobertura de diminutivos. Os primeiros resultados de avaliação mostram que o MorphoBr constitui uma iniciativa promissora que contribuirá de forma direta para conferir robustez a ferramentas e aplicações de processamento de linguagem natural que dependem de análise morfológica de ampla cobertura.PALAVRAS-CHAVE: linguística computacional; processamento de linguagem natural; análise morfológica; léxico de formas plenas; formação de diminutivos.


Sign in / Sign up

Export Citation Format

Share Document