scholarly journals A SEMI-AUTOMATIC PROCEDURE FOR A DEMOGRAPHIC ANALYSIS OF THE FOSS4G DEVELOPERS’ COMMUNITY

Author(s):  
D. Oxoli ◽  
H.-K. Kang ◽  
M. A. Brovelli

<p><strong>Abstract.</strong> The open and direct collaboration at the creation, improvement, and documentation of source code and software applications &amp;ndash; enabled by the web &amp;ndash; is recognized as a peculiarity of the Free and Open Source Software for Geospatial (FOSS4G) projects representing, at the same time, one of their main strengths. With this in mind, it turns out to be interesting to perform an extensive monitoring of both the evolution and the geographical arrangement of the developers’ communities in order to investigate their actual extension, evolution and degree of activity. In this work, a semi-automatic procedure to perform this particular analysis is described. The procedure is mainly based on the use of the GitHub Search Application Programming Interface by means of JavaScript custom modules to perform a census of the users registered with a collaborator role to the repositories of the most popular FOSS4G projects, hosted on the GitHub platform. The collected data is processed and analysed using Python and QGIS. The results &amp;ndash; presented through tables, charts, and thematic maps &amp;ndash; allow describing both dimensions as well as the geographical heterogeneity of the contributing community of each individual project, while enabling to identify the most active countries &amp;ndash; in terms of the number of contributors &amp;ndash; in the development of the most popular FOSS4G. The limits of the analysis, including technical constraints and considerations on the significance of the developers' census, are finally highlighted and discussed.</p>

2018 ◽  
Author(s):  
Soohyun Lee ◽  
Jeremy Johnson ◽  
Carl Vitzthum ◽  
Koray Kırlı ◽  
Burak H. Alver ◽  
...  

AbstractSummaryWe introduce Tibanna, an open-source software tool for automated execution of bioinformatics pipelines on Amazon Web Services (AWS). Tibanna accepts reproducible and portable pipeline standards including Common Workflow Language (CWL), Workflow Description Language (WDL) and Docker. It adopts a strategy of isolation and optimization of individual executions, combined with a serverless scheduling approach. Pipelines are executed and monitored using local commands or the Python Application Programming Interface (API) and cloud configuration is automatically handled. Tibanna is well suited for projects with a range of computational requirements, including those with large and widely fluctuating loads. Notably, it has been used to process terabytes of data for the 4D Nucleome (4DN) Network.AvailabilitySource code is available on GitHub at https://github.com/4dn-dcic/tibanna.


Database ◽  
2019 ◽  
Vol 2019 ◽  
Author(s):  
Leo William Norval ◽  
Stefan Daniel Krämer ◽  
Mingjie Gao ◽  
Tobias Herz ◽  
Jianyu Li ◽  
...  

Abstract The kinetics of featured interactions (KOFFI) database is a novel tool and resource for binding kinetics data from biomolecular interactions. While binding kinetics data are abundant in literature, finding valuable information is a laborious task. We used text extraction methods to store binding rates (association, dissociation) as well as corresponding meta-information (e.g. methods, devices) in a novel database. To date, over 270 articles were manually curated and binding data on over 1705 interactions was collected and stored in the (KOFFI) database. Moreover, the KOFFI database application programming interface was implemented in Anabel (open-source software for the analysis of binding interactions), enabling users to directly compare their own binding data analyses with related experiments described in the database.


2019 ◽  
Vol 35 (21) ◽  
pp. 4424-4426 ◽  
Author(s):  
Soohyun Lee ◽  
Jeremy Johnson ◽  
Carl Vitzthum ◽  
Koray Kırlı ◽  
Burak H Alver ◽  
...  

Abstract Summary We introduce Tibanna, an open-source software tool for automated execution of bioinformatics pipelines on Amazon Web Services (AWS). Tibanna accepts reproducible and portable pipeline standards including Common Workflow Language (CWL), Workflow Description Language (WDL) and Docker. It adopts a strategy of isolation and optimization of individual executions, combined with a serverless scheduling approach. Pipelines are executed and monitored using local commands or the Python Application Programming Interface (API) and cloud configuration is automatically handled. Tibanna is well suited for projects with a range of computational requirements, including those with large and widely fluctuating loads. Notably, it has been used to process terabytes of data for the 4D Nucleome (4DN) Network. Availability and implementation Source code is available on GitHub at https://github.com/4dn-dcic/tibanna. Supplementary information Supplementary data are available at Bioinformatics online.


2021 ◽  
Vol 40 (1) ◽  
pp. 35-44
Author(s):  
Whitney Trainor-Guitton ◽  
Leo Turon ◽  
Dominique Dubucq

The Python Earth Engine application programming interface (API) provides a new open-source ecosphere for testing hydrocarbon detection algorithms on large volumes of images curated with the Google Earth Engine. We specifically demonstrate the Python Earth Engine API by calculating three hydrocarbon indices: fluorescence, rotation absorption, and normalized fluorescence. The Python Earth Engine API provides an ideal environment for testing these indices with varied oil seeps and spills by (1) removing barriers of proprietary software formats and (2) providing an extensive library of data analysis tools (e.g., Pandas and Seaborn) and classification algorithms (e.g., Scikit-learn and TensorFlow). Our results demonstrate end-member cases in which fluorescence and normalized fluorescence indices of seawater and oil are statistically similar and different. As expected, predictive classification is more effective and the calculated probability of oil is more accurate for scenarios in which seawater and oil are well separated in the fluorescence space.


Author(s):  
Santo Wijaya ◽  
Marta H.R.S.R. Sari ◽  
Adian Wihariono Putera

Pendidikan sebagai industri produk dan jasa berbasis ilmu pengetahuan dan keterampilan menghadapi persaingan yang semakin kompetitif dengan banyaknya institusi baik dalam dan luar negeri yang operasional di Indonesia. Untuk meningkatkan daya saing, maka utilisasi teknologi informasi khususnya di era revolusi industri 4.0 menjadi kunci penting. Penelitian ini bertujuan untuk mengembangkan Sistem Informasi Registrasi Mahasiswa Baru (SIRMB) menggunakan kerangka open-source web-based application serta integrasinya dengan teknologi Application Programming Interface (API) Bank BNI menjadikan layanan administrasi yang terotomasi. Proses identifikasi masalah sampai perancangan solusi SIRMB menggunakan analisis gugus kendali mutu (QCC) dengan pendekatan metode Plan-Do-Check-Action (PDCA) sehingga menjamin perbaikan yang berkesinambungan. Penelitian ini berkontribusi terhadap perbaikan 76.9% terhadap proses kerja dengan eliminasi proses kerja manual registrasi mahasiswa baru, sehingga memberikan peningkatan kualitas layanan dan peningkatan produktivitas secara keseluruhan.


Data Science ◽  
2021 ◽  
pp. 1-15
Author(s):  
Jörg Schad ◽  
Rajiv Sambasivan ◽  
Christopher Woodward

Experimenting with different models, documenting results and findings, and repeating these tasks are day-to-day activities for machine learning engineers and data scientists. There is a need to keep control of the machine-learning pipeline and its metadata. This allows users to iterate quickly through experiments and retrieve key findings and observations from historical activity. This is the need that Arangopipe serves. Arangopipe is an open-source tool that provides a data model that captures the essential components of any machine learning life cycle. Arangopipe provides an application programming interface that permits machine-learning engineers to record the details of the salient steps in building their machine learning models. The components of the data model and an overview of the application programming interface is provided. Illustrative examples of basic and advanced machine learning workflows are provided. Arangopipe is not only useful for users involved in developing machine learning models but also useful for users deploying and maintaining them.


Author(s):  
Amit Sharma

The paper portrays the utilization of tools for data gathering and extraction that permits researchers to fare data in standard document groups from various areas of the facebook long range informal communication benefit. Kinship networks, gatherings, and pages can subsequently be breaking down quantitatively and subjectively with respect to demographical, post-demographical, and social qualities. The paper gives a review over expository headings opened up by the data made accessible, talks about stage particular parts of data extraction through the official Application Programming Interface, and quickly connects with the troublesome moral contemplations connected to this sort of research.


2021 ◽  
Vol 1 (4) ◽  
pp. 27-31
Author(s):  
Bhuvan Agarwal ◽  
Soumyajeet Bhattacharjee ◽  
Sima Kar ◽  
Madhurima Saha ◽  
Vijay Kumar ◽  
...  

Abstract – Based on the concept of Application programming interface (API).This project comprises of a package named "algokit" which contains several algorithms based on the category of searching, sorting, dynamic programming, tree traversals and swapping. Keeping in mind that different algorithms from the same category have its own benefit in time and space complexity, This project covers almost all the algorithms known and available from each category. This would give the user several options to choose the right algorithm for its code.An user just requires to import the package named AlgoKit and call the functions inside it for a smooth programming experience. One of the prime objectives of this project is to build a kit that serves the purpose of reducing the number of lines of code and also reduce the time taken to run the same code elsewhere. It is platform independent and can be used in any open source Java development environment.


2007 ◽  
Vol 345-346 ◽  
pp. 1593-1596 ◽  
Author(s):  
Doo Man Chun ◽  
Hyung Jung Kim ◽  
Jae Chul Lee ◽  
Sung Hoon Ahn

Material selection became one of the important activities for engineers in design and manufacturing processes. A selected material at the beginning of design stage affects functionality of the designed part as well as manufacturability and cost of the final product. Unfortunately there are not many accessible material databases that can be properly used for design. In this research, a web-based material database was constructed. In order to assist designers to compare different materials, two-dimensional and three-dimensional graphs for comparison of material properties were provided via the web browser. Using these graphical tools, multi-dimensional comparison was available in more intuitive manner. In addition, this web site is open to the public, and one may add new material properties to the database in order to compare the data with existing materials. To provide the database in a commercial CAD (Computer Aided Design) environment, API (Application Programming Interface) modules were developed using Web services. The web-based material database can be accessible from http://fab.snu.ac.kr/matdb.


2021 ◽  
Vol 23 (06) ◽  
pp. 1672-1681
Author(s):  
Vinay Balamurali ◽  
◽  
Prof. Venkatesh S ◽  

Servers are required to monitor the health of the various I/O cards connected to it to alert the required personnel to service these cards. The Data Collection Unit (DCU) is responsible for detecting the I/O cards, sending their inventory as well as monitoring their health. Currently, the keys required to detect these I/O cards are manually coded into the source code. Such a task is highly laborious and time-consuming. To eliminate this manual work, a Software Pluggable Module was devised which would read the I/O card-related information from the I/O component list. This software design aims at using Data Science and OOPS concepts to automate certain tasks on server systems. The proposed methodology is implemented on a Linux system. The software design is modular in nature and extensible to accommodate future requirements. Such an automation framework can be used to track information maintained in Excel Spreadsheets and access them using an Application Programming Interface (API).


Sign in / Sign up

Export Citation Format

Share Document