software toolkit Latest Research Papers

Data science and machine learning are buzzwords of the early 21st century. Now pervasive through human civilization, how do these concepts translate to use by researchers and clinicians in the life-science and medical field? Here, we describe a software toolkit, just large enough in scale, so that it can be maintained and extended by a small team, optimised for problems that arise in small/medium laboratories. In particular, this system may be managed from data ingestion statistics preparation predictions by a single person. At the system’s core is a graph type database, so that it is flexible in terms of irregular, constantly changing data types, as such data types are common during explorative research. At the system’s outermost shell, the concept of ’user stories’ is introduced to help the end-user researchers perform various tasks separated by their expertise: these range from simple data input, data curation, statistics, and finally to predictions via machine learning algorithms. We compiled a sizable list of already existing, modular Python platform libraries usable for data analysis that may be used as a reference in the field and may be incorporated into this software. We also provide an insight into basic concepts, such as labelled-unlabelled data, supervised vs. unsupervised learning, regression vs. classification, evaluation by different error metrics, and an advanced concept of cross-validation. Finally, we show some examples from our laboratory using our blood sample and blood clot data from thrombosis patients (sufferers from stroke, heart and peripheral thrombosis disease) and how such tools can help to set up realistic expectations and show caveats.

Download Full-text

Improved N-Best Extraction with an Evaluation on Language Data

Computational Linguistics ◽

10.1162/coli_a_00427 ◽

2021 ◽

pp. 1-35

Author(s):

Johanna Björklund ◽

Frank Drewes ◽

Anna Jonsson

Keyword(s):

Language Processing ◽

State Of The Art ◽

Search Space ◽

Data Sets ◽

Weighted Tree ◽

Original Algorithm ◽

Software Toolkit ◽

Minimal Weight ◽

Language Data ◽

Memory Efficient

Abstract We show that a previously proposed algorithm for the N-best trees problem can be made more efficient by changing how it arranges and explores the search space. Given an integer N and a weighted tree automaton (wta) M over the tropical semiring, the algorithm computes N trees of minimal weight with respect to M. Compared to the original algorithm, the modifications increase the laziness of the evaluation strategy, which makes the new algorithm asymptotically more efficient than its predecessor. The algorithm is implemented in the software Betty, and compared to the state-of-the-art algorithm for extracting the N best runs, implemented in the software toolkit Tiburon. The data sets used in the experiments are wtas resulting from real-world natural language processing tasks, as well as artificially created wtas with varying degrees of nondeterminism. We find that Betty outperforms Tiburon on all tested data sets with respect to running time, while Tiburon seems to be the more memory-efficient choice.

Download Full-text

HNU-EBL: A Software Toolkit for Electron Beam Lithography Simulation and Optimization

10.1109/iwaps54037.2021.9671243 ◽

2021 ◽

Author(s):

Wei Liu ◽

Wenze Yao ◽

Chengyang Hou ◽

Hongcheng Xu ◽

Haojie Zhao ◽

...

Keyword(s):

Electron Beam ◽

Electron Beam Lithography ◽

Simulation And Optimization ◽

Software Toolkit ◽

Lithography Simulation

Download Full-text

TChem v3.0 - A Software Toolkit for the Analysis of Complex Kinetic Models.

10.2172/1829197 ◽

2021 ◽

Author(s):

Cosmin Safta ◽

Habib Najm ◽

Oscar Diaz-Ibarra ◽

Kyungjoo Kim

Keyword(s):

Kinetic Models ◽

Software Toolkit

Download Full-text

BleTIES: Annotation of natural genome editing in ciliates using long read sequencing

Bioinformatics ◽

10.1093/bioinformatics/btab613 ◽

2021 ◽

Author(s):

Brandon K B Seah ◽

Estienne C Swart

Keyword(s):

Dna Sequences ◽

Sequence Data ◽

Low Complexity ◽

Supplementary Information ◽

Software Toolkit ◽

Assembly Strategy ◽

Sequencing Technologies ◽

Long Reads ◽

Oxford Nanopore ◽

Long Read

Abstract Summary Ciliates are single-celled eukaryotes that eliminate specific, interspersed DNA sequences (internally eliminated sequences, IESs) from their genomes during development. These are challenging to annotate and assemble because IES-containing sequences are typically much less abundant in the cell than those without, and IES sequences themselves often contain repetitive and low-complexity sequences. Long read sequencing technologies from Pacific Biosciences and Oxford Nanopore have the potential to reconstruct longer IESs than has been possible with short reads, but require a different assembly strategy. Here we present BleTIES, a software toolkit for detecting, assembling, and analyzing IESs using mapped long reads. Availability and implementation BleTIES is implemented in Python 3. Source code is available at https://github.com/Swart-lab/bleties (MIT license), and also distributed via Bioconda. Supplementary information Benchmarking of BleTIES with published sequence data.

Download Full-text

A software toolkit for modeling human sentence parsing: An approach using continuous-time, discrete-state stochastic dynamical systems

10.31234/osf.io/dtazq ◽

2021 ◽

Author(s):

Garrett Smith ◽

Shravan Vasishth

Keyword(s):

Sentence Processing ◽

Continuous Time ◽

Broad Class ◽

Comprehension Question ◽

Stochastic Dynamical Systems ◽

Discrete State ◽

Software Toolkit ◽

Ambiguity Advantage ◽

Python Package ◽

Quantitative Evaluations

We present a new software toolkit for implementing a broad class oftheories of sentence processing. In this framework, processing a word ina sentence is viewed as a continuous-time random walk through a set ofdiscrete states that encode information about the emerging structure of thesentence so far. The state space includes one or more special absorbingstates, which, when reached, indicate the decision to move on to the nextword of the sentence. This setup allows us to ask how how long it takesto reach an absorbing state and what the probability of reaching this stateis. We summarize a number of important statistics that can be directlyrelated to human reading times and comprehension question performance.To illustrate the use of the toolkit, we model two types of garden paths,local coherence effects, and the ambiguity advantage using three qualitativelydifferent theories of sentence processing. While the modeler must still makedefensible theoretical and implementation choices, this framework representsan improvement over the descriptive, paper-pencil modeling that is thenorm in psycholinguistics by facilitating quantitative evaluations of modelperformance and laying the groundwork for Bayesian fitting of free parametersin a model. An open-source Python package is provided.

Download Full-text

USING ASSOCIATIVE RULE CONSTRUCTION METHODS TO IDENTIFY RISK GROUPS IN PATIENTS’ DIAGNOSTIC FINDINGS

Automation and modeling in design and management of ◽

10.30987/2658-6436-2021-2-14-18 ◽

2021 ◽

Vol 2021 (2) ◽

pp. 14-18

Author(s):

Oleg Vdovichenko ◽

Andrey Averchenkov

Keyword(s):

Thyroid Gland ◽

Organizational Support ◽

Association Rule ◽

Ultrasound Examination ◽

Specific Problem ◽

Risk Groups ◽

Software Toolkit ◽

Construction Algorithm ◽

Diagnostic Problems ◽

Construction Methods

The article considers the application of the Apriori association rule construction algorithm to analyze the results of the thyroid gland ultrasound examination. The algorithm is applied to solve a specific problem of organizational support of the thyroid gland examination. A software toolkit has been developed that allows physicians to apply the specified algorithm to carry out the necessary research in the process of solving diagnostic problems.

Download Full-text

GROOPS: A software toolkit for gravity field recovery and GNSS processing

Computers & Geosciences ◽

10.1016/j.cageo.2021.104864 ◽

2021 ◽

pp. 104864

Author(s):

Torsten Mayer-Gürr ◽

Saniya Behzadpour ◽

Annette Eicker ◽

Matthias Ellmer ◽

Beate Koch ◽

...

Keyword(s):

Gravity Field ◽

Software Toolkit ◽

Gravity Field Recovery ◽

Gnss Processing

Download Full-text

Technical Note: SpekPy v2.0—a software toolkit for modelling x‐ray tube spectra

Medical Physics ◽

10.1002/mp.14945 ◽

2021 ◽

Author(s):

Gavin Poludniowski ◽

Artur Omar ◽

Robert Bujila ◽

Pedro Andreo

Keyword(s):

Technical Note ◽

X Ray ◽

Software Toolkit

Download Full-text

CSPlib - A Software Toolkit for the Analysis of Dynamical Systems and Chemical Kinetic Models.

10.2172/1810242 ◽

2021 ◽

Author(s):

Oscar Diaz-Ibarra ◽

Kyungjoo Kim ◽

Cosmin Safta ◽

Habib Najm

Keyword(s):

Dynamical Systems ◽

Kinetic Models ◽

Chemical Kinetic ◽

Software Toolkit

Download Full-text

software toolkit
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Anatomy of a Data Science Software Toolkit That Uses Machine Learning to Aid ‘Bench-to-Bedside’ Medical Research—With Essential Concepts of Data Mining and Analysis Explained

Improved N-Best Extraction with an Evaluation on Language Data

HNU-EBL: A Software Toolkit for Electron Beam Lithography Simulation and Optimization

TChem v3.0 - A Software Toolkit for the Analysis of Complex Kinetic Models.

BleTIES: Annotation of natural genome editing in ciliates using long read sequencing

A software toolkit for modeling human sentence parsing: An approach using continuous-time, discrete-state stochastic dynamical systems

USING ASSOCIATIVE RULE CONSTRUCTION METHODS TO IDENTIFY RISK GROUPS IN PATIENTS’ DIAGNOSTIC FINDINGS

GROOPS: A software toolkit for gravity field recovery and GNSS processing

Technical Note: SpekPy v2.0—a software toolkit for modelling x‐ray tube spectra

CSPlib - A Software Toolkit for the Analysis of Dynamical Systems and Chemical Kinetic Models.

Export Citation Format

software toolkitRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Anatomy of a Data Science Software Toolkit That Uses Machine Learning to Aid ‘Bench-to-Bedside’ Medical Research—With Essential Concepts of Data Mining and Analysis Explained

Improved N-Best Extraction with an Evaluation on Language Data

HNU-EBL: A Software Toolkit for Electron Beam Lithography Simulation and Optimization

TChem v3.0 - A Software Toolkit for the Analysis of Complex Kinetic Models.

BleTIES: Annotation of natural genome editing in ciliates using long read sequencing

A software toolkit for modeling human sentence parsing: An approach using continuous-time, discrete-state stochastic dynamical systems

USING ASSOCIATIVE RULE CONSTRUCTION METHODS TO IDENTIFY RISK GROUPS IN PATIENTS’ DIAGNOSTIC FINDINGS

GROOPS: A software toolkit for gravity field recovery and GNSS processing

Technical Note: SpekPy v2.0—a software toolkit for modelling x‐ray tube spectra

CSPlib - A Software Toolkit for the Analysis of Dynamical Systems and Chemical Kinetic Models.

software toolkit
Recently Published Documents