benchmark sets
Recently Published Documents


TOTAL DOCUMENTS

25
(FIVE YEARS 13)

H-INDEX

9
(FIVE YEARS 2)

2021 ◽  
Author(s):  
Usman Ghani ◽  
Israel Desta ◽  
Akhil Jindal ◽  
Omeir Khan ◽  
George Jones ◽  
...  

AbstractIt has been demonstrated earlier that the neural network based program AlphaFold2 can be used to dock proteins given the two sequences separated by a gap as the input. The protocol presented here combines AlphaFold2 with the physics based docking program ClusPro. The monomers of the model generated by AlphaFold2 are separated, re-docked using ClusPro, and the resulting 10 models are refined by AlphaFold2. Finally, the five original AlphaFold2 models are added to the 10 AlphaFold2 refined ClusPro models, and the 15 models are ranked by their predicted aligned error (PAE) values obtained by AlphaFold2. The protocol is applied to two benchmark sets of complexes, the first based on the established protein-protein docking benchmark, and the second consisting of only structures released after May 2018, the cut-off date for training AlphaFold2. It is shown that the quality of the initial AlphaFold2 models improves with each additional step of the protocol. In particular, adding the AlphaFold2 refined ClusPro models to the AlphaFold2 models increases the success rate by 23% in the top 5 predictions, whereas considering the 10 models obtained by the combined protocol increases the success rate to close to 40%. The improvement is similar for the second benchmark that includes only complexes distinct from the proteins used for training the neural network.


Author(s):  
Jingyang Zhao ◽  
Mingyu Xiao

The Traveling Tournament Problem is a well-known benchmark problem in tournament timetabling, which asks us to design a schedule of home/away games of n teams (n is even) under some feasibility requirements such that the total traveling distance of all the n teams is minimized. In this paper, we study TTP-2, the traveling tournament problem where at most two consecutive home games or away games are allowed, and give an effective algorithm for n/2 being odd. Experiments on the well-known benchmark sets show that we can beat previously known solutions for all instances with n/2 being odd by an average improvement of 2.66%. Furthermore, we improve the theoretical approximation ratio from 3/2+O(1/n) to 1+O(1/n) for n/2 being odd, answering a challenging open problem in this area.


Mathematics ◽  
2021 ◽  
Vol 9 (13) ◽  
pp. 1515
Author(s):  
Bojan Nikolic ◽  
Aleksandar Kartelj ◽  
Marko Djukanovic ◽  
Milana Grbic ◽  
Christian Blum ◽  
...  

The longest common subsequence (LCS) problem is a prominent NP–hard optimization problem where, given an arbitrary set of input strings, the aim is to find a longest subsequence, which is common to all input strings. This problem has a variety of applications in bioinformatics, molecular biology and file plagiarism checking, among others. All previous approaches from the literature are dedicated to solving LCS instances sampled from uniform or near-to-uniform probability distributions of letters in the input strings. In this paper, we introduce an approach that is able to effectively deal with more general cases, where the occurrence of letters in the input strings follows a non-uniform distribution such as a multinomial distribution. The proposed approach makes use of a time-restricted beam search, guided by a novel heuristic named Gmpsum. This heuristic combines two complementary scoring functions in the form of a convex combination. Furthermore, apart from the close-to-uniform benchmark sets from the related literature, we introduce three new benchmark sets that differ in terms of their statistical properties. One of these sets concerns a case study in the context of text analysis. We provide a comprehensive empirical evaluation in two distinctive settings: (1) short-time execution with fixed beam size in order to evaluate the guidance abilities of the compared search heuristics; and (2) long-time executions with fixed target duration times in order to obtain high-quality solutions. In both settings, the newly proposed approach performs comparably to state-of-the-art techniques in the context of close-to-uniform instances and outperforms state-of-the-art approaches for non-uniform instances.


2021 ◽  
Author(s):  
Marcos Casanova Paez ◽  
Lars Goerigk

<div> <div> <div> <p>Following the work on spin-component and spin-opposite scaled (SCS/SOS) global double hybrids for singlet-singlet excitations by Schwabe and Goerigk [J. Chem. Theory Comput. 2017, 13, 4307-4323] and our own works on new long-range corrected (LC) double hybrids for singlet-singlet and singlet-triplet excitations [J. Chem. Theory Comput. 2019, 15, 4735- 4744; J. Chem. Phys. 2020, 153, 064106], we present new LC double hybrids with SCS/SOS that demonstrate further improvement over previously published results and methods. We introduce new unscaled and scaled versions of different global and LC double hybrids based on Becke88 or PBE exchange combined with LYP, PBE or P86 correlation. For singlet-singlet excitations, we cross-validate them on six benchmark sets that cover small to medium-sized chromophores with different excitation types (local valence, Rydberg, and charge transfer). For singlet-triplet excitations, we perform the cross-validation on three different benchmark sets following the same analysis as in our previous work in 2020. In total, 203 unique excitations are analyzed. Our results confirm and extend those of Schwabe and Goerigk regarding the superior performance of SCS and SOS variants compared to their unscaled parents by decreasing mean absolute deviations, root-mean-square deviations or error spans by more than half and bringing absolute mean deviations closer to zero. Our SCS/SOS variants show to be highly efficient and robust for the computation of vertical excitation energies, which even outperform specialized double hybrids that also contain an LC in their perturbative part. In particular, our new SCS/SOS-ωPBEPP86 and SCS/SOS-ωB88PP86 functional are four of the most accurate and robust methods tested in this work and we fully recommend them for future applications. However, if the relevant SCS and SOS algorithms are not available to the user, we suggest ωB88PP86 as the best unscaled method in this work. </p> </div> </div> </div>


2021 ◽  
Author(s):  
Marcos Casanova Paez ◽  
Lars Goerigk

<div> <div> <div> <p>Following the work on spin-component and spin-opposite scaled (SCS/SOS) global double hybrids for singlet-singlet excitations by Schwabe and Goerigk [J. Chem. Theory Comput. 2017, 13, 4307-4323] and our own works on new long-range corrected (LC) double hybrids for singlet-singlet and singlet-triplet excitations [J. Chem. Theory Comput. 2019, 15, 4735- 4744; J. Chem. Phys. 2020, 153, 064106], we present new LC double hybrids with SCS/SOS that demonstrate further improvement over previously published results and methods. We introduce new unscaled and scaled versions of different global and LC double hybrids based on Becke88 or PBE exchange combined with LYP, PBE or P86 correlation. For singlet-singlet excitations, we cross-validate them on six benchmark sets that cover small to medium-sized chromophores with different excitation types (local valence, Rydberg, and charge transfer). For singlet-triplet excitations, we perform the cross-validation on three different benchmark sets following the same analysis as in our previous work in 2020. In total, 203 unique excitations are analyzed. Our results confirm and extend those of Schwabe and Goerigk regarding the superior performance of SCS and SOS variants compared to their unscaled parents by decreasing mean absolute deviations, root-mean-square deviations or error spans by more than half and bringing absolute mean deviations closer to zero. Our SCS/SOS variants show to be highly efficient and robust for the computation of vertical excitation energies, which even outperform specialized double hybrids that also contain an LC in their perturbative part. In particular, our new SCS/SOS-ωPBEPP86 and SCS/SOS-ωB88PP86 functional are four of the most accurate and robust methods tested in this work and we fully recommend them for future applications. However, if the relevant SCS and SOS algorithms are not available to the user, we suggest ωB88PP86 as the best unscaled method in this work. </p> </div> </div> </div>


Author(s):  
Cameron Shand ◽  
Richard Allmendinger ◽  
Julia Handl ◽  
Andrew Webb ◽  
John Keane

2020 ◽  
Vol 6 (351) ◽  
pp. 97-106
Author(s):  
Jerzy Korzeniewski

The measures of the semantic relatedness of concepts can be categorised into two types: knowledge‑based methods and corpus‑based methods. Knowledge‑based techniques make use of man‑created dictionaries, thesauruses and other artefacts as a source of knowledge. Corpus‑based techniques assess the semantic similarity of two concepts making use of large corpora of text documents. Some researchers claim that knowledge‑based measures outperform corpus‑based ones, but it is much more important to observe that the latter ones are heavily corpus dependent. In this article, we propose to modify the best WordNet‑based method of assessing semantic relatedness, i.e. the Leacock‑Chodorow measure. This measure has proven to be the best in several studies and has a very simple formula. We asses our proposal on the basis of two popular benchmark sets of pairs of concepts, i.e. the Ruben‑Goodenough set of 65 pairs of concepts and the Fickelstein set of 353 pairs of terms. The results prove that our proposal outperforms the traditional Leacock‑Chodorow measure.


2020 ◽  
Vol 60 (12) ◽  
pp. 6612-6623
Author(s):  
Amanda E. Wakefield ◽  
Christine Yueh ◽  
Dmitri Beglov ◽  
Marcelo S. Castilho ◽  
Dima Kozakov ◽  
...  

2020 ◽  
Vol 10 (21) ◽  
pp. 7700
Author(s):  
Wojciech Wieczorek ◽  
Tomasz Jastrzab ◽  
Olgierd Unold

We propose an approach to non-deterministic finite automaton (NFA) inductive synthesis that is based on answer set programming (ASP) solvers. To that end, we explain how an NFA and its response to input samples can be encoded as rules in a logic program. We then ask an ASP solver to find an answer set for the program, which we use to extract the automaton of the required size. We conduct a series of experiments on some benchmark sets, using the implementation of our approach. The results show that our method outperforms, in terms of CPU time, a SAT approach and other exact algorithms on all benchmarks.


GigaScience ◽  
2020 ◽  
Vol 9 (4) ◽  
Author(s):  
Aleksandr Morgulis ◽  
Richa Agarwala

Abstract Background Alignment of sequence reads generated by next-generation sequencing is an integral part of most pipelines analyzing next-generation sequencing data. A number of tools designed to quickly align a large volume of sequences are already available. However, most existing tools lack explicit guarantees about their output. They also do not support searching genome assemblies, such as the human genome assembly GRCh38, that include primary and alternate sequences and placement information for alternate sequences to primary sequences in the assembly. Findings This paper describes SRPRISM (Single Read Paired Read Indel Substitution Minimizer), an alignment tool for aligning reads without splices. SRPRISM has features not available in most tools, such as (i) support for searching genome assemblies with alternate sequences, (ii) partial alignment of reads with a specified region of reads to be included in the alignment, (iii) choice of ranking schemes for alignments, and (iv) explicit criteria for search sensitivity. We compare the performance of SRPRISM to GEM, Kart, STAR, BWA-MEM, Bowtie2, Hobbes, and Yara using benchmark sets for paired and single reads of lengths 100 and 250 bp generated using DWGSIM. SRPRISM found the best results for most benchmark sets with error rate of up to ∼2.5% and GEM performed best for higher error rates. SRPRISM was also more sensitive than other tools even when sensitivity was reduced to improve run time performance. Conclusions We present SRPRISM as a flexible read mapping tool that provides explicit guarantees on results.


Sign in / Sign up

Export Citation Format

Share Document