Structural effects in algorithm performance: A framework and a case study on graph coloring

Author(s):  
Tania Turrubiates Lopez ◽  
Elisa Schaeffer ◽  
Dalia Domiguez-Diaz ◽  
German Dominguez-Carrillo
2015 ◽  
Vol 54 (7) ◽  
pp. 1637-1662 ◽  
Author(s):  
Jason M. Apke ◽  
Daniel Nietfeld ◽  
Mark R. Anderson

AbstractEnhanced temporal and spatial resolution of the Geostationary Operational Environmental Satellite–R Series (GOES-R) will allow for the use of cloud-top-cooling-based convection-initiation (CI) forecasting algorithms. Two such algorithms have been created on the current generation of GOES: the University of Wisconsin cloud-top-cooling algorithm (UWCTC) and the University of Alabama in Huntsville’s satellite convection analysis and tracking algorithm (SATCAST). Preliminary analyses of algorithm products have led to speculation over preconvective environmental influences on algorithm performance. An objective validation approach is developed to separate algorithm products into positive and false indications. Seventeen preconvective environmental variables are examined for the positive and false indications to improve algorithm output. The total dataset consists of two time periods in the late convective season of 2012 and the early convective season of 2013. Data are examined for environmental relationships using principal component analysis (PCA) and quadratic discriminant analysis (QDA). Data fusion by QDA is tested for SATCAST and UWCTC on five separate case-study days to determine whether application of environmental variables improves satellite-based CI forecasting. PCA and significance testing revealed that positive indications favored environments with greater vertically integrated instability (CAPE), less stability (CIN), and more low-level convergence. QDA improved both algorithms on all five case studies using significantly different variables. This study provides an examination of environmental influences on the performance of GOES-R Proving Ground CI forecasting algorithms and shows that integration of QDA in the cloud-top-cooling-based algorithms using environmental variables will ultimately generate a more skillful product.


2018 ◽  
Vol 52 (3) ◽  
pp. 807-818
Author(s):  
Assia Gueham ◽  
Anass Nagih ◽  
Hacene Ait Haddadene ◽  
Malek Masmoudi

In this paper, we focus on the coloration approach and estimation of chromatic number. First, we propose an upper bound of the chromatic number based on the orientation algorithm described in previous studies. This upper bound is further improved by developing a novel coloration algorithm. Second, we make a theoretical and empirical comparison of our bounds with Brooks’s bound and Reed’s conjecture for class of triangle-free graphs. Third, we propose an adaptation of our algorithm to deal with the team building problem respecting several hard and soft constraints. Finally, a real case study from healthcare domain is considered for illustration.


2002 ◽  
Vol 34 (3) ◽  
pp. 297-312 ◽  
Author(s):  
MARNE C. CARIO ◽  
JOHN J. CLIFFORD ◽  
RAYMOND R. HILL ◽  
IAEHWAN YANG ◽  
KEJIAN YANG ◽  
...  

Molecules ◽  
2021 ◽  
Vol 26 (16) ◽  
pp. 4757
Author(s):  
William E. Hackett ◽  
Joseph Zaia

Protein glycosylation that mediates interactions among viral proteins, host receptors, and immune molecules is an important consideration for predicting viral antigenicity. Viral spike proteins, the proteins responsible for host cell invasion, are especially important to be examined. However, there is a lack of consensus within the field of glycoproteomics regarding identification strategy and false discovery rate (FDR) calculation that impedes our examinations. As a case study in the overlap between software, here as a case study, we examine recently published SARS-CoV-2 glycoprotein datasets with four glycoproteomics identification software with their recommended protocols: GlycReSoft, Byonic, pGlyco2, and MSFragger-Glyco. These software use different Target-Decoy Analysis (TDA) forms to estimate FDR and have different database-oriented search methods with varying degrees of quantification capabilities. Instead of an ideal overlap between software, we observed different sets of identifications with the intersection. When clustering by glycopeptide identifications, we see higher degrees of relatedness within software than within glycosites. Taking the consensus between results yields a conservative and non-informative conclusion as we lose identifications in the desire for caution; these non-consensus identifications are often lower abundance and, therefore, more susceptible to nuanced changes. We conclude that present glycoproteomics softwares are not directly comparable, and that methods are needed to assess their overall results and FDR estimation performance. Once such tools are developed, it will be possible to improve FDR methods and quantify complex glycoproteomes with acceptable confidence, rather than potentially misleading broad strokes.


2020 ◽  
Vol 2020 ◽  
pp. 1-9
Author(s):  
Yasin Kirelli ◽  
Seher Arslankaya

As the usage of social media has increased, the size of shared data has instantly surged and this has been an important source of research for environmental issues as it has been with popular topics. Sentiment analysis has been used to determine people's sensitivity and behavior in environmental issues. However, the analysis of Turkish texts has not been investigated much in literature. In this article, sentiment analysis of Turkish tweets about global warming and climate change is determined by machine learning methods. In this regard, by using algorithms that are determined by supervised methods (linear classifiers and probabilistic classifiers) with trained thirty thousand randomly selected Turkish tweets, sentiment intensity (positive, negative, and neutral) has been detected and algorithm performance ratios have been compared. This study also provides benchmarking results for future sentiment analysis studies on Turkish texts.


Sign in / Sign up

Export Citation Format

Share Document