A hybrid computational framework for intelligent inter-continent SARS-CoV-2 sub-strains characterization and prediction

Abstract Whereas accelerated attention beclouded early stages of the coronavirus spread, knowledge of actual pathogenicity and origin of possible sub-strains remained unclear. By harvesting the Global initiative on Sharing All Influenza Data (GISAID) database (https://www.gisaid.org/), between December 2019 and January 15, 2021, a total of 8864 human SARS-CoV-2 complete genome sequences processed by gender, across 6 continents (88 countries) of the world, Antarctica exempt, were analyzed. We hypothesized that data speaks for itself and can discern true and explainable patterns of the disease. Identical genome diversity and pattern correlates analysis performed using a hybrid of biotechnology and machine learning methods corroborate the emergence of inter- and intra- SARS-CoV-2 sub-strains. Interestingly, some viral sub-strain patterns progressively transformed into new sub-strain clusters indicating varying amino acid and strong nucleotide association derived from same lineage. A novel cognitive approach to knowledge mining from enriched genome datasets and output classification targets, helped intelligent prediction of emerging or new viral sub-strains. Classification results outsmarted state-of-the-art methods and sustained an increase in sub-strains within the various continents with nucleotide mutations dynamically varying between individuals in close association with the virus adaptability to its host/environment. They also offer explanations for the growing concerns and next wave(s) of the virus. Defuzzifying confusable pattern clusters for comparative performance with the proposed cognitive solution is a possible future research direction of this paper.

Download Full-text

A Hybrid Computational Framework for Intelligent Inter-continent SARS-CoV-2 Sub-strains Characterization and Prediction

10.21203/rs.3.rs-88429/v1 ◽

2020 ◽

Author(s):

Moses Ekpenyong ◽

Mercy Edoho ◽

Udoinyang Inyang ◽

Faith-Michael Uzoka ◽

Itemobong Ekaidem ◽

...

Keyword(s):

Machine Learning ◽

Amino Acid ◽

Genome Diversity ◽

Genome Sequences ◽

Computational Framework ◽

Machine Learning Methods ◽

Novel Approach ◽

The World ◽

Global Initiative ◽

Cognitive Knowledge

Abstract Whereas accelerated attention beclouded early stages of the coronavirus spread, knowledge of actual pathogenicity and origin of possible sub-strains remained unclear. By harvesting the Global initiative on Sharing All Influenza Data (GISAID) database (https://www.gisaid.org/), between December 2019 and August 20, 2020, a total of 157 human SARS-CoV-2 (complete) genome sequences processed by gender, across 6 continents of the world, were analyzed. We hypothesized that data speaks for itself and can discern true and explainable patterns of the disease. Identical genome diversity and pattern correlates analysis performed using a hybrid of biotechnology and machine learning methods corroborate multiple emergence of SARS-CoV-2 sub-strains and explained the diversity of the SARS-CoV-2. Interestingly, some viral sub-strains progressively transformed into new sub-strain clusters indicating varying amino acid and strong nucleotide association derived from same origin. A novel approach to cognitive knowledge mining from enriched genome datasets and output targets labeling, helped intelligent prediction of emerging or new viral sub-strains.

Download Full-text

A Hybrid Computational Framework for Intelligent Inter-continent SARS-CoV-2 Sub-strains Characterization and Prediction

10.21203/rs.3.rs-88429/v2 ◽

2020 ◽

Author(s):

Moses Ekpenyong ◽

Mercy Edoho ◽

Udoinyang Inyang ◽

Faith-Michael Uzoka ◽

Itemobong Ekaidem ◽

...

Keyword(s):

Machine Learning ◽

Amino Acid ◽

Genome Diversity ◽

Genome Sequences ◽

Computational Framework ◽

Machine Learning Methods ◽

Novel Approach ◽

The World ◽

Global Initiative ◽

Cognitive Knowledge

Abstract Whereas accelerated attention beclouded early stages of the coronavirus spread, knowledge of actual pathogenicity and origin of possible sub-strains remained unclear. By harvesting the Global initiative on Sharing All Influenza Data (GISAID) database (https://www.gisaid.org/), between December 2019 and August 20, 2020, a total of 157 human SARS-CoV-2 (complete) genome sequences processed by gender, across 6 continents of the world, were analyzed. We hypothesized that data speaks for itself and can discern true and explainable patterns of the disease. Identical genome diversity and pattern correlates analysis performed using a hybrid of biotechnology and machine learning methods corroborate multiple emergence of SARS-CoV-2 sub-strains and explained the diversity of the SARS-CoV-2. Interestingly, some viral sub-strains progressively transformed into new sub-strain clusters indicating varying amino acid and strong nucleotide association derived from same origin. A novel approach to cognitive knowledge mining from enriched genome datasets and output targets labeling, helped intelligent prediction of emerging or new viral sub-strains.

Download Full-text

A new approach based on graph matching and evolutionary approach for sport scheduling problem

Intelligent Decision Technologies ◽

10.3233/idt-190114 ◽

2020 ◽

pp. 1-16

Author(s):

Meriem Khelifa ◽

Dalila Boughaci ◽

Esma Aïmeur

Keyword(s):

Graph Matching ◽

State Of The Art ◽

Travel Cost ◽

Round Robin ◽

New Approach ◽

Traveling Tournament Problem ◽

Significant Interest ◽

National League ◽

Better Than

The Traveling Tournament Problem (TTP) is concerned with finding a double round-robin tournament schedule that minimizes the total distances traveled by the teams. It has attracted significant interest recently since a favorable TTP schedule can result in significant savings for the league. This paper proposes an original evolutionary algorithm for TTP. We first propose a quick and effective constructive algorithm to construct a Double Round Robin Tournament (DRRT) schedule with low travel cost. We then describe an enhanced genetic algorithm with a new crossover operator to improve the travel cost of the generated schedules. A new heuristic for ordering efficiently the scheduled rounds is also proposed. The latter leads to significant enhancement in the quality of the schedules. The overall method is evaluated on publicly available standard benchmarks and compared with other techniques for TTP and UTTP (Unconstrained Traveling Tournament Problem). The computational experiment shows that the proposed approach could build very good solutions comparable to other state-of-the-art approaches or better than the current best solutions on UTTP. Further, our method provides new valuable solutions to some unsolved UTTP instances and outperforms prior methods for all US National League (NL) instances.

Download Full-text

Fighting Together against the Pandemic: Learning Multiple Models on Tomography Images for COVID-19 Diagnosis

AI ◽

10.3390/ai2020016 ◽

2021 ◽

Vol 2 (2) ◽

pp. 261-273

Author(s):

Mario Manzo ◽

Simone Pellino

Keyword(s):

Network Architecture ◽

State Of The Art ◽

Ensemble Classification ◽

Effective Vaccine ◽

Rt Pcr ◽

Neural Network Architecture ◽

Experimental Phase ◽

Different Types ◽

Polymerase Chain ◽

Better Than

COVID-19 has been a great challenge for humanity since the year 2020. The whole world has made a huge effort to find an effective vaccine in order to save those not yet infected. The alternative solution is early diagnosis, carried out through real-time polymerase chain reaction (RT-PCR) tests or thorax Computer Tomography (CT) scan images. Deep learning algorithms, specifically convolutional neural networks, represent a methodology for image analysis. They optimize the classification design task, which is essential for an automatic approach with different types of images, including medical. In this paper, we adopt a pretrained deep convolutional neural network architecture in order to diagnose COVID-19 disease from CT images. Our idea is inspired by what the whole of humanity is achieving, as the set of multiple contributions is better than any single one for the fight against the pandemic. First, we adapt, and subsequently retrain for our assumption, some neural architectures that have been adopted in other application domains. Secondly, we combine the knowledge extracted from images by the neural architectures in an ensemble classification context. Our experimental phase is performed on a CT image dataset, and the results obtained show the effectiveness of the proposed approach with respect to the state-of-the-art competitors.

Download Full-text

Cache-efficient sweeping-based interval joins for extended Allen relation predicates

The VLDB Journal ◽

10.1007/s00778-020-00650-5 ◽

2021 ◽

Author(s):

Danila Piatov ◽

Sven Helmer ◽

Anton Dignös ◽

Fabio Persia

Keyword(s):

Data Structure ◽

Experimental Evaluation ◽

State Of The Art ◽

Temporal Databases ◽

Access Method ◽

Wide Range ◽

Interval Relation ◽

Cache Efficient ◽

Join Algorithms ◽

Better Than

AbstractWe develop a family of efficient plane-sweeping interval join algorithms for evaluating a wide range of interval predicates such as Allen’s relationships and parameterized relationships. Our technique is based on a framework, components of which can be flexibly combined in different manners to support the required interval relation. In temporal databases, our algorithms can exploit a well-known and flexible access method, the Timeline Index, thus expanding the set of operations it supports even further. Additionally, employing a compact data structure, the gapless hash map, we utilize the CPU cache efficiently. In an experimental evaluation, we show that our approach is several times faster and scales better than state-of-the-art techniques, while being much better suited for real-time event processing.

Download Full-text

Domain Adaptation Using a Three-Way Decision Improves the Identification of Autism Patients from Multisite fMRI Data

Brain Sciences ◽

10.3390/brainsci11050603 ◽

2021 ◽

Vol 11 (5) ◽

pp. 603

Author(s):

Chunlei Shi ◽

Xianwei Xin ◽

Jiacai Zhang

Keyword(s):

Machine Learning ◽

Domain Adaptation ◽

Recognition Accuracy ◽

State Of The Art ◽

Autism Spectrum ◽

Fmri Data ◽

Target Domain ◽

Sample Distribution ◽

Machine Learning Methods ◽

First Time

Machine learning methods are widely used in autism spectrum disorder (ASD) diagnosis. Due to the lack of labelled ASD data, multisite data are often pooled together to expand the sample size. However, the heterogeneity that exists among different sites leads to the degeneration of machine learning models. Herein, the three-way decision theory was introduced into unsupervised domain adaptation in the first time, and applied to optimize the pseudolabel of the target domain/site from functional magnetic resonance imaging (fMRI) features related to ASD patients. The experimental results using multisite fMRI data show that our method not only narrows the gap of the sample distribution among domains but is also superior to the state-of-the-art domain adaptation methods in ASD recognition. Specifically, the ASD recognition accuracy of the proposed method is improved on all the six tasks, by 70.80%, 75.41%, 69.91%, 72.13%, 71.01% and 68.85%, respectively, compared with the existing methods.

Download Full-text

Video Frame Interpolation via Deformable Separable Convolution

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i07.6634 ◽

2020 ◽

Vol 34 (07) ◽

pp. 10607-10614 ◽

Cited By ~ 2

Author(s):

Xianhang Cheng ◽

Zhenzhong Chen

Keyword(s):

State Of The Art ◽

Video Frame ◽

Kernel Size ◽

Frame Interpolation ◽

Interpolation Methods ◽

Video Frames ◽

Convolution Process ◽

Strong Performance ◽

Existing Frames ◽

Better Than

Learning to synthesize non-existing frames from the original consecutive video frames is a challenging task. Recent kernel-based interpolation methods predict pixels with a single convolution process to replace the dependency of optical flow. However, when scene motion is larger than the pre-defined kernel size, these methods yield poor results even though they take thousands of neighboring pixels into account. To solve this problem in this paper, we propose to use deformable separable convolution (DSepConv) to adaptively estimate kernels, offsets and masks to allow the network to obtain information with much fewer but more relevant pixels. In addition, we show that the kernel-based methods and conventional flow-based methods are specific instances of the proposed DSepConv. Experimental results demonstrate that our method significantly outperforms the other kernel-based interpolation methods and shows strong performance on par or even better than the state-of-the-art algorithms both qualitatively and quantitatively.

Download Full-text

The State-of-the Art of Solar Photovoltaics and its Future Direction

IEEJ Transactions on Power and Energy ◽

10.1541/ieejpes.128.904 ◽

2008 ◽

Vol 128 (7) ◽

pp. 904-907 ◽

Cited By ~ 13

Author(s):

Kosuke Kurokawa

Keyword(s):

State Of The Art ◽

The State ◽

Solar Photovoltaics ◽

Future Direction

Download Full-text

Capsule-LPI: a LncRNA–protein interaction predicting tool based on a capsule network

BMC Bioinformatics ◽

10.1186/s12859-021-04171-y ◽

2021 ◽

Vol 22 (1) ◽

Author(s):

Ying Li ◽

Hang Sun ◽

Shiyao Feng ◽

Qi Zhang ◽

Siyu Han ◽

...

Keyword(s):

Protein Interactions ◽

State Of The Art ◽

Recognition Performance ◽

Feature Learning ◽

Biological Processes ◽

Multimodal Features ◽

Learning Architectures ◽

Motif Information ◽

Experimental Comparisons ◽

Better Than

Abstract Background Long noncoding RNAs (lncRNAs) play important roles in multiple biological processes. Identifying LncRNA–protein interactions (LPIs) is key to understanding lncRNA functions. Although some LPIs computational methods have been developed, the LPIs prediction problem remains challenging. How to integrate multimodal features from more perspectives and build deep learning architectures with better recognition performance have always been the focus of research on LPIs. Results We present a novel multichannel capsule network framework to integrate multimodal features for LPI prediction, Capsule-LPI. Capsule-LPI integrates four groups of multimodal features, including sequence features, motif information, physicochemical properties and secondary structure features. Capsule-LPI is composed of four feature-learning subnetworks and one capsule subnetwork. Through comprehensive experimental comparisons and evaluations, we demonstrate that both multimodal features and the architecture of the multichannel capsule network can significantly improve the performance of LPI prediction. The experimental results show that Capsule-LPI performs better than the existing state-of-the-art tools. The precision of Capsule-LPI is 87.3%, which represents a 1.7% improvement. The F-value of Capsule-LPI is 92.2%, which represents a 1.4% improvement. Conclusions This study provides a novel and feasible LPI prediction tool based on the integration of multimodal features and a capsule network. A webserver (http://csbg-jlu.site/lpc/predict) is developed to be convenient for users.

Download Full-text