Matrix Computation with Sparse Matrices

Abstract Genomic selection is now practiced successfully across many species. However, many questions remain such as long-term effects, estimations of genomic parameters, robustness of GWAS with small and large datasets, and stability of genomic predictions. This study summarizes presentations from at the 2020 ASAS symposium. The focus of many studies until now is on linkage disequilibrium (LD) between two loci. Ignoring higher level equilibrium may lead to phantom dominance and epistasis. The Bulmer effect leads to a reduction of the additive variance; however, selection for increased recombination rate can release anew genetic variance. With genomic information, estimates of genetic parameters may be biased by genomic preselection, but costs of estimation can increase drastically due to the dense form of the genomic information. To make computation of estimates feasible, genotypes could be retained only for the most important animals, and methods of estimation should use algorithms that can recognize dense blocks in sparse matrices. GWAS studies using small genomic datasets frequently find many marker-trait associations whereas studies using much bigger datasets find only a few. Most current tools use very simple models for GWAS, possibly causing artifacts. These models are adequate for large datasets where pseudo-phenotypes such as deregressed proofs indirectly account for important effects for traits of interest. Artifacts arising in GWAS with small datasets can be minimized by using data from all animals (whether genotyped or not), realistic models, and methods that account for population structure. Recent developments permit computation of p-values from GBLUP, where models can be arbitrarily complex but restricted to genotyped animals only, and to single-step GBLUP that also uses phenotypes from ungenotyped animals. Stability was an important part of nongenomic evaluations, where genetic predictions were stable in the absence of new data even with low prediction accuracies. Unfortunately, genomic evaluations for such animals change because all animals with genotypes are connected. A top ranked animal can easily drop in the next evaluation, causing a crisis of confidence in genomic evaluations. While correlations between consecutive genomic evaluations are high, outliers can have differences as high as one SD. A solution to fluctuating genomic evaluations is to base selection decisions on groups of animals. While many issues in genomic selection have been solved, many new issues that require additional research continue to surface.

Download Full-text

PARALLEL COMPUTING OF NUMERICAL SCHEMES AND BIG DATA ANALYTIC FOR SOLVING REAL LIFE APPLICATIONS

Jurnal Teknologi ◽

10.11113/jt.v78.9552 ◽

2016 ◽

Vol 78 (8-2) ◽

Cited By ~ 2

Author(s):

Norma Alias ◽

Nadia Nofri Yeni Suhari ◽

Hafizah Farhah Saipan Saipol ◽

Abdullah Aysh Dahawi ◽

Masyitah Mohd Saidi ◽

...

Keyword(s):

Big Data ◽

Parallel Computing ◽

Parallel Algorithm ◽

Sparse Matrices ◽

Real Life ◽

Poor Performance ◽

Equation System ◽

Numerical Schemes ◽

Linear Equation System ◽

Data Analytic

This paper proposed the several real life applications for big data analytic using parallel computing software. Some parallel computing software under consideration are Parallel Virtual Machine, MATLAB Distributed Computing Server and Compute Unified Device Architecture to simulate the big data problems. The parallel computing is able to overcome the poor performance at the runtime, speedup and efficiency of programming in sequential computing. The mathematical models for the big data analytic are based on partial differential equations and obtained the large sparse matrices from discretization and development of the linear equation system. Iterative numerical schemes are used to solve the problems. Thus, the process of computational problems are summarized in parallel algorithm. Therefore, the parallel algorithm development is based on domain decomposition of problems and the architecture of difference parallel computing software. The parallel performance evaluations for distributed and shared memory architecture are investigated in terms of speedup, efficiency, effectiveness and temporal performance.

Download Full-text

A probabilistic approach to optimal pivoting and prediction of fill-in for random sparse matrices

IEEE Transactions on Circuit Theory ◽

10.1109/tct.1972.1083477 ◽

1972 ◽

Vol 19 (4) ◽

pp. 329-336 ◽

Cited By ~ 8

Author(s):

Hsueh Hsieh ◽

M. Ghausi

Keyword(s):

Sparse Matrices ◽

Probabilistic Approach

Download Full-text

Sparse Recovery Using Sparse Matrices

Proceedings of the IEEE ◽

10.1109/jproc.2010.2045092 ◽

2010 ◽

Vol 98 (6) ◽

pp. 937-947 ◽

Cited By ~ 158

Author(s):

Anna Gilbert ◽

Piotr Indyk

Keyword(s):

Sparse Matrices ◽

Sparse Recovery

Download Full-text

Amesos2 and Belos: Direct and Iterative Solvers for Large Sparse Linear Systems

Scientific Programming ◽

10.1155/2012/243875 ◽

2012 ◽

Vol 20 (3) ◽

pp. 241-255 ◽

Cited By ~ 18

Author(s):

Eric Bavier ◽

Mark Hoemmen ◽

Sivasankaran Rajamanickam ◽

Heidi Thornquist

Keyword(s):

Linear Systems ◽

Iterative Methods ◽

Sparse Matrix ◽

Sparse Matrices ◽

Direct Methods ◽

Iterative Solvers ◽

Software Project ◽

Sparse Linear Systems ◽

Scalar Type ◽

Mixed Precision

Solvers for large sparse linear systems come in two categories: direct and iterative. Amesos2, a package in the Trilinos software project, provides direct methods, and Belos, another Trilinos package, provides iterative methods. Amesos2 offers a common interface to many different sparse matrix factorization codes, and can handle any implementation of sparse matrices and vectors, via an easy-to-extend C++ traits interface. It can also factor matrices whose entries have arbitrary “Scalar” type, enabling extended-precision and mixed-precision algorithms. Belos includes many different iterative methods for solving large sparse linear systems and least-squares problems. Unlike competing iterative solver libraries, Belos completely decouples the algorithms from the implementations of the underlying linear algebra objects. This lets Belos exploit the latest hardware without changes to the code. Belos favors algorithms that solve higher-level problems, such as multiple simultaneous linear systems and sequences of related linear systems, faster than standard algorithms. The package also supports extended-precision and mixed-precision algorithms. Together, Amesos2 and Belos form a complete suite of sparse linear solvers.

Download Full-text