Large Matrix Multiplication on a Novel Heterogeneous Parallel DSP Architecture

2018 ◽

Vol 11 (3) ◽

pp. 954

Author(s):

D.T.V. Dharmajee Rao ◽

K.V. Ramana

Keyword(s):

Neural Network ◽

Deep Neural Network ◽

Programming Model ◽

Optimization Technique ◽

Matrix Multiplication ◽

Training Algorithms ◽

Training Time ◽

Large Matrix ◽

Novel Approach ◽

Parallel Programming Model

Deep Neural Network training algorithms consumes long training time, especially when the number of hidden layers and nodes is large. Matrix multiplication is the key operation carried out at every node of each layer for several hundreds of thousands of times during the training of Deep Neural Network. Blocking is a well-proven optimization technique to improve the performance of matrix multiplication. Blocked Matrix multiplication algorithms can easily be parallelized to accelerate the performance further. This paper proposes a novel approach of implementing Parallel Blocked Matrix multiplication algorithms to reduce the long training time. The proposed approach was implemented using a parallel programming model OpenMP with collapse() clause for the multiplication of input and weight matrices of Backpropagation and Boltzmann Machine Algorithms for training Deep Neural Network and tested on multi-core processor system. Experimental results showed that the proposed approach achieved approximately two times speedup than classic algorithms.

Download Full-text

Achieving security, robust cheating resistance, and high-efficiency for outsourcing large matrix multiplication computation to a malicious cloud

Information Sciences ◽

10.1016/j.ins.2014.05.014 ◽

2014 ◽

Vol 280 ◽

pp. 205-217 ◽

Cited By ~ 58

Author(s):

Xinyu Lei ◽

Xiaofeng Liao ◽

Tingwen Huang ◽

Feno Heriniaina

Keyword(s):

High Efficiency ◽

Matrix Multiplication ◽

Large Matrix

Download Full-text

Online privacy preserving outsourcing of large matrix multiplication

2017 7th International Conference on Computer and Knowledge Engineering (ICCKE) ◽

10.1109/iccke.2017.8167882 ◽

2017 ◽

Cited By ~ 1

Author(s):

Fatemeh Erfan ◽

Hamid Mala

Keyword(s):

Matrix Multiplication ◽

Privacy Preserving ◽

Online Privacy ◽

Large Matrix

Download Full-text

Approach of large matrix multiplication based on Hadoop

Journal of Computer Applications ◽

10.3724/sp.j.1087.2013.03339 ◽

2013 ◽

Vol 33 (12) ◽

pp. 3339-3344 ◽

Cited By ~ 2

Author(s):

Yuanshuai SUN ◽

Yao CHEN ◽

Xinjun GUAN ◽

Chen LIN

Keyword(s):

Matrix Multiplication ◽

Large Matrix

Download Full-text

A data assimilation framework that uses the Kullback-Leibler divergence

PLoS ONE ◽

10.1371/journal.pone.0256584 ◽

2021 ◽

Vol 16 (8) ◽

pp. e0256584

Author(s):

Sam Pimentel ◽

Youssef Qranfal

Keyword(s):

Kalman Filter ◽

Data Assimilation ◽

Large Scale ◽

Matrix Multiplication ◽

Optimal Interpolation ◽

Sequential Data ◽

Physical Constraints ◽

Large Matrix ◽

Leibler Divergence ◽

Similar Accuracy

The process of integrating observations into a numerical model of an evolving dynamical system, known as data assimilation, has become an essential tool in computational science. These methods, however, are computationally expensive as they typically involve large matrix multiplication and inversion. Furthermore, it is challenging to incorporate a constraint into the procedure, such as requiring a positive state vector. Here we introduce an entirely new approach to data assimilation, one that satisfies an information measure and uses the unnormalized Kullback-Leibler divergence, rather than the standard choice of Euclidean distance. Two sequential data assimilation algorithms are presented within this framework and are demonstrated numerically. These new methods are solved iteratively and do not require an adjoint. We find them to be computationally more efficient than Optimal Interpolation (3D-Var solution) and the Kalman filter whilst maintaining similar accuracy. Furthermore, these Kullback-Leibler data assimilation (KL-DA) methods naturally embed constraints, unlike Kalman filter approaches. They are ideally suited to systems that require positive valued solutions as the KL-DA guarantees this without need of transformations, projections, or any additional steps. This Kullback-Leibler framework presents an interesting new direction of development in data assimilation theory. The new techniques introduced here could be developed further and may hold potential for applications in the many disciplines that utilize data assimilation, especially where there is a need to evolve variables of large-scale systems that must obey physical constraints.

Download Full-text

An Efficient Image Reconstruction Framework Using Total Variation Regularization with Lp-Quasinorm and Group Gradient Sparsity

Information ◽

10.3390/info10030115 ◽

2019 ◽

Vol 10 (3) ◽

pp. 115 ◽

Cited By ~ 1

Author(s):

Fan Lin ◽

Yingpin Chen ◽

Lingzhi Wang ◽

Yuqun Chen ◽

Wei Zhu ◽

...

Keyword(s):

Image Reconstruction ◽

Total Variation ◽

Random Noise ◽

Matrix Multiplication ◽

Structural Similarity ◽

Total Variation Regularization ◽

Reconstruction Method ◽

Image Gradient ◽

Large Matrix ◽

Tv Regularization

The total variation (TV) regularization-based methods are proven to be effective in removing random noise. However, these solutions usually have staircase effects. This paper proposes a new image reconstruction method based on TV regularization with Lp-quasinorm and group gradient sparsity. In this method, the regularization term of the group gradient sparsity can retrieve the neighborhood information of an image gradient, and the Lp-quasinorm constraint can characterize the sparsity of the image gradient. The method can effectively deblur images and remove impulse noise to well preserve image edge information and reduce the staircase effect. To improve the image recovery efficiency, a Fast Fourier Transform (FFT) is introduced to effectively avoid large matrix multiplication operations. Moreover, by introducing accelerated alternating direction method of multipliers (ADMM) in the method to allow for a fast restart of the optimization process, this method can run faster. In numerical experiments on standard test images sourced form Emory University and CVG-UGR (Computer Vision Group, University of Granada) image database, the advantage of the new method is verified by comparing it with existing advanced TV-based methods in terms of peak signal-to-noise ratio (PSNR), structural similarity (SSIM), and operational time.

Download Full-text

VEPP:A Verifiable, Highly Efficient and Privacy-Preserving Protocol for Outsourcing Large Matrix Multiplication

2018 Third International Conference on Security of Smart Cities, Industrial Control System and Communications (SSIC) ◽

10.1109/ssic.2018.8556689 ◽

2018 ◽

Author(s):

Liang Hui ◽

Xiaolei Dong ◽

Jiachen Shen ◽

Zhenfu Cao ◽

Hongyuan Chen ◽

...

Keyword(s):

Matrix Multiplication ◽

Privacy Preserving ◽

Highly Efficient ◽

Large Matrix

Download Full-text

Bence-albee after 25 years: A new superior α-factor correction procedure for the quantitative analysis of oxides and silicates

Proceedings, annual meeting, Electron Microscopy Society of America ◽

10.1017/s0424820100133357 ◽

1992 ◽

Vol 50 (2) ◽

pp. 1744-1745

Author(s):

John T. Armstrong

Keyword(s):

Quantitative Analysis ◽

Electron Microprobe ◽

Binary Systems ◽

Correction Procedure ◽

Geological Samples ◽

The Past ◽

Large Matrix ◽

Computational Speed ◽

Microprobe Data ◽

Geological Sciences

One of the most cited papers in the geological sciences has been that of Albee and Bence on the use of empirical " α -factors" to correct quantitative electron microprobe data. During the past 25 years this method has remained the most commonly used correction for geological samples, despite the facts that few investigators have actually determined empirical α-factors, but instead employ tables of calculated α-factors using one of the conventional "ZAF" correction programs; a number of investigators have shown that the assumption that an α-factor is constant in binary systems where there are large matrix corrections is incorrect (e.g, 2-3); and the procedure’s desirability in terms of program size and computational speed is much less important today because of developments in computing capabilities. The question thus exists whether it is time to honorably retire the Bence-Albee procedure and turn to more modern, robust correction methods. This paper proposes that, although it is perhaps time to retire the original Bence-Albee procedure, it should be replaced by a similar method based on compositiondependent polynomial α-factor expressions.

Download Full-text

Stable and Supported Semantics in Continuous Vector Spaces

Proceedings of the Seventeenth International Conference on Principles of Knowledge Representation and Reasoning ◽

10.24963/kr.2020/7 ◽

2020 ◽

Author(s):

Yaniv Aspis ◽

Krysia Broda ◽

Alessandra Russo ◽

Jorge Lobo

Keyword(s):

Logic Program ◽

Matrix Multiplication ◽

Vector Spaces ◽

Continuous Space ◽

Continuous Vector ◽

Novel Approach ◽

Gradient Based ◽

Normal Logic ◽

Parameter Values ◽

Normal Logic Program

We introduce a novel approach for the computation of stable and supported models of normal logic programs in continuous vector spaces by a gradient-based search method. Specifically, the application of the immediate consequence operator of a program reduct can be computed in a vector space. To do this, Herbrand interpretations of a propositional program are embedded as 0-1 vectors in $\mathbb{R}^N$ and program reducts are represented as matrices in $\mathbb{R}^{N \times N}$. Using these representations we prove that the underlying semantics of a normal logic program is captured through matrix multiplication and a differentiable operation. As supported and stable models of a normal logic program can now be seen as fixed points in a continuous space, non-monotonic deduction can be performed using an optimisation process such as Newton's method. We report the results of several experiments using synthetically generated programs that demonstrate the feasibility of the approach and highlight how different parameter values can affect the behaviour of the system.

Download Full-text

A Parallel Matrix Multiplication Algorithm Based on Network of Moore Graph of Diameter 2

Chinese Journal of Computers ◽

10.3724/sp.j.1016.2013.01843 ◽

2014 ◽

Vol 36 (9) ◽

pp. 1843-1849

Author(s):

Bing ZHANG

Keyword(s):

Matrix Multiplication ◽

Matrix Multiplication Algorithm ◽

Multiplication Algorithm ◽

Moore Graph

Download Full-text

Large Matrix Multiplication on a Novel Heterogeneous Parallel DSP Architecture

A Novel Approach for Efficient Training of Deep Neural Networks

Achieving security, robust cheating resistance, and high-efficiency for outsourcing large matrix multiplication computation to a malicious cloud

Online privacy preserving outsourcing of large matrix multiplication

Approach of large matrix multiplication based on Hadoop

A data assimilation framework that uses the Kullback-Leibler divergence

An Efficient Image Reconstruction Framework Using Total Variation Regularization with Lp-Quasinorm and Group Gradient Sparsity

VEPP:A Verifiable, Highly Efficient and Privacy-Preserving Protocol for Outsourcing Large Matrix Multiplication

Bence-albee after 25 years: A new superior α-factor correction procedure for the quantitative analysis of oxides and silicates

Stable and Supported Semantics in Continuous Vector Spaces

A Parallel Matrix Multiplication Algorithm Based on Network of Moore Graph of Diameter 2

Export Citation Format