A parallel algorithm for stiffness matrix assembling in a shared memory environment

Abstract In this paper, a method using bintree structure to express the states of the packing space of rectangular packing is proposed. Through the sequential decomposition of the packing space, the optimal packing scheme of various sized rectangular packing can be obtained by every time putting the optimal piece that satisfies specular conditions toward the current packing space and by locating it at the up-left corner of the current packing space. Different optimal packing schemes that satisfy different demands can be obtained by adjusting the values of the ordering factors KA and KB. A parallel algorithm based on SIMD-CREW shared-memory computer is designed through the analysis of the parallelism of the bintree expression. The whole packing process is clearly expressed by the bintree. The computational complexity of the algorithm is shown to be O(n2logn). Both the experimental results and the comparison with other sequential packing algorithms have proved that the parallel packing algorithm is efficient. What is more, it nearly doubles the problem solving speed.

Download Full-text

A Shared Memory Parallel Algorithm for Logic Synthesis

The Sixth International Conference on VLSI Design ◽

10.1109/icvd.1993.669703 ◽

2005 ◽

Cited By ~ 1

Author(s):

Chieng-Fai Lim ◽

P. Banerjee ◽

K. De ◽

S. Muroga

Keyword(s):

Parallel Algorithm ◽

Shared Memory ◽

Logic Synthesis

Download Full-text

RNA Secondary Structure Prediction Parallel Algorithm on Shared Memory Multicore Architecture

Algorithms for Intelligent Systems - Proceedings of Integrated Intelligence Enable Networks and Computing ◽

10.1007/978-981-33-6307-6_34 ◽

2021 ◽

pp. 327-337

Author(s):

Pradnya S. Borkar ◽

Vijaya P. Balpande ◽

Anjali R. Mahajan

Keyword(s):

Secondary Structure ◽

Parallel Algorithm ◽

Shared Memory ◽

Structure Prediction ◽

Rna Secondary Structure ◽

Secondary Structure Prediction ◽

Multicore Architecture ◽

Rna Secondary Structure Prediction

Download Full-text

Asynchronous parallel algorithm for mining association rules on a shared-memory multi-processors

Proceedings of the tenth annual ACM symposium on Parallel algorithms and architectures - SPAA '98 ◽

10.1145/277651.277694 ◽

1998 ◽

Cited By ~ 18

Author(s):

David W. Cheung ◽

Kan Hu ◽

Shaowei Xia

Keyword(s):

Parallel Algorithm ◽

Shared Memory ◽

Association Rules ◽

Mining Association Rules ◽

Asynchronous Parallel

Download Full-text

Area Optimization of Slicing Floorplans in Parallel

VLSI Design ◽

10.1155/1994/63707 ◽

1994 ◽

Vol 2 (2) ◽

pp. 143-156

Author(s):

Cheng-Hsi Chen ◽

Ioannis G. Tollis

Keyword(s):

Parallel Algorithms ◽

Parallel Algorithm ◽

Shared Memory ◽

Distributed System ◽

Sequential Algorithm ◽

Area Optimization

We first present a parallel algorithm for finding the optimal implementations for the modules of a slicing floorplan that respects a given slicing tree. The algorithm runs in O(n) time and requires O(n) processors, where n is the number of modules. It is based on a new O(n2) sequential algorithm for solving the above problem. We then present a parallel algorithm for finding a set of optimal implementations for a slicing floorplan whose corresponding slicing tree has height O(logn). This algorithm runs in O(n) time using O(logn) processors. Our parallel algorithms do not need shared memory and can be implemented in a distributed system.

Download Full-text

A Parallel Algorithm for Stiffness Matrix Decomposition Using Threadpool Method

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.594-597.2880 ◽

2012 ◽

Vol 594-597 ◽

pp. 2880-2885

Author(s):

Jun Tao Chen ◽

Ming Xiao ◽

Hui Bo Liu

Keyword(s):

Finite Element ◽

Parallel Algorithm ◽

Stiffness Matrix ◽

Decomposition Method ◽

High Efficiency ◽

Matrix Decomposition ◽

One Dimensional ◽

Numerical Tests ◽

Element Simulation ◽

Nested Loops

To shorten calculation time in finite element simulation by using multithreading computer, a parallel algorithm for stiffness matrix decomposition based on threadpool method is proposed. Firstly, a decomposition method of applicability to parallel computation is put forward by transferring the Cholesky's LLT method. Then, the threadplool is employed to generate multithreading for repeating use and the optimization is conducted considering load-balancing of each thread. Finally, numerical tests by using proposed algorithm in decomposition of one-dimensional array stored stiffness matrix are carried out on different calculation platforms with multi-processors. It is shown that the parallel algorithm can overcome the limitations of OpenMP when being applied in nested loops and is of high efficiency on stiffness matrix decomposition with low platform demands. The algorithm has explicit concept and minor programming difficulty and is applicable to solve problems caused by limitation of OpenMP in particular.

Download Full-text

An implementation of ray tracing algorithm for the multiprocessor machines

Yugoslav journal of operations research ◽

10.2298/yjor0601125s ◽

2006 ◽

Vol 16 (1) ◽

pp. 125-135 ◽

Cited By ~ 1

Author(s):

Aleksandar Samardzic ◽

Dusan Starcevic ◽

Milan Tuba

Keyword(s):

Parallel Algorithm ◽

Ray Tracing ◽

Shared Memory ◽

Parallel Machines ◽

Lighting Condition ◽

Posix Threads ◽

Scene Description ◽

Tracing Algorithm

Ray Tracing is an algorithm for generating photo-realistic pictures of the 3D scenes, given scene description, lighting condition and viewing parameters as inputs. The algorithm is inherently convenient for parallelization and the simplest parallelization scheme is for the shared-memory parallel machines (multiprocessors). This paper presents two implementations of the algorithm developed by the authors for alike machines, one using the POSIX threads API and another one using the OpenMP API. The paper also presents results of rendering some test scenes using these implementations and discusses our parallel algorithm version efficiency.

Download Full-text

The MPI and OpenMP Implementation of Parallel Algorithm for Generating Mandelbrot Set

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.571-572.26 ◽

2014 ◽

Vol 571-572 ◽

pp. 26-29

Author(s):

Xiang Wei Duan ◽

Wei Chang Shen ◽

Jun Guo

Keyword(s):

Parallel Algorithm ◽

Shared Memory ◽

Message Passing ◽

Message Passing Interface ◽

Algorithm Design ◽

Performance Testing ◽

Mandelbrot Set ◽

The Difference ◽

And Performance

The paper introduce the Mandelbrot Set and the message passing interface (MPI) and shared-memory (OpenMP), analyses the characteristic of algorithm design in the MPI and OpenMP environment, describes the implementation of parallel algorithm about Mandelbrot Set in the MPI environment and the OpenMP environment, conducted a series of evaluation and performance testing during the process of running, then the difference between the two system implementations is compared.

Download Full-text