parallel processors Latest Research Papers

This paper focuses on the maximization of the minimum completion time on identical parallel processors. The objective of this maximization is to ensure fair distribution. Let a set of jobs to be assigned to several identical parallel processors. This problem is shown as NP-hard. The research work of this paper is based essentially on the comparison of the proposed heuristics with others cited in the literature review. Our heuristics are developed using essentially the randomization method and the iterative utilization of the knapsack problem to solve the studied problem. Heuristics are assessed by several instances represented in the experimental results. The results show that the knapsack based heuristic gives almost a similar performance than heuristic in a literature review but in better running time.

Download Full-text

CPU AND GPU PERFORMANCE ANALYSIS ON 2D MATRIX OPERATION

Proxies : Jurnal Informatika ◽

10.24167/proxies.v2i1.3194 ◽

2021 ◽

Vol 2 (1) ◽

pp. 1

Author(s):

Kwek Benny Kurniawan ◽

YB Dwi Setianto

Keyword(s):

Graphic Processing Unit ◽

Computing Time ◽

General Purpose ◽

Parallel Processors ◽

Processing Unit ◽

Matrix Operation ◽

Industry Standard ◽

Programming Interfaces ◽

Complete Matrix ◽

Matrix Calculation

GPU or Graphic Processing Unit can be used on many platforms in general GPUs are used for rendering graphics but now GPUs are general purpose parallel processors with support for easily accessible programming interfaces and industry standard languages such as C, Python and Fortran. In this study, the authors will compare CPU and GPU for completing some matrix calculation. To compare between CPU and GPU, the authors have done some testing to observe the use of Processing Unit, memory and computing time to complete matrix calculations by changing matrix sizes and dimensions. The results of tests that have been done shows asynchronous GPU is faster than sequential. Furthermore, thread for GPU needs to be adjusted to achieve efficiency in GPU load.

Download Full-text

Towards the Improving Branch Instructions Identification in High- Performance Processors: Issues, Challenges and Techniques

Recent Advances in Computer Science and Communications ◽

10.2174/2666255814666210210164146 ◽

2021 ◽

Vol 14 ◽

Author(s):

Sweety Nain ◽

Prachi Chaudhary

Keyword(s):

Continuous Flow ◽

High Performance ◽

Branch Prediction ◽

Parallel Processors ◽

Processor Performance ◽

Branch Predictor ◽

Prediction Technique ◽

Execution Speed ◽

Conditional Branch

Introduction: Accurate branch prediction technique has become compulsory in the superscalar and deep pipeline processors. The conditional instructions can break the continuous flow of execution in the pipeline stages, thereby decreasing processor performance. Discussion: This paper highlights the concept of branch prediction, some issues and challenges, and techniques for improving processor performance. Further, this paper also presents the role of branch prediction in different processors and their features. Conclusion: The concept of the branch prediction used in parallel processors to enhance the execution speed of the conditional branch instructions and improve the processor's performance is highlighted in this paper. Further, this paper highlights the branch predictor techniques with their features and presents the challenges, issues, and future techniques related to the branch prediction.

Download Full-text

Two Deadline Reduction Algorithms for Scheduling Dependent Tasks on Parallel Processors

Integration of Constraint Programming, Artificial Intelligence, and Operations Research - Lecture Notes in Computer Science ◽

10.1007/978-3-030-78230-6_14 ◽

2021 ◽

pp. 214-230

Author(s):

Claire Hanen ◽

Alix Munier Kordon ◽

Theo Pedersen

Keyword(s):

Parallel Processors ◽

Dependent Tasks

Download Full-text

Applications of highly parallel processors

Parallel Computing ◽

10.1201/9781003069522-29 ◽

2020 ◽

pp. 269-279

Author(s):

Heather M Liddell

Keyword(s):

Parallel Processors

Download Full-text

Scheduling with deterioration effects and maintenance activities under parallel processors

Engineering Optimization ◽

10.1080/0305215x.2020.1844194 ◽

2020 ◽

pp. 1-18

Author(s):

Hongyu He ◽

Yang Hu ◽

Wei-Wei Liu

Keyword(s):

Parallel Processors ◽

Maintenance Activities

Download Full-text

A17 Amacrine Cells and Olfactory Granule Cells: Parallel Processors of Early Sensory Information

Frontiers in Cellular Neuroscience ◽

10.3389/fncel.2020.600537 ◽

2020 ◽

Vol 14 ◽

Author(s):

Veronica Egger ◽

Jeffrey S. Diamond

Keyword(s):

Granule Cells ◽

Sensory Information ◽

Amacrine Cells ◽

Parallel Processors

Download Full-text

Cyclic Scheduling for Parallel Processors with Precedence Constrains

Journal of Physics Conference Series ◽

10.1088/1742-6596/1658/1/012019 ◽

2020 ◽

Vol 1658 ◽

pp. 012019

Author(s):

N Grigoreva

Keyword(s):

Cyclic Scheduling ◽

Parallel Processors

Download Full-text

Concurrent Binary Trees (with application to longest edge bisection)

Proceedings of the ACM on Computer Graphics and Interactive Techniques ◽

10.1145/3406186 ◽

2020 ◽

Vol 3 (2) ◽

pp. 1-20

Author(s):

Jonathan Dupuy

Keyword(s):

Processing Speed ◽

Binary Tree ◽

Large Scale ◽

Binary Search ◽

Parallel Processors ◽

Leaf Node ◽

Binary Trees ◽

Bitwise Operations

We introduce the concurrent binary tree (CBT), a novel concurrent representation to build and update arbitrary binary trees in parallel. Fundamentally, our representation consists of a binary heap, i.e., a 1D array, that explicitly stores the sum-reduction tree of a bitfield. In this bitfield, each one-valued bit represents a leaf node of the binary tree encoded by the CBT, which we locate algorithmically using a binary-search over the sum-reduction. We show that this construction allows to dispatch down to one thread per leaf node and that, in turn, these threads can safely split and/or remove nodes concurrently via simple bitwise operations over the bitfield. The practical benefit of CBTs lies in their ability to accelerate binary-tree-based algorithms with parallel processors. To support this claim, we leverage our representation to accelerate a longest-edge-bisection-based algorithm that computes and renders adaptive geometry for large-scale terrains entirely on the GPU. For this specific algorithm, the CBT accelerates processing speed linearly with the number of processors.

Download Full-text

parallel processors
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Power Quality Parameters Calculation Using FPGA Embedded Parallel Processors in Compliance with the IEC 61000-4-30 Standard

Max-Min Processors Scheduling

CPU AND GPU PERFORMANCE ANALYSIS ON 2D MATRIX OPERATION

Towards the Improving Branch Instructions Identification in High- Performance Processors: Issues, Challenges and Techniques

Two Deadline Reduction Algorithms for Scheduling Dependent Tasks on Parallel Processors

Applications of highly parallel processors

Scheduling with deterioration effects and maintenance activities under parallel processors

A17 Amacrine Cells and Olfactory Granule Cells: Parallel Processors of Early Sensory Information

Cyclic Scheduling for Parallel Processors with Precedence Constrains

Concurrent Binary Trees (with application to longest edge bisection)

Export Citation Format

parallel processorsRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Power Quality Parameters Calculation Using FPGA Embedded Parallel Processors in Compliance with the IEC 61000-4-30 Standard

Max-Min Processors Scheduling

CPU AND GPU PERFORMANCE ANALYSIS ON 2D MATRIX OPERATION

Towards the Improving Branch Instructions Identification in High- Performance Processors: Issues, Challenges and Techniques

Two Deadline Reduction Algorithms for Scheduling Dependent Tasks on Parallel Processors

Applications of highly parallel processors

Scheduling with deterioration effects and maintenance activities under parallel processors

A17 Amacrine Cells and Olfactory Granule Cells: Parallel Processors of Early Sensory Information

Cyclic Scheduling for Parallel Processors with Precedence Constrains

Concurrent Binary Trees (with application to longest edge bisection)

parallel processors
Recently Published Documents