parallel random access machine Latest Research Papers

Graph Connectivity in Log Steps Using Label Propagation

Parallel Processing Letters ◽

10.1142/s0129626421500213 ◽

2021 ◽

Author(s):

Paul Burkhardt

Keyword(s):

Computational Models ◽

Undirected Graph ◽

Random Access ◽

Memory Systems ◽

Label Propagation ◽

Graph Connectivity ◽

Connected Components ◽

Original Graph ◽

Path Graph ◽

Parallel Random Access Machine

The fastest deterministic algorithms for connected components take logarithmic time and perform superlinear work on a Parallel Random Access Machine (PRAM). These algorithms maintain a spanning forest by merging and compressing trees, which requires pointer-chasing operations that increase memory access latency and are limited to shared-memory systems. Many of these PRAM algorithms are also very complicated to implement. Another popular method is “leader-contraction” where the challenge is to select a constant fraction of leaders that are adjacent to a constant fraction of non-leaders with high probability, but this can require adding more edges than were in the original graph. Instead we investigate label propagation because it is deterministic, easy to implement, and does not rely on pointer-chasing. Label propagation exchanges representative labels within a component using simple graph traversal, but it is inherently difficult to complete in a sublinear number of steps. We are able to overcome the problems with label propagation for graph connectivity. We introduce a surprisingly simple framework for deterministic, undirected graph connectivity using label propagation that is easily adaptable to many computational models. It achieves logarithmic convergence independently of the number of processors and without increasing the edge count. We employ a novel method of propagating directed edges in alternating direction while performing minimum reduction on vertex labels. We present new algorithms in PRAM, Stream, and MapReduce. Given a simple, undirected graph [Formula: see text] with [Formula: see text] vertices, [Formula: see text] edges, our approach takes O(m) work each step, but we can only prove logarithmic convergence on a path graph. It was conjectured by Liu and Tarjan (2019) to take [Formula: see text] steps or possibly [Formula: see text] steps. Our experiments on a range of difficult graphs also suggest logarithmic convergence. We leave the proof of convergence as an open problem.

Download Full-text

Parallel Computing Enabled Cloudd-Based IOT Applications

10.3233/apc210149 ◽

2021 ◽

Author(s):

Abiraami T V ◽

Maithili K ◽

Nivetha J E

Keyword(s):

Parallel Computing ◽

Virtual Machines ◽

Computational Cost ◽

Random Access ◽

Iot Applications ◽

Delay Sensitive ◽

Proposed Model ◽

Sensor Devices ◽

Parallel Random Access Machine ◽

Cloud Servers

In delay-sensitive IoT applications, the acquisition and processing of data from the sensor devices to the cloud-based computing environment result in higher computational cost and inefficient system. The cloud servers dedicated to performing larger tasks are found quite difficult as IoT applications collect data frequently. Hence there is a constant retrieval and updating leads to the synchronization of data in the cloud. The potential of cloud servers and virtual machines tend to lose the ability of computing larger tasks. Hence the scope of parallel computing in cloud-based IoT applications are proposed with certain parallel shared models of computation. The Parallel Random Access Machine is introduced in cloud-based IoT applications. The parallel algorithms are designed to eliminate the conflicts encountered in the proposed model. Hence Conflicts of Concurrent Read Concurrent Write PRAM and Conflicts of Concurrent Read Exclusive Write PRAM algorithms are introduced which promotes the efficiency of cloud-based IoT applications.

Download Full-text

Optimal Algebraic Breadth-First Search for Sparse Graphs

ACM Transactions on Knowledge Discovery from Data ◽

10.1145/3446216 ◽

2021 ◽

Vol 15 (5) ◽

pp. 1-19

Author(s):

Paul Burkhardt

Keyword(s):

Adjacency Matrix ◽

Sparse Matrix ◽

Large Diameter ◽

Random Access ◽

Sequential Algorithm ◽

Breadth First Search ◽

Matrix Methods ◽

Sparse Graphs ◽

The Many ◽

Parallel Random Access Machine

There has been a rise in the popularity of algebraic methods for graph algorithms given the development of the GraphBLAS library and other sparse matrix methods. An exemplar for these approaches is Breadth-First Search (BFS). The algebraic BFS algorithm is simply a recurrence of matrix-vector multiplications with the n × n adjacency matrix, but the many redundant operations over nonzeros ultimately lead to suboptimal performance. Therefore an optimal algebraic BFS should be of keen interest especially if it is easily integrated with existing matrix methods. Current methods, notably in the GraphBLAS, use a Sparse Matrix masked-Sparse Vector multiplication in which the input vector is kept in a sparse representation in each step of the BFS, and nonzeros in the vector are masked in subsequent steps. This has been an area of recent research in GraphBLAS and other libraries. While in theory, these masking methods are asymptotically optimal on sparse graphs, many add work that leads to suboptimal runtime. We give a new optimal, algebraic BFS for sparse graphs, thus closing a gap in the literature. Our method multiplies progressively smaller submatrices of the adjacency matrix at each step. Let n and m refer to the number of vertices and edges, respectively. On a sparse graph, our method takes O ( n ) algebraic operations as opposed to O ( m ) operations needed by theoretically optimal sparse matrix approaches. Thus, for sparse graphs, it matches the bounds of the best-known sequential algorithm, and on a Parallel Random Access Machine, it is work-optimal. Our result holds for both directed and undirected graphs. Compared to a leading GraphBLAS library, our method achieves up to 24x faster sequential time, and for parallel computation, it can be 17x faster on large graphs and 12x faster on large-diameter graphs.

Download Full-text

A Parallel Random Access Machine (PRAM) Model for English Language Recognizer (PRAM-ELR)

International Journal of Computer Applications ◽

10.5120/20749-3139 ◽

2015 ◽

Vol 118 (6) ◽

pp. 12-18

Author(s):

Madiha KhurramPasha ◽

Maryam Feroze ◽

Khurram Ahmad Pasha

Keyword(s):

English Language ◽

Random Access ◽

Random Access Machine ◽

Pram Model ◽

Parallel Random Access Machine

Download Full-text

ON THE PERFORMANCE AND COST OF SOME PRAM MODELS ON CMP HARDWARE

International Journal of Foundations of Computer Science ◽

10.1142/s0129054110007325 ◽

2010 ◽

Vol 21 (03) ◽

pp. 387-404 ◽

Cited By ~ 4

Author(s):

MARTTI FORSELL

Keyword(s):

Random Access ◽

Chip Multiprocessor ◽

Pram Model ◽

Vlsi Technology ◽

Cost Efficient ◽

Performance Area ◽

And Performance ◽

On Chip ◽

Parallel Random Access Machine ◽

The Cost

The Parallel Random Access Machine is a very strong model of parallel computing that has resisted cost-efficient implementation attempts for decades. Recently, the development of VLSI technology has provided means for indirect on-chip implementation, but there are different variants of the PRAM model that provide different performance, area and power figures and it is not known how their implementations compare to each others. In this paper we measure the performance and estimate the cost of practical implementations of four PRAM models including EREW, Limited Arbitrary CRCW, Full Arbitrary CRCW, Full Arbitrary Multioperation CRCW on our Eclipse chip multiprocessor framework. Interestingly, the most powerful model shows the lowest simulation cost and highest performance/area and performance/power figures.

Download Full-text

Total Eclipse - an Efficient Architectural Realization of the Parallel Random Access Machine

Parallel and Distributed Computing ◽

10.5772/9446 ◽

2010 ◽

Cited By ~ 11

Author(s):

Martti Forsell

Keyword(s):

Random Access ◽

Random Access Machine ◽

Total Eclipse ◽

Parallel Random Access Machine

Download Full-text

The bulk-synchronous parallel random access machine

Theoretical Computer Science ◽

10.1016/s0304-3975(97)00197-7 ◽

1998 ◽

Vol 196 (1-2) ◽

pp. 109-130 ◽

Cited By ~ 18

Author(s):

Alexandre Tiskin

Keyword(s):

Random Access ◽

Bulk Synchronous Parallel ◽

Random Access Machine ◽

Parallel Random Access Machine

Download Full-text

Transforming comparison model lower bounds to the parallel-random-access-machine

Information Processing Letters ◽

10.1016/s0020-0190(97)00032-x ◽

1997 ◽

Vol 62 (2) ◽

pp. 103-110 ◽

Cited By ~ 3

Author(s):

Dany Breslauer ◽

Artur Czumaj ◽

Devdatt P. Dubhashi ◽

Friedhelm Meyer auf der Heide

Keyword(s):

Lower Bounds ◽

Random Access ◽

Comparison Model ◽

Random Access Machine ◽

Parallel Random Access Machine

Download Full-text

A Simple Optimal Parallel Algorithm for Reporting Paths in a Tree

Parallel Processing Letters ◽

10.1142/s0129626497000036 ◽

1997 ◽

Vol 07 (01) ◽

pp. 3-11 ◽

Cited By ~ 1

Author(s):

Andrzej Lingas ◽

Anil Maheshwari

Keyword(s):

Parallel Algorithm ◽

Random Access ◽

Input Tree ◽

Single Pair ◽

Random Access Machine ◽

Erew Pram ◽

Parallel Random Access Machine ◽

Path Queries

We present optimal parallel solutions to reporting paths between pairs of nodes in an n-node tree. Our algorithms are deterministic and designed to run on an exclusive read exclusive write parallel random-access machine (EREW PRAM). In particular, we provide a simple optimal parallel algorithm for preprocessing the input tree such that the path queries can be answered efficiently. Our algorithm for preprocessing runs in O( log n) time using O(n/ log n) processors. Using the preprocessing, we can report paths between k node pairs in O( log n + log k) time using O(k + (n + S)/ log n) processors on an EREW PRAM, where S is the size of the output. In particular, we can report the path between a single pair of distinct nodes in O( log n) time using O(L/ log n) processors, where L denotes the length of the path.

Download Full-text