single instruction multiple data Latest Research Papers

Block aligner: fast and flexible pairwise sequence alignment with SIMD-accelerated adaptive blocks

10.1101/2021.11.08.467651 ◽

2021 ◽

Author(s):

Daniel Liu ◽

Martin Steinegger

Keyword(s):

Protein Sequences ◽

Single Instruction Multiple Data ◽

Alignment Algorithm ◽

Biological Sequences ◽

Pairwise Sequence Alignment ◽

Popular Method ◽

Multiple Data ◽

Long Reads ◽

Speed Up ◽

Scoring Schemes

Background: The Smith-Waterman-Gotoh alignment algorithm is the most popular method for comparing biological sequences. Recently, Single Instruction Multiple Data methods have been used to speed up alignment. However, these algorithms have limitations like being optimized for specific scoring schemes, cannot handle large gaps, or require quadratic time computation. Results: We propose a new algorithm called block aligner for aligning nucleotide and protein sequences. It greedily shifts and grows a block of computed scores to span large gaps within the aligned sequences. This greedy approach is able to only compute a fraction of the DP matrix. In exchange for these features, there is no guarantee that the computed scores are accurate compared to full DP. However, in our experiments, we show that block aligner performs accurately on various realistic datasets, and it is up to 9 times faster than the popular Farrar's algorithm for protein global alignments. Conclusions: Our algorithm has applications in computing global alignments and X-drop alignments on proteins and long reads. It is available as a Rust library at https://github.com/Daniel-Liu-c0deb0t/block-aligner.

Get full-text (via PubEx)

Parallel Privacy-Preserving Shortest Path Algorithms

Cryptography ◽

10.3390/cryptography5040027 ◽

2021 ◽

Vol 5 (4) ◽

pp. 27

Author(s):

Mohammad Anagreh ◽

Peeter Laud ◽

Eero Vainikko

Keyword(s):

Network Performance ◽

Transitive Closure ◽

Control Flow ◽

Single Instruction Multiple Data ◽

Multiple Data ◽

Shortest Distance ◽

Dense Graphs ◽

Performance Profiles ◽

Shortest Path Algorithms ◽

Memory Accesses

In this paper, we propose and present secure multiparty computation (SMC) protocols for single-source shortest distance (SSSD) and all-pairs shortest distance (APSD) in sparse and dense graphs. Our protocols follow the structure of classical algorithms—Bellman–Ford and Dijkstra for SSSD; Johnson, Floyd–Warshall, and transitive closure for APSD. As the computational platforms offered by SMC protocol sets have performance profiles that differ from typical processors, we had to perform extensive changes to the structure (including their control flow and memory accesses) and the details of these algorithms in order to obtain good performance. We implemented our protocols on top of the secret sharing based protocol set offered by the Sharemind SMC platform, using single-instruction-multiple-data (SIMD) operations as much as possible to reduce the round complexity. We benchmarked our protocols under several different parameters for network performance and compared our performance figures against each other and with ones reported previously.

Get full-text (via PubEx)

Not in Name Alone: A Memristive Memory Processing Unit for Real In-Memory Processing

10.36227/techrxiv.12894941.v1 ◽

2020 ◽

Author(s):

Ameer Haj-Ali ◽

Nimrod Wald ◽

Ronny Ronen ◽

Shahar Kvatinsky ◽

Rotem Ben-Hur

Keyword(s):

Data Transfer ◽

Single Instruction Multiple Data ◽

Processing Unit ◽

Von Neumann ◽

Data Movement ◽

Memory Processing ◽

Data Intensive ◽

Root Cause ◽

Multiple Data ◽

Data Intensive Applications

<div>Data movement between processing and memory is</div><div>the root cause of the limited performance and energy</div><div>efficiency in modern von Neumann systems. To</div><div>overcome the data-movement bottleneck, we present</div><div>the memristive Memory Processing Unit (mMPU)—a</div><div>real processing-in-memory system in which the computation is done directly in the</div><div>memory cells, thus eliminating the necessity for data transfer. Furthermore, with its</div><div>enormous inner parallelism, this system is ideal for data-intensive applications that are</div><div>based on single instruction, multiple data (SIMD)—providing high throughput and</div><div>energy-efficiency.</div>

Get full-text (via PubEx)

Not in Name Alone: A Memristive Memory Processing Unit for Real In-Memory Processing

10.36227/techrxiv.12894941 ◽

2020 ◽

Author(s):

Ameer Haj-Ali ◽

Nimrod Wald ◽

Ronny Ronen ◽

Shahar Kvatinsky ◽

Rotem Ben-Hur

Keyword(s):

Data Transfer ◽

Single Instruction Multiple Data ◽

Processing Unit ◽

Von Neumann ◽

Data Movement ◽

Memory Processing ◽

Data Intensive ◽

Root Cause ◽

Multiple Data ◽

Data Intensive Applications

<div>Data movement between processing and memory is</div><div>the root cause of the limited performance and energy</div><div>efficiency in modern von Neumann systems. To</div><div>overcome the data-movement bottleneck, we present</div><div>the memristive Memory Processing Unit (mMPU)—a</div><div>real processing-in-memory system in which the computation is done directly in the</div><div>memory cells, thus eliminating the necessity for data transfer. Furthermore, with its</div><div>enormous inner parallelism, this system is ideal for data-intensive applications that are</div><div>based on single instruction, multiple data (SIMD)—providing high throughput and</div><div>energy-efficiency.</div>

Get full-text (via PubEx)

abPOA: an SIMD-based C library for fast partial order alignment using adaptive band

10.1101/2020.05.07.083196 ◽

2020 ◽

Author(s):

Yan Gao ◽

Yongzhuang Liu ◽

Yanmei Ma ◽

Bo Liu ◽

Yadong Wang ◽

...

Keyword(s):

Error Correction ◽

Partial Order ◽

Directed Acyclic Graph ◽

State Of The Art ◽

Single Instruction Multiple Data ◽

Multiple Sequence ◽

Software Interface ◽

Multiple Data ◽

Long Read ◽

Read Error Correction

AbstractSummaryPartial order alignment, which aligns a sequence to a directed acyclic graph, is now frequently used as a key component in long-read error correction and assembly. We present abPOA (adaptive banded Partial Order Alignment), a Single Instruction Multiple Data (SIMD) based C library for fast partial order alignment using adaptive banded dynamic programming. It can work as a stand-alone multiple sequence alignment and consensus calling tool or be easily integrated into any long-read error correction and assembly workflow. Compared to a state-of-the-art tool (SPOA), abPOA is up to 15 times faster with a comparable alignment accuracy.Availability and implementationabPOA is implemented in C. A stand-alone tool and a C/Python software interface are freely available at https://github.com/yangao07/[email protected] or [email protected]

Get full-text (via PubEx)

Fast global motion estimation on single instruction multiple data processors for real-time devices

Optical Engineering ◽

10.1117/1.oe.58.11.113105 ◽

2019 ◽

Vol 58 (11) ◽

pp. 1

Author(s):

Yun Gu Lee

Keyword(s):

Motion Estimation ◽

Real Time ◽

Single Instruction Multiple Data ◽

Global Motion ◽

Global Motion Estimation ◽

Multiple Data

Get full-text (via PubEx)

PNS302 IMPROVING THE PERFORMANCE OF PATIENT-LEVEL SIMULATION MODELS USING MULTI-THREADING AND SINGLE INSTRUCTION MULTIPLE DATA (SIMD) OPERATIONS

Value in Health ◽

10.1016/j.jval.2019.09.2202 ◽

2019 ◽

Vol 22 ◽

pp. S814

Author(s):

R.F. Pollock

Keyword(s):

Simulation Models ◽

Single Instruction Multiple Data ◽

Patient Level ◽

Multiple Data

Get full-text (via PubEx)

Design of Programmable Pixel Shader Computing Unit

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.i3049.0789s319 ◽

2019 ◽

Vol 8 (9S3) ◽

pp. 269-272

Keyword(s):

Mobile Devices ◽

Single Instruction Multiple Data ◽

Verilog Hdl ◽

Visual Effects ◽

Computing Unit ◽

Major Constraint ◽

Multiple Data ◽

Unit Design ◽

Computational Resources

With increased focus on improved Graphics, the gaming industries for computer and mobile devices requires more complex graphical algorithms, which in need of better computational resources. In general graphical computation requires more computational units, which will be major constraint in mobile devices as it demand for more power. In order to develop such efficient computational units, one need to focus on the Graphical pipeline to have better visual effects. In this paper design of programmable Pixel shader computing unit is discussed. In GPU, the Pixel shaders are used to process and manipulate the each pixel. Also the Pixel Shaders are used to provide final coloring to the processed data. Here the programmable Pixel shader computing unit design is discussed, which help the programmer to add his own code for achieving better visual effects or results. The possible Instructions of Pixel shader are designed using Verilog HDL. The SIMD(Single Instruction Multiple Data) concept is adopted for the designing of each instruction.

Get full-text (via PubEx)

SIMD||DNA: Single Instruction, Multiple Data Computation with DNA Strand Displacement Cascades

Lecture Notes in Computer Science - DNA Computing and Molecular Programming ◽

10.1007/978-3-030-26807-7_12 ◽

2019 ◽

pp. 219-235 ◽

Cited By ~ 1

Author(s):

Boya Wang ◽

Cameron Chalk ◽

David Soloveichik

Keyword(s):

Single Instruction Multiple Data ◽

Strand Displacement ◽

Displacement Cascades ◽

Multiple Data ◽

Dna Strand Displacement ◽

Dna Strand

Get full-text (via PubEx)

A Fast, Single-Instruction–Multiple-Data, Scalable Priority Queue

IEEE Transactions on Very Large Scale Integration (VLSI) Systems ◽

10.1109/tvlsi.2018.2838044 ◽

2018 ◽

Vol 26 (10) ◽

pp. 1939-1952 ◽

Cited By ~ 4

Author(s):

Imad Benacer ◽

Francois-Raymond Boyer ◽

Yvon Savaria

Keyword(s):

Priority Queue ◽

Single Instruction Multiple Data ◽

Multiple Data

Get full-text (via PubEx)

single instruction multiple data
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Block aligner: fast and flexible pairwise sequence alignment with SIMD-accelerated adaptive blocks

Parallel Privacy-Preserving Shortest Path Algorithms

Not in Name Alone: A Memristive Memory Processing Unit for Real In-Memory Processing

Not in Name Alone: A Memristive Memory Processing Unit for Real In-Memory Processing

abPOA: an SIMD-based C library for fast partial order alignment using adaptive band

Fast global motion estimation on single instruction multiple data processors for real-time devices

PNS302 IMPROVING THE PERFORMANCE OF PATIENT-LEVEL SIMULATION MODELS USING MULTI-THREADING AND SINGLE INSTRUCTION MULTIPLE DATA (SIMD) OPERATIONS

Design of Programmable Pixel Shader Computing Unit

SIMD||DNA: Single Instruction, Multiple Data Computation with DNA Strand Displacement Cascades

A Fast, Single-Instruction–Multiple-Data, Scalable Priority Queue

Export Citation Format

single instruction multiple dataRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Block aligner: fast and flexible pairwise sequence alignment with SIMD-accelerated adaptive blocks

Parallel Privacy-Preserving Shortest Path Algorithms

Not in Name Alone: A Memristive Memory Processing Unit for Real In-Memory Processing

Not in Name Alone: A Memristive Memory Processing Unit for Real In-Memory Processing

abPOA: an SIMD-based C library for fast partial order alignment using adaptive band

Fast global motion estimation on single instruction multiple data processors for real-time devices

PNS302 IMPROVING THE PERFORMANCE OF PATIENT-LEVEL SIMULATION MODELS USING MULTI-THREADING AND SINGLE INSTRUCTION MULTIPLE DATA (SIMD) OPERATIONS

Design of Programmable Pixel Shader Computing Unit

SIMD||DNA: Single Instruction, Multiple Data Computation with DNA Strand Displacement Cascades

A Fast, Single-Instruction–Multiple-Data, Scalable Priority Queue

single instruction multiple data
Recently Published Documents