processor architecture Latest Research Papers

Ultra-large-scale molecular docking can improve the accuracy of lead compounds in drug discovery. In this study, we developed a molecular docking piece of software, Vina@QNLM, which can use more than 4,80,000 parallel processes to search for potential lead compounds from hundreds of millions of compounds. We proposed a task scheduling mechanism for large-scale parallelism based on Vinardo and Sunway supercomputer architecture. Then, we readopted the core docking algorithm to incorporate the full advantage of the heterogeneous multicore processor architecture in intensive computing. We successfully expanded it to 10, 465, 065 cores (1,61,001 management process elements and 0, 465, 065 computing process elements), with a strong scalability of 55.92%. To the best of our knowledge, this is the first time that 10 million cores are used for molecular docking on Sunway. The introduction of the heterogeneous multicore processor architecture achieved the best speedup, which is 11x more than that of the management process element of Sunway. The performance of Vina@QNLM was comprehensively evaluated using the CASF-2013 and CASF-2016 protein–ligand benchmarks, and the screening power was the highest out of the 27 pieces of software tested in the CASF-2013 benchmark. In some existing applications, we used Vina@QNLM to dock more than 10 million molecules to nine rigid proteins related to SARS-CoV-2 within 8.5 h on 10 million cores. We also developed a platform for the general public to use the software.

Download Full-text

Processor Architecture Optimization for Spatially Dynamic Neural Networks

10.1109/vlsi-soc53125.2021.9607013 ◽

2021 ◽

Author(s):

Steven Colleman ◽

Thomas Verelst ◽

Linyan Mei ◽

Tinne Tuytelaars ◽

Marian Verhelst

Keyword(s):

Neural Networks ◽

Processor Architecture ◽

Dynamic Neural Networks ◽

Architecture Optimization

Download Full-text

Regular Expression Matching Processor Architecture Supporting Restraint and Nested Repetitive Operations

The Journal of Korean Institute of Communications and Information Sciences ◽

10.7840/kics.2021.46.9.1515 ◽

2021 ◽

Vol 46 (9) ◽

pp. 1515-1520

Author(s):

Byung-suk Seo

Keyword(s):

Regular Expression ◽

Processor Architecture ◽

Regular Expression Matching

Download Full-text

Design and Assertion Based Verification of RISC-V Processor Subsystems

Journal of VLSI Design and Signal Processing ◽

10.46610/jovdsp.2021.v07i03.002 ◽

2021 ◽

Vol 7 (3) ◽

Author(s):

Shruthi . ◽

Jamuna S

Keyword(s):

High Speed ◽

Branch Prediction ◽

Floating Point ◽

Processor Architecture ◽

Pipeline Architecture ◽

Description Language ◽

Instruction Fetch ◽

Hardware Description ◽

Floating Point Unit ◽

Back Stage

RISC-V is an open, free standard architecture. As its open-source architecture, it can be used in multiple applications like embedded processors, IoT, artificial intelligence, machine learning, military and defense applications. The parameters like throughput, performance, high speed etc., become essential in designing processor architecture. Pipelining is one such unique feature supported by RISC-V ISA, which basically involves the execution of multiple instructions in single cycle. This feature helps in improving the performance of the processor architecture. RISC-V ISA supports five stages of pipelining they are instruction fetch, instruction decode, execute, memory and write-back stage. The work covered in this paper involves the design and implementation of the subsystems of the RISC-V ISA which are present in different stages of pipeline architecture. The subsystems included in this work are Floating Point Unit (FPU) of Execute stage, Branch Prediction Unit (BPU) of instruction fetch stage, Forwarding Unit of execution stage, Operand Logic of decode stage and Floating-Point register file of Write-back stage. These subsystems are designed using Verilog Hardware Description Language in Xilinx ISE. Followed by the implementation the verification of the floating-point unit and the forwarding unit is performed using System Verilog Assertions in QuestaSim. The Assertion coverage report for the same is extracted.

Download Full-text

HiPReP: High-Performance Reconfigurable Processor - Architecture and Compiler

10.1109/fpl53798.2021.00074 ◽

2021 ◽

Author(s):

Philipp Kasgen ◽

Mohamed Messelka ◽

Markus Weinhardt

Keyword(s):

High Performance ◽

Processor Architecture ◽

Reconfigurable Processor

Download Full-text

Tensor-Centric Processor Architecture for Applications in Advanced Driver Assistance Systems

2021 International Symposium on VLSI Design, Automation and Test (VLSI-DAT) ◽

10.1109/vlsi-dat52063.2021.9427310 ◽

2021 ◽

Author(s):

Yu-Sheng Lin ◽

Wei-Chao Chen ◽

Trista Pei-Chun Chen

Keyword(s):

Processor Architecture ◽

Driver Assistance ◽

Driver Assistance Systems ◽

Advanced Driver Assistance Systems ◽

Assistance Systems

Download Full-text

Superconductor Processor Architecture

TEION KOGAKU (Journal of Cryogenics and Superconductivity Society of Japan) ◽

10.2221/jcsj.56.87 ◽

2021 ◽

Vol 56 (2) ◽

pp. 87-93

Author(s):

Koji INOUE ◽

Masamitsu TANAKA ◽

Koki ISHIDA

Keyword(s):

Processor Architecture

Download Full-text

Vectorizing posit operations on RISC-V for faster deep neural networks: experiments and comparison with ARM SVE

Neural Computing and Applications ◽

10.1007/s00521-021-05814-0 ◽

2021 ◽

Author(s):

Marco Cococcioni ◽

Federico Rossi ◽

Emanuele Ruffaldi ◽

Sergio Saponara

Keyword(s):

Neural Networks ◽

Open Source ◽

Deep Neural Networks ◽

Number System ◽

Information Representation ◽

Processor Architecture ◽

Speed Up ◽

Catch Up ◽

Logic Operations ◽

Matrix Vector

AbstractWith the arrival of the open-source RISC-V processor architecture, there is the chance to rethink Deep Neural Networks (DNNs) and information representation and processing. In this work, we will exploit the following ideas: i) reduce the number of bits needed to represent the weights of the DNNs using our recent findings and implementation of the posit number system, ii) exploit RISC-V vectorization as much as possible to speed up the format encoding/decoding, the evaluation of activations functions (using only arithmetic and logic operations, exploiting approximated formulas) and the computation of core DNNs matrix-vector operations. The comparison with the well-established architecture ARM Scalable Vector Extension is natural and challenging due to its closedness and mature nature. The results show how it is possible to vectorize posit operations on RISC-V, gaining a substantial speed-up on all the operations involved. Furthermore, the experimental outcomes highlight how the new architecture can catch up, in terms of performance, with the more mature ARM architecture. Towards this end, the present study is important because it anticipates the results that we expect to achieve when we will have an open RISC-V hardware co-processor capable to operate natively with posits.

Download Full-text

Realization of advantages of Russian processor architecture KOMDIV64 in technical means of complexes of civil purpose automation

Issues of radio electronics ◽

10.21778/2218-5453-2020-12-6-16 ◽

2021 ◽

pp. 6-16

Author(s):

D. A. Dolgov ◽

K. S. Nozdrin

Keyword(s):

Life Cycle ◽

Influencing Factors ◽

High Performance ◽

Processor Architecture ◽

Technical Parameters ◽

Automation Systems ◽

Cluster Type ◽

Technical Solutions ◽

Life Cycle Failure ◽

Switching Equipment

The paper discusses the practical issues of creation cluster-type computing equipment and switching equipment, which are based on Russian technologies and components developed for the segment of high-performance teraflop-class servers. A number of technical solutions are proposed, aimed at forming in the shortest possible time with minimal costs using a limited set of components (clusters), a model range of cluster-type computing equipment. The latter should ensure the creation of technical means of automation systems that have a performance parity with technical means of foreign production, as well as surpass it in a number of important operational and technical parameters, including the duration of the products and technical means life cycle, failure stability and external influencing factors durability.

Download Full-text

Real-time range gate control of a satellite laser ranging system based the on heterogeneous processor architecture

Applied Optics ◽

10.1364/ao.408434 ◽

2021 ◽

Vol 60 (2) ◽

pp. 296

Author(s):

Wenbo Yang ◽

Yan Zhao ◽

Cunbo Fan ◽

Zhe Kang ◽

Peiyu Liu

Keyword(s):

Real Time ◽

Satellite Laser Ranging ◽

Time Range ◽

Laser Ranging ◽

Processor Architecture ◽

Gate Control ◽

Heterogeneous Processor

Download Full-text

processor architecture
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Redesigning Vina@QNLM for Ultra-Large-Scale Molecular Docking and Screening on a Sunway Supercomputer

Processor Architecture Optimization for Spatially Dynamic Neural Networks

Regular Expression Matching Processor Architecture Supporting Restraint and Nested Repetitive Operations

Design and Assertion Based Verification of RISC-V Processor Subsystems

HiPReP: High-Performance Reconfigurable Processor - Architecture and Compiler

Tensor-Centric Processor Architecture for Applications in Advanced Driver Assistance Systems

Superconductor Processor Architecture

Vectorizing posit operations on RISC-V for faster deep neural networks: experiments and comparison with ARM SVE

Realization of advantages of Russian processor architecture KOMDIV64 in technical means of complexes of civil purpose automation

Real-time range gate control of a satellite laser ranging system based the on heterogeneous processor architecture

Export Citation Format

processor architectureRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Redesigning Vina@QNLM for Ultra-Large-Scale Molecular Docking and Screening on a Sunway Supercomputer

Processor Architecture Optimization for Spatially Dynamic Neural Networks

Regular Expression Matching Processor Architecture Supporting Restraint and Nested Repetitive Operations

Design and Assertion Based Verification of RISC-V Processor Subsystems

HiPReP: High-Performance Reconfigurable Processor - Architecture and Compiler

Tensor-Centric Processor Architecture for Applications in Advanced Driver Assistance Systems

Superconductor Processor Architecture

Vectorizing posit operations on RISC-V for faster deep neural networks: experiments and comparison with ARM SVE

Realization of advantages of Russian processor architecture KOMDIV64 in technical means of complexes of civil purpose automation

Real-time range gate control of a satellite laser ranging system based the on heterogeneous processor architecture

processor architecture
Recently Published Documents