scholarly journals Research on Parallel Deep Learning for Heterogeneous Computing Architecture

2020 ◽  
Vol 18 (2) ◽  
pp. 177-179
Author(s):  
Kaijian Xia ◽  
Tao Hu ◽  
Wen Si
2020 ◽  
Vol 245 ◽  
pp. 09014
Author(s):  
Chao Jiang ◽  
David Ojika ◽  
Sofia Vallecorsa ◽  
Thorsten Kurth ◽  
Prabhat ◽  
...  

AI and deep learning are experiencing explosive growth in almost every domain involving analysis of big data. Deep learning using Deep Neural Networks (DNNs) has shown great promise for such scientific data analysis applications. However, traditional CPU-based sequential computing without special instructions can no longer meet the requirements of mission-critical applications, which are compute-intensive and require low latency and high throughput. Heterogeneous computing (HGC), with CPUs integrated with GPUs, FPGAs, and other science-targeted accelerators, offers unique capabilities to accelerate DNNs. Collaborating researchers at SHREC1at the University of Florida, CERN Openlab, NERSC2at Lawrence Berkeley National Lab, Dell EMC, and Intel are studying the application of heterogeneous computing (HGC) to scientific problems using DNN models. This paper focuses on the use of FPGAs to accelerate the inferencing stage of the HGC workflow. We present case studies and results in inferencing state-of-the-art DNN models for scientific data analysis, using Intel distribution of OpenVINO, running on an Intel Programmable Acceleration Card (PAC) equipped with an Arria 10 GX FPGA. Using the Intel Deep Learning Acceleration (DLA) development suite to optimize existing FPGA primitives and develop new ones, we were able accelerate the scientific DNN models under study with a speedup from 2.46x to 9.59x for a single Arria 10 FPGA against a single core (single thread) of a server-class Skylake CPU.


Author(s):  
Jiaqi Song ◽  
Jing Li ◽  
Di Wu ◽  
Guangye Li ◽  
Jiaxin Zhang ◽  
...  

Power line corridor inspection plays a vital role in power system safe operation, traditional human inspection’s low efficiency makes the novel inspection method requiring high precision and high efficiency. Combined with the current deep learning target detection algorithm based on high accuracy and strong real-time performance, this paper proposes a YOLOV4-Tiny based drone real-time power line inspection method. The 5G and edge computing technology are combined properly forming a complete edge computing architecture. The UAV is treated as an edge device with a YOLOV4-Tiny deep- learning-based object detection model and AI chip on board. Extensive experiments on real data demonstrate the 5G and Edge computing architecture could satisfy the demands of real-time power inspection, and the intelligence of the whole inspection improved significantly.


2014 ◽  
Vol 15 (1) ◽  
pp. 216 ◽  
Author(s):  
Davor Sluga ◽  
Tomaz Curk ◽  
Blaz Zupan ◽  
Uros Lotric

IEEE Micro ◽  
2020 ◽  
pp. 1-1
Author(s):  
Kenji Tanaka ◽  
Yuki Arikawa ◽  
Tsiyoshi Ito ◽  
Kazutaka Morita ◽  
Naru Nemoto ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document