gpu parallelism Latest Research Papers

HeteroCPPR: Accelerating Common Path Pessimism Removal with Heterogeneous CPU-GPU Parallelism

10.1109/iccad51958.2021.9643457 ◽

2021 ◽

Author(s):

Zizheng Guo ◽

Tsung-Wei Huang ◽

Yibo Lin

Keyword(s):

Common Path ◽

Gpu Parallelism

Enabling simulation of high‐dimensional micro‐macro biophysical models through hybrid CPU and multi‐GPU parallelism

Concurrency and Computation Practice and Experience ◽

10.1002/cpe.6305 ◽

2021 ◽

Author(s):

Steven Cook ◽

Tamar Shinar

Keyword(s):

High Dimensional ◽

Biophysical Models ◽

Gpu Parallelism

When Massive GPU Parallelism Ain't Enough

The 2020 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays ◽

10.1145/3373087.3375301 ◽

2020 ◽

Author(s):

Vladimir Rybalkin ◽

Norbert Wehn

Keyword(s):

Gpu Parallelism

Exploiting GPU parallelism in improving bees swarm optimization for mining big transactional databases

Information Sciences ◽

10.1016/j.ins.2018.06.060 ◽

2019 ◽

Vol 496 ◽

pp. 326-342 ◽

Cited By ~ 9

Author(s):

Youcef Djenouri ◽

Djamel Djenouri ◽

Asma Belhadi ◽

Philippe Fournier-Viger ◽

Jerry Chun-Wei Lin ◽

...

Keyword(s):

Swarm Optimization ◽

Transactional Databases ◽

Gpu Parallelism

An Improved Method for Named Entity Recognition and Its Application to CEMR

Future Internet ◽

10.3390/fi11090185 ◽

2019 ◽

Vol 11 (9) ◽

pp. 185

Author(s):

Ming Gao ◽

Qifeng Xiao ◽

Shaochun Wu ◽

Kun Deng

Keyword(s):

Word Order ◽

Medical Records ◽

Semantic Information ◽

Short Term Memory ◽

Named Entity Recognition ◽

Entity Recognition ◽

Local Context ◽

Network Computing ◽

Named Entity ◽

Gpu Parallelism

Named Entity Recognition (NER) on Clinical Electronic Medical Records (CEMR) is a fundamental step in extracting disease knowledge by identifying specific entity terms such as diseases, symptoms, etc. However, the state-of-the-art NER methods based on Long Short-Term Memory (LSTM) fail to exploit GPU parallelism fully under the massive medical records. Although a novel NER method based on Iterated Dilated CNNs (ID-CNNs) can accelerate network computing, it tends to ignore the word-order feature and semantic information of the current word. In order to enhance the performance of ID-CNNs-based models on NER tasks, an attention-based ID-CNNs-CRF model, which combines the word-order feature and local context, is proposed. Firstly, position embedding is utilized to fuse word-order information. Secondly, the ID-CNNs architecture is used to extract global semantic information rapidly. Simultaneously, the attention mechanism is employed to pay attention to the local context. Finally, we apply the CRF to obtain the optimal tag sequence. Experiments conducted on two CEMR datasets show that our model outperforms traditional ones. The F1-scores of 94.55% and 91.17% are obtained respectively on these two datasets, and both are better than LSTM-based models.

CNN-Based Chinese NER with Lexicon Rethinking

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2019/692 ◽

2019 ◽

Cited By ~ 5

Author(s):

Tao Gui ◽

Ruotian Ma ◽

Qi Zhang ◽

Lujun Zhao ◽

Yu-Gang Jiang ◽

...

Keyword(s):

Short Term Memory ◽

Named Entity Recognition ◽

Entity Recognition ◽

Great Success ◽

Short Term ◽

Named Entity ◽

Word Level ◽

Long Short Term Memory ◽

High Level ◽

Gpu Parallelism

Character-level Chinese named entity recognition (NER) that applies long short-term memory (LSTM) to incorporate lexicons has achieved great success. However, this method fails to fully exploit GPU parallelism and candidate lexicons can conflict. In this work, we propose a faster alternative to Chinese NER: a convolutional neural network (CNN)-based method that incorporates lexicons using a rethinking mechanism. The proposed method can model all the characters and potential words that match the sentence in parallel. In addition, the rethinking mechanism can address the word conflict by feeding back the high-level features to refine the networks. Experimental results on four datasets show that the proposed method can achieve better performance than both word-level and character-level baseline methods. In addition, the proposed method performs up to 3.21 times faster than state-of-the-art methods, while realizing better performance.

GRN: Gated Relation Network to Enhance Convolutional Neural Network for Named Entity Recognition

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33016236 ◽

2019 ◽

Vol 33 ◽

pp. 6236-6243 ◽

Cited By ~ 9

Author(s):

Hui Chen ◽

Zijia Lin ◽

Guiguang Ding ◽

Jianguang Lou ◽

Yusen Zhang ◽

...

Keyword(s):

Neural Networks ◽

Short Term Memory ◽

Named Entity Recognition ◽

Entity Recognition ◽

Local Context ◽

Named Entity ◽

Entire Sentence ◽

Context Features ◽

Gpu Parallelism

The dominant approaches for named entity recognitionm (NER) mostly adopt complex recurrent neural networks (RNN), e.g., long-short-term-memory (LSTM). However, RNNs are limited by their recurrent nature in terms of computational efficiency. In contrast, convolutional neural networks (CNN) can fully exploit the GPU parallelism with their feedforward architectures. However, little attention has been paid to performing NER with CNNs, mainly owing to their difficulties in capturing the long-term context information in a sequence. In this paper, we propose a simple but effective CNN-based network for NER, i.e., gated relation network (GRN), which is more capable than common CNNs in capturing long-term context. Specifically, in GRN we firstly employ CNNs to explore the local context features of each word. Then we model the relations between words and use them as gates to fuse local context features into global ones for predicting labels. Without using recurrent layers that process a sentence in a sequential manner, our GRN allows computations to be performed in parallel across the entire sentence. Experiments on two benchmark NER datasets (i.e., CoNLL2003 and Ontonotes 5.0) show that, our proposed GRN can achieve state-of-the-art performance with or without external knowledge. It also enjoys lower time costs to train and test.

GPU Parallelism-Oriented Traffic Modeling and Simulation

Model Engineering for Simulation ◽

10.1016/b978-0-12-813543-3.00015-9 ◽

2019 ◽

pp. 315-339

Author(s):

Xiao Song ◽

Yan Xu ◽

Gary Tan ◽

Fuwang Zhao

Keyword(s):

Modeling And Simulation ◽

Traffic Modeling ◽

Gpu Parallelism

Spike: A GPU Optimised Spiking Neural Network Simulator

10.1101/461160 ◽

2018 ◽

Author(s):

Nasir Ahmad ◽

James B. Isbister ◽

Toby St. Clere Smithe ◽

Simon M. Stringer

Keyword(s):

Neural Network ◽

High Performance ◽

Internal Variables ◽

Spiking Neural Network ◽

Network Simulator ◽

Synaptic Inputs ◽

Active Synapse ◽

Order Of Magnitude ◽

Gpu Parallelism ◽

Simulation Time

ABSTRACTSpiking Neural Network (SNN) simulations require internal variables – such as the membrane voltages of individual neurons and their synaptic inputs – to be updated on a sub-millisecond resolution. As a result, a single second of simulation time requires many thousands of update calculations per neuron. Furthermore, increases in the scale of SNN models have, accordingly, led to manyfold increases in the runtime of SNN simulations. Existing solutions to this problem of scale include high performance CPU based simulators capable of multithreaded execution (“CPU parallelism”). More recent GPU based simulators have emerged, which aim to utilise GPU parallelism for SNN execution. We have identified several key speedups, which give GPU based simulators up to an order of magnitude performance increase over CPU based simulators on several benchmarks. We present the Spike simulator with three key optimisations: timestep grouping, active synapse grouping, and delay insensitivity. Combined, these optimisations massively increase the speed of executing a SNN simulation and produce a simulator which is, on a single machine, faster than currently available simulators.

Multiobjective Optimisation of Aircraft Trajectories Under Wind Uncertainty Using GPU Parallelism and Genetic Algorithms

Computational Methods in Applied Sciences - Evolutionary and Deterministic Methods for Design Optimization and Control With Applications to Industrial and Societal Problems ◽

10.1007/978-3-319-89890-2_29 ◽

2018 ◽

pp. 453-466

Author(s):

Daniel González-Arribas ◽

Manuel Sanjurjo-Rivo ◽

Manuel Soler

Keyword(s):

Genetic Algorithms ◽

Multiobjective Optimisation ◽

Aircraft Trajectories ◽

Gpu Parallelism

gpu parallelism
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

HeteroCPPR: Accelerating Common Path Pessimism Removal with Heterogeneous CPU-GPU Parallelism

Enabling simulation of high‐dimensional micro‐macro biophysical models through hybrid CPU and multi‐GPU parallelism

When Massive GPU Parallelism Ain't Enough

Exploiting GPU parallelism in improving bees swarm optimization for mining big transactional databases

An Improved Method for Named Entity Recognition and Its Application to CEMR

CNN-Based Chinese NER with Lexicon Rethinking

GRN: Gated Relation Network to Enhance Convolutional Neural Network for Named Entity Recognition

GPU Parallelism-Oriented Traffic Modeling and Simulation

Spike: A GPU Optimised Spiking Neural Network Simulator

Multiobjective Optimisation of Aircraft Trajectories Under Wind Uncertainty Using GPU Parallelism and Genetic Algorithms

Export Citation Format

gpu parallelismRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

HeteroCPPR: Accelerating Common Path Pessimism Removal with Heterogeneous CPU-GPU Parallelism

Enabling simulation of high‐dimensional micro‐macro biophysical models through hybrid CPU and multi‐GPU parallelism

When Massive GPU Parallelism Ain't Enough

Exploiting GPU parallelism in improving bees swarm optimization for mining big transactional databases

An Improved Method for Named Entity Recognition and Its Application to CEMR

CNN-Based Chinese NER with Lexicon Rethinking

GRN: Gated Relation Network to Enhance Convolutional Neural Network for Named Entity Recognition

GPU Parallelism-Oriented Traffic Modeling and Simulation

Spike: A GPU Optimised Spiking Neural Network Simulator

Multiobjective Optimisation of Aircraft Trajectories Under Wind Uncertainty Using GPU Parallelism and Genetic Algorithms

gpu parallelism
Recently Published Documents