forward computation Latest Research Papers

Scalability of distributed deep learning (DL) training with parameter server architecture is often communication constrained in large clusters. There are recent efforts that use a layer by layer strategy to overlap gradient communication with backward computation so as to reduce the impact of communication constraint on the scalability. However, the approaches cannot be effectively applied to the overlap between parameter communication and forward computation. In this paper, we propose and design iBatch, a novel communication approach that batches parameter communication and forward computation to overlap them with each other. We formulate the batching decision as an optimization problem and solve it based on greedy algorithm to derive communication and computation batches. We implement iBatch in the open-source DL framework BigDL and perform evaluations with various DL workloads. Experimental results show that iBatch improves the scalability of a cluster of 72 nodes by up to 73% over the default PS and 41% over the layer by layer strategy.

Download Full-text

Optimizing Forward Computation in Adjoint Method via Multi-level Blocking

Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region - HPC Asia 2018 ◽

10.1145/3149457.3149458 ◽

2018 ◽

Author(s):

Tomoya Ikeda ◽

Shin-ichi Ito ◽

Hiromichi Nagao ◽

Takahiro Katagiri ◽

Toru Nagai ◽

...

Keyword(s):

Adjoint Method ◽

Forward Computation ◽

Multi Level

Download Full-text

Privacy-Preserving HMM Forward Computation

Proceedings of the Seventh ACM on Conference on Data and Application Security and Privacy - CODASPY '17 ◽

10.1145/3029806.3029816 ◽

2017 ◽

Cited By ~ 2

Author(s):

Jan Henrik Ziegeldorf ◽

Jan Metzke ◽

Jan Rüth ◽

Martin Henze ◽

Klaus Wehrle

Keyword(s):

Privacy Preserving ◽

Forward Computation

Download Full-text

The forward computation and inversion of magnetotelluric fields in two-dimensional nonisotropic medium

10.1190/gem2015-119 ◽

2015 ◽

Author(s):

Miaoxin Yang ◽

Handong Tan ◽

Xiaohong Meng ◽

Changhong Lin

Keyword(s):

Two Dimensional ◽

Forward Computation ◽

And Inversion

Download Full-text

Concrete pavement forward computation of GPR with FDTD

2011 International Conference on Remote Sensing, Environment and Transportation Engineering ◽

10.1109/rsete.2011.5964951 ◽

2011 ◽

Cited By ~ 1

Author(s):

Sheng Zhang ◽

Juncai Xu

Keyword(s):

Concrete Pavement ◽

Forward Computation

Download Full-text

Cerebro-cerebellar Interactions Underlying Temporal Information Processing

Journal of Cognitive Neuroscience ◽

10.1162/jocn.2010.21429 ◽

2010 ◽

Vol 22 (12) ◽

pp. 2913-2925 ◽

Cited By ~ 43

Author(s):

Kenji Aso ◽

Takashi Hanakawa ◽

Toshihiko Aso ◽

Hidenao Fukuyama

Keyword(s):

Information Processing ◽

Discrimination Task ◽

Temporal Information ◽

Generation Task ◽

Feed Forward ◽

Time Processing ◽

Temporal Information Processing ◽

Time Discrimination ◽

Forward Computation ◽

The Right

The neural basis of temporal information processing remains unclear, but it is proposed that the cerebellum plays an important role through its internal clock or feed-forward computation functions. In this study, fMRI was used to investigate the brain networks engaged in perceptual and motor aspects of subsecond temporal processing without accompanying coprocessing of spatial information. Direct comparison between perceptual and motor aspects of time processing was made with a categorical-design analysis. The right lateral cerebellum (lobule VI) was active during a time discrimination task, whereas the left cerebellar lobule VI was activated during a timed movement generation task. These findings were consistent with the idea that the cerebellum contributed to subsecond time processing in both perceptual and motor aspects. The feed-forward computational theory of the cerebellum predicted increased cerebro-cerebellar interactions during time information processing. In fact, a psychophysiological interaction analysis identified the supplementary motor and dorsal premotor areas, which had a significant functional connectivity with the right cerebellar region during a time discrimination task and with the left lateral cerebellum during a timed movement generation task. The involvement of cerebro-cerebellar interactions may provide supportive evidence that temporal information processing relies on the simulation of timing information through feed-forward computation in the cerebellum.

Download Full-text

A Improved Algorithm for Forward Computation of Dynamic Program Slice

2009 WRI World Congress on Software Engineering ◽

10.1109/wcse.2009.247 ◽

2009 ◽

Cited By ~ 2

Author(s):

Ma Jianhong ◽

Xu Min ◽

Wu Hongtao ◽

Cui Xiangling

Keyword(s):

Dynamic Program ◽

Program Slice ◽

Forward Computation ◽

Improved Algorithm

Download Full-text

Exploiting the empirical characteristics of program dependences for improved forward computation of dynamic slices

Empirical Software Engineering ◽

10.1007/s10664-008-9071-y ◽

2008 ◽

Vol 13 (4) ◽

pp. 369-399 ◽

Cited By ~ 16

Author(s):

Wes Masri

Keyword(s):

Forward Computation

Download Full-text

Fluorescent molecular tomographic image reconstruction based on parallel forward computation

10.1117/12.743149 ◽

2007 ◽

Author(s):

Wei Zou ◽

Jiajun Wang ◽

David Dagan Feng

Keyword(s):

Image Reconstruction ◽

Tomographic Image ◽

Tomographic Image Reconstruction ◽

Forward Computation

Download Full-text

Reverse time migration with optimal checkpointing

Geophysics ◽

10.1190/1.2742686 ◽

2007 ◽

Vol 72 (5) ◽

pp. SM213-SM221 ◽

Cited By ~ 226

Author(s):

William W. Symes

Keyword(s):

Wave Equation ◽

Reverse Order ◽

Reverse Time ◽

Reverse Time Migration ◽

Time Migration ◽

Adjoint State ◽

Forward Computation ◽

State Method ◽

Adjoint State Method

Reverse time migration (RTM) requires that fields computed in forward time be accessed in reverse order. Such out-of-order access, to recursively computed fields, requires that some part of the recursion history be stored (checkpointed), with the remainder computed by repeating parts of the forward computation. Optimal checkpointing algorithms choose checkpoints in such a way that the total storage is minimized for a prescribed level of excess computation, or vice versa. Optimal checkpointing dramatically reduces the storage required by RTM, compared to that needed for nonoptimal implementations, at the price of a small increase in computation. This paper describes optimal checkpointing in a form which applies both to RTM and other applications of the adjoint state method, such as construction of velocity updates from prestack wave equation migration.

Download Full-text

forward computation
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Scalable Distributed DL Training: Batching Communication and Computation

Optimizing Forward Computation in Adjoint Method via Multi-level Blocking

Privacy-Preserving HMM Forward Computation

The forward computation and inversion of magnetotelluric fields in two-dimensional nonisotropic medium

Concrete pavement forward computation of GPR with FDTD

Cerebro-cerebellar Interactions Underlying Temporal Information Processing

A Improved Algorithm for Forward Computation of Dynamic Program Slice

Exploiting the empirical characteristics of program dependences for improved forward computation of dynamic slices

Fluorescent molecular tomographic image reconstruction based on parallel forward computation

Reverse time migration with optimal checkpointing

Export Citation Format

forward computationRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Scalable Distributed DL Training: Batching Communication and Computation

Optimizing Forward Computation in Adjoint Method via Multi-level Blocking

Privacy-Preserving HMM Forward Computation

The forward computation and inversion of magnetotelluric fields in two-dimensional nonisotropic medium

Concrete pavement forward computation of GPR with FDTD

Cerebro-cerebellar Interactions Underlying Temporal Information Processing

A Improved Algorithm for Forward Computation of Dynamic Program Slice

Exploiting the empirical characteristics of program dependences for improved forward computation of dynamic slices

Fluorescent molecular tomographic image reconstruction based on parallel forward computation

Reverse time migration with optimal checkpointing

forward computation
Recently Published Documents