block circulant matrix
Recently Published Documents


TOTAL DOCUMENTS

9
(FIVE YEARS 2)

H-INDEX

2
(FIVE YEARS 1)

Electronics ◽  
2019 ◽  
Vol 8 (1) ◽  
pp. 78 ◽  
Author(s):  
Zidi Qin ◽  
Di Zhu ◽  
Xingwei Zhu ◽  
Xuan Chen ◽  
Yinghuan Shi ◽  
...  

As a key ingredient of deep neural networks (DNNs), fully-connected (FC) layers are widely used in various artificial intelligence applications. However, there are many parameters in FC layers, so the efficient process of FC layers is restricted by memory bandwidth. In this paper, we propose a compression approach combining block-circulant matrix-based weight representation and power-of-two quantization. Applying block-circulant matrices in FC layers can reduce the storage complexity from O ( k 2 ) to O ( k ) . By quantizing the weights into integer powers of two, the multiplications in the reference can be replaced by shift and add operations. The memory usages of models for MNIST, CIFAR-10 and ImageNet can be compressed by 171 × , 2731 × and 128 × with minimal accuracy loss, respectively. A configurable parallel hardware architecture is then proposed for processing the compressed FC layers efficiently. Without multipliers, a block matrix-vector multiplication module (B-MV) is used as the computing kernel. The architecture is flexible to support FC layers of various compression ratios with small footprint. Simultaneously, the memory access can be significantly reduced by using the configurable architecture. Measurement results show that the accelerator has a processing power of 409.6 GOPS, and achieves 5.3 TOPS/W energy efficiency at 800 MHz.


Author(s):  
Eloy Romero ◽  
Andrés Tomás ◽  
Antonio Soriano ◽  
Ignacio Blanquer

Geophysics ◽  
1992 ◽  
Vol 57 (7) ◽  
pp. 933-943 ◽  
Author(s):  
Christof Stork

The symmetries of a block circulant matrix significantly reduce the computational expense of the singular value decomposition (SVD) of the variable velocity inverse problem for a generic reflection seismology model. As a result, the decomposition does not suffer from edge effects or parameterization artifacts that are associated with small model spaces. Using this approach, we study the eigenvector and eigenvalue characteristics for a generic model of a size as large as is used with a variety of iterative inversion techniques (>100 000 parameters). Singular value decomposition of the raypath inverse problem of a discretized generic seismic model having one reflector indicates that the eigenvalue distribution for the inverse problem is nonuniform, with a large concentration near 0 and a gap near 0.4. All but the long horizontal wavelength reflector‐depth variations cannot be uniquely resolved from velocity variations. Lateral velocity variations serve to significantly reduce the ability of seismic data to resolve reflector depth for most of the horizontal wavelength components shorter than twice the cable length. As a result, automatic velocity analysis methods may not be able to resolve reflector variations when the velocity field is allowed to take on an arbitrary structure.


1983 ◽  
Vol 31 (5) ◽  
pp. 808-810 ◽  
Author(s):  
T. De Mazancourt ◽  
D. Gerlic

Sign in / Sign up

Export Citation Format

Share Document