Quantization of Deep Neural Network Models Considering Per-Layer Computation Complexity for Efficient Execution in Multi-Precision Accelerators
Keyword(s):
2020 ◽
Vol 1662
◽
pp. 012010
2020 ◽
Vol 35
(5)
◽
pp. 999-1015