1-bit stochastic gradient descent and its application to data-parallel distributed training of speech DNNs
Keyword(s):
2019 ◽
Vol 7
(4)
◽
pp. 360-363
Keyword(s):
Keyword(s):
Keyword(s):