C++ Code Generation for Fast Inference of Deep Learning Models in ROOT/TMVA
Keyword(s):
We report the latest development in ROOT/TMVA, a new system that takes trained ONNX deep learning models and emits C++ code that can be easily included and invoked for fast inference of the model, with minimal dependency. We present an overview of the current solutions for conducting inference in C++ production environment, discuss the technical details and examples of the generated code, and demonstrates its development status with a preliminary benchmark against popular tools.
2020 ◽
2020 ◽
2019 ◽
2020 ◽
Keyword(s):
2020 ◽
Vol 16
(3)
◽
pp. 199-205
Keyword(s):