Speed up PyTorch Deep Learning Inference on GPUs using TensorRT – by Hemant Ranvir
TensorRT is a high-speed inference library developed by NVIDIA. It speeds up already trained deep learning models by applying various optimizations on the models. The following article focuses on giving a simple overview of such optimizations along with a small demo showing the speed-up achieved. The first part gives an overview listing out the advantagesRead More