Optimize TensorFlow Models For Deployment with TensorRT

Coursera Course · Coursera

Open Course on Coursera

Free to audit · Opens on Coursera

Optimize TensorFlow Models For Deployment with TensorRT

Coursera · Beginner ·📐 ML Fundamentals ·2h ago
This is a hands-on, guided project on optimizing your TensorFlow models for inference with NVIDIA's TensorRT. By the end of this 1.5 hour long project, you will be able to optimize Tensorflow models using the TensorFlow integration of NVIDIA's TensorRT (TF-TRT), use TF-TRT to optimize several deep learning models at FP32, FP16, and INT8 precision, and observe how tuning TF-TRT parameters affects performance and inference throughput. Prerequisites: In order to successfully complete this project, you should be competent in Python programming, understand deep learning and what inference is, and…
Watch on Coursera ↗ (saves to browser)
The NEW wave of engineering 🤔
Next Up
The NEW wave of engineering 🤔
Sajjaad Khader