Optimize TensorFlow Models For Deployment with TensorRT

Coursera Courses ↗ · Coursera

Open Course on Coursera

Free to audit · Opens on Coursera

Optimize TensorFlow Models For Deployment with TensorRT

Coursera · Beginner ·📐 ML Fundamentals ·1mo ago
This is a hands-on, guided project on optimizing your TensorFlow models for inference with NVIDIA's TensorRT. By the end of this 1.5 hour long project, you will be able to optimize Tensorflow models using the TensorFlow integration of NVIDIA's TensorRT (TF-TRT), use TF-TRT to optimize several deep learning models at FP32, FP16, and INT8 precision, and observe how tuning TF-TRT parameters affects performance and inference throughput. Prerequisites: In order to successfully complete this project, you should be competent in Python programming, understand deep learning and what inference is, and have experience building deep learning models in TensorFlow and its Keras API. Note: This course works best for learners who are based in the North America region. We’re currently working on providing the same experience in other regions.
Watch on Coursera ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

7 Common Java Streams Mistakes and How to Avoid Them
Learn to avoid common Java Streams mistakes and improve your coding skills
Medium · Programming
Implementing an Item-Based Recommendation System from Scratch in Python
Learn to implement an item-based recommendation system from scratch in Python for personalized suggestions
Medium · Machine Learning
Implementing an Item-Based Recommendation System from Scratch in Python
Learn to build an item-based recommendation system from scratch in Python for personalized user experiences
Medium · Data Science
The Threshold Is a Business Decision, Not a Statistical One
Learn how to build a production-grade fraud detection system and why statistical thresholds are business decisions, not just statistical ones
Medium · Machine Learning
Up next
Capstone Assignment
Coursera
Watch →