Process Images, Create Captioning AI Models

Coursera Courses ↗ · Coursera

Open Course on Coursera

Free to audit · Opens on Coursera

Process Images, Create Captioning AI Models

Coursera · Intermediate ·👁️ Computer Vision ·1mo ago
Master the essential preprocessing techniques that transform raw visual data into model-ready inputs for computer vision systems. This course empowers you to systematically prepare image data through normalization and color-space conversions, then advance to extracting meaningful motion information from video sequences. You'll apply pixel value normalization, execute color transformations between RGB, grayscale, HSV, and BGR formats, then implement optical flow algorithms and frame differencing to capture temporal dynamics. By completing this course, you'll be able to: • Apply normalization and color-space conversions to preprocess image data • Apply optical flow and frame differencing techniques to extract motion features from video This course is unique because it combines fundamental preprocessing with advanced motion analysis in practical, hands-on implementations. To be successful in this project, you should have a background in Python programming, basic computer vision concepts, and familiarity with NumPy arrays.e.g. This is primarily aimed at first- and second-year undergraduates interested in engineering or science, along with high school students and professionals with an interest in programming.
Watch on Coursera ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

Inside SAM 3D: how Meta turns a single image into 3D
Learn how Meta's SAM 3D technology turns a single image into 3D, revolutionizing the field of computer vision
Medium · Machine Learning
Inside SAM 3D: how Meta turns a single image into 3D
Learn how Meta's SAM 3D technology generates 3D models from single images, revolutionizing the field of computer vision
Medium · Deep Learning
Demystifying CNNs: How Convolutional Filters and Max-Pooling Actually Work
Learn how Convolutional Neural Networks (CNNs) use convolutional filters and max-pooling to recognize images
Medium · Data Science
Your "Biometric Age Check" Isn't Verifying Identity — And Defense Lawyers Know It
Biometric age checks don't verify identity, a crucial distinction for developers in computer vision and biometrics
Dev.to AI
Up next
Best Mac Mini Alternatives for Running OpenClaw 24/7 in 2026
Tin Rovic
Watch →