The Computer Vision Process: Fireside

Roboflow · Beginner ·📐 ML Fundamentals ·5y ago

Skills: ML Pipelines80%Supervised Learning70%ML Maths Basics60%

Key Takeaways

The video discusses the computer vision process, covering key steps such as defining the problem, collecting images, labeling data, pre-processing, training a model, and deploying it, with a focus on the importance of data quality and representation, and introduces tools like Roboflow and Rebelflow for streamlining the process.

Full Transcript

hey everyone thanks for joining us again today we're here to talk about the computer vision pipeline and some of the steps that are included to take uh a batch of images and result in a trained model um so joseph go ahead and start us off like what what the first step might be yeah i mean so when we talk about the computer vision pipeline uh you got to think about each of the component parts um you have to think about first and foremost what problem you want to solve which is pretty easy to skip that step and just think about oh i have a bunch of images or a bunch of videos but what problem specifically is it you want to solve do you want to monitor the presence of something do you want to look for package theft do you want to count the presence of something like counting the number of cells that ran an experiment or identifying weeds from like aerial imagery so you can more targeted deploy herbicides and things like this um but think very specifically not necessarily narrowly but really specifically in defining your problem and then from there i mean the steps that you follow are pretty common collecting your images getting them identifying which ones should be sent for labeling um labeling your data uh pre-processing your images so standard things that you need to do like resizing or maybe increasing contrast augmenting your data does it increase your data size representation and improve your model's ability to generalize train a model which also involves a little bit of model selection and then check out the inference and see how it does in a given test set deploy that model and then continue to monitor its its performance over time and especially key at that step is continuing to collect image or video data from your inference conditions so that you can continue the process all over again yeah that that seems like a pretty good summary i i definitely think it's you know each one of those steps is uh kind of complex and there's there's a lot involved so maybe um now that we've kind of summarized the whole pipeline maybe we can dive in a little bit into each component part um which do you think is the hardest like if you you're like you're going about building your model which step do you think causes the biggest hurdle yeah so so that's actually a good question and um i i think you know anyone who goes through this process is going to feel like one of those steps is harder for them than you know this is just my unique point of view but i would say the hardest part for me is actually the deployment of a model so that's the step after after you've determined you know through tests and through through different metrics that the model is actually going to be good enough to do what you want it to do but then you actually need to bring it onto a device or onto a server to be able to make those inferences continuously so that's the hardest part from my point of view but what do you think i know but that necessarily that it's the hardest but i think like the most overlooked or at least like the biggest area for improvement and growth in vision and probably machine learning more generally is that computer vision is you know it's only so much about actually the code that you write or only so much about the model that you select and it's all about the data that you use and it's kind of remarkable right when you're debugging model performance understanding why you're getting the inferences you are or the quality performance that you are is entirely about the data that you're training on and so it's amazing that we have such little emphasis on i think even today in our tooling and our understanding of what data are we collecting what data are we sending for labeling do we have balanced classes do we have augmentations that represent the inference conditions uh and basically making this be an ongoing iterative improved process so one of my biggest tips as a result of that is you know you're never going to have a perfect model that's you know it's a trope at this point but people don't internalize that that means that like usually the highest leverage thing you can do is get an initial model to production that solves one part of the problem doesn't have to be perfect but identifies one class that you ultimately want to do many many more classes and then make it really easy to collect additional data and create safeguards in failsafes i should say for when the model doesn't produce an inference that you want if you do those things get a model quickly to production and put in fail-safes for if the model is incorrect and collect additional data your performance is going to cascade and improve on a much faster iterative basis than thinking that you need to have every class or perfect model or a thousand inferences a second or something like this definitely the scoping in the very beginning is extremely important to be able to actually get these things to work and actually deploy them uh all the way through to production so i definitely i definitely agree with that yeah what have been some of the like i don't know maybe unintuitive or tips and tricks or hacks that you've picked up along the way as you've taken a problem and then broken it down into these steps like what are things to watch out for what's like a quick tip that your fifth time self would have told your first time self or your 100th self would tell your first time self well i mean certainly the thing you've already been talking about um with uh narrowing the scope to something that is definitely going to be achievable but i think uh another just sort of general fact about the whole pipeline that i think often ends up being the case is people will spend a lot of time gathering a data set that they believe is entirely representative of the problem that they're going to tackle they do a lot of work to improve their model on this data set choosing models making data augmentations and going through that part of the pipeline um that you were that you were discussing to to get to a really good model but then they take it out and put it into reality and realize that uh the reality of the deployment situation wasn't exactly what they had um when they initially gathered the data set so being very careful and very real with the end state that you wanted to be in i think is very important because otherwise you'll spend a lot of work optimizing something that isn't exactly uh what you wanted in the beginning um so you can use other data sets to kind of get quick uh you know like some quick progress and start to make a model out of things but um there's no no sacrifice for directly collecting images from the exact state um that you want to be ending up in yeah makes a lot of sense makes a lot of sense um why robofall why does roboflow make this process easier or wow that tool yeah i mean so i think one of the i mean so we've we've kind of been talking about these different pieces of this pipeline in uh abstraction you know but at the end of the day each one of these pieces has a variety of different implementations and there's a variety of different uh services that are out there to be able to handle each one of these things for you um the best part about rebelflow is that it kind of sits there in the middle and you're able to actually integrate all these pieces together so you can move quickly throughout the process and experiment with different pieces without having to go back and rebuild everything and so that's that's primarily through data integration um but then beyond that you can kind of use roboflow as a source of record as you're moving through the process to know that okay once i got from here to here i was at this state and here's what my data set looked like and as joseph was elaborating on it's really all about the data set and to use that as kind of a central locus as you're you're working through the process yeah yeah i mean one thing we really care a lot about is interoperability and allowing the best tool to be used at each component part of the process so collect your images from any given security camera raspberry pi video camera whatever it is label them wherever you would want to label them reviews outsource labeling bring them in inspect understand try out a bunch of different image and model formats convert to various places um but yeah we think a lot about how do we accelerate people's workflows and really be the tool to empower a whole generation of applications and developers that are going to build and are building our future of understanding the real world around us um and so when it comes to computer vision pipelines and machine learning pipelines more generally i think the the key takeaways are really get something working make it easy to improve and continue to focus on your data um you follow those those key quick tips and you can pretty quickly start have industry leading applications it's a wide open field well that was a lot in one one fireside chat would you say yeah yeah thank you thank you all for joining don't forget to like this video subscribe for additional verbal full content and as always happy model building good luck see you in the comments

Original Description

Refining the machine learning pipeline to create computer vision models is paramount to iterating on your computer vision model. We discuss the computer vision process fireside in this video and how to make it through each piece of the process, and then to return to the loop, improving your model. ✅ Subscribe: https://bit.ly/rf-yt-sub

Watch on YouTube ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

Playlist

Uploads from Roboflow · Roboflow · 53 of 60

← Previous Next →

YOLOv3 PyTorch Notebook Tutorial

YOLOv3 PyTorch Notebook Tutorial

How to Train YOLOv4 on a Custom Dataset (PyTorch)

How to Train YOLOv4 on a Custom Dataset (PyTorch)

How to Train YOLOv5 on a Custom Dataset

How to Train YOLOv5 on a Custom Dataset

How to Use the Roboflow Dataset Health Check

How to Use the Roboflow Dataset Health Check

What is Mean Average Precision (mAP)?

What is Mean Average Precision (mAP)?

How to Use the Roboflow Model Library

How to Use the Roboflow Model Library

How to Train EfficientDet in TensorFlow 2 Object Detection

How to Train EfficientDet in TensorFlow 2 Object Detection

How to Train YOLO v4 Tiny (Darknet) on a Custom Dataset

How to Train YOLO v4 Tiny (Darknet) on a Custom Dataset

Ask the Roboflow Team Anything - Episode 1

Ask the Roboflow Team Anything - Episode 1

Exploring The COCO Dataset

Exploring The COCO Dataset

Community Spotlight: Improving Uno with Computer Vision

Community Spotlight: Improving Uno with Computer Vision

Mosaic Data Augmentation - Deep Dive

Mosaic Data Augmentation - Deep Dive

Hands on with the OAK-1

Hands on with the OAK-1

Glenn Jocher: What is New in YOLO v5?

Glenn Jocher: What is New in YOLO v5?

How to Use Amazon Rekognition Custom Labels and Roboflow to Build an Object Detection Model

How to Use Amazon Rekognition Custom Labels and Roboflow to Build an Object Detection Model

An Interview with Brandon Gilles, Luxonis Founder and OAK Chief Architect

An Interview with Brandon Gilles, Luxonis Founder and OAK Chief Architect

How to Train a Custom Mobile Object Detection Model (with YOLOv4 Tiny and TensorFlow Lite)

How to Train a Custom Mobile Object Detection Model (with YOLOv4 Tiny and TensorFlow Lite)

Tackling the Small Object Problem in Object Detection

Tackling the Small Object Problem in Object Detection

Fast.ai v2 Released - What's New?

Fast.ai v2 Released - What's New?

Teaser: Roboflow Train (1-Click Computer Vision AutoML)

Teaser: Roboflow Train (1-Click Computer Vision AutoML)

How to Train a Custom Resnet34 Image Classification Model

How to Train a Custom Resnet34 Image Classification Model

How to Label Images for Object Detection with CVAT

How to Label Images for Object Detection with CVAT

Deploy YOLOv5 to Jetson Xavier NX at 30 FPS

Deploy YOLOv5 to Jetson Xavier NX at 30 FPS

Elisha Odemakinde Hosts Roboflow ML Engineer, Jacob Solawetz

Elisha Odemakinde Hosts Roboflow ML Engineer, Jacob Solawetz

Getting Started with VoTT - Computer Vision Annotation

Getting Started with VoTT - Computer Vision Annotation

How to Manage Classes in Object Detection (Rename, Combine, Balance)

How to Manage Classes in Object Detection (Rename, Combine, Balance)

How to Train YOLOv4 on a Custom Dataset in Darknet

How to Train YOLOv4 on a Custom Dataset in Darknet

Is Grayscale a Preprocessing or Augmentation Step in Computer Vision?

Is Grayscale a Preprocessing or Augmentation Step in Computer Vision?

Getting Started with Image Data Augmentation

Getting Started with Image Data Augmentation

Glenn Jocher: Image Augmentation in YOLO v5 and Beyond

Glenn Jocher: Image Augmentation in YOLO v5 and Beyond

GA Hosts Roboflow - Healthcare and AI

GA Hosts Roboflow - Healthcare and AI

How do self driving cars know when to stop?

How do self driving cars know when to stop?

What is PASCAL VOC XML?

What is PASCAL VOC XML?

AutoML Showdown: Google vs Amazon vs Microsoft

AutoML Showdown: Google vs Amazon vs Microsoft

How is computer vision changing manufacturing?

How is computer vision changing manufacturing?

The Alphabet in American Sign Language

The Alphabet in American Sign Language

Luxonis OAK-D: Computer Vision on Device

Luxonis OAK-D: Computer Vision on Device

How to Train a Custom Faster R-CNN Model with Facebook AI's Detectron2 | Use Your Own Dataset

How to Train a Custom Faster R-CNN Model with Facebook AI's Detectron2 | Use Your Own Dataset

TensorFlow vs PyTorch: Fireside

TensorFlow vs PyTorch: Fireside

Occlusion Techniques in Computer Vision

Occlusion Techniques in Computer Vision

A Customizable Web Application for Your Computer Vision Model

A Customizable Web Application for Your Computer Vision Model

Model Tradeoffs and the Future of Computer Vision

Model Tradeoffs and the Future of Computer Vision

Designing an Augmented Reality Board Game App

Designing an Augmented Reality Board Game App

YOLOv4 - Advanced Tactics

YOLOv4 - Advanced Tactics

How to Use CreateML and Build a Computer Vision iPhone App | AR Object Detection

How to Use CreateML and Build a Computer Vision iPhone App | AR Object Detection

Fireside Chat: Computer Vision in Agriculture

Fireside Chat: Computer Vision in Agriculture

Scaled-YOLOv4 Tops EfficientDet: Research Rundown

Scaled-YOLOv4 Tops EfficientDet: Research Rundown

What is Image Preprocessing?

What is Image Preprocessing?

Building a Community of Creators with BlkArthouse and Von Deon

Building a Community of Creators with BlkArthouse and Von Deon

How to Train Scaled-YOLOv4 to Detect Custom Objects

How to Train Scaled-YOLOv4 to Detect Custom Objects

Intro to Computer Vision: Fireside

Intro to Computer Vision: Fireside

The Best Way to Annotate Images for Object Detection

The Best Way to Annotate Images for Object Detection

The Computer Vision Process: Fireside

The Computer Vision Process: Fireside

How to Annotate Images with Your Team Using Roboflow

How to Annotate Images with Your Team Using Roboflow

Introducing the Roboflow Object Count Histogram

Introducing the Roboflow Object Count Histogram

How Fast is the M1 at Machine Learning? Benchmarking Apple's M1 and Intel's Chips

How Fast is the M1 at Machine Learning? Benchmarking Apple's M1 and Intel's Chips

CLIP: OpenAI's amazing new zero-shot image classifier

CLIP: OpenAI's amazing new zero-shot image classifier

How I hacked my Nest camera to run custom models

How I hacked my Nest camera to run custom models

Getting Started with the Roboflow Inference API

Getting Started with the Roboflow Inference API

Transfer Learning in Computer Vision | What, How, Why

Transfer Learning in Computer Vision | What, How, Why

This video teaches the computer vision process, from defining the problem to deploying a model, and highlights the importance of data quality and representation, with tools like Roboflow and Rebelflow simplifying the process. It emphasizes the need to get something working, make it easy to improve, and focus on data. By following these steps, developers can create effective computer vision models and improve their performance over time.

Key Takeaways

Define the problem
Collect images
Label data
Pre-process images
Train a model
Deploy a model
Collect additional data
Create safeguards for model incorrectness

💡 Data quality and representation are crucial for model performance, and narrowing the scope to something achievable is key to creating a successful computer vision model.

🔒 Pro feature: Ask AI to explain this lesson →

More on: ML Pipelines

View skill →

Building a Dog Breed Identifier App from scratch - DogNet

Building a Dog Breed Identifier App from scratch - DogNet

Aladdin Persson

Complete Dockers For Data Science Tutorial In One Shot

Complete Dockers For Data Science Tutorial In One Shot

Part 6 | Deploy ML Model on Kubernetes | Auto-Scaling with HPA and Monitoring with Prometheus

Part 6 | Deploy ML Model on Kubernetes | Auto-Scaling with HPA and Monitoring with Prometheus

Abonia Sojasingarayar

Vertex Pipelines: Qwik Start

Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation

Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation

Automate R scripts with GitHub Actions: Deploy a model

Related AI Lessons

How to Learn a Hard Technical Skill Without Burning Out

Learn how to acquire hard technical skills without burnout by creating a sustainable learning plan

Dev.to · Anas Kalthoum | FreeBrain

After interviewing over 100 ML Candidates. Last Week Someone Walked In and Made Me Take Notes.

Learn what makes a standout ML candidate after interviewing over 100 applicants

Medium · Machine Learning

How AI Learns with Less Labeled Data

Discover how AI can learn with less labeled data, a crucial aspect of machine learning beyond model selection

Medium · Machine Learning

Mastering TypeScript — Understanding the TypeScript Compiler (tsc) from Scratch — Lesson 2

Learn the basics of the TypeScript compiler to write better JavaScript code

Medium · JavaScript

Learn Deep Learning by Hand (Beginner's Guide - Part 1)