Luxonis OAK-D: Computer Vision on Device

Roboflow · Beginner ·👁️ Computer Vision ·5y ago

Skills: CV Basics80%ML Pipelines70%LLMOps60%

Key Takeaways

The Luxonis OAK-D is a computer vision device that can perform real-time object detection and depth triangulation, and Roboflow provides a tutorial on how to use it with their Python package to deploy custom models for tasks like American Sign Language letter identification.

Full Transcript

today we're discussing the luxonis oak d i mean this thing is kind of like a raspberry pi on steroids built for computer vision it's got a 4k camera it's got the ability to do real time object detection it smashed records on kickstarter raising over 1.2 million i mean what makes this device so special yeah we are really excited about this device over here at rebel flow and the reasons for that are numerous first of all it's got a very high resolution camera it has the ability to triangulate depth which we'll go into a little bit more here in a bit and then it also has the ability to do real-time inference by parsing neural network tensors very fast on the back here with the movidius vpu and the movidius vpu is essentially a replacement for a gpu so this is what the computer is going to be using as it's taking images in transforming them into tensors running that through a model very quickly and then spitting inference back out for you at the end and so the other thing you know that i was touching on is the depth component which i think is uh particularly game changing and uh joseph tell us a little bit about how how that works sure yeah so we caught up with brandon from lexanus in it in a prior video don't forget to like and subscribe and he describes how the luxonos team identified if you're able to triangulate the distance of an object not only can you do object detection to identify that for example you have a car on the horizon but if you know the constant distance between one camera and the other two cameras on the side then using the depth ai software platform you can infer and identify measurements and so i mean spatial ai unlocks new capabilities for example if you're a commercial fishery you can only catch and keep fish that are of a specific length and if you want to be able to do say automated identification of both your count of your catch and ensure that you have a sufficient batch that you're able to keep something like this would basically handle both those tasks at once now i mean to be clear not every task requires a sense of depth and object detection and 2d object detection still unlocks massive capabilities as we'll see today but i do think that this unlocks uh new things that i can't wait to see when it ships to everyone in december 2020 but jacob i understand that you've uh got something set up for us to try out today you mind introducing what it is we'll be doing yeah so today we have a tutorial ready for you basically on how to use this technology and a demonstration of what it can do and so the task we've chosen to tackle is actually the identification of the alphabet in american sign language and we're able to do this thanks to a data set provided by roboflow user david lee um but let's go ahead and dive right in and see if we can get the computer to identify uh different letters in in sign language so what do you say should try it out i'm excited let's dive in awesome the first step is to gather a data set here you can see we have american sign language letters hosted publicly on roboflow courtesy of roboflow user david lee let's take a look at a couple of these images the next step is to train a model with our new data we'll go ahead and go through this notebook and form a custom trained model to identify american sign language using state of the art object detection technology while we're working through the notebook we'll also check to make sure that our model can make inference on test images once we're satisfied with our model we'll go ahead and export it to a representation that is rentable on depth ai now we're going to go live here i have alexanus oak d it's plugged into my computer where i have the custom weights loaded in and we're going to go ahead and kick it off so here we go it's just a single python command to kick off our custom model and you can see here that a video pane pops up where the device is actually doing real-time inference so now let's go ahead and test it out to see if it can identify letters like o or v or maybe it can even teach me some sign language that was awesome great work thanks yeah it was pretty exciting to be able to build this so quickly and um it's amazing the state of the technology is in today and i just kind of wonder though like how could we how could we make it better collect more data right i mean if you get more data in your inference condition your model will only improve so data of doing hand signs with different backgrounds and all these different things um i would always keep track of just like you showed in roboflow how many letters you have of each class and then yeah continue to to train and deploy to luxonis um can you go into a bit more depth about how i can do this too at home or anyone else yeah certainly so everything we've done here today is publicly open source to you uh via blog post below we have all the code and all the instructions to be able to do the same thing with your own custom task so we look forward to seeing all the different applications that you bring forward uh both leveraging roboflow depth ai and the luxonis devices so don't forget to like and subscribe and thanks so much for joining us today happy training

Original Description

Roboflow discusses the breakthrough computer vision technology in the Luxonis OpenCV AI Kit. Let us know what you think of the OAK-D below! Full OAK-D Deploy Tutorial https://blog.roboflow.com/luxonis-oak-d-custom-model/ Deploying with the Roboflow Python Package (roboflowoak): https://help.roboflow.com/en_US/guides/roboflow-python-package-for-oak-deployment

Watch on YouTube ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

Playlist

Uploads from Roboflow · Roboflow · 37 of 60

← Previous Next →

YOLOv3 PyTorch Notebook Tutorial

YOLOv3 PyTorch Notebook Tutorial

How to Train YOLOv4 on a Custom Dataset (PyTorch)

How to Train YOLOv4 on a Custom Dataset (PyTorch)

How to Train YOLOv5 on a Custom Dataset

How to Train YOLOv5 on a Custom Dataset

How to Use the Roboflow Dataset Health Check

How to Use the Roboflow Dataset Health Check

What is Mean Average Precision (mAP)?

What is Mean Average Precision (mAP)?

How to Use the Roboflow Model Library

How to Use the Roboflow Model Library

How to Train EfficientDet in TensorFlow 2 Object Detection

How to Train EfficientDet in TensorFlow 2 Object Detection

How to Train YOLO v4 Tiny (Darknet) on a Custom Dataset

How to Train YOLO v4 Tiny (Darknet) on a Custom Dataset

Ask the Roboflow Team Anything - Episode 1

Ask the Roboflow Team Anything - Episode 1

Exploring The COCO Dataset

Exploring The COCO Dataset

Community Spotlight: Improving Uno with Computer Vision

Community Spotlight: Improving Uno with Computer Vision

Mosaic Data Augmentation - Deep Dive

Mosaic Data Augmentation - Deep Dive

Hands on with the OAK-1

Hands on with the OAK-1

Glenn Jocher: What is New in YOLO v5?

Glenn Jocher: What is New in YOLO v5?

How to Use Amazon Rekognition Custom Labels and Roboflow to Build an Object Detection Model

How to Use Amazon Rekognition Custom Labels and Roboflow to Build an Object Detection Model

An Interview with Brandon Gilles, Luxonis Founder and OAK Chief Architect

An Interview with Brandon Gilles, Luxonis Founder and OAK Chief Architect

How to Train a Custom Mobile Object Detection Model (with YOLOv4 Tiny and TensorFlow Lite)

How to Train a Custom Mobile Object Detection Model (with YOLOv4 Tiny and TensorFlow Lite)

Tackling the Small Object Problem in Object Detection

Tackling the Small Object Problem in Object Detection

Fast.ai v2 Released - What's New?

Fast.ai v2 Released - What's New?

Teaser: Roboflow Train (1-Click Computer Vision AutoML)

Teaser: Roboflow Train (1-Click Computer Vision AutoML)

How to Train a Custom Resnet34 Image Classification Model

How to Train a Custom Resnet34 Image Classification Model

How to Label Images for Object Detection with CVAT

How to Label Images for Object Detection with CVAT

Deploy YOLOv5 to Jetson Xavier NX at 30 FPS

Deploy YOLOv5 to Jetson Xavier NX at 30 FPS

Elisha Odemakinde Hosts Roboflow ML Engineer, Jacob Solawetz

Elisha Odemakinde Hosts Roboflow ML Engineer, Jacob Solawetz

Getting Started with VoTT - Computer Vision Annotation

Getting Started with VoTT - Computer Vision Annotation

How to Manage Classes in Object Detection (Rename, Combine, Balance)

How to Manage Classes in Object Detection (Rename, Combine, Balance)

How to Train YOLOv4 on a Custom Dataset in Darknet

How to Train YOLOv4 on a Custom Dataset in Darknet

Is Grayscale a Preprocessing or Augmentation Step in Computer Vision?

Is Grayscale a Preprocessing or Augmentation Step in Computer Vision?

Getting Started with Image Data Augmentation

Getting Started with Image Data Augmentation

Glenn Jocher: Image Augmentation in YOLO v5 and Beyond

Glenn Jocher: Image Augmentation in YOLO v5 and Beyond

GA Hosts Roboflow - Healthcare and AI

GA Hosts Roboflow - Healthcare and AI

How do self driving cars know when to stop?

How do self driving cars know when to stop?

What is PASCAL VOC XML?

What is PASCAL VOC XML?

AutoML Showdown: Google vs Amazon vs Microsoft

AutoML Showdown: Google vs Amazon vs Microsoft

How is computer vision changing manufacturing?

How is computer vision changing manufacturing?

The Alphabet in American Sign Language

The Alphabet in American Sign Language

Luxonis OAK-D: Computer Vision on Device

Luxonis OAK-D: Computer Vision on Device

How to Train a Custom Faster R-CNN Model with Facebook AI's Detectron2 | Use Your Own Dataset

How to Train a Custom Faster R-CNN Model with Facebook AI's Detectron2 | Use Your Own Dataset

TensorFlow vs PyTorch: Fireside

TensorFlow vs PyTorch: Fireside

Occlusion Techniques in Computer Vision

Occlusion Techniques in Computer Vision

A Customizable Web Application for Your Computer Vision Model

A Customizable Web Application for Your Computer Vision Model

Model Tradeoffs and the Future of Computer Vision

Model Tradeoffs and the Future of Computer Vision

Designing an Augmented Reality Board Game App

Designing an Augmented Reality Board Game App

YOLOv4 - Advanced Tactics

YOLOv4 - Advanced Tactics

How to Use CreateML and Build a Computer Vision iPhone App | AR Object Detection

How to Use CreateML and Build a Computer Vision iPhone App | AR Object Detection

Fireside Chat: Computer Vision in Agriculture

Fireside Chat: Computer Vision in Agriculture

Scaled-YOLOv4 Tops EfficientDet: Research Rundown

Scaled-YOLOv4 Tops EfficientDet: Research Rundown

What is Image Preprocessing?

What is Image Preprocessing?

Building a Community of Creators with BlkArthouse and Von Deon

Building a Community of Creators with BlkArthouse and Von Deon

How to Train Scaled-YOLOv4 to Detect Custom Objects

How to Train Scaled-YOLOv4 to Detect Custom Objects

Intro to Computer Vision: Fireside

Intro to Computer Vision: Fireside

The Best Way to Annotate Images for Object Detection

The Best Way to Annotate Images for Object Detection

The Computer Vision Process: Fireside

The Computer Vision Process: Fireside

How to Annotate Images with Your Team Using Roboflow

How to Annotate Images with Your Team Using Roboflow

Introducing the Roboflow Object Count Histogram

Introducing the Roboflow Object Count Histogram

How Fast is the M1 at Machine Learning? Benchmarking Apple's M1 and Intel's Chips

How Fast is the M1 at Machine Learning? Benchmarking Apple's M1 and Intel's Chips

CLIP: OpenAI's amazing new zero-shot image classifier

CLIP: OpenAI's amazing new zero-shot image classifier

How I hacked my Nest camera to run custom models

How I hacked my Nest camera to run custom models

Getting Started with the Roboflow Inference API

Getting Started with the Roboflow Inference API

Transfer Learning in Computer Vision | What, How, Why

Transfer Learning in Computer Vision | What, How, Why

The Luxonis OAK-D is a powerful computer vision device that can perform real-time object detection and depth triangulation, and Roboflow provides a tutorial on how to use it with their Python package to deploy custom models for tasks like American Sign Language letter identification. The tutorial covers gathering a dataset, training a model, and deploying it on the OAK-D device.

Key Takeaways

Gather a dataset for the task
Train a model using the dataset
Export the model for deployment on the OAK-D
Deploy the model on the OAK-D using the Roboflow Python package
Test the model on the OAK-D

💡 The Luxonis OAK-D can perform real-time object detection and depth triangulation, making it a powerful tool for computer vision tasks.

🔒 Pro feature: Ask AI to explain this lesson →

More on: CV Basics

View skill →

Identify Horses or Humans with TensorFlow and Vertex AI

Building a Dog Breed Identifier App from scratch - DogNet

Building a Dog Breed Identifier App from scratch - DogNet

Aladdin Persson

Apply OpenGL Texturing and Camera Systems

Apply OpenGL Texturing and Camera Systems

Aerial Image Segmentation with PyTorch

Aerial Image Segmentation with PyTorch

How to Install Stable Diffusion - automatic1111

How to Install Stable Diffusion - automatic1111

Sebastian Kamph

NVIDIA RTXGI Unreal Engine 4 Plugin: Introduction and Setup

NVIDIA RTXGI Unreal Engine 4 Plugin: Introduction and Setup

NVIDIA Developer

Related Reads

Modern Image Search Technology Explained: Features, Benefits & Uses

Learn about modern image search technology and its features, benefits, and uses

Medium · Deep Learning

Modern Image Search Technology Explained: Features, Benefits & Uses

Learn about modern image search technology and its features, benefits, and uses

Deep Learning Project FAQs on Faster R-CNN and YOLO for interview

Learn key concepts and differences between Faster R-CNN and YOLO for deep learning project interviews

Medium · Deep Learning

How the Internet Works: From Typing a URL to Seeing a Website

Learn how the internet works by understanding the process of typing a URL to seeing a website, and why it matters for developers and users alike

Dev.to · Juma Evans

Marketing management for ugc net| Important topics of marketing management ugc net commerce dec 2023

Bhoomi Learning Centre~Dr. Muskan