Luxonis OAK-D: Computer Vision on Device
Key Takeaways
The Luxonis OAK-D is a computer vision device that can perform real-time object detection and depth triangulation, and Roboflow provides a tutorial on how to use it with their Python package to deploy custom models for tasks like American Sign Language letter identification.
Full Transcript
today we're discussing the luxonis oak d i mean this thing is kind of like a raspberry pi on steroids built for computer vision it's got a 4k camera it's got the ability to do real time object detection it smashed records on kickstarter raising over 1.2 million i mean what makes this device so special yeah we are really excited about this device over here at rebel flow and the reasons for that are numerous first of all it's got a very high resolution camera it has the ability to triangulate depth which we'll go into a little bit more here in a bit and then it also has the ability to do real-time inference by parsing neural network tensors very fast on the back here with the movidius vpu and the movidius vpu is essentially a replacement for a gpu so this is what the computer is going to be using as it's taking images in transforming them into tensors running that through a model very quickly and then spitting inference back out for you at the end and so the other thing you know that i was touching on is the depth component which i think is uh particularly game changing and uh joseph tell us a little bit about how how that works sure yeah so we caught up with brandon from lexanus in it in a prior video don't forget to like and subscribe and he describes how the luxonos team identified if you're able to triangulate the distance of an object not only can you do object detection to identify that for example you have a car on the horizon but if you know the constant distance between one camera and the other two cameras on the side then using the depth ai software platform you can infer and identify measurements and so i mean spatial ai unlocks new capabilities for example if you're a commercial fishery you can only catch and keep fish that are of a specific length and if you want to be able to do say automated identification of both your count of your catch and ensure that you have a sufficient batch that you're able to keep something like this would basically handle both those tasks at once now i mean to be clear not every task requires a sense of depth and object detection and 2d object detection still unlocks massive capabilities as we'll see today but i do think that this unlocks uh new things that i can't wait to see when it ships to everyone in december 2020 but jacob i understand that you've uh got something set up for us to try out today you mind introducing what it is we'll be doing yeah so today we have a tutorial ready for you basically on how to use this technology and a demonstration of what it can do and so the task we've chosen to tackle is actually the identification of the alphabet in american sign language and we're able to do this thanks to a data set provided by roboflow user david lee um but let's go ahead and dive right in and see if we can get the computer to identify uh different letters in in sign language so what do you say should try it out i'm excited let's dive in awesome the first step is to gather a data set here you can see we have american sign language letters hosted publicly on roboflow courtesy of roboflow user david lee let's take a look at a couple of these images the next step is to train a model with our new data we'll go ahead and go through this notebook and form a custom trained model to identify american sign language using state of the art object detection technology while we're working through the notebook we'll also check to make sure that our model can make inference on test images once we're satisfied with our model we'll go ahead and export it to a representation that is rentable on depth ai now we're going to go live here i have alexanus oak d it's plugged into my computer where i have the custom weights loaded in and we're going to go ahead and kick it off so here we go it's just a single python command to kick off our custom model and you can see here that a video pane pops up where the device is actually doing real-time inference so now let's go ahead and test it out to see if it can identify letters like o or v or maybe it can even teach me some sign language that was awesome great work thanks yeah it was pretty exciting to be able to build this so quickly and um it's amazing the state of the technology is in today and i just kind of wonder though like how could we how could we make it better collect more data right i mean if you get more data in your inference condition your model will only improve so data of doing hand signs with different backgrounds and all these different things um i would always keep track of just like you showed in roboflow how many letters you have of each class and then yeah continue to to train and deploy to luxonis um can you go into a bit more depth about how i can do this too at home or anyone else yeah certainly so everything we've done here today is publicly open source to you uh via blog post below we have all the code and all the instructions to be able to do the same thing with your own custom task so we look forward to seeing all the different applications that you bring forward uh both leveraging roboflow depth ai and the luxonis devices so don't forget to like and subscribe and thanks so much for joining us today happy training
Original Description
Roboflow discusses the breakthrough computer vision technology in the Luxonis OpenCV AI Kit.
Let us know what you think of the OAK-D below!
Full OAK-D Deploy Tutorial
https://blog.roboflow.com/luxonis-oak-d-custom-model/
Deploying with the Roboflow Python Package (roboflowoak):
https://help.roboflow.com/en_US/guides/roboflow-python-package-for-oak-deployment
Watch on YouTube ↗
(saves to browser)
Sign in to unlock AI tutor explanation · ⚡30
Playlist
Uploads from Roboflow · Roboflow · 37 of 60
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
▶
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
YOLOv3 PyTorch Notebook Tutorial
Roboflow
How to Train YOLOv4 on a Custom Dataset (PyTorch)
Roboflow
How to Train YOLOv5 on a Custom Dataset
Roboflow
How to Use the Roboflow Dataset Health Check
Roboflow
What is Mean Average Precision (mAP)?
Roboflow
How to Use the Roboflow Model Library
Roboflow
How to Train EfficientDet in TensorFlow 2 Object Detection
Roboflow
How to Train YOLO v4 Tiny (Darknet) on a Custom Dataset
Roboflow
Ask the Roboflow Team Anything - Episode 1
Roboflow
Exploring The COCO Dataset
Roboflow
Community Spotlight: Improving Uno with Computer Vision
Roboflow
Mosaic Data Augmentation - Deep Dive
Roboflow
Hands on with the OAK-1
Roboflow
Glenn Jocher: What is New in YOLO v5?
Roboflow
How to Use Amazon Rekognition Custom Labels and Roboflow to Build an Object Detection Model
Roboflow
An Interview with Brandon Gilles, Luxonis Founder and OAK Chief Architect
Roboflow
How to Train a Custom Mobile Object Detection Model (with YOLOv4 Tiny and TensorFlow Lite)
Roboflow
Tackling the Small Object Problem in Object Detection
Roboflow
Fast.ai v2 Released - What's New?
Roboflow
Teaser: Roboflow Train (1-Click Computer Vision AutoML)
Roboflow
How to Train a Custom Resnet34 Image Classification Model
Roboflow
How to Label Images for Object Detection with CVAT
Roboflow
Deploy YOLOv5 to Jetson Xavier NX at 30 FPS
Roboflow
Elisha Odemakinde Hosts Roboflow ML Engineer, Jacob Solawetz
Roboflow
Getting Started with VoTT - Computer Vision Annotation
Roboflow
How to Manage Classes in Object Detection (Rename, Combine, Balance)
Roboflow
How to Train YOLOv4 on a Custom Dataset in Darknet
Roboflow
Is Grayscale a Preprocessing or Augmentation Step in Computer Vision?
Roboflow
Getting Started with Image Data Augmentation
Roboflow
Glenn Jocher: Image Augmentation in YOLO v5 and Beyond
Roboflow
GA Hosts Roboflow - Healthcare and AI
Roboflow
How do self driving cars know when to stop?
Roboflow
What is PASCAL VOC XML?
Roboflow
AutoML Showdown: Google vs Amazon vs Microsoft
Roboflow
How is computer vision changing manufacturing?
Roboflow
The Alphabet in American Sign Language
Roboflow
Luxonis OAK-D: Computer Vision on Device
Roboflow
How to Train a Custom Faster R-CNN Model with Facebook AI's Detectron2 | Use Your Own Dataset
Roboflow
TensorFlow vs PyTorch: Fireside
Roboflow
Occlusion Techniques in Computer Vision
Roboflow
A Customizable Web Application for Your Computer Vision Model
Roboflow
Model Tradeoffs and the Future of Computer Vision
Roboflow
Designing an Augmented Reality Board Game App
Roboflow
YOLOv4 - Advanced Tactics
Roboflow
How to Use CreateML and Build a Computer Vision iPhone App | AR Object Detection
Roboflow
Fireside Chat: Computer Vision in Agriculture
Roboflow
Scaled-YOLOv4 Tops EfficientDet: Research Rundown
Roboflow
What is Image Preprocessing?
Roboflow
Building a Community of Creators with BlkArthouse and Von Deon
Roboflow
How to Train Scaled-YOLOv4 to Detect Custom Objects
Roboflow
Intro to Computer Vision: Fireside
Roboflow
The Best Way to Annotate Images for Object Detection
Roboflow
The Computer Vision Process: Fireside
Roboflow
How to Annotate Images with Your Team Using Roboflow
Roboflow
Introducing the Roboflow Object Count Histogram
Roboflow
How Fast is the M1 at Machine Learning? Benchmarking Apple's M1 and Intel's Chips
Roboflow
CLIP: OpenAI's amazing new zero-shot image classifier
Roboflow
How I hacked my Nest camera to run custom models
Roboflow
Getting Started with the Roboflow Inference API
Roboflow
Transfer Learning in Computer Vision | What, How, Why
Roboflow
More on: CV Basics
View skill →Related AI Lessons
⚡
⚡
⚡
⚡
Cloud-Optimized OpenCV + A Special Surprise Announcement on OpenCV Live
OpenCV Blog
When the Camera Becomes an Exam Proctor: Building an AI-Powered Exam Monitoring System with…
Medium · Python
When the Camera Becomes an Exam Proctor: Building an AI-Powered Exam Monitoring System with…
Medium · Deep Learning
When the Camera Becomes an Exam Proctor: Building an AI-Powered Exam Monitoring System with…
Medium · Cybersecurity
🎓
Tutor Explanation
DeepCamp AI