NVIDIA's NEW Open Multimodal Intelligence - Nemotron 3 Nano Omni

Sam Witteveen · Beginner ·🧠 Large Language Models ·2w ago

Skills: Multimodal LLMs90%

In this video, we look at the latest Nemotron model from Nvidia, Nemotron 3 Nano Omni, which is a multi-modal model which is built to be small, fast, and fully multi-modal for agents supporting text, images, videos and audio. Blog: https://developer.nvidia.com/blog/nvidia-nemotron-3-nano-omni-powers-multimodal-agent-reasoning-in-a-single-efficient-open-model/ HF Blog: https://huggingface.co/blog/nvidia/nemotron-3-nano-omni-multimodal-intelligence HF Model: https://huggingface.co/nvidia/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-BF16 Twitter: https://x.com/Sam_Witteveen 🕵️ Interested in building LLM Agents? Fill out the form below Building LLM Agents Form: https://drp.li/dIMes 👨‍💻Github: https://github.com/samwit/llm-tutorials ⏱️Time Stamps: 00:00 Intro 00:12 NVIDIA models released in the past 00:59 Nemotron 3 Nano Omni 02:26 PinchBench 03:31 Nemotron 3 Nano Omni Paper 04:16 Nemotron 3 Nano Paper 05:28 Nemotron 3 Nano Omni Hugging Face 05:50 OpenRouter and NVIDIA Cloud 06:25 Demo

Watch on YouTube ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

More on: Multimodal LLMs

View skill →

INSTALL NEW UNCENSORED FaceGen Ai WebUI LOCALLY in 1 CLICK!

INSTALL NEW UNCENSORED FaceGen Ai WebUI LOCALLY in 1 CLICK!

Google Veo 3 Tutorial: How to create AI Videos in Flow, Gemini or Google Vids?

Google Veo 3 Tutorial: How to create AI Videos in Flow, Gemini or Google Vids?

AI Tool Journey

NVIDIA Clara Guardian Virtual Patient Assistant

NVIDIA Clara Guardian Virtual Patient Assistant

NVIDIA Developer

Building Multimodal Search and RAG

Building Multimodal Search and RAG

Midjourney Trick: Consistent Character in Different Images

Midjourney Trick: Consistent Character in Different Images

Ollama Multimodal: EASILY setup Llava locally & Integrate API

Ollama Multimodal: EASILY setup Llava locally & Integrate API

Related AI Lessons

Thursday Thoughts: The Models We Can't Run

Explore the limitations of running latest AI models and their implications on the AI community

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.

Big Tech firms are investing billions in AI, driving growth and transformation, while prioritizing safety and responsible adoption

35 ChatGPT Prompts for Recruiters (That Actually Work in 2026)

Learn 35 effective ChatGPT prompts for recruiters to streamline their workflow in 2026

Dev.to · ClawGear

Stop Writing Like a Robot: The Prompt That Makes ChatGPT Sound Human

Learn how to craft prompts that make ChatGPT sound human, overcoming lifeless AI writing

Medium · ChatGPT

Chapters (9)

Intro

0:12 NVIDIA models released in the past

0:59 Nemotron 3 Nano Omni

2:26 PinchBench

3:31 Nemotron 3 Nano Omni Paper

4:16 Nemotron 3 Nano Paper

5:28 Nemotron 3 Nano Omni Hugging Face

5:50 OpenRouter and NVIDIA Cloud

6:25 Demo

5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems

Dave Ebbelaar (LLM Eng)