Gemini 1.5 API: Chat with Videos, Images and Audios in your app (Gemini API Code tutorial)

Name: Gemini 1.5 API: Chat with Videos, Images and Audios in your app (Gemini API Code tutorial)
Uploaded: 2024-06-18T18:47:19+00:00
Channel: Enric Domingo - AI Engineering
Description: Tutorial exploring the basics of the new Google Gemini 1.5 Pro and Flash models, how they compare to the OpenAI GPT-4o and GPT-4 Turbo, how to get the G...

Enric Domingo - AI Engineering · Beginner ·🧠 Large Language Models ·1y ago

Tutorial exploring the basics of the new Google Gemini 1.5 Pro and Flash models, how they compare to the OpenAI GPT-4o and GPT-4 Turbo, how to get the Google API Key, how to send requests to the Gemini API, and chatting with all kinds of files (videos, images, audios, etc.). Finally, we implement the Gemini 1.5 API models to the OmniChat App, a custom Streamlit Python webapp chatbot that we started developing on my last video in order to use GPT-4o from the OpenAI API with images and text. Blog: https://medium.com/@enricdomingo/how-i-add-gemini-1-5-pro-api-to-my-app-chat-with-videos-images-an…

Watch on YouTube ↗ (saves to browser)

Chapters (6)

Intro to Gemini 1.5

2:23 How to use the Gemini API

5:30 Adding images to the prompt

11:26 Gemini 1.5 into our chatbot webapp

25:20 Pushing to GitHub and deploying the app online

27:40 Testing video prompts

Next Up

5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems

Dave Ebbelaar (LLM Eng)