Gemini 1.5 API: Chat with Videos, Images and Audios in your app (Gemini API Code tutorial)

Enric Domingo - AI Engineering · Beginner ·🧠 Large Language Models ·1y ago
Tutorial exploring the basics of the new Google Gemini 1.5 Pro and Flash models, how they compare to the OpenAI GPT-4o and GPT-4 Turbo, how to get the Google API Key, how to send requests to the Gemini API, and chatting with all kinds of files (videos, images, audios, etc.). Finally, we implement the Gemini 1.5 API models to the OmniChat App, a custom Streamlit Python webapp chatbot that we started developing on my last video in order to use GPT-4o from the OpenAI API with images and text. Blog: https://medium.com/@enricdomingo/how-i-add-gemini-1-5-pro-api-to-my-app-chat-with-videos-images-an…
Watch on YouTube ↗ (saves to browser)

Chapters (6)

Intro to Gemini 1.5
2:23 How to use the Gemini API
5:30 Adding images to the prompt
11:26 Gemini 1.5 into our chatbot webapp
25:20 Pushing to GitHub and deploying the app online
27:40 Testing video prompts
5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems
Next Up
5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems
Dave Ebbelaar (LLM Eng)