Comparing Google's Gemini Pro vs OpenAI's ChatGPT4: Which model has better visual reasoning?

DeepLearning Hero · Beginner ·🧠 Large Language Models ·2y ago
In this video, we're going to test Gemini Pro and ChatGPT4 side by side on a gamut of tasks requiring various levels of visual reasoning and specialized skills. These tasks include; common sense reasoning, aesthetic understanding, data analysis, math, etc. Watch the full video to see how these models perform on these tasks. 00:00 - Introduction 03:18 - Common sense reasoning 04:51 - Aesthetic understanding 05:55 - Entity recognition 08:12 - Data analysis 10:53 - Design analysis & Text extraction 12:35 - Misc labelling 15:06 - Math 16:08 - Final thoughts
Watch on YouTube ↗ (saves to browser)

Chapters (9)

Introduction
3:18 Common sense reasoning
4:51 Aesthetic understanding
5:55 Entity recognition
8:12 Data analysis
10:53 Design analysis & Text extraction
12:35 Misc labelling
15:06 Math
16:08 Final thoughts
5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems
Next Up
5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems
Dave Ebbelaar (LLM Eng)