[LLM News] Claude 3.5 Sonnet, Open-Sora, Context Caching, PlanRAG, Safe SuperIntelligence Inc

Elvis Saravia · Beginner ·🧠 Large Language Models ·1y ago

Skills: LLM Foundations90%Prompt Craft80%Fine-tuning LLMs70%

Another exciting episode of LLM News! Links mentioned in the video: 00:00 Claude 3.5 Sonnet - https://www.anthropic.com/news/claude-3-5-sonnet 04:04 Gen-3 Alpha - https://runwayml.com/blog/introducing-gen-3-alpha/ 05:05 Safe SuperIntelligence Inc - https://x.com/ssi/status/1803472825476587910 06:35 Meta AI Research - https://about.fb.com/news/2024/06/releasing-new-ai-research-models-to-accelerate-innovation-at-scale/ 07:49 DeepSeek-Coder-V2 - https://youtu.be/0Xp7K2rHcZg?si=khDjNLSQPKSZVn0j 10:29 Memory Tuning - https://youtu.be/Bs36gxpKcqk?si=SicIqfAday3q15tE 12:28 V2A - https://deepmind.google/discover/blog/generating-audio-for-video/ 15:06 Local III - https://changes.openinterpreter.com/log/local-iii 16:26 Open-Sora - https://github.com/hpcaitech/Open-Sora 17:30 tokencost - https://github.com/AgentOps-AI/tokencost 18:27 MCTSr - https://arxiv.org/pdf/2406.07394 19:45 Gemini Context Caching - https://youtu.be/987Pd89EDPs?si=aeAGjSmwUj22sVTp 21:17 Long-Context LLMs - https://x.com/omarsar0/status/1804184820806766875 23:48 RAG to Rich - https://x.com/omarsar0/status/1803254134289895555 25:09 PlanRAG - https://x.com/omarsar0/status/1803262374574448757 26:37 LangChain Criticism - https://www.octomind.dev/blog/why-we-no-longer-use-langchain-for-building-our-ai-agents Claude 3.5 Sonnet overview: https://youtu.be/h9CERBnVOmQ To learn more, check out our upcoming live training to learn more about building with LLMs: https://maven.com/dair-ai/prompt-engineering-llms #ai #machinelearning #science #chatgpt Reach out to elvissaravia@dair.ai if you would like to sponsor the LLM News.

Watch on YouTube ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

Playlist

Uploads from Elvis Saravia · Elvis Saravia · 0 of 60

← Previous Next →

101 ways to solve search (by Pratik Bhavsar)

101 ways to solve search (by Pratik Bhavsar)

TLDR Generation of Scientific Documents | ML Interview #1 with Isabel Cachola

TLDR Generation of Scientific Documents | ML Interview #1 with Isabel Cachola

Sentiment Analysis: Key Milestones, Challenges and New Directions

Sentiment Analysis: Key Milestones, Challenges and New Directions

Discriminative Adversarial Search for Abstractive Summarization (by Thomas Scialom)

Discriminative Adversarial Search for Abstractive Summarization (by Thomas Scialom)

Question Understanding: COVID-Q: 1,600+ Questions about COVID-19

Question Understanding: COVID-Q: 1,600+ Questions about COVID-19

Getting Started with NLP

Getting Started with NLP

Building tools and frameworks for large-scale social media mining (by Dr. Juan M. Banda)

Building tools and frameworks for large-scale social media mining (by Dr. Juan M. Banda)

TextAttack: A Framework for Data Augmentation and Adversarial Training in NLP

TextAttack: A Framework for Data Augmentation and Adversarial Training in NLP

Dive into Deep Learning (Study Group): Introduction to Deep Learning | Session 1

Dive into Deep Learning (Study Group): Introduction to Deep Learning | Session 1

Dive into Deep Learning (Study Group): Multilayer Perceptrons | Session 4

Dive into Deep Learning (Study Group): Multilayer Perceptrons | Session 4

How I read and annotate ML papers

How I read and annotate ML papers

Keep Learning ML (Session 1) | DSV, CompLex, Modern tools for emotions

Keep Learning ML (Session 1) | DSV, CompLex, Modern tools for emotions

Dive into Deep Learning (Study Group): Preliminaries | Session 2

Dive into Deep Learning (Study Group): Preliminaries | Session 2

Keep Learning ML #2 | Language-conditioned policy learning, Effective ML Testing, EagerPy

Keep Learning ML #2 | Language-conditioned policy learning, Effective ML Testing, EagerPy

Dive into Deep Learning (Study Group): Linear Neural Networks | Session 3

Dive into Deep Learning (Study Group): Linear Neural Networks | Session 3

Dive into Deep Learning (Study Group): Multilayer Perceptrons | Session 4

Dive into Deep Learning (Study Group): Multilayer Perceptrons | Session 4

Keep Learning ML #3 | Contrastively Trained Structured World Models

Keep Learning ML #3 | Contrastively Trained Structured World Models

Dive into Deep Learning (Study Group): Deep Learning Computation with PyTorch | Session 5

Dive into Deep Learning (Study Group): Deep Learning Computation with PyTorch | Session 5

Dive into Deep Learning (Study Group): Convolutional Neural Networks | Session 6

Dive into Deep Learning (Study Group): Convolutional Neural Networks | Session 6

Dive into Deep Learning (Study Group): Modern CNNs | Session 7

Dive into Deep Learning (Study Group): Modern CNNs | Session 7

101 ways to solve neural search with Jina

101 ways to solve neural search with Jina

(Hopefully-Reusable) Life Lessons for PhD Students in NLP

(Hopefully-Reusable) Life Lessons for PhD Students in NLP

How to save the world and forward your career in 5 easy steps | Women in NLP Talks

How to save the world and forward your career in 5 easy steps | Women in NLP Talks

Prompt Engineering Overview

Prompt Engineering Overview

Getting Started with the OpenAI Playground

Getting Started with the OpenAI Playground

LM-Guided Chain of Thought

LM-Guided Chain of Thought

Elements of a Prompt

Elements of a Prompt

Reasoning with Intermediate Revision and Search with LLMs #chatgpt #ai #llms #science #programming

Reasoning with Intermediate Revision and Search with LLMs #chatgpt #ai #llms #science #programming

General Tips for Designing Prompts

General Tips for Designing Prompts

Efficient Infinite Context Transformers #ai #machinelearning #research #llms #science

Efficient Infinite Context Transformers #ai #machinelearning #research #llms #science

Best Practices and Lessons Learned on Synthetic Data for Language Models #ai #machinelearning #genai

Best Practices and Lessons Learned on Synthetic Data for Language Models #ai #machinelearning #genai

Reducing Hallucinations in Structured Outputs via RAG #chatgpt #ai #llms #programming

Reducing Hallucinations in Structured Outputs via RAG #chatgpt #ai #llms #programming

Basic Prompt Examples for LLMs

Basic Prompt Examples for LLMs

LLM In Context Recall is Prompt Dependent #llms #ai #chatgpt #machinelearning

LLM In Context Recall is Prompt Dependent #llms #ai #chatgpt #machinelearning

Zero-shot Prompting Explained

Zero-shot Prompting Explained

RAG Faithfulness #llms #ai #gpt4

RAG Faithfulness #llms #ai #gpt4

Understanding LLM Settings

Understanding LLM Settings

Llama 3 is here! | First impressions and thoughts

Llama 3 is here! | First impressions and thoughts

Llama 3 is Here! #ai #llms #llama3

Llama 3 is Here! #ai #llms #llama3

Microsoft introduces Phi-3 | The most capable small language model?

Microsoft introduces Phi-3 | The most capable small language model?

Microsoft introduces Phi-3! #ai #llms #microsoft

Microsoft introduces Phi-3! #ai #llms #microsoft

Make Your LLM Fully Utilize the Context #ai #llms #machinelearning

Make Your LLM Fully Utilize the Context #ai #llms #machinelearning

When to Retrieve? #ai #llms #machinelearning

When to Retrieve? #ai #llms #machinelearning

Training an LLM to effectively use information retrieval

Training an LLM to effectively use information retrieval

State-of-the-art open-source LLM judges #ai #machinelearning #gpt4

State-of-the-art open-source LLM judges #ai #machinelearning #gpt4

Better and Faster LLMs via Multi-token Prediction

Better and Faster LLMs via Multi-token Prediction

AlphaMath Almost Zero #ai #science #machinelearning

AlphaMath Almost Zero #ai #science #machinelearning

SWE-Agent | An LLM-based Software Engineering Agent

SWE-Agent | An LLM-based Software Engineering Agent

[LLM NEWS] AlphaFold 3, xLSTM, OpenAI's Model Spec, DeepSeek-V2, OpenDevin CodeAct 1.0

[LLM NEWS] AlphaFold 3, xLSTM, OpenAI's Model Spec, DeepSeek-V2, OpenDevin CodeAct 1.0

LLM-powered tool for web scraping #ai #chatgpt #engineering

LLM-powered tool for web scraping #ai #chatgpt #engineering

Learn about LLMs in this NEW course #ai #chatgpt #engineering

Learn about LLMs in this NEW course #ai #chatgpt #engineering

[LLM NEWS] KANs, Gemma 10M Context, OpenAI Updates?, Automatic Prompt Engineering, Tokenizer Arena

[LLM NEWS] KANs, Gemma 10M Context, OpenAI Updates?, Automatic Prompt Engineering, Tokenizer Arena

[LLM News] GPT4-o, Project Astra, Veo, Copilot+ PCs, Gemini 1.5 Flash, Chameleon

[LLM News] GPT4-o, Project Astra, Veo, Copilot+ PCs, Gemini 1.5 Flash, Chameleon

Enhancing Answer Selection in LLMs #ai #machinelearning #engineering

Enhancing Answer Selection in LLMs #ai #machinelearning #engineering

On exploring LLMs #ai #promptengineering #chatgpt

On exploring LLMs #ai #promptengineering #chatgpt

Transformers Can Do Arithmetic with the Right Embeddings #ai #machinelearning #engineering

Transformers Can Do Arithmetic with the Right Embeddings #ai #machinelearning #engineering

[LLM News] xAI Series B, Codestral, LLM Guide, AutoGen Course, Symbolic Chain-of-Thought

[LLM News] xAI Series B, Codestral, LLM Guide, AutoGen Course, Symbolic Chain-of-Thought

PR-Agent #ai #gpt4 #software

PR-Agent #ai #gpt4 #software

Extracting features from Claude 3 Sonnet

Extracting features from Claude 3 Sonnet

Has prompt engineering been solved?

Has prompt engineering been solved?

More on: LLM Foundations

View skill →

Getting Started with Vertex AI Gemini 1.5 Flash

I TRAINED AN AI TO SOLVE 2+2 (w/ Live Coding)

I TRAINED AN AI TO SOLVE 2+2 (w/ Live Coding)

How to use the ChatGPT API with Python!!

How to use the ChatGPT API with Python!!

Nicholas Renotte

Gemini 2.5: Create an interactive plot of economic data

Gemini 2.5: Create an interactive plot of economic data

Google DeepMind

LangChain Chatbots: Building a Personalized AI Assistant

LangChain Chatbots: Building a Personalized AI Assistant

Analytics Vidhya

Auto-generating meeting notes with Python

Auto-generating meeting notes with Python

Related AI Lessons

GPT-5.5 vs Claude Opus 4.7: Pricing, Speed, and Benchmarks

Compare GPT-5.5 and Claude Opus 4.7 pricing, speed, and benchmarks to choose the best AI language model for your project

From Idea to Image: A Practical Midjourney Prompting Guide

Learn to craft effective Midjourney prompts to generate high-quality images, focusing on clarity and creative briefs

Dell Becomes OpenAI's On-Prem Channel For Frontier Models

Dell partners with OpenAI to bring Codex to on-premises environments, expanding access to frontier AI models for enterprises

Forbes Innovation

Beyond Simple RAG:Creating an Evidence-Driven Coordination Environment for Local AI

Create a testable local AI environment using evidence-driven coordination, and learn how to prioritize data collection for better results

Medium · Programming

Chapters (16)

Claude 3.5 Sonnet - https://www.anthropic.com/news/claude-3-5-sonnet

4:04 Gen-3 Alpha - https://runwayml.com/blog/introducing-gen-3-alpha/

5:05 Safe SuperIntelligence Inc - https://x.com/ssi/status/1803472825476587910

6:35 Meta AI Research - https://about.fb.com/news/2024/06/releasing-new-ai-research-m

7:49 DeepSeek-Coder-V2 - https://youtu.be/0Xp7K2rHcZg?si=khDjNLSQPKSZVn0j

10:29 Memory Tuning - https://youtu.be/Bs36gxpKcqk?si=SicIqfAday3q15tE

12:28 V2A - https://deepmind.google/discover/blog/generating-audio-for-video/

15:06 Local III - https://changes.openinterpreter.com/log/local-iii

16:26 Open-Sora - https://github.com/hpcaitech/Open-Sora

17:30 tokencost - https://github.com/AgentOps-AI/tokencost

18:27 MCTSr - https://arxiv.org/pdf/2406.07394

19:45 Gemini Context Caching - https://youtu.be/987Pd89EDPs?si=aeAGjSmwUj22sVTp

21:17 Long-Context LLMs - https://x.com/omarsar0/status/1804184820806766875

23:48 RAG to Rich - https://x.com/omarsar0/status/1803254134289895555

25:09 PlanRAG - https://x.com/omarsar0/status/1803262374574448757

26:37 LangChain Criticism - https://www.octomind.dev/blog/why-we-no-longer-use-langcha

5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems

Dave Ebbelaar (LLM Eng)