[LLM News] Claude 3.5 Sonnet, Open-Sora, Context Caching, PlanRAG, Safe SuperIntelligence Inc
Another exciting episode of LLM News!
Links mentioned in the video:
00:00 Claude 3.5 Sonnet - https://www.anthropic.com/news/claude-3-5-sonnet
04:04 Gen-3 Alpha - https://runwayml.com/blog/introducing-gen-3-alpha/
05:05 Safe SuperIntelligence Inc - https://x.com/ssi/status/1803472825476587910
06:35 Meta AI Research - https://about.fb.com/news/2024/06/releasing-new-ai-research-models-to-accelerate-innovation-at-scale/
07:49 DeepSeek-Coder-V2 - https://youtu.be/0Xp7K2rHcZg?si=khDjNLSQPKSZVn0j
10:29 Memory Tuning - https://youtu.be/Bs36gxpKcqk?si=SicIqfAday3q15tE
12:28 V2A - https://deepmind.…
Watch on YouTube ↗
(saves to browser)
Chapters (16)
Claude 3.5 Sonnet - https://www.anthropic.com/news/claude-3-5-sonnet
4:04
Gen-3 Alpha - https://runwayml.com/blog/introducing-gen-3-alpha/
5:05
Safe SuperIntelligence Inc - https://x.com/ssi/status/1803472825476587910
6:35
Meta AI Research - https://about.fb.com/news/2024/06/releasing-new-ai-research-m
7:49
DeepSeek-Coder-V2 - https://youtu.be/0Xp7K2rHcZg?si=khDjNLSQPKSZVn0j
10:29
Memory Tuning - https://youtu.be/Bs36gxpKcqk?si=SicIqfAday3q15tE
12:28
V2A - https://deepmind.google/discover/blog/generating-audio-for-video/
15:06
Local III - https://changes.openinterpreter.com/log/local-iii
16:26
Open-Sora - https://github.com/hpcaitech/Open-Sora
17:30
tokencost - https://github.com/AgentOps-AI/tokencost
18:27
MCTSr - https://arxiv.org/pdf/2406.07394
19:45
Gemini Context Caching - https://youtu.be/987Pd89EDPs?si=aeAGjSmwUj22sVTp
21:17
Long-Context LLMs - https://x.com/omarsar0/status/1804184820806766875
23:48
RAG to Rich - https://x.com/omarsar0/status/1803254134289895555
25:09
PlanRAG - https://x.com/omarsar0/status/1803262374574448757
26:37
LangChain Criticism - https://www.octomind.dev/blog/why-we-no-longer-use-langcha
Playlist
Uploads from Elvis Saravia · Elvis Saravia · 0 of 60
← Previous
Next →
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
How to contribute to dair.ai?
Elvis Saravia
dair-ai.github.io/_posts at master · dair-ai/dair-ai.github.io
Elvis Saravia
New story – Medium
Elvis Saravia
101 ways to solve search (by Pratik Bhavsar)
Elvis Saravia
TLDR Generation of Scientific Documents | ML Interview #1 with Isabel Cachola
Elvis Saravia
Sentiment Analysis: Key Milestones, Challenges and New Directions
Elvis Saravia
Discriminative Adversarial Search for Abstractive Summarization (by Thomas Scialom)
Elvis Saravia
Question Understanding: COVID-Q: 1,600+ Questions about COVID-19
Elvis Saravia
Getting Started with NLP
Elvis Saravia
Building tools and frameworks for large-scale social media mining (by Dr. Juan M. Banda)
Elvis Saravia
TextAttack: A Framework for Data Augmentation and Adversarial Training in NLP
Elvis Saravia
Dive into Deep Learning (Study Group): Introduction to Deep Learning | Session 1
Elvis Saravia
Dive into Deep Learning (Study Group): Multilayer Perceptrons | Session 4
Elvis Saravia
How I read and annotate ML papers
Elvis Saravia
Keep Learning ML (Session 1) | DSV, CompLex, Modern tools for emotions
Elvis Saravia
Dive into Deep Learning (Study Group): Preliminaries | Session 2
Elvis Saravia
Keep Learning ML #2 | Language-conditioned policy learning, Effective ML Testing, EagerPy
Elvis Saravia
Dive into Deep Learning (Study Group): Linear Neural Networks | Session 3
Elvis Saravia
Dive into Deep Learning (Study Group): Multilayer Perceptrons | Session 4
Elvis Saravia
Keep Learning ML #3 | Contrastively Trained Structured World Models
Elvis Saravia
Dive into Deep Learning (Study Group): Deep Learning Computation with PyTorch | Session 5
Elvis Saravia
Dive into Deep Learning (Study Group): Convolutional Neural Networks | Session 6
Elvis Saravia
Dive into Deep Learning (Study Group): Modern CNNs | Session 7
Elvis Saravia
101 ways to solve neural search with Jina
Elvis Saravia
(Hopefully-Reusable) Life Lessons for PhD Students in NLP
Elvis Saravia
How to save the world and forward your career in 5 easy steps | Women in NLP Talks
Elvis Saravia
Prompt Engineering Overview
Elvis Saravia
Getting Started with the OpenAI Playground
Elvis Saravia
LM-Guided Chain of Thought
Elvis Saravia
Elements of a Prompt
Elvis Saravia
Reasoning with Intermediate Revision and Search with LLMs #chatgpt #ai #llms #science #programming
Elvis Saravia
General Tips for Designing Prompts
Elvis Saravia
Efficient Infinite Context Transformers #ai #machinelearning #research #llms #science
Elvis Saravia
Best Practices and Lessons Learned on Synthetic Data for Language Models #ai #machinelearning #genai
Elvis Saravia
Reducing Hallucinations in Structured Outputs via RAG #chatgpt #ai #llms #programming
Elvis Saravia
Basic Prompt Examples for LLMs
Elvis Saravia
LLM In Context Recall is Prompt Dependent #llms #ai #chatgpt #machinelearning
Elvis Saravia
Zero-shot Prompting Explained
Elvis Saravia
RAG Faithfulness #llms #ai #gpt4
Elvis Saravia
Understanding LLM Settings
Elvis Saravia
Llama 3 is here! | First impressions and thoughts
Elvis Saravia
Llama 3 is Here! #ai #llms #llama3
Elvis Saravia
Microsoft introduces Phi-3 | The most capable small language model?
Elvis Saravia
Microsoft introduces Phi-3! #ai #llms #microsoft
Elvis Saravia
Make Your LLM Fully Utilize the Context #ai #llms #machinelearning
Elvis Saravia
When to Retrieve? #ai #llms #machinelearning
Elvis Saravia
Training an LLM to effectively use information retrieval
Elvis Saravia
State-of-the-art open-source LLM judges #ai #machinelearning #gpt4
Elvis Saravia
Better and Faster LLMs via Multi-token Prediction
Elvis Saravia
AlphaMath Almost Zero #ai #science #machinelearning
Elvis Saravia
SWE-Agent | An LLM-based Software Engineering Agent
Elvis Saravia
[LLM NEWS] AlphaFold 3, xLSTM, OpenAI's Model Spec, DeepSeek-V2, OpenDevin CodeAct 1.0
Elvis Saravia
LLM-powered tool for web scraping #ai #chatgpt #engineering
Elvis Saravia
Learn about LLMs in this NEW course #ai #chatgpt #engineering
Elvis Saravia
[LLM NEWS] KANs, Gemma 10M Context, OpenAI Updates?, Automatic Prompt Engineering, Tokenizer Arena
Elvis Saravia
[LLM News] GPT4-o, Project Astra, Veo, Copilot+ PCs, Gemini 1.5 Flash, Chameleon
Elvis Saravia
Enhancing Answer Selection in LLMs #ai #machinelearning #engineering
Elvis Saravia
Exploring Capabilities of Long-Context LLMs
Elvis Saravia
On exploring LLMs #ai #promptengineering #chatgpt
Elvis Saravia
Transformers Can Do Arithmetic with the Right Embeddings #ai #machinelearning #engineering
Elvis Saravia
DeepCamp AI