[LLM NEWS] KANs, Gemma 10M Context, OpenAI Updates?, Automatic Prompt Engineering, Tokenizer Arena

Elvis Saravia · Beginner ·📰 AI News & Updates ·1y ago
The Top AI and LLMs news. Links mentioned in the video: 00:00 OpenAI Updates? - https://twitter.com/OpenAI/status/1788987793613725786 02:13 Automatic Prompt Engineering - https://twitter.com/AnthropicAI/status/1788958483565732213 08:05 Consistency LLMs - https://twitter.com/omarsar0/status/1788594039865958762 10:00 Tokenizer Arena - https://huggingface.co/spaces/Cognitive-Lab/Tokenizer_Arena 11:55 Gemma 10M Context Window - https://twitter.com/siddrrsh/status/1788632667627696417 14:25 Evaluation from Chip Huyen - https://twitter.com/chipro/status/1788972359900389475 17:37 Evaluation from Jas…
Watch on YouTube ↗ (saves to browser)

Chapters (8)

OpenAI Updates? - https://twitter.com/OpenAI/status/1788987793613725786
2:13 Automatic Prompt Engineering - https://twitter.com/AnthropicAI/status/1788958483
8:05 Consistency LLMs - https://twitter.com/omarsar0/status/1788594039865958762
10:00 Tokenizer Arena - https://huggingface.co/spaces/Cognitive-Lab/Tokenizer_Arena
11:55 Gemma 10M Context Window - https://twitter.com/siddrrsh/status/17886326676276964
14:25 Evaluation from Chip Huyen - https://twitter.com/chipro/status/17889723599003894
17:37 Evaluation from Jason Liu - https://twitter.com/jxnlco/status/178855805309411769
19:55 KANs - https://arxiv.org/abs/2404.19756v2

Playlist

Uploads from Elvis Saravia · Elvis Saravia · 55 of 60

1 How to contribute to dair.ai?
How to contribute to dair.ai?
Elvis Saravia
2 dair-ai.github.io/_posts at master · dair-ai/dair-ai.github.io
dair-ai.github.io/_posts at master · dair-ai/dair-ai.github.io
Elvis Saravia
3 New story – Medium
New story – Medium
Elvis Saravia
4 101 ways to solve search (by Pratik Bhavsar)
101 ways to solve search (by Pratik Bhavsar)
Elvis Saravia
5 TLDR Generation of Scientific Documents | ML Interview #1 with Isabel Cachola
TLDR Generation of Scientific Documents | ML Interview #1 with Isabel Cachola
Elvis Saravia
6 Sentiment Analysis: Key Milestones, Challenges and New Directions
Sentiment Analysis: Key Milestones, Challenges and New Directions
Elvis Saravia
7 Discriminative Adversarial Search for Abstractive Summarization (by Thomas Scialom)
Discriminative Adversarial Search for Abstractive Summarization (by Thomas Scialom)
Elvis Saravia
8 Question Understanding: COVID-Q: 1,600+ Questions about COVID-19
Question Understanding: COVID-Q: 1,600+ Questions about COVID-19
Elvis Saravia
9 Getting Started with NLP
Getting Started with NLP
Elvis Saravia
10 Building tools and frameworks for large-scale social media mining (by Dr. Juan M. Banda)
Building tools and frameworks for large-scale social media mining (by Dr. Juan M. Banda)
Elvis Saravia
11 TextAttack: A Framework for Data Augmentation and Adversarial Training in NLP
TextAttack: A Framework for Data Augmentation and Adversarial Training in NLP
Elvis Saravia
12 Dive into Deep Learning (Study Group): Introduction to Deep Learning | Session 1
Dive into Deep Learning (Study Group): Introduction to Deep Learning | Session 1
Elvis Saravia
13 Dive into Deep Learning (Study Group): Multilayer Perceptrons | Session 4
Dive into Deep Learning (Study Group): Multilayer Perceptrons | Session 4
Elvis Saravia
14 How I read and annotate ML papers
How I read and annotate ML papers
Elvis Saravia
15 Keep Learning ML  (Session 1) | DSV, CompLex, Modern tools for emotions
Keep Learning ML (Session 1) | DSV, CompLex, Modern tools for emotions
Elvis Saravia
16 Dive into Deep Learning (Study Group): Preliminaries | Session 2
Dive into Deep Learning (Study Group): Preliminaries | Session 2
Elvis Saravia
17 Keep Learning ML #2 | Language-conditioned policy learning, Effective ML Testing, EagerPy
Keep Learning ML #2 | Language-conditioned policy learning, Effective ML Testing, EagerPy
Elvis Saravia
18 Dive into Deep Learning (Study Group): Linear Neural Networks | Session 3
Dive into Deep Learning (Study Group): Linear Neural Networks | Session 3
Elvis Saravia
19 Dive into Deep Learning (Study Group): Multilayer Perceptrons | Session 4
Dive into Deep Learning (Study Group): Multilayer Perceptrons | Session 4
Elvis Saravia
20 Keep Learning ML #3 | Contrastively Trained Structured World Models
Keep Learning ML #3 | Contrastively Trained Structured World Models
Elvis Saravia
21 Dive into Deep Learning (Study Group): Deep Learning Computation with PyTorch |  Session 5
Dive into Deep Learning (Study Group): Deep Learning Computation with PyTorch | Session 5
Elvis Saravia
22 Dive into Deep Learning (Study Group): Convolutional Neural Networks | Session 6
Dive into Deep Learning (Study Group): Convolutional Neural Networks | Session 6
Elvis Saravia
23 Dive into Deep Learning (Study Group): Modern CNNs | Session 7
Dive into Deep Learning (Study Group): Modern CNNs | Session 7
Elvis Saravia
24 101 ways to solve neural search with Jina
101 ways to solve neural search with Jina
Elvis Saravia
25 (Hopefully-Reusable) Life Lessons for PhD Students in NLP
(Hopefully-Reusable) Life Lessons for PhD Students in NLP
Elvis Saravia
26 How to save the world and forward your career in 5 easy steps | Women in NLP Talks
How to save the world and forward your career in 5 easy steps | Women in NLP Talks
Elvis Saravia
27 Prompt Engineering Overview
Prompt Engineering Overview
Elvis Saravia
28 Getting Started with the OpenAI Playground
Getting Started with the OpenAI Playground
Elvis Saravia
29 LM-Guided Chain of Thought
LM-Guided Chain of Thought
Elvis Saravia
30 Elements of a Prompt
Elements of a Prompt
Elvis Saravia
31 Reasoning with Intermediate Revision and Search with LLMs #chatgpt #ai #llms #science #programming
Reasoning with Intermediate Revision and Search with LLMs #chatgpt #ai #llms #science #programming
Elvis Saravia
32 General Tips for Designing Prompts
General Tips for Designing Prompts
Elvis Saravia
33 Efficient Infinite Context Transformers #ai #machinelearning #research #llms #science
Efficient Infinite Context Transformers #ai #machinelearning #research #llms #science
Elvis Saravia
34 Best Practices and Lessons Learned on Synthetic Data for Language Models #ai #machinelearning #genai
Best Practices and Lessons Learned on Synthetic Data for Language Models #ai #machinelearning #genai
Elvis Saravia
35 Reducing Hallucinations in Structured Outputs via RAG #chatgpt #ai #llms #programming
Reducing Hallucinations in Structured Outputs via RAG #chatgpt #ai #llms #programming
Elvis Saravia
36 Basic Prompt Examples for LLMs
Basic Prompt Examples for LLMs
Elvis Saravia
37 LLM In Context Recall is Prompt Dependent  #llms #ai #chatgpt #machinelearning
LLM In Context Recall is Prompt Dependent #llms #ai #chatgpt #machinelearning
Elvis Saravia
38 Zero-shot Prompting Explained
Zero-shot Prompting Explained
Elvis Saravia
39 RAG Faithfulness #llms #ai #gpt4
RAG Faithfulness #llms #ai #gpt4
Elvis Saravia
40 Understanding LLM Settings
Understanding LLM Settings
Elvis Saravia
41 Llama 3 is here! | First impressions and thoughts
Llama 3 is here! | First impressions and thoughts
Elvis Saravia
42 Llama 3 is Here! #ai #llms #llama3
Llama 3 is Here! #ai #llms #llama3
Elvis Saravia
43 Microsoft introduces Phi-3 | The most capable small language model?
Microsoft introduces Phi-3 | The most capable small language model?
Elvis Saravia
44 Microsoft introduces Phi-3! #ai #llms #microsoft
Microsoft introduces Phi-3! #ai #llms #microsoft
Elvis Saravia
45 Make Your LLM Fully Utilize the Context #ai #llms #machinelearning
Make Your LLM Fully Utilize the Context #ai #llms #machinelearning
Elvis Saravia
46 When to Retrieve? #ai #llms #machinelearning
When to Retrieve? #ai #llms #machinelearning
Elvis Saravia
47 Training an LLM to effectively use information retrieval
Training an LLM to effectively use information retrieval
Elvis Saravia
48 State-of-the-art open-source LLM judges #ai #machinelearning #gpt4
State-of-the-art open-source LLM judges #ai #machinelearning #gpt4
Elvis Saravia
49 Better and Faster LLMs via Multi-token Prediction
Better and Faster LLMs via Multi-token Prediction
Elvis Saravia
50 AlphaMath Almost Zero #ai #science #machinelearning
AlphaMath Almost Zero #ai #science #machinelearning
Elvis Saravia
51 SWE-Agent | An LLM-based Software Engineering Agent
SWE-Agent | An LLM-based Software Engineering Agent
Elvis Saravia
52 [LLM NEWS] AlphaFold 3, xLSTM, OpenAI's Model Spec, DeepSeek-V2, OpenDevin CodeAct 1.0
[LLM NEWS] AlphaFold 3, xLSTM, OpenAI's Model Spec, DeepSeek-V2, OpenDevin CodeAct 1.0
Elvis Saravia
53 LLM-powered tool for web scraping #ai #chatgpt #engineering
LLM-powered tool for web scraping #ai #chatgpt #engineering
Elvis Saravia
54 Learn about LLMs in this NEW course #ai #chatgpt #engineering
Learn about LLMs in this NEW course #ai #chatgpt #engineering
Elvis Saravia
[LLM NEWS] KANs, Gemma 10M Context, OpenAI Updates?, Automatic Prompt Engineering, Tokenizer Arena
[LLM NEWS] KANs, Gemma 10M Context, OpenAI Updates?, Automatic Prompt Engineering, Tokenizer Arena
Elvis Saravia
56 [LLM News] GPT4-o, Project Astra, Veo, Copilot+ PCs, Gemini 1.5 Flash, Chameleon
[LLM News] GPT4-o, Project Astra, Veo, Copilot+ PCs, Gemini 1.5 Flash, Chameleon
Elvis Saravia
57 Enhancing Answer Selection in LLMs #ai #machinelearning #engineering
Enhancing Answer Selection in LLMs #ai #machinelearning #engineering
Elvis Saravia
58 Exploring Capabilities of Long-Context LLMs
Exploring Capabilities of Long-Context LLMs
Elvis Saravia
59 On exploring LLMs #ai #promptengineering #chatgpt
On exploring LLMs #ai #promptengineering #chatgpt
Elvis Saravia
60 Transformers Can Do Arithmetic with the Right Embeddings #ai #machinelearning #engineering
Transformers Can Do Arithmetic with the Right Embeddings #ai #machinelearning #engineering
Elvis Saravia
AI Is Quietly Replacing Entry-Level Jobs
Next Up
AI Is Quietly Replacing Entry-Level Jobs
Full Disclosure