Understand the AI Engineer's Tech Stack

DataCamp · Intermediate ·🤖 AI Agents & Automation ·4mo ago

Key Takeaways

The video discusses the modern AI engineer's tech stack, covering tools for data processing, model development, evaluation, deployment, and agentic workflows, with a focus on open-source and closed-source models, including Claude, Sonnet, GPT5, Open AAI, Anthropic, Base 10, Foundation Lab, Cloud Opus 4.6, Kimmy, and GLM.

Full Transcript

Hello everyone and welcome to today's session. My name is Ree and I'll be your moderator for today. We're about to get started. Uh we are a little bit late starting today. However, we've got a couple of minutes before we do actually get going. So, if you haven't done so already, make sure that you register for today's session. You can do so by scanning the QR code that's on screen. that'll take you to all of the future webinars and the today's webinar uh that we've got scheduled for now and the future as well. So yeah, make sure to scan that QR code. You can also head over to datacamp.com/webinars where you'll find this session as well as all of our future sessions as well. I'll put a link to that in the chat now. But yeah, make sure you register for today's session. That means that uh you can get the recording and resources sent to you when we're done just in case you need to pop off for the live session today. Uh with that being said, uh we are also going to be uh sharing some resources that will be uh useful for what we're going to be talking about today. So uh please do keep your eyes peeled in the chat for those. They'll be posted very shortly and they will also be in the video description on YouTube as well. I'll notify everyone in the chat as well. If you have any questions or comments at any point throughout the session today, let us know in the chat. We're going to be running through your questions for the last 10 minutes of the session. So make sure you stick around for those as well. However, if you want to leave a comment and then see if we answer it afterwards and you need to jump off, leave the comment and catch up with the recording. So, yeah, make sure that you register for today's session so you don't miss a thing. Brilliant. I'll run through these messages one more time for everyone else that's just joined and then we'll get going. So, hello. If you've just joined, my name is Ree and I'll be your moderator for today's session. We are just about to get started uh for today. Um, if you haven't done so already, make sure that you sign up and register for this session. The link to do so, uh, is in the comments. You can head over to datacamp.com/webinars. That's where you'll see all of our future sessions, uh, aside from the upcoming campaigns and virtual conferences we've got coming up. However, we will mention those at uh, a later point during today's session. So, yeah, make sure you registered for today's session. If you have any questions or comments, let us know in the chat. We're going to be running through your questions for the last 10 minutes of the session. Uh, and I think that's pretty much everything from me. Please do check out the resources as well. They'll be in the YouTube video description as well as the comments very shortly. Also, uh, the slides uh we are going to be able to uh potentially share with those with you at after the end of the session. So, the only way to get them is to sign up for the recording. Make sure you sign up uh for today's session either by going to datacump.com/webinars or scanning the QR code that's on screen. Brilliant. And with that, I will hand you over to Richie for uh today's session. So, yeah, Richie, please take it away. >> Hi there, data scamps and data champs. Welcome to the webinar. We'll get started in just a few moments time. In the meantime, please do let us know where you're joining from. Uh great to see a few of you saying hi in the chat there. So, uh we've got Ove saying happy to be here. We've got uh Runa Runa is that uh saying hi everyone. Dehane saying hi. Uh great to have you all here. All right, so uh today uh let's kick things off. Um in case you haven't noticed, there are a lot of AI tools. It's crazy. So for any given problem, there are at least half a dozen pieces of software to help you solve it. And that's brilliant, but it's also confusing. So today, we're going to explore the AI engineers tech stack. That means we're going to figure out which combination of tools you need in order to build AI products effectively. Our guest is Alex Kerr. He's an applied AI engineer at Base 10. Welcome, Alex. Great to have you here. >> Hi, that invite. >> Wonderful. So, Alex is an AI engineer. He's an entrepreneur. He's also an investor. Uh, so at the AI inference platform base 10, he helps customers build AI solutions. He's also a venture partner at TA researched capital where he invests in early stage companies and he previously founded the AI project incubator uh P AI where he continues to serve as chairman and pre previously Alex was a decision scientist at launch darkly. He's also been an editor at Stanford High and a machine learning scientist at Neurable. So uh uh quite a career history. Uh I'm going to let you take it away. Tell us all about the uh the AI tech stack. Awesome. Okay. Um, hi everyone. My name is Alex. Uh, currently I'm an engineer at Base 10 and I'm very excited today to talk to you about um how open- source models are secretly powering um a lot of AI coding at scale. And I'll be sharing uh my personal favorite models uh for coding that are nonobvious but highly valuable if you've only considered uh closed source options like anthropic and open AAI and then share some workflows that I use day-to-day um as a developer and and with some remarks on how um I should uh we should sort of adapt as developers um as the ground beneath our sh uh feet are shifting very quickly with um all all the new updates to to the stack. So, um, so here's a little bit about me. Um, I joined B 10 earlier this year at the intersection of go to market and engineering. I split my time between contributing to open source projects and creating content to help developers build better AI applications in production. Um, and uh, Richie went over some of my previous experiences, so I'll skip that. And uh, fun fact is in my free time, I like uh, being obsessed over calisthenic skills and reading as many uh, books uh, in fiction on non-fiction to get closer to my good reads goal. Um and I want to start off to say that um what um we work with a variety of customers today at base 10 and a large number of the most popular AI applications today that you might have used uh is on this list and because of this we have a great front row view of how Agentic systems are built across different domains such as healthcare coding and um sales tools And um today in pace of AI which is coding um it has proliferated over the past uh few years because it has well- definfined examples uh across the web and uh patterns and um ways you can check the correctness of code that is being generated. Um so yeah let's dive in um to go over the agenda briefly. Uh first we'll go over some of the limitations and walls uh builders are facing today with closed source options like GPT5 and Sonnet um 4.6 and why we still continue to use closed source models. Uh I still love jamming with claude on some tasks, but I tell you why it might not be the best option in production. Then we'll contrast this with the rise of uh open source models. I'll touch on three of my favorite state-of-the-art models uh today. Then we'll move into how to take advantage of these open source models uh in your current dev workflow uh from something that is the least handholding and opinionated to some of the more opinionated options and uh more custom options. And lastly, we'll go over a brief case study of how at base 10 at the infra layer in inference, we increase the performance of these coding models to bring users to light. U it's sort of a under the hood sneak peek of the optimization we make uh to making these models run fast. Um that I think you will find very interesting. And lastly, I'll conclude with some takeaways. uh as uh how to think about the world as a uh like AI native builders. So the starting with the problems of open source models, I usually say these are great for prototyping your applications, but the issues really start to emerge at scale. Uh the first is rate limits. So when you're not paying a foundation lab uh millions of dollars, they'll limit the amount of tokens you can push through per minute or per second. Um the second is the costs. Uh there are huge costs that are multiples more than open source somewhere on the order of 5 to 10x when you compare um something like a cloud opus 4.6 versus uh Kimmy or GLM, which I'll dive into later. The third is reliability. Um reliability is concerned as your mercy at these uh closed source labs uh service level agreements. Um if they have outages then you you sort of are also out um if you're building an app that customers are actively using. And lastly, for uh speed and control, um often these um closed source endpoints are throughput optimized, which means they're trying to serve uh millions of developers. So your individual uh token throughput could become an issue if there are particularly uh noisy neighbors around you. And then for control, if you want to build a truly differentiated AI product in this market and want to use techniques like RL or SFT, um closed source have limited offerings and you often um cannot fine-tune slash do not have ownership of these models at the end of the day. So why do we use closed source models to start with? It's because um they're really smart and with high quality.

Original Description

Session resources: https://bit.ly/3Oq5YfE Alex Ker, an Applied AI Engineer at Baseten, will break down the modern AI engineer’s tech stack. You’ll learn which tools are used for data processing, model development, evaluation, deployment, and agentic workflows—and how to choose the right ones for your projects.
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Playlist

Uploads from DataCamp · DataCamp · 0 of 60

← Previous Next →
1 SQL Server Tutorial: Date manipulation
SQL Server Tutorial: Date manipulation
DataCamp
2 R Tutorial: Intermediate Interactive Data Visualization with plotly in R
R Tutorial: Intermediate Interactive Data Visualization with plotly in R
DataCamp
3 R Tutorial: Adding aesthetics to represent a variable
R Tutorial: Adding aesthetics to represent a variable
DataCamp
4 R Tutorial: Moving Beyond Simple Interactivity
R Tutorial: Moving Beyond Simple Interactivity
DataCamp
5 Python Tutorial: Why use ML for marketing? Strategies and use cases
Python Tutorial: Why use ML for marketing? Strategies and use cases
DataCamp
6 Python Tutorial: Preparation for modeling
Python Tutorial: Preparation for modeling
DataCamp
7 Python Tutorial: Machine Learning modeling steps
Python Tutorial: Machine Learning modeling steps
DataCamp
8 R Tutorial: The prior model
R Tutorial: The prior model
DataCamp
9 R Tutorial: Data & the likelihood
R Tutorial: Data & the likelihood
DataCamp
10 R Tutorial: The posterior model
R Tutorial: The posterior model
DataCamp
11 R Tutorial: An Introduction to plotly
R Tutorial: An Introduction to plotly
DataCamp
12 R Tutorial: Plotting a single variable
R Tutorial: Plotting a single variable
DataCamp
13 R Tutorial: Bivariate graphics
R Tutorial: Bivariate graphics
DataCamp
14 Python Tutorial: Customer Segmentation in Python
Python Tutorial: Customer Segmentation in Python
DataCamp
15 Python Tutorial: Time cohorts
Python Tutorial: Time cohorts
DataCamp
16 Python Tutorial: Calculate cohort metrics
Python Tutorial: Calculate cohort metrics
DataCamp
17 Python Tutorial: Cohort analysis visualization
Python Tutorial: Cohort analysis visualization
DataCamp
18 R Tutorial: Building Dashboards with flexdashboard
R Tutorial: Building Dashboards with flexdashboard
DataCamp
19 R Tutorial: Anatomy of a flexdashboard
R Tutorial: Anatomy of a flexdashboard
DataCamp
20 R Tutorial: Layout basics
R Tutorial: Layout basics
DataCamp
21 R Tutorial: Advanced layouts
R Tutorial: Advanced layouts
DataCamp
22 Python Tutorial: Time Series Analysis in Python
Python Tutorial: Time Series Analysis in Python
DataCamp
23 Python Tutorial: Correlation of Two Time Series
Python Tutorial: Correlation of Two Time Series
DataCamp
24 Python Tutorial: Simple Linear Regressions
Python Tutorial: Simple Linear Regressions
DataCamp
25 Python Tutorial: Autocorrelation
Python Tutorial: Autocorrelation
DataCamp
26 R Tutorial: The gapminder dataset
R Tutorial: The gapminder dataset
DataCamp
27 R Tutorial: The filter verb
R Tutorial: The filter verb
DataCamp
28 R Tutorial: The arrange verb
R Tutorial: The arrange verb
DataCamp
29 R Tutorial: The mutate verb
R Tutorial: The mutate verb
DataCamp
30 R Tutorial: What is cluster analysis?
R Tutorial: What is cluster analysis?
DataCamp
31 R Tutorial: Distance between two observations
R Tutorial: Distance between two observations
DataCamp
32 R Tutorial: The importance of scale
R Tutorial: The importance of scale
DataCamp
33 R Tutorial: Measuring distance for categorical data
R Tutorial: Measuring distance for categorical data
DataCamp
34 Python Tutorial: Plotting multiple graphs
Python Tutorial: Plotting multiple graphs
DataCamp
35 Python Tutorial: Customizing axes
Python Tutorial: Customizing axes
DataCamp
36 Python Tutorial: Legends, annotations, & styles
Python Tutorial: Legends, annotations, & styles
DataCamp
37 Python Tutorial: Introduction to iterators
Python Tutorial: Introduction to iterators
DataCamp
38 Python Tutorial: Playing with iterators
Python Tutorial: Playing with iterators
DataCamp
39 Python Tutorial: Using iterators to load large files into memory
Python Tutorial: Using iterators to load large files into memory
DataCamp
40 SQL Tutorial: Introduction to Relational Databases in SQL
SQL Tutorial: Introduction to Relational Databases in SQL
DataCamp
41 SQL Tutorial: Tables: At the core of every database
SQL Tutorial: Tables: At the core of every database
DataCamp
42 SQL Tutorial: Update your database as the structure changes
SQL Tutorial: Update your database as the structure changes
DataCamp
43 Python Tutorial: Classification-Tree Learning
Python Tutorial: Classification-Tree Learning
DataCamp
44 Python Tutorial: Decision-Tree for Classification
Python Tutorial: Decision-Tree for Classification
DataCamp
45 Python Tutorial: Decision-Tree for Regression
Python Tutorial: Decision-Tree for Regression
DataCamp
46 Python Tutorial: Census Subject Tables
Python Tutorial: Census Subject Tables
DataCamp
47 Python Tutorial: Census Geography
Python Tutorial: Census Geography
DataCamp
48 Python Tutorial: Using the Census API
Python Tutorial: Using the Census API
DataCamp
49 R Tutorial: A/B Testing in R
R Tutorial: A/B Testing in R
DataCamp
50 R Tutorial: Baseline Conversion Rates
R Tutorial: Baseline Conversion Rates
DataCamp
51 R Tutorial: Designing an Experiment - Power Analysis
R Tutorial: Designing an Experiment - Power Analysis
DataCamp
52 R Tutorial: Introduction to qualitative data
R Tutorial: Introduction to qualitative data
DataCamp
53 R Tutorial: Understanding your qualitative variables
R Tutorial: Understanding your qualitative variables
DataCamp
54 R Tutorial: Making Better Plots
R Tutorial: Making Better Plots
DataCamp
55 SQL Tutorial: OLTP and OLAP
SQL Tutorial: OLTP and OLAP
DataCamp
56 SQL Tutorial: Storing data
SQL Tutorial: Storing data
DataCamp
57 SQL Tutorial: Database design
SQL Tutorial: Database design
DataCamp
58 Python Tutorial: Introduction to spaCy
Python Tutorial: Introduction to spaCy
DataCamp
59 Python Tutorial: Statistical Models
Python Tutorial: Statistical Models
DataCamp
60 Python Tutorial: Rule-based Matching
Python Tutorial: Rule-based Matching
DataCamp

This video provides an overview of the AI engineer's tech stack, covering the pros and cons of open-source and closed-source models, and how to choose the right tools for AI engineering. The speaker, Alex Kerr, shares his personal favorite models and discusses the limitations of each approach.

Key Takeaways
  1. Evaluate the trade-offs between open-source and closed-source models
  2. Choose the right tools for AI engineering
  3. Design agentic workflows
  4. Select appropriate tools for AI project development
  5. Consider the limitations of open-source models, including rate limits, costs, reliability, and speed and control
  6. Consider the limitations of closed-source models, including limited offerings and no ownership of models
💡 Open-source models are great for prototyping but have issues at scale, while closed-source models are throughput optimized but have limited offerings and no ownership of models.

Related Reads

📰
No AI Model Passes the Real-Time Teamwork Test: GPTNT Benchmark Results
No AI model passes the GPTNT benchmark test, which evaluates real-time teamwork and communication, highlighting the limitations of current AI models
Dev.to AI
📰
Anthropic Launches Claude Science: AI Becomes a Scientific Instrument
Anthropic's Claude Science uses AI to analyze complex scientific problems, like protein folding, in real-time, revolutionizing research efficiency
Dev.to AI
📰
I built a drop-in AI chatbot widget for React that works with any provider — here's why
Learn how to create a drop-in AI chatbot widget for React that works with any provider, avoiding vendor lock-in and repetitive development
Dev.to AI
📰
AI Agents Are Now Handling Java Migrations - Here's What That Means for You
AI agents can now handle Java migrations, reducing costs and time, and it's crucial to understand the implications for software development and maintenance
Dev.to AI
Up next
Building Great Agent Skills: The Missing Manual
AI Engineer
Watch →