CodeAct and Manus
Key Takeaways
The video discusses CodeAct, a framework that powers AI agent startups like Manus, and its alternative approach to function calling, where the LLM generates and executes code to solve problems.
Full Transcript
Let's talk about Kodak, the framework that powers AI agent startups like Manis. Manis recently went a bit viral when it came out with an AI agent that can do a bunch of things for you by essentially interacting with your computer like a human. And then they got a $75 million investment from Benchmark at a $500 million valuation, but it's now being reviewed by the US government because Manis is apparently a Chinese company with interesting legal structures like being incorporated in the Cayman Islands and having offices in a bunch of different countries. Anyway, Kodak is interesting because it offers an alternative to function calling, which you're probably already familiar with. But as a quick refresher, function calling basically involves telling the LLM about potential tools, and then the LLM will generate structured commands, often in JSON, to invoke those predefined tools. Importantly, the LM doesn't actually do any of the execution of the functions. It just gives you what you should use to call that particular tool. With Kodak, you actually have the LLM just write code, execute it, and then work through problems in multiple turns like an AI agent. This agent can approach problems in a more flexible way and can also adjust things as it goes. The paper found that by using Kodact rather than more traditional methods like function calling, the authors were able to achieve better results on benchmarks. Now, the paper is almost a year old at this point, but it's an interesting alternative to function calling. I'm Diana and I break down texts and AI without the hype. Follow for more.
Original Description
CodeAct, one of the frameworks / AI Agent implementations that powers startups like Manus, offers an interesting alternative to function calling.
Rather than having the LLM output structured outputs used to call a function, CodeAct proposes that you should let the LLM actually code, act, and revise based on what it observes. On benchmarks, they're able to achieve better performance using this approach.
Compared to function calling, CodeAct allows for more flexible solutions.
----
Hi, Iโm Diana ๐ Iโm a 4x first enterprise software PM, co-founded my own startup, and worked in venture capital! I double click into important trends in tech, startups, and AI without the hype. Follow for more!
----
#codeact #llm #aiagents #manus
Watch on YouTube โ
(saves to browser)
Sign in to unlock AI tutor explanation ยท โก30
More on: Prompt Craft
View skill โRelated AI Lessons
โก
โก
โก
โก
5 prompt engineering techniques to get the best out of a legacy project
Dev.to ยท Marco Coelho
The Real Reason Prompt Engineering Isn't Going Away
Dev.to AI
Common Prompt Engineering Mistakes and How to Avoid Them
Medium ยท ChatGPT
Day 5: Prompt Engineering Basics (For DevOps & Cloud Engineers)
Medium ยท AI
๐
Tutor Explanation
DeepCamp AI