Skip to content

Commit fb24df4

Browse files
authored
Merge pull request #895 from YoungPhlo/docs/community-streams
docs: Add What Did You Get Done This Week #4 summaries and timestamps
2 parents 216e312 + 83dffb4 commit fb24df4

File tree

3 files changed

+267
-80
lines changed

3 files changed

+267
-80
lines changed

docs/community/Streams/12-2024/2024-12-03.md

+6
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,9 @@
1+
---
2+
sidebar_position: 1
3+
title: "AI Agent Dev School Part 2"
4+
description: "Building Complex AI Agents with Actions, Providers, & Evaluators"
5+
---
6+
17
# AI Agent Dev School Part 2
28

39
**Building Complex AI Agents with Actions, Providers, & Evaluators**
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,104 @@
1+
---
2+
sidebar_position: 2
3+
title: "AI Agent Dev School Part 3"
4+
description: "Form-Filling Frenzy & Eliza's Wild Ride"
5+
---
6+
7+
# AI Agent Dev School Part 3
8+
9+
**Form-Filling Frenzy & Eliza's Wild Ride**
10+
11+
Date: 2024-12-05
12+
YouTube Link: https://www.youtube.com/watch?v=Y1DiqSVy4aU
13+
14+
## Timestamps
15+
16+
**00:00:00** - Intro & Housekeeping:
17+
- Link: https://www.youtube.com/watch?v=Y1DiqSVy4aU&t=0
18+
- Recap of previous sessions (Typescript, plugins, actions)
19+
- Importance of staying on the latest Eliza branch
20+
- How to pull latest changes and stash local modifications
21+
22+
**00:08:05** - Building a Form-Filling Agent:
23+
- Link: https://www.youtube.com/watch?v=Y1DiqSVy4aU&t=485
24+
- Introduction to Providers & Evaluators
25+
- Practical use case: Extracting user data (name, location, job)
26+
- Steps for a provider-evaluator loop to gather info and trigger actions
27+
28+
**00:16:15** - Deep Dive into Evaluators:
29+
- Link: https://www.youtube.com/watch?v=Y1DiqSVy4aU&t=975
30+
- Understanding "Evaluator" in Eliza's context
31+
- When they run, their role in agent's self-reflection
32+
33+
**00:27:45** - Code walkthrough of the "Fact Evaluator":
34+
- Link: https://www.youtube.com/watch?v=Y1DiqSVy4aU&t=1675
35+
- Code walkthrough of the "Fact Evaluator"
36+
37+
**00:36:07** - Building a User Data Evaluator:
38+
- Link: https://www.youtube.com/watch?v=Y1DiqSVy4aU&t=2167
39+
- Starting from scratch, creating a basic evaluator
40+
- Registering the evaluator directly in the agent (no plugin)
41+
- Logging evaluator activity and inspecting context
42+
43+
**00:51:50** - Exploring Eliza's Cache Manager:
44+
- Link: https://www.youtube.com/watch?v=Y1DiqSVy4aU&t=3110
45+
- Shaw uses Code2Prompt to analyze cache manager code
46+
- Applying cache manager principles to user data storage
47+
48+
**01:06:01** - Using Claude AI for Code Generation:
49+
- Link: https://www.youtube.com/watch?v=Y1DiqSVy4aU&t=3961
50+
- Pasting code into Claude and giving instructions
51+
- Iterative process: Refining code and providing feedback to Claude
52+
53+
**01:21:18** - Testing the User Data Flow:
54+
- Link: https://www.youtube.com/watch?v=Y1DiqSVy4aU&t=4878
55+
- Running the agent and interacting with it
56+
- Observing evaluator logs and context injections
57+
- Troubleshooting and iterating on code based on agent behavior
58+
59+
**01:30:27** - Adding a Dynamic Provider Based on Completion:
60+
- Link: https://www.youtube.com/watch?v=Y1DiqSVy4aU&t=5427
61+
- Creating a new provider that only triggers after user data is collected
62+
- Example: Providing a secret code or access link as a reward
63+
64+
**01:37:16** - Q&A with the Audience:
65+
- Link: https://www.youtube.com/watch?v=Y1DiqSVy4aU&t=5836
66+
- Python vs. TypeScript agents
67+
- Pre-evaluation vs. post-evaluation hooks
68+
- Agent overwhelm with many plugins/evaluators
69+
- Agentic app use cases beyond chat
70+
- Running stateless agents
71+
- Building AIXBT agents
72+
73+
**01:47:31** - Outro and Next Steps:
74+
- Link: https://www.youtube.com/watch?v=Y1DiqSVy4aU&t=6451
75+
- Recap of key learnings and the potential of provider-evaluator loops
76+
- Call to action: Share project ideas and feedback for future sessions
77+
78+
## Summary
79+
80+
This is the third part of the live stream series "AI Agent Dev School" hosted by Shaw from ai16z, focusing on building AI agents using the Eliza framework.
81+
82+
**Key takeaways:**
83+
84+
* **Updating Eliza:** Shaw emphasizes staying up-to-date with the rapidly evolving Eliza project due to frequent bug fixes and new features. He provides instructions on pulling the latest changes from the main branch on GitHub.
85+
* **Focus on Providers and Evaluators:** The stream focuses on building a practical provider-evaluator loop to demonstrate a popular use case for AI agents – filling out a form by extracting user information.
86+
* **Form Builder Example:** Shaw walks the audience through building a "form provider" that gathers a user's name, location, and job. This provider utilizes a cache to store already extracted information and instructs the agent to prompt the user for any missing details.
87+
* **Evaluator Role:** The evaluator continually checks the cache for the completeness of user data. Once all information is extracted, the evaluator triggers an action to send the collected data to an external API (simulated in the example).
88+
* **Live Coding and AI Assistance:** Shaw live codes the example, using tools like "Code2Prompt" and Claude AI to help generate and refine the code. He advocates for writing code in a human-readable manner, utilizing comments to provide context and guidance for both developers and AI assistants.
89+
* **Agentic Applications:** Shaw highlights the potential of agentic applications to replicate existing website functionality through conversational interfaces, bringing services directly to users within their preferred social media platforms.
90+
* **Community Engagement:** Shaw encourages active participation from the community, suggesting contributions to the project through pull requests and feedback on desired features and patterns for future Dev School sessions.
91+
92+
**Overall, this live stream provided a practical tutorial on building a common AI agent use case (form filling) while emphasizing the potential of the Eliza framework for developing a wide range of agentic applications.**
93+
94+
## Hot Takes
95+
96+
1. **"I'm just going to struggle bus some code today." (00:09:31,664)** - Shaw embraces a "struggle bus" approach, showcasing live coding with errors and debugging, reflecting the reality of AI agent development. This contrasts with polished tutorials, highlighting the iterative and messy nature of this new technology.
97+
98+
2. **"I'm actually not gonna put this in a plugin. I'm gonna put this in the agent... just so you can see what happens if you were to, like, make your own agent without using a plugin at all." (00:37:24,793)** - Shaw goes against the Eliza framework's plugin structure, showing viewers how to bypass it entirely. This bold move emphasizes flexibility, but could spark debate on best practices and potential drawbacks.
99+
100+
3. **"I really don't remember conversations from people very well, like verbatim, but I definitely remember like the gist, the context, the really needy ideas." (00:24:48,180)** - Shaw draws a controversial parallel between human memory and the Eliza agent's fact extraction. Reducing human interaction to "needy ideas" is provocative, questioning the depth of social understanding AI agents currently possess.
101+
102+
4. **"It's just an LLM. It's just making those numbers up. It could be off. I don't really buy the confidence here." (01:13:56,971)** - Shaw dismisses the confidence scores generated by the Large Language Model (LLM), revealing a distrust of these black-box outputs. This skepticism is crucial in a field where relying solely on AI's self-assessment can be misleading.
103+
104+
5. **"Dude, that's a $250 million market cap token. Let's get that shit in Bubba Cat." (01:45:34,809)** - Shaw throws out a blunt, market-driven statement regarding the AIXBT token. Bringing finance directly into the technical discussion highlights the intertwined nature of AI development and potential financial incentives, a topic often tiptoed around.

0 commit comments

Comments
 (0)