Generative Agents
11.06.23
Setup - Reality show
Scientific Peer Reviewer
Moreover, emoticons are used to show the state of an agent
2
Observations
Scientific Peer Reviewer
The limit for what an agent can observe is based on a visibility threshold encoded in the sandbox environment
Observations include actions that
3
Retrieval
Scientific Peer Reviewer
4
Reflections
Scientific Peer Reviewer
[Reflection Tag] Then the agent answers these questions using retrieval on their memory
5
Planning - coherent actions
Scientific Peer Reviewer
6
Movement
Scientific Peer Reviewer
7
Metrics - Believability
Scientific Peer Reviewer
8
Emergent Behaviors
Scientific Peer Reviewer
Gauge emergent behaviors such as information diffusion: network density
9
Strengths
Scientific Peer Reviewer
10
Weaknesses (by authors)
Scientific Peer Reviewer
11
Weaknesses (by me)
Scientific Peer Reviewer
12
Rating Time!
Scientific Peer Reviewer
Strength
Threat
Weakness
Opportunity
13
Rating Time!
Scientific Peer Reviewer
Strength
Threat
Weakness
Opportunity
Final Rating: 2/5
14
Historian
Interactive Agents
15
Historian
Interactive Agents
16
Historian
Interactive Agents
Building believable proxies of human behavior
17
Historian
What are Believable Agents?
18
Historian
Chess
19
Historian
DOTA 2
20
Historian
DOTA 2
21
Historian
Simulated Worlds
22
Historian
Simulated Worlds
Breakup ← relationship(dating,x,y)
23
Historian
Real World
24
Historian
Real World
25
Historian
Takeaways
How can we leverage LLMs to create believable agents in different worlds?
26
Historian
References
27
Agents
Data Analyst
28
Agent Memories (Per simulation day)
Data Analyst
29
Agent Observations (Per simulation day)
Event
Reflection
Data Analyst
30
Agent Observations (Per simulation day)
Chat
Data Analyst
31
Agent Movement (Per simulation timestep)
Data Analyst
32
Initial Memory
Data Analyst
33
Environment
Data Analyst
34
Agent Spatial Memories (Per simulation day)
Data Analyst
35
Evaluation
Data Analyst
36
Controlled Evaluation
Data Analyst
37
Controlled Evaluation
Data Analyst
38
Elo Rank Rating
K-factor: Constant for maximum change, Score SA: Actual score {1, 0.5, 0}, EA: Expected score (Probability of A winning)
Data Analyst
RA = 𝛍A�𝝈B = 𝝈A
39
Elo Rank Rating
Data Analyst
40
TrueSkill Rank Rating
Data Analyst
41
TrueSkill Rank Rating - Simple 1v1 case
Data Analyst
42
TrueSkill Rank Rating - Simple 1v1 case
Data Analyst
43
TrueSkill Rank Rating
Data Analyst
44
TrueSkill Rank Rating - Update Rules
Approximate Message Passing
Factor Graphs
Data Analyst
Proprietary details
45
TrueSkill Rank Rating - Simple 1v1 case
β2 : Variance of Performance around skill of each player�ε: Draw margin
Data Analyst
Expected Win
Expected Draw
46
Controlled Evaluation
Data Analyst
Scores
47
Controlled Evaluation
Data Analyst
𝜇 = 29.89; 𝜎 = 0.72
𝜇 = 26.88; 𝜎 = 0.69
𝜇 = 25.64; 𝜎 = 0.68
𝜇 = 22.95; 𝜎 = 0.69
𝜇 = 21.21; 𝜎 = 0.70
48
End-to-End Evaluation: Emergent Social Behaviours
Network Density (Relationships) | Information Diffusion (% population) | Network Density (Relationships) | Information Diffusion �(% population) |
0.164 (52 connections) | Sam’s Mayoral Candidacy : 1 (4%)��Isabella’s party : 1 (4%)�� | 0.74 (222 connections) | Sam’s Mayoral Candidacy : 8 (32%)��Isabella’s party : 13 (52%) |
Initial State | End State |
Hacker
50
Hacker
51
Main Modifications:
Code:
https://github.com/user074/generative_agents
Hacker
52
Generative Agents: Interactive Simulacra of Human Behavior
Academic Researcher
53
Generative Agents: Interactive Simulacra of Human Behavior
Academic Researcher
54
Generative Agents for Human-Robot Interaction Simulations
Academic Researcher
GOAL: Understanding human-robot interactions to ensure robots are in-tune with human emotions, enhance user satisfaction, and integrate into society as trusted collaborators.
55
Content
Academic Researcher
56
Components
Academic Researcher
57
Generative Robotic Agent
Academic Researcher
Each agent’s identity, including occupation and human-robot interactions, is described in a one-paragraph natural language seed memory.
58
Agent Behavior and Interaction
Academic Researcher
59
Simulation Environment
Academic Researcher
60
Content
Academic Researcher
61
Learning Agent Modeling
Academic Researcher
CHALLENGES:
62
Learning Agent Modeling
Academic Researcher
CHALLENGES:
IDEAS:
63
Learning Agent Control
Academic Researcher
CHALLENGES:
64
Learning Agent Control
Academic Researcher
CHALLENGES:
IDEAS:
65
Learning Agent Interaction
Academic Researcher
CHALLENGES:
66
Learning Agent Interaction
Academic Researcher
CHALLENGES:
IDEAS:
67
Conclusion
Academic Researcher
Integrating generative human and robotic agents allows:
68
Problems in Video Games
Industry Practitioner / Entrepreneur
Image from:
https://www.kotaku.com.au/2014/09/why-npcs-always-have-the-same-one-liners/
https://gaming.ebaumsworld.com/pictures/47-funny-memes-to-level-up-with/87081377/
Programmed conversation
Illogical reaction
Homogeneous characters
69
Why?
Industry Practitioner / Entrepreneur
Image by DALL·E 3
70
Our solution: Generative Agents!
Industry Practitioner / Entrepreneur
Image from RPG Maker
High-level setup/background
Description generation
Agent generation
71
Our product
Industry Practitioner / Entrepreneur
Image by DALL·E 3
72
Our vision
Industry Practitioner / Entrepreneur
Image by DALL·E 3
Our collaborators
73
Safety and security
Industry Practitioner / Entrepreneur
74