Agents and the Genesis Mission
Background and technology recommendations
These slides provide some background and initial recommendations concerning appropriate agentic technologies for use in Genesis Mission projects.
They are very much a work in progress.
Comments and questions to foster@anl.gov
Why agents?
The Genesis Mission seeks to create autonomous, high-throughput, continuously operating scientific systems that integrate computation, experiment, data, and models across the national research infrastructure
Agents have promise as a source of the operational intelligence needed to coordinate, adapt, and accelerate discovery at scale, by:
Questions that we address here
These slides will be updated; check back soon
Agents background
What is an “agent”?
An agent is a persistent, stateful process that acts on behalf of a user or system. An agent may:
We can think of an agent as a scientific assistant that can reason, act, and coordinate on our behalf
query
thought
answer
tools
action
observation
finish
Two common patterns: Workflows & agent networks
A
Manager
B
C
A
B
C
Workflows / chains
Agent Networks
Other distinctions
In summary
Scientific agents span a spectrum:
No single technology covers all cases. Choose based on autonomy, structure, scale, and environment.
Agent frameworks
There are many agent frameworks
LangGraph
Microsoft Agent Framework (prev. AutoGen)
Academy (UChicago)
CrewAI
Goose (Linux Foundation)
Google agent SDK
SmolAgents (Hugging Face)
LlamaIndex
…
We recommend: see below
Agent frameworks simplify the creation of multi-step (LLM) workflows
They do so by managing prompts, context, state, memory, and tool invocation to support reliable reasoning and action
Different frameworks focus on different types of activities
None apart from Academy address explicitly the unique needs of DOE science (e.g., federation, HPC)
There are benefits to focusing on a few frameworks in GM
Our recommendations (a work in progress!)
These roles are complementary rather than competitive. For example, teams may use LangGraph to manage execution of federated Academy agents.
LangGraph: Structured LLM Workflows
(and predecessor LangChain)
Microsoft AutoGen: Conversational Multi-Agent Systems
(also known as Microsoft Agent Framework)
Academy: Persistent Scientific Agents at Scale
Our initial thoughts concerning tech applicability
Your Need | Best Fit |
Deterministic pipelines | LangGraph or workflow tools |
Multi-step reasoning with LLM | LangGraph |
Conversational multi-agent behavior | AutoGen |
Persistent agents that call each other | AutoGen or Academy |
Distributed, secure, long-lived agents | Academy |
Autonomous labs and instrument control | Academy |
In brief: LangGraph = controlled reasoning and workflows
AutoGen = conversational agent interactions
Academy = scientific autonomy at scale
We are developing example code to show how to combine frameworks: e.g., LangGraph for agent logic and Academy for scaling
Agent technologies compared
Feature | LangGraph | Microsoft AutoGen | Academy |
Primary Purpose | Structured, deterministic LLM-driven workflows | Conversational multi-agent coordination | Persistent, distributed scientific agents across CI (HPC, labs, data) |
Core Abstraction | Graph of steps (nodes) with optional state | Agents exchanging messages | Registered, long-lived agents with capabilities and identities |
Decision Type | Internal decisions (choose tools, branches) | External decisions (which agent to talk to) | Both: internal tool use + external delegation across systems |
Execution Model | Reproducible, stepwise reasoning | Dialogue-driven collaboration | Persistent, event-driven, multi-institution execution |
Best At | Multi-step LLM planning; tool graphs; controllable workflows | Rapid multi-agent prototyping; committee-of-experts patterns | Instrument/HPC orchestration; autonomous labs; scientific mission agents |
State Handling | Local per-run state | Conversational history per interaction | Durable, cross-session state; long-lived agent processes |
Environment Scope | Single runtime or workflow | Single machine or cloud app | Federated cyberinfrastructure: HPC, instruments, storage, authentication |
Typical Use Cases | Tool-chains, retrieval + reasoning, structured pipelines | Brainstorming agents, experts debating, social agents | Simulation–experiment loops, monitoring agents, lab automation, autonomy |
AmSC and ModCon work to support agent use in GM
AmSC and ModCon work to support agent use in GM
Very much a work in progress! But:
Please tell us of other relevant efforts and missing needs!