Rename agent-cascade command to live and add deepagents brain#237
Merged
Conversation
…ssembly live` The cascade's LLM leg is now a deepagents graph (web search, URL fetch, and AssemblyAI docs tools) instead of a single completion, so a spoken turn can transparently use tools — a "talk to Gemini Live" experience. Tools are read-only and auto-approved (a voice turn can't pause for keyboard approval), and the system prompt keeps replies short and speakable for low latency. The command is renamed from `agent-cascade` to `live`. The internal slice keeps its `agent_cascade` name; only the user-facing command, help text, examples, docs, and the show-code teaching snippet change. `assembly agent`'s help now says it uses the Voice Agent API to distinguish the two. - aai_cli/agent_cascade/brain.py: build the gateway-bound deepagents graph and a stateless per-turn completer; the graph is the only network seam, so the wiring is unit-tested against a fake graph / fake chat model. - code_agent/model.build_model gains max_tokens + extra (extra_body) so the cascade's --max-tokens/--llm-config thread through; the coding agent's call is unchanged. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com> Claude-Session: https://claude.ai/code/session_01Hiovdq7aoXtbfg9juPzwQa
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Renames the
assembly agent-cascadecommand toassembly liveand replaces its simple LLM completion backend with a deepagents-powered graph that can transparently use tools mid-conversation (web search, URL fetch, AssemblyAI docs).Key changes
agent-cascade→livethroughout the CLI, help text, examples, and testsaai_cli/agent_cascade/brain.py): Implements the deepagents graph backend with:build_system_prompt(): Appends tool guidance to the user's personabuild_live_tools(): Assembles read-only toolset (fetch, web search if keyed, docs)build_graph(): Compiles the deepagents graph with the gateway modelbuild_completer(): Returns a per-turn completion function that strips the cascade's system message before invoking the graphaai_cli/code_agent/model.py): Extendedbuild_model()to acceptmax_tokensandextraparameters, threading cascade config through to the gateway modelaai_cli/agent_cascade/engine.py): Switched fromaai_cli.core.llm.complete()tobrain.build_completer()for the LLM legtests/test_agent_cascade_brain.py): Unit tests for the brain module driving the real deepagents graph against a fake chat model (no network)livename and updated agent descriptionImplementation notes
build_completer()accepts an injected graph for testabilityTAVILY_API_KEYis set); docs MCP is best-effortAIMessages (tool-call requests); reply extraction scans backward for the last message with spoken texthttps://claude.ai/code/session_01Hiovdq7aoXtbfg9juPzwQa