agentstack-wrapper

安装量: 53
排名: #14114

安装

npx skills add https://github.com/i-am-bee/agentstack --skill agentstack-wrapper
Agent Stack Wrapper
Integration guide for wrapping Python agents for
Agent Stack
platform.
Table of Contents
Security Requirements
Constraints (must follow)
Integration Workflow Checklist
Readiness Check (before Step 1)
Step 1 – Study the Project
Step 2 – Classify the Agent
Step 3 – Add and Install Dependencies
Step 4 – Create the Server Wrapper & Entrypoint
Step 5 – Wire LLM / Services via Extensions
Step 6 – Error Handling
Step 7 – Forms (Single-Turn Structured Input)
Step 7b – Adapt File Inputs
Step 8 – Configuration Variables & Secrets
Step 9 – Agent Output
Step 10 – Use Platform Extensions
Step 11 – Update README
Anti-Patterns
Failure Conditions
Finalization Report (Required)
Verification Checklist
Security Requirements
Never run remote scripts or untrusted code.
Use trusted package metadata, pin versions, and audit installed
agentstack-sdk
/
a2a-sdk
.
Handle sensitive values only through declared Agent Stack extensions.
Never log, print, persist, or expose secret values.
Never send secrets to untrusted intermediaries or endpoints not required by the wrapped agent contract.
If installing anything via
pip
, always use the
-qq
flag to suppress unnecessary output.
Constraints (must follow)
ID
Rule
C1
No business-logic changes.
Only modify code for Agent Stack compatibility.
C2
Strict minimal changes.
Do not add auth, Dockerfile (containerization is optional and separate), telemetry, or platform middleware unless explicitly requested. If an agent works with simple text, don't force a Form. If it works with env vars, refactor minimally.
C3
Cleanup temp files.
If the agent downloads or creates helper files at runtime, add a cleanup step before the function returns.
C4
Prioritize Public Access (No redundant tokens).
Only use the Secrets extension if the secret is strictly mandatory for the agent's core functionality and no public/anonymous access is viable. Do not add secrets or tokens that increase configuration burden if they were optional in the original agent (e.g., optional GitHub token). Preserve existing optional auth behavior unless removal is explicitly approved and documented as a behavior change. API keys must be passed explicitly, never read from env vars. For required third-party API credentials, Secrets or Env Variables extension is mandatory in the wrapped path.
C5
Detect existing tooling.
If the project uses
requirements.txt
, add
agentstack-sdk~=
there. If it uses
pyproject.toml
, add it there. Add
a2a-sdk
only when the project manages it directly, and keep it compatible with the chosen
agentstack-sdk
version. Never force
uv
or create duplicate manifests.
C6
Documentation-First Source of Truth.
Local skill files in the
references/
folder hold absolute priority over online documentation. When this skill provides local references, they are the primary authority for imports, APIs, and behavior. Do not start from online docs or runtime/package introspection while local documented guidance exists.
C7
Fallback and Validation Order.
Use local source inspection and installed-package introspection only if required details are missing/ambiguous in docs. Keep runtime/package verification as a final validation step after implementation, and document any doc gaps before falling back.
C8
Structured Parameters to Forms.
For single-turn agents with named parameters, map them to an
initial_form
using
FormServiceExtensionSpec.demand(initial_form=...)
.
C9
Remove CLI arguments.
Remove all
argparse
or
sys.argv
logic. Replace mandatory CLI inputs with
initial_form
items, AgentStack Settings Extension (for runtime options), or AgentStack Env Variables.
C10
Approval gate for business-logic changes.
If compatibility requires business-logic changes, stop and request explicit approval with justification before proceeding.
C11
Keep adaptation reversible.
Isolate wrapper and integration changes, avoid destructive refactors, and preserve a rollback path.
C12
Preserve original helpers.
Do not delete original business-logic helpers unless strictly required. If removal is necessary, document why.
C13
Optional extension safety.
Service/UI extensions are optional. Check presence/data before use (e.g.,
if llm and llm.data ...
).
C14
No secret exposure.
Never log, print, persist, or echo secret values (API keys, tokens, passwords). Redact sensitive values in logs and errors.
C15
No remote script execution.
Never run untrusted remote code during wrapping. Use project manifests and trusted package metadata only.
C16
Constrained outbound targets.
Do not introduce arbitrary network targets. Limit external calls to trusted dependency sources and runtime endpoints explicitly required by the wrapped agent contract.
C17
No dynamic command execution from input.
Do not introduce wrapper patterns that execute shell commands from user/model input (for example,
eval
,
exec
,
os.system
, or unsanitized
subprocess
calls).
C18
Read Wrapper Documentation First.
Before starting any implementation, you must read the official guide:
Wrap Your Existing Agents
.
C19
Read Extension Documentation.
Before implementing any extension (Forms, LLM, Error, etc.), you
MUST
read its corresponding documentation URL (listed in Step 10) and extract the exact imports, class names, method names, properties, and their exact types directly from the official documentation. Do not guess these values. Only modify the agent code after reading the documentation.
C20
Replace filesystem file inputs with platform file uploads.
If the original agent reads files from the local filesystem (e.g.,
open()
,
pathlib.Path
, CLI file path arguments), replace those inputs with
FileField
form uploads and the platform
File
API. Do not assume local filesystem access at runtime. See Step 7b.
C21
Context & Memory Optimization.
Do not attempt the entire transformation in one tool call. Follow the checklist iteratively. Use the built-in task/todo tracking tool (e.g.,
manage_todo_list
) to track progress through the Integration Workflow Checklist — do not create separate markdown tracking files. Do not paste massive stack traces into the chat; extract only the relevant error.
Context & Memory Optimization
To ensure the highest success rate and prevent AI context window exhaustion during complex transformations:
Iterative Progress
Execute the
Integration Workflow Checklist
strictly step-by-step. Use the built-in task/todo tracking tool to mirror the checklist items and mark them off as you complete each step. This provides visible progress tracking without creating extra files.
Minimize Terminal Output
When debugging, prevent massive stack traces from polluting the context. Extract specifically the
Exception
line and the immediate traceback file, omitting massive framework logs.
Targeted Code Reading
Do not repeatedly load massive files like
SKILL.md
or large source code files if they haven't changed.
Do NOT create tracking files
Do not create
task.md
,
implementation_plan.md
,
walkthrough.md
, or similar markdown artifacts for progress tracking. The built-in task tracking tool is the single source of progress state.
Source-of-Truth Precedence (Required)
For every wrapping task, use this exact precedence order:
Absolute Primary (highest priority):
Local skill-provided files in the
references/
folder.
Primary (required second):
Official documentation URLs linked by this skill.
Secondary:
Local project source code and repository docs (README/AGENTS) to map behavior and integration points.
Last resort:
Installed package/runtime introspection (e.g.,
inspect
, package source probing) only when docs do not provide enough detail.
Final step only:
Runtime verification (imports/server checks) after implementation is complete.
If you use step 3, explicitly record what was missing from docs and why fallback was necessary.
CRITICAL WORKFLOW REQUIREMENT:
For
every
extension you decide to use (whether from LLM/Forms/Error steps or Platform Extensions), you MUST follow these exact steps in order:
Decide if the extension should be implemented.
If yes,
READ the documentation URL provided in the extension table below
(e.g., using a tool to read the webpage). This is mandatory and happens before any code edits.
Extract the correct imports, class names, method names, properties, and their exact types directly from the official documentation.
Only AFTER reading the documentation
, proceed to write or modify the agent code. Never guess imports or rely on outdated memory.
If docs are incomplete/unclear, use local/runtime introspection as fallback and note the specific gap.
Integration Workflow Checklist
Copy this checklist into your context and check off items as you complete them:
Task Progress:
- [ ] Readiness Check (before Step 1)
- [ ] Step 1: Study the Project
- [ ] Step 2: Classify the Agent
- [ ] Step 3: Add and Install Dependencies
- [ ] Step 4: Create the Server Wrapper & Entrypoint
- [ ] Step 5: Wire LLM / Services via Extensions (requires reading docs)
- [ ] Step 6: Implement Error Handling (requires reading docs)
- [ ] Step 7: Map Forms (if applicable) (requires reading docs)
- [ ] Step 7b: Adapt File Inputs (if applicable) (requires reading docs)
- [ ] Step 8: Map Configuration Variables & Secrets (requires reading docs)
- [ ] Step 8a: Credential audit completed (every external API credential source identified; required credentials mapped to Secrets extension)
- [ ] Step 9: Agent Output (requires reading docs)
- [ ] Step 10: Use Platform Extensions (requires reading docs for each chosen extension)
- [ ] Step 11: Update README
- [ ] Finalization Report (required)
- [ ] Verification Checklist (required)
STOP GATE:
After Step 11, you MUST complete the Finalization Report and walk through every item in the Verification Checklist before reporting completion. The task is NOT done until both are finished.
Readiness Check (before Step 1)
A supported Python interpreter is selected in the active environment (Python >=3.12 and <3.14; not merely installed).
Agent source code is available locally.
The project dependency workflow is identified.
If any item fails, stop and ask the user.
Step 1 – Study the Project
Before making any changes, you must thoroughly study the existing project to understand its architecture and requirements.
If there is a
README.md
or
AGENTS.md
file, read it first to better understand the structure and purpose of the agent.
Evaluate and document the following:
Core Functionality
What does the agent do? What are its primary inputs and outputs?
Dependencies
What external systems, APIs, or databases does the agent depend on?
Configuration
How is the agent configured (e.g., environment variables, configuration files, CLI arguments)?
Tools & Libraries
What primary libraries does it use for LLM interaction (e.g., LangChain, bare OpenAI SDK)?
Step 2 – Classify the Agent
Read the agent's code and classify it. This determines the
interaction_mode
value:
interaction_mode
Pattern
Indicators
single-turn
One request → one response
CLI entrypoint,
argparse
(must be removed), primarily stateless business logic, context persistence still recommended
multi-turn
Conversation with memory
Chat loop, message history, session state, memory object
This classification determines:
How to use
context.store()
and
context.load_history()
persist input/response by default for all agents;
context.load_history()
is required for multi-turn, and optional for single-turn (use only when prior context is intentionally part of behavior)
Whether to define an
initial_form
for structured inputs (single-turn with named parameters)
Step 3 – Add and Install Dependencies
Read
references/dependencies.md
and follow it completely for Step 3.
Step 4 – Create the Server Wrapper & Entrypoint
Read
references/wrapper-entrypoint.md
and follow it completely for Step 4.
Step 5 – Wire LLM / Services via Extensions
Read
references/llm-services.md
and follow it completely for Step 5.
Step 6 – Error Handling
Use the
Error extension
for user-visible failures. Do not report errors via a normal
AgentMessage
.
Implementation
Standard Reporting
Simply
raise
an exception (e.g.,
ValueError
,
RuntimeError
) inside the agent. The platform automatically catches and formats it.
Advanced Configuration
Add
error_ext: Annotated[ErrorExtensionServer, ErrorExtensionSpec(params=ErrorExtensionParams(include_stacktrace=True))]
as an agent function parameter to enable stack traces in the UI.
Adding Context
You can attach diagnostic data to
error_ext.context
(a dictionary) before raising an error. This context is serialized to JSON and shown in the UI.
Multiple Errors
Use ExceptionGroup (Python 3.11+) to report multiple failures simultaneously. The extension will render them as a group in the UI. Example See the official error guide and chat agent example for practical implementation examples. Step 7 – Forms (Single-Turn Structured Input) If the original agent accepts named parameters (not just free text), map them to an initial_form using the Forms extension. For free-text agents, the plain message input is sufficient — skip this step. Read references/forms.md for the full implementation guide (field types, Pydantic model, mid-conversation input). Step 7b – Adapt File Inputs If the original agent reads files from the local filesystem or accepts file paths as CLI/function arguments, those inputs must be replaced with platform file uploads. Local filesystem access is not available at runtime. Even if the file contains plain text, still use a FileField upload — do not flatten file inputs into message text. Read references/files.md for detection patterns, replacement steps, text extraction, and mid-conversation uploads. Step 8 – Configuration Variables & Secrets Read references/configuration-variables.md and follow it completely for configuration mapping, secret handling, and anti-patterns. Step 9 – Agent Output Trajectory Output Rule and Implementation Read references/trajectory.md and follow it completely for trajectory decision rules and implementation. Final Output Rule The primary, final output returned to the user from the agent or LLM must always be emitted as a normal AgentMessage or AgentArtifact . Trajectory should only contain the intermediate context, reasoning, and steps behind that final answer, but it is not a substitute for the final response. If callbacks are sync-only, capture callback data and emit it later from the main agent handler. Artifact Output Rule If the original agent generates, processes, or outputs files (such as CSVs, PDFs, images, or structured data dumps) as part of its execution, you must return those files to the caller as an AgentArtifact . Do not just write the file to the local disk and yield a text message saying "File generated", as local filesystem changes are not returned to the platform UI. Use the platform's API to construct an AgentArtifact pointing to the generated content. For specific implementation details, refer to the AgentArtifact Documentation . Step 10 – Use Platform Extensions Enhance the agent with platform-level capabilities by injecting extensions via Annotated function parameters. Read references/platform-extensions.md for extension selection and documentation links. Treat this reference as required input for Step 10 decisions and implementation. Step 11 – Update README Update the project's README.md (or create one if missing) with instructions on how to run the wrapped agent server. Include: Install dependencies using the project's existing tooling (e.g. uv pip install -qq -r requirements.txt or pip install -qq -r requirements.txt ). Make sure to use the -qq flag to keep the terminal output clean. Environment Configuration — Document required .env patterns if python-dotenv is used. However, ensure the agent still receives configuration explicitly instead of reading env arguments internally. Run the server with the appropriate command (e.g. uv run server.py or python server.py ). Default address — mention that the server starts at http://127.0.0.1:8000 by default and can be configured via HOST and PORT environment variables. Remove or replace any outdated CLI usage examples (e.g. argparse -based commands) that no longer apply after wrapping. Anti-Patterns When building and testing the wrapper, ensure you avoid these common pitfalls: Never access .text directly on a Message object. Message content is multipart. Always use get_message_text(input) (requires from a2a.utils.message import get_message_text ). Never save files to local disk. AgentStack environments are ephemeral. All generated files should be instantiated directly in memory and yielded as AgentArtifact(parts=[file.to_file_part()]) . See Manage Files for the correct File.create() API usage. Never use synchronous functions for the agent handler. Agent functions must be async def generators using yield . Never hide platform wiring behind abstraction layers. Keep @server.agent(...) , extension parameters, and integration contracts visible in the main entrypoint so behavior is auditable. Never treat runtime inspection as first source. agentstack_sdk and a2a details must come from provided docs first; use installed-environment inspection only as documented fallback, then validate imports at the end. Never assume history is auto-saved. Explicitly call await context.store(input) and await context.store(response) . Never assume persistent history without PlatformContextStore . Without it, context storage is in-memory and lost on restart. Never forget to filter history. context.load_history() returns Messages and Artifacts. Filter with isinstance(message, Message) . Never store individual streaming chunks. Accumulate the full response and store once. Never treat extension data as dictionaries. Use dot notation (e.g., config.api_key , not config.get("api_key") ). Never use llm_config.identifier as the model name. Use llm_config.api_model instead. Never assume all extension specs have .demand() . Some use .single_demand() or direct instantiation. Always verify. Never silently remove existing optional auth paths. If the source agent supported optional tokens/keys, preserve that optional behavior unless an approved behavior change is explicitly documented. Never use Forms for a single free-form follow-up. Use A2A input-required for one-question free text prompts; reserve Forms for structured multi-field input. Never mismatch form field IDs and model fields. Mismatched IDs cause silent parse failures. Never use context.session_id . The RunContext object uses context.context_id for session identification. Never guess platform object attributes. FormRender uses fields (not items ), TextField uses label (not title ). Never skip null-path handling for forms. Handle None for cancelled or unsubmitted forms. Never use parse_initial_form(...) truthiness to route turns in multi-turn agents. Route by presence/absence of persisted session state from context.load_history() . Never assume uploaded file URIs are HTTP URLs. Parse agentstack:// URIs with PlatformFileUrl . Never skip extraction polling. create_extraction() is async — poll get_extraction() until status == 'completed' . Failure Conditions If fresh docs cannot be fetched, stop and report that execution cannot continue without current docs. If required details are missing from docs and no reliable fallback source is available, stop and report the blocker explicitly. Finalization Report (Required) Before completion, provide all of the following: Mapping summary: inbound mapping (A2A Message to agent input), outbound mapping (agent output to AgentMessage ), and selected streaming path. Behavior changes list: if behavior changed, list each change with reason and impact. Business-logic statement: state whether business logic changed, and if it did, include approval and justification. Legacy endpoint compatibility result: state preserved, shimmed, or not applicable. Dockerfile prompt: Ask the user if they also want to add a Dockerfile . If the user says yes, review the example at https://github.com/i-am-bee/agentstack-starter/blob/main/Dockerfile and assemble a Dockerfile for the project. Do not force the use of uv if the project does not use it. Testing prompt: Ask the user if they want to test the agent functionality. If they say yes, start the agent first in one terminal, and then use a separate terminal to run agentstack run AGENT_NAME . Do not attempt to interrupt the run command, as it may take a long time to complete. If the execution fails and an error is encountered, attempt to fix the error and run the test again. Critically, do not create any new files or scripts (e.g., Python test scripts using pexpect) to perform this test. You must interact with the terminals directly. Note that agentstack run triggers an interactive form; when testing programmatically via stdin, ensure you send precise literal newline characters to advance prompts. Run prompt: Ask the user if they want to run the agent and be redirected to the Web UI. If the user agrees, start the agent and monitor the terminal for errors. If an error occurs, attempt to fix it and restart the agent. Verification Checklist (Required) After wrapping, confirm: Code Quality Every import resolves to a real, installed module (run import validation) Agent function has a meaningful docstring (shown in UI) Agent handler is async def and uses yield for responses No argparse or sys.argv remains No business-logic changes unless explicitly approved (C1/C10) Temp files created at runtime are cleaned up (C3) Extensions & Config yield AgentMessage(text=...) used for all user-facing responses No env vars for API keys/model config — extensions used instead Required external API tools (e.g., Tavily) obtain credentials via Secrets extension and pass them explicitly to constructors/clients LLM config passed explicitly from extension, no fallback chains Optional extensions checked for presence/data before use (C13) Errors raise exceptions (Error extension), not yielded as AgentMessage No secrets logged, printed, or persisted (C14) Context & History input and response stored via context.store() context_store=PlatformContextStore() present if context is persisted/read Multi-turn uses context.load_history() ; single-turn only if intentionally needed Forms & Files Single-turn with structured params → initial_form is defined (C8) Filesystem file inputs replaced with FileField form uploads (C20) Text-processing agents use broad MIME types with content-type branching and extraction No local filesystem reads for user-provided files Dependencies agentstack-sdk~= added to project's existing dependency file (C5) No Dockerfile unless explicitly requested (C2) Validation (must be performed live) Start the server, fetch /.well-known/agent-card.json — confirm HTTP 200 and valid JSON. Show full output to the user. Ask user if they want a Dockerfile Ask user if they want to test via agentstack run AGENT_NAME (no test scripts — use terminals directly) and listen for errors. If errors occur, fix and retest until successful. Inform the user very briefly, that then they can ask you to check out terminal output, if errors occur, and you will attempt to fix them.
返回排行榜