The agent framework built for real work.
Secure by design. Efficient by architecture. Precise in execution. Delfhos is the AI agent framework you can actually trust in production.
Works with the tools you already use
// efficient. Reproducible benchmarks.
Efficient by design. Lower cost, lower latency, less noise.
Token costs spiral out of control
Verbose frameworks often include excessive metadata and redundant internal loops, causing context windows to fill rapidly and costs to balloon.
Delfhos: 2.0x Fewer Tokens
Efficient Code-Act architecture.
Latency that users actually feel
Complex chain abstractions create significant overhead. Users often face long "thinking" pauses that break the flow of real-time applications.
Delfhos: 2.2x Faster Execution
Streamlined task orchestration.
Opaque execution, debugging unknowns
Black-box abstractions in ReAct frameworks hinder traceability and exception handling. Code-act frameworks like Delfhos reduce potential errors and improve auditability.
Delfhos: Full Code-Level Auditability
Native Python traceability and robust management.
| System Audit | Delfhos | LangChain |
|---|---|---|
| Trace Depth | Full | Limited |
| Success Rate | 100% | 100% |
// precise. Less code, more control.
// precise. One reasoning step. Real results.
Why Code-Act is the precise way to run agents
Real-world task: "Find the 5 biggest open invoices in the database and add them to a new Google Sheet."
Traditional Agent Loop (ReAct)
4-6 LLM Calls"I need to query the database. First I must search for an SQL tool and learn how to use it."
High friction: Frequent context-switching between user intent and internal discovery logic.
"Fetching results... OK, I have 5 rows. Now I must look for a Google Sheets tool to proceed."
High friction: Forcing the model to pause and parse raw outputs back into text summaries repeatedly.
"Preparing the sheet... I hope the row data formatting from two steps ago is still in context."
Risk: Accumulated prompt history leads to 'hallucinations' or loss of specific data points.
Sequential reasoning forces the model to pause, parse text output, and re-invoke the LLM for every small logical step, growing the context exponentially.
Delfhos Code-Act Engine
1 LLM Call// secure. Production-safe by default.
Secure by design.
Not bolted on.
Most frameworks hand your agent a full set of keys and hope for the best. Delfhos forces you to declare exactly which actions each connection is allowed to take — and which require human approval before they run.
Three ways to handle human approval
Set confirm=True on any connection. Choose the mode that fits your workflow.
Interactive
Agent pauses and prompts you in the terminal before any flagged action. Zero setup — works out of the box.
Callback
Pass an async function. Route approvals to Slack, email, a web UI — any system you control.
Programmatic
Auto-approve or deny based on your own logic. Write tests that verify what your agent asks for before it acts.
// secure · efficient · precise — everything you need to ship.
Per-Action Allow Lists
Restrict any connection to only the actions you permit. allow=["read"] means the agent literally cannot send emails, no matter what it's told.
Human-in-the-Loop
Set confirm=True and every flagged action pauses for your approval — interactively, via callback, or programmatically in tests.
Code-Act Engine
The LLM generates a Python script, not a list of function calls. One reasoning pass handles multi-step tasks — no context ballooning across tool calls.
Built-in Tool Suite
Gmail, SQL, Google Sheets, Drive, Docs, Calendar, WebSearch, and MCP servers — all production-ready, no glue code required.
Auto Retry on Failure
When generated code fails, Delfhos feeds the traceback back to the LLM and retries automatically. No silent failures.
Any LLM, Any Provider
OpenAI, Anthropic, Google, xAI, Ollama — swap models per role. Use a lightweight model for routing and a powerful one for code generation.
From zero to a production agent in under a minute.
Install the library
Connect your tools
Connect Gmail, SQL, Drive, Sheets, or any custom tool with a connection object. Set allow lists and confirm gates to control exactly what your agent can do.
Run and monitor
Delfhos handles the observation loop, error correction, and tracing. Just call agent.run(task).