YaoApp/yao

Item: YaoApp/yao
Rating: 3
Author: RepoPilot

✨ Build AI agents and web apps — with a single binary.

Mixed

Solo project — review before adopting

ConcernsDependency

non-standard license (Other); single-maintainer (no co-maintainers visible)

HealthyFork & modify

Has a license, tests, and CI — clean foundation to fork and modify.

HealthyLearn from

Documented and popular — useful reference codebase to read through.

HealthyDeploy as-is

No critical CVEs, sane security posture — runnable as-is.

⚠Solo or near-solo (1 contributor active in recent commits)
⚠Non-standard license (Other) — review terms
✓Last commit 1d ago
✓Other licensed
✓CI configured
✓Tests present

What would improve this?

→Use as dependency Concerns → Mixed if: clarify license terms

Computed from maintenance signals — commit recency, contributor breadth, bus factor, license, CI, tests

Informational only. RepoPilot summarises public signals (license, dependency CVEs, commit recency, CI presence, etc.) at the time of analysis. Signals can be incomplete or stale. Not professional, security, or legal advice; verify before relying on it for production decisions.

Embed the "Forkable" badge

Paste into your README — live-updates from the latest cached analysis.

Variant:

[![RepoPilot: Forkable](https://repopilot.app/api/badge/yaoapp/yao?axis=fork)](https://repopilot.app/r/yaoapp/yao)

Paste at the top of your README.md — renders inline like a shields.io badge.

▸Preview social card

This card auto-renders when someone shares https://repopilot.app/r/yaoapp/yao on X, Slack, or LinkedIn.

Ask AI about yaoapp/yao

Grounded in the actual source code. Pick a starter question or write your own.

What does this repo do, in one paragraph?How would I get started using it?What are the main alternatives?Show me the entry point.

Or write your own question →

Onboarding doc

Onboarding: YaoApp/yao

Generated by RepoPilot · 2026-06-24 · Source

🎯Verdict

WAIT — Solo project — review before adopting

Last commit 1d ago
Other licensed
CI configured
Tests present
⚠ Solo or near-solo (1 contributor active in recent commits)
⚠ Non-standard license (Other) — review terms

<sub>Computed from maintenance signals — commit recency, contributor breadth, bus factor, license, CI, tests</sub>

⚡TL;DR

Yao is a single-binary runtime for building AI agents and full-stack web applications, combining TypeScript hooks, LLM executors (OpenAI, Anthropic), CLI sandboxes with VNC support, and built-in REST/GraphQL APIs with vector search. It ships as one executable with no external dependencies (no Node.js, Python, or containers required on the host). Monorepo organized by domain: agent/ contains the core agent framework with assistant/ submodule (hooks, chat, history, MCP support), .github/workflows/ handle multi-platform builds (Linux ARM64/x64, macOS, Docker). Go binaries generated via Makefile; TypeScript compiled into V8 engine at runtime. Data models, REST routes, and UI pages are JSON/YAML config files executed by the runtime.

👥Who it's for

Full-stack developers and AI engineers building conversational agents, autonomous workflows, or internal tools who want to ship complex applications without managing separate frontend/backend deployments or AI infrastructure. Also appeals to teams needing sandbox isolation for code execution and enterprise search (vector + knowledge graphs).

🌱Maturity & risk

Actively developed and production-capable—the repo shows CI/CD workflows (pr-test.yml, unit-test.yml, release workflows for Linux/macOS/Docker), comprehensive test coverage in agent/ and assistant/ directories, and real-world performance benchmarks (hook/REALWORLD_PERFORMANCE_REPORT.md). However, the agent/ framework appears to be the focus of recent iteration rather than stable long-term maintenance.

Go-heavy codebase (13.5MB) with embedded V8 engine and Docker/container dependencies, making builds complex—note the custom GitHub Actions (setup-yao, setup-db). Risk of lock-in to Yao's specific TypeScript hook pattern; no clear migration path if you outgrow it. Smaller community than alternatives; rely heavily on official docs at yaoagents.com.

Active areas of work

Active work on agent assistance framework (agent/assistant/) with focus on hook pipeline, MCP (Model Context Protocol) integration, and memory APIs. Recent additions include build_mcp_test.go, cache_test.go, and chat_test.go suggesting improvements to multi-step reasoning and context caching. Release workflows show ongoing binaries for macOS, Linux, and Docker.

🚀Get running

git clone https://github.com/YaoApp/yao.git
cd yao
make build
./yao version

See Makefile for build targets (make test, make dev). Requires Go 1.20+ and optional Docker for sandbox features.

Daily commands:

make build              # Compile Go binary
./yao start             # Run dev server (default :8000)
make test               # Run all Go tests
make bench              # Run benchmarks in agent/assistant/hook/

See Makefile for full targets. Dev mode watches for changes and reloads.

🗺️Map of the codebase

agent/assistant/assistant.go — Core Assistant struct and initialization—defines the main abstraction wrapping LLM execution, hooks, and sandbox orchestration.
agent/assistant/hook/hook.go — Hook execution engine (Create/Next lifecycle)—processes all request/response boundaries and drives the control-flow pipeline.
agent/assistant/sandbox.go — Sandbox execution layer—manages isolated code execution context and security boundaries for agent scripts.
agent/caller/orchestrator.go — Request orchestrator—routes calls through Create Hook → Executor → Next Hook loop with multi-turn conversation state.
agent/assistant/load.go — Configuration loader—parses assistant definitions from YAML/JSON, merges system prompts, and initializes the runtime.
agent/assistant/llm.go — LLM provider abstraction—handles OpenAI, Anthropic, and other model integrations with streaming and token management.
agent/assistant/types.go — Core data structures (Assistant, Thread, Message, Context)—defines the domain model for agents and conversations.

🛠️How to make changes

Add a Custom LLM Provider

Define provider struct in agent/assistant/llm.go following the LLMClient interface (agent/assistant/llm.go)
Implement Chat() and Stream() methods for model invocation and token handling (agent/assistant/llm.go)
Register provider in assistant load logic to parse config YAML provider field (agent/assistant/load.go)
Add integration tests in agent/assistant/llm_test.go verifying streaming and error cases (agent/assistant/llm.go)

Add a Custom Hook Action

Define hook action struct in agent/assistant/hook/types.go extending HookAction interface (agent/assistant/hook/types.go)
Implement Execute() method in agent/assistant/hook/create.go or next.go depending on timing (agent/assistant/hook/create.go)
Parse YAML hook config in assistant definition to instantiate your action type (agent/assistant/load.go)
Add unit tests in agent/assistant/hook/create_test.go validating action execution and state mutation (agent/assistant/hook/create_test.go)

Integrate a New Search Backend (KB, DB, Web)

Create search provider struct implementing SearchProvider interface in agent/assistant/search.go (agent/assistant/search.go)
Implement Query() and Auth() methods for retrieval and access control (agent/assistant/search.go)
Register provider in search orchestrator's auto-routing logic to detect when to invoke (agent/assistant/search.go)
Add integration tests in agent/assistant/search_auth_integration_test.go validating retrieval and permission enforcement (agent/assistant/search_auth_integration_test.go)

Create a New Assistant Configuration

Define YAML file in your app directory with system prompt, model, hooks, and scripts sections (agent/assistant/load.go)
Reference hook scripts in agent/assistant/hook directory for Create and Next phases (agent/assistant/hook/script.go)
Load assistant at runtime via agent/assistant/load.go Load() function with config path (agent/assistant/load.go)
Invoke via agent/caller/orchestrator.go Call() passing thread and user message (agent/caller/orchestrator.go)

🔧Why these technologies

Go — Single binary distribution, fast startup, concurrency for streaming/multi-hook execution, suitable for edge deployment.
YAML/JSON Configuration — Declarative assistant definitions enable non-code customization of prompts, hooks, and system behavior.
Streaming I/O — Real-time token streaming from LLMs reduces latency perception and enables interactive agent responses.
Sandbox/Container Isolation — Protects host system from untrusted agent scripts and enforces security boundaries (permissions, resource limits).
Hook Pipeline (Create/Next) — Decouples request transformation, validation, and orchestration from core LLM execution; enables reusable constraint/routing logic.

⚖️Trade-offs already made

Single binary with embedded sandbox (sandbox_v2.go) vs. containerized agents
- Why: Simplifies deployment and reduces operational overhead; customers get instant, self-contained executables.
- Consequence: Sandbox resource limits are harder to enforce; multi-tenancy requires careful permission isolation. Scales to moderate workloads; large-scale deployments may need external orchestration.
Synchronous hook pipeline (Create → Execute → Next) vs. async job queue
- Why: Predictable latency, simpler state management, supports real-time streaming feedback.
- Consequence: Long-running hooks block request; loops increase end-to-end latency. Heavy compute offloaded to external services or CLI agents.
In-memory conversation history + cache vs. external state store
- Why: Fast retrieval, no network round-trips, single binary self-contained.
- Consequence: History lost on process restart unless persisted; multi-process deployments require shared store (Redis/DB).
Unified search abstraction (KB, DB, Web) with auth enforcement vs. caller-managed retrieval
- Why: Automatic augmentation reduces prompt engineering burden; row-level access control enforced transparently.
- Consequence: undefined

🪤Traps & gotchas

V8 Engine Lock: TypeScript hooks are compiled into the Go binary at runtime via embedded V8—you cannot hot-reload TypeScript without restarting the entire process. 2. Docker for Sandbox: CLI Agent mode requires Docker daemon running; no workaround for environments where Docker is unavailable. 3. MCP Transport: The repo supports process, SSE, and STDIO MCP transports but doesn't bundle them—you must configure transport in your agent config. 4. No Schema Validation: Agent definitions are JSON/YAML with minimal built-in validation; typos in hook names or memory scopes fail at runtime, not parse time.

🏗️Architecture

💡Concepts to learn

Hook Pipeline (Create/Next Middleware) — The core design pattern in Yao—every request flows through Create (pre-processing) and Next (post-processing) hooks, allowing you to inject custom logic without modifying the agent executor. Understanding this pattern is essential to extending Yao.
Model Context Protocol (MCP) — Yao's native mechanism for connecting external tools and skills to agents; you define tools in MCP format and Yao routes function calls through process, SSE, or STDIO transports.
Embedded V8 Engine — Yao compiles TypeScript hooks into the binary via V8—no separate Node.js process needed. This enables single-binary deployment but constrains runtime flexibility.
Memory Scopes (Request/Session/User/Team) — Multi-tenant context isolation pattern—Yao segregates state at four levels so agents in the same process don't leak user data. Critical for SaaS deployments.
GraphRAG (Graph + Vector Search Hybrid) — Yao's advanced retrieval strategy combines vector embeddings with knowledge graph traversal—enables semantic search on both unstructured text and relational data.
CLI Agent / Sandbox Isolation — Yao can execute Claude Code or custom CLI runners in isolated Docker containers with VNC desktop—enables computer-use agents without host-level access.
Server-Sent Events (SSE) Transport — One of Yao's MCP transport options—allows streaming tool results back to agents in real-time without polling, essential for long-running skill execution.

anthropics/anthropic-sdk-go — Official Anthropic SDK for Go—Yao uses Anthropic as an LLM executor option; understanding its API shapes Yao's chat pipeline.
openai/openai-go — Official OpenAI SDK for Go—primary LLM executor in Yao; chat completion and embedding calls route through this.
modelcontextprotocol/sdk-go — Official MCP SDK for Go—Yao's native tool integration relies on this for process, SSE, and STDIO transports.
nodejs/node — Not directly related, but Yao's embedded V8 engine is forked from Node.js internals; understanding V8 helps debug TypeScript hook performance.
duckdb/duckdb — Alternative in-process SQL engine—Yao uses custom data models and REST query routing, but DuckDB shows patterns for embedded DB systems in single binaries.

🪄PR ideas

To work on one of these in Claude Code or Cursor, paste: Implement the "<title>" PR idea from CLAUDE.md, working through the checklist as the task list.

Add comprehensive test coverage for agent/assistant/mcp.go

The MCP (Model Context Protocol) implementation in mcp.go exists but there is no corresponding mcp_test.go file, despite heavy test coverage in adjacent modules (build_mcp_test.go, build_prompts_test.go, etc.). MCP is critical for agent functionality, and this gap could hide regressions in protocol handling, context injection, and model integration.

[ ] Create agent/assistant/mcp_test.go with unit tests for MCP initialization and context building
[ ] Add tests for MCP request/response handling and error cases
[ ] Test integration between mcp.go and the hook system (agent/assistant/hook/)
[ ] Add benchmarks for MCP context creation similar to agent/assistant/hook/create_bench_test.go

Add GitHub Actions workflow for end-to-end agent/assistant tests

The repo has unit-test.yml and unit-test-v1.yml workflows, but no dedicated CI workflow for the agent subsystem's integration tests. With complex components like chat.go, history.go, loop.go, and the hook system, an E2E test workflow could catch cross-module regressions before release.

[ ] Create .github/workflows/test-agent-e2e.yml to run agent-specific integration tests
[ ] Use .github/actions/setup-yao to initialize the binary
[ ] Test agent creation, chat flows, and hook execution with real LLM mocking (or sandbox-v2.env)
[ ] Run against build outputs from build-linux.yml and build-macos.yml to verify agent works across platforms

Document agent/assistant/llm.go LLM integration contract with examples

The llm.go file is central to agent behavior (handling OpenAI, Anthropic, etc.), but there is no corresponding documentation file explaining the LLM interface, expected inputs/outputs, or how to add a new provider. The README.md only mentions 'OpenAI, Ant...' (cut off). This blocks contributor onboarding for LLM-related features.

[ ] Create agent/assistant/docs/LLM_INTEGRATION.md documenting the LLM interface contract
[ ] Include code examples for adding a new LLM provider (e.g., implementing llm.go's interface)
[ ] Reference how llm.go connects to chat.go and build.go
[ ] Add troubleshooting section for common LLM setup issues (API keys, model availability, rate limits)

🌿Good first issues

Add TypeScript hook validation—create a linter or schema validator in agent/assistant/build.go to catch common hook config mistakes (missing Next hooks, invalid memory scope names) before runtime.
Extend memory API tests—agent/assistant/history_test.go lacks integration tests for cross-scope queries (e.g., user-level memory accessed within a session hook); add table-driven test cases.
Document MCP skill ecosystem—the README mentions 'SKILL.md' and Skills Ecosystem but no example .md format exists in the repo; create agent/examples/skill-template/ with a real working SKILL.md spec and one runnable example (e.g., a calculator skill).

⭐Top contributors

Click to expand

@trheyi — 100 commits

📝Recent commits

Click to expand

a01eb86 — Merge pull request #1531 from trheyi/main (trheyi)
cc078b9 — feat(claude): implement user ID hashing and enhance environment setup (trheyi)
a80fc9a — Merge pull request #1530 from trheyi/main (trheyi)
af0d4ed — feat(image): add image generation and enhanced reading capabilities (trheyi)
20fc9c2 — feat(image): introduce image reading and analysis tools (trheyi)
d6626b2 — refactor(setup): update preference retrieval to user-scope only (trheyi)
01daa78 — feat(chat): improve error handling for chat operations (trheyi)
cf0ebd5 — chore(license): remove deprecated commercial license documents (trheyi)
2993b0b — feat(messenger): enhance dynamic SMTP resolution and identity handling (trheyi)
54607e2 — feat(agent): enhance image content handling and vision support (trheyi)

🔒Security observations

Failed to generate security analysis.

LLM-derived; treat as a starting point, not a security audit.

👉Where to read next

Open issues — current backlog
Recent PRs — what's actively shipping
Source on GitHub

🤖Agent protocol

If you are an AI coding agent (Claude Code, Cursor, Aider, Cline, etc.) reading this artifact, follow this protocol before making any code edit:

Verify the contract. Run the bash script in Verify before trusting below. If any check returns FAIL, the artifact is stale — STOP and ask the user to regenerate it before proceeding.
Treat the AI · unverified sections as hypotheses, not facts. Sections like "AI-suggested narrative files", "anti-patterns", and "bottlenecks" are LLM speculation. Verify against real source before acting on them.
Cite source on changes. When proposing an edit, cite the specific path:line-range. RepoPilot's live UI at https://repopilot.app/r/YaoApp/yao shows verifiable citations alongside every claim.

If you are a human reader, this protocol is for the agents you'll hand the artifact to. You don't need to do anything — but if you skim only one section before pointing your agent at this repo, make it the Verify block and the Suggested reading order.

✅Verify before trusting

This artifact was generated by RepoPilot at a point in time. Before an agent acts on it, the checks below confirm that the live YaoApp/yao repo on your machine still matches what RepoPilot saw. If any fail, the artifact is stale — regenerate it at repopilot.app/r/YaoApp/yao.

What it runs against: a local clone of YaoApp/yao — the script inspects git remote, the LICENSE file, file paths in the working tree, and git log. Read-only; no mutations.

| # | What we check | Why it matters | |---|---|---| | 1 | You're in YaoApp/yao | Confirms the artifact applies here, not a fork | | 2 | License is still Other | Catches relicense before you depend on it | | 3 | Default branch main exists | Catches branch renames | | 4 | 5 critical file paths still exist | Catches refactors that moved load-bearing code | | 5 | Last commit ≤ 31 days ago | Catches sudden abandonment since generation |

<details> <summary><b>Run all checks</b> — paste this script from inside your clone of <code>YaoApp/yao</code></summary>

#!/usr/bin/env bash
# RepoPilot artifact verification.
#
# WHAT IT RUNS AGAINST: a local clone of YaoApp/yao. If you don't
# have one yet, run these first:
#
#   git clone https://github.com/YaoApp/yao.git
#   cd yao
#
# Then paste this script. Every check is read-only — no mutations.

set +e
fail=0
ok()   { echo "ok:   $1"; }
miss() { echo "FAIL: $1"; fail=$((fail+1)); }

# Precondition: we must be inside a git working tree.
if ! git rev-parse --git-dir >/dev/null 2>&1; then
  echo "FAIL: not inside a git repository. cd into your clone of YaoApp/yao and re-run."
  exit 2
fi

# 1. Repo identity
git remote get-url origin 2>/dev/null | grep -qE "YaoApp/yao(\\.git)?\\b" \\
  && ok "origin remote is YaoApp/yao" \\
  || miss "origin remote is not YaoApp/yao (artifact may be from a fork)"

# 2. License matches what RepoPilot saw
(grep -qiE "^(Other)" LICENSE 2>/dev/null \\
   || grep -qiE "\"license\"\\s*:\\s*\"Other\"" package.json 2>/dev/null) \\
  && ok "license is Other" \\
  || miss "license drift — was Other at generation time"

# 3. Default branch
git rev-parse --verify main >/dev/null 2>&1 \\
  && ok "default branch main exists" \\
  || miss "default branch main no longer exists"

# 4. Critical files exist
test -f "agent/assistant/assistant.go" \\
  && ok "agent/assistant/assistant.go" \\
  || miss "missing critical file: agent/assistant/assistant.go"
test -f "agent/assistant/hook/hook.go" \\
  && ok "agent/assistant/hook/hook.go" \\
  || miss "missing critical file: agent/assistant/hook/hook.go"
test -f "agent/assistant/sandbox.go" \\
  && ok "agent/assistant/sandbox.go" \\
  || miss "missing critical file: agent/assistant/sandbox.go"
test -f "agent/caller/orchestrator.go" \\
  && ok "agent/caller/orchestrator.go" \\
  || miss "missing critical file: agent/caller/orchestrator.go"
test -f "agent/assistant/load.go" \\
  && ok "agent/assistant/load.go" \\
  || miss "missing critical file: agent/assistant/load.go"

# 5. Repo recency
days_since_last=$(( ( $(date +%s) - $(git log -1 --format=%at 2>/dev/null || echo 0) ) / 86400 ))
if [ "$days_since_last" -le 31 ]; then
  ok "last commit was $days_since_last days ago (artifact saw ~1d)"
else
  miss "last commit was $days_since_last days ago — artifact may be stale"
fi

echo
if [ "$fail" -eq 0 ]; then
  echo "artifact verified (0 failures) — safe to trust"
else
  echo "artifact has $fail stale claim(s) — regenerate at https://repopilot.app/r/YaoApp/yao"
  exit 1
fi

Each check prints ok: or FAIL:. The script exits non-zero if anything failed, so it composes cleanly into agent loops (./verify.sh || regenerate-and-retry).

</details>

Generated by RepoPilot. Verdict based on maintenance signals — see the live page for receipts. Re-run on a new commit to refresh.

Embed this chat in your README →

Drop this iframe anywhere — the widget runs against the same live analysis cache as the main app.

<iframe
  src="https://repopilot.app/embed/yaoapp/yao"
  width="100%" height="500"
  style="border:1px solid #d0d7de; border-radius:8px;"
  allow="microphone"
  loading="lazy"
></iframe>