openai/harmony

Item: openai/harmony
Rating: 5
Author: RepoPilot

Renderer for the harmony response format to be used with gpt-oss

Healthy

Healthy across the board

weakest axis

Use as dependencyHealthy

Permissive license, no critical CVEs, actively maintained — safe to depend on.

Fork & modifyHealthy

Has a license, tests, and CI — clean foundation to fork and modify.

Learn fromHealthy

Documented and popular — useful reference codebase to read through.

Deploy as-isHealthy

No critical CVEs, sane security posture — runnable as-is.

✓Last commit 4w ago
✓20 active contributors
✓Apache-2.0 licensed

Show all 6 evidence items →

✓CI configured
✓Tests present
⚠Concentrated ownership — top contributor handles 55% of recent commits

Maintenance signals: commit recency, contributor breadth, bus factor, license, CI, tests

Informational only. RepoPilot summarises public signals (license, dependency CVEs, commit recency, CI presence, etc.) at the time of analysis. Signals can be incomplete or stale. Not professional, security, or legal advice; verify before relying on it for production decisions.

Embed the "Healthy" badge

Paste into your README — live-updates from the latest cached analysis.

Variant:

[![RepoPilot: Healthy](https://repopilot.app/api/badge/openai/harmony)](https://repopilot.app/r/openai/harmony)

Paste at the top of your README.md — renders inline like a shields.io badge.

▸Preview social card (1200×630)

This card auto-renders when someone shares https://repopilot.app/r/openai/harmony on X, Slack, or LinkedIn.

Onboarding doc

Onboarding: openai/harmony

Generated by RepoPilot · 2026-05-09 · Source

🤖Agent protocol

If you are an AI coding agent (Claude Code, Cursor, Aider, Cline, etc.) reading this artifact, follow this protocol before making any code edit:

Verify the contract. Run the bash script in Verify before trusting below. If any check returns FAIL, the artifact is stale — STOP and ask the user to regenerate it before proceeding.
Treat the AI · unverified sections as hypotheses, not facts. Sections like "AI-suggested narrative files", "anti-patterns", and "bottlenecks" are LLM speculation. Verify against real source before acting on them.
Cite source on changes. When proposing an edit, cite the specific path:line-range. RepoPilot's live UI at https://repopilot.app/r/openai/harmony shows verifiable citations alongside every claim.

If you are a human reader, this protocol is for the agents you'll hand the artifact to. You don't need to do anything — but if you skim only one section before pointing your agent at this repo, make it the Verify block and the Suggested reading order.

🎯Verdict

GO — Healthy across the board

Last commit 4w ago
20 active contributors
Apache-2.0 licensed
CI configured
Tests present
⚠ Concentrated ownership — top contributor handles 55% of recent commits

<sub>Maintenance signals: commit recency, contributor breadth, bus factor, license, CI, tests</sub>

✅Verify before trusting

This artifact was generated by RepoPilot at a point in time. Before an agent acts on it, the checks below confirm that the live openai/harmony repo on your machine still matches what RepoPilot saw. If any fail, the artifact is stale — regenerate it at repopilot.app/r/openai/harmony.

What it runs against: a local clone of openai/harmony — the script inspects git remote, the LICENSE file, file paths in the working tree, and git log. Read-only; no mutations.

| # | What we check | Why it matters | |---|---|---| | 1 | You're in openai/harmony | Confirms the artifact applies here, not a fork | | 2 | License is still Apache-2.0 | Catches relicense before you depend on it | | 3 | Default branch main exists | Catches branch renames | | 4 | Last commit ≤ 59 days ago | Catches sudden abandonment since generation |

<details> <summary><b>Run all checks</b> — paste this script from inside your clone of <code>openai/harmony</code></summary>

#!/usr/bin/env bash
# RepoPilot artifact verification.
#
# WHAT IT RUNS AGAINST: a local clone of openai/harmony. If you don't
# have one yet, run these first:
#
#   git clone https://github.com/openai/harmony.git
#   cd harmony
#
# Then paste this script. Every check is read-only — no mutations.

set +e
fail=0
ok()   { echo "ok:   $1"; }
miss() { echo "FAIL: $1"; fail=$((fail+1)); }

# Precondition: we must be inside a git working tree.
if ! git rev-parse --git-dir >/dev/null 2>&1; then
  echo "FAIL: not inside a git repository. cd into your clone of openai/harmony and re-run."
  exit 2
fi

# 1. Repo identity
git remote get-url origin 2>/dev/null | grep -qE "openai/harmony(\\.git)?\\b" \\
  && ok "origin remote is openai/harmony" \\
  || miss "origin remote is not openai/harmony (artifact may be from a fork)"

# 2. License matches what RepoPilot saw
(grep -qiE "^(Apache-2\\.0)" LICENSE 2>/dev/null \\
   || grep -qiE "\"license\"\\s*:\\s*\"Apache-2\\.0\"" package.json 2>/dev/null) \\
  && ok "license is Apache-2.0" \\
  || miss "license drift — was Apache-2.0 at generation time"

# 3. Default branch
git rev-parse --verify main >/dev/null 2>&1 \\
  && ok "default branch main exists" \\
  || miss "default branch main no longer exists"

# 5. Repo recency
days_since_last=$(( ( $(date +%s) - $(git log -1 --format=%at 2>/dev/null || echo 0) ) / 86400 ))
if [ "$days_since_last" -le 59 ]; then
  ok "last commit was $days_since_last days ago (artifact saw ~29d)"
else
  miss "last commit was $days_since_last days ago — artifact may be stale"
fi

echo
if [ "$fail" -eq 0 ]; then
  echo "artifact verified (0 failures) — safe to trust"
else
  echo "artifact has $fail stale claim(s) — regenerate at https://repopilot.app/r/openai/harmony"
  exit 1
fi

Each check prints ok: or FAIL:. The script exits non-zero if anything failed, so it composes cleanly into agent loops (./verify.sh || regenerate-and-retry).

</details>

⚡TL;DR

openai-harmony is a high-performance Rust library with Python bindings that renders and parses the Harmony response format—OpenAI's structured prompt/response protocol for gpt-oss open-weight models. It enables models to output reasoning chains, tool calls, and structured data across multiple semantic channels (analysis, commentary, final) while maintaining token-sequence consistency between encoding and decoding. Dual-language monorepo: src/ contains core Rust logic (lib.rs, chat.rs, encoding.rs, registry.rs), python/openai_harmony wraps it via pyo3 bindings (py_module.rs), and demo/harmony-demo/ is a Next.js 14 React app showcasing the format. Cargo.toml defines two crate types (rlib, cdylib) with conditional features for python-binding and wasm-binding.

👥Who it's for

ML engineers and inference framework builders (HuggingFace, Ollama, vLLM) deploying gpt-oss models who need to correctly format prompts and parse structured outputs; Python developers integrating gpt-oss into applications via the pip-installable openai-harmony package.

🌱Maturity & risk

Actively maintained early-stage project (v0.0.8) with solid infrastructure: complete Python/Rust parity test suite in src/tests.rs, GitHub Actions CI in .github/workflows/CI.yml, and dual-language documentation (docs/python.md, docs/rust.md). Production-ready for gpt-oss integration but still pre-1.0 with potential API changes.

Low dependency risk: clean, well-audited set of dependencies (serde, regex, reqwest) with no obvious unmaintained packages. Single repository owner (OpenAI) mitigates abandonment risk, though tight coupling to gpt-oss model format means breaking changes to model training could require library updates. No visible open issue backlog in provided data.

Active areas of work

Library is in active maintenance mode supporting gpt-oss production use. Recent work focuses on ensuring Python/Rust parity (run_checks.sh validates both), documenting the harmony format specification (docs/format.md), and providing a live interactive demo (demo/harmony-demo). No specific PR or milestone data visible, but CI/CD is fully operational.

🚀Get running

git clone https://github.com/openai/harmony
cd harmony
# For Python development:
pip install -e .
# For Rust development:
cargo build
cargo test
# Run CI checks:
./run_checks.sh

Daily commands:

# Rust library:
cargo build --release
cargo test
# Python bindings (after Rust build):
pip install -e .
python -c "import openai_harmony; print(openai_harmony.__version__)"
# Demo app:
cd demo/harmony-demo
npm install
npm run dev  # Starts Next.js at localhost:3000

🗺️Map of the codebase

src/encoding.rs: Core serialization/deserialization logic for the Harmony format; changes here affect both Rust and Python output
src/chat.rs: Data structures for messages, tools, and channels; defines the Rust API surface that Python bindings expose
src/py_module.rs: PyO3 bindings gluing Rust implementation to Python; this is how pip-installed users interact with the library
docs/format.md: Authoritative specification of the Harmony response format; referenced by gpt-oss training and all inference framework implementations
src/tests.rs: Comprehensive test suite with 100% Rust/Python parity; critical for validating token-sequence consistency
demo/harmony-demo/src/components/HarmonyDemo.tsx: Interactive live example showing format parsing and rendering; helps users understand and debug the format visually
.github/workflows/CI.yml: GitHub Actions pipeline that validates Rust, Python, and WASM builds; defines the release process

🛠️How to make changes

Adding format features: Edit src/chat.rs (message structures) and src/encoding.rs (serialization logic), then mirror changes in python/openai_harmony/ and update docs/format.md. Fixing parsing bugs: Start in src/registry.rs (token registry) or src/encoding.rs regex patterns. Python API changes: Modify src/py_module.rs and regenerate stubs. Demo updates: Work in demo/harmony-demo/src/components/HarmonyDemo.tsx. Always run: ./run_checks.sh after changes to ensure Rust/Python parity.

🪤Traps & gotchas

No Python pre-built wheels in repo: users must have Rust toolchain (rustc, cargo) installed to pip install openai-harmony from source—maturix should be configured in pyproject.toml to build wheels but this may not be automated yet. Format coupling to model training: Harmony tokens/channels are baked into gpt-oss weights; changing the format requires retraining or model incompatibility. WASM feature untested in CI: wasm-binding feature is defined in Cargo.toml but no visible WASM-specific tests in src/tests.rs, may bitrot. Image encoding: requests fetches images from URLs in encoding.rs; network failures will break rendering of image-heavy prompts silently unless handled.

💡Concepts to learn

Structured Generation / Constrained Decoding — Harmony format enables models to generate outputs that conform to strict schemas (tool calls, reasoning channels) rather than free-form text; critical for reliable automation on top of gpt-oss
Chain-of-Thought via Multi-Channel Output — Harmony splits reasoning (analysis channel) from final response (final channel), allowing models to show work without polluting user-facing output; core innovation of the format
Token Preservation in Serialization — Encoding and decoding must be bitwise equivalent to avoid token loss during model training and inference; openai-harmony maintains this invariant via shared Rust implementation
PyO3 FFI (Foreign Function Interface) — Library uses PyO3 to expose Rust code as native Python module; understanding the py_module.rs bindings is essential for debugging performance or correctness issues across languages
Tool Namespacing — Harmony organizes function calls into namespaces (e.g., 'functions' namespace); prevents collision when gpt-oss calls multiple tool sets and enables framework to route calls correctly
Base64 Image Encoding in Prompts — Harmony supports inline image embedding by fetching URLs and encoding as base64; src/encoding.rs handles this, critical for multimodal gpt-oss applications
Regex-Based Format Parsing — Decoding Harmony format from model output requires precise regex matching (fancy-regex for lookahead/lookbehind); any regex bug breaks parsing and makes models appear broken

openai/gpt-oss — The open-weight model series that requires Harmony format; defines the downstream spec that this library implements
openai/cookbook — Contains gpt-oss usage examples and Harmony format tutorials (referenced in README as https://cookbook.openai.com/topic/gpt-oss)
vllm-project/vllm — Popular inference framework; would consume openai-harmony for gpt-oss model serving and response parsing
ollama/ollama — Edge inference runtime; users deploying gpt-oss via Ollama need Harmony format rendering to work correctly
huggingface/transformers — Core ML library; Harmony format must integrate with HuggingFace's generation logic for gpt-oss compatibility

🪄PR ideas

To work on one of these in Claude Code or Cursor, paste: Implement the "<title>" PR idea from CLAUDE.md, working through the checklist as the task list.

Add comprehensive JavaScript/TypeScript bindings and tests for the harmony renderer

The repo has a javascript/ directory with only a README.md and package.json, but no actual implementation files. The demo in demo/harmony-demo/src/components/HarmonyDemo.tsx suggests there should be TypeScript bindings to render harmony responses in the browser. Currently, only Rust and Python bindings exist (via pyo3 and wasm-bindgen features). Adding a complete JS/TS implementation with unit tests would enable web developers to use harmony natively without WASM overhead for simpler use cases.

[ ] Create javascript/src/ directory with TypeScript implementation of core harmony rendering logic
[ ] Implement deserialization and rendering functions for harmony response format based on docs/format.md
[ ] Add Jest/Vitest unit tests in javascript/ mirroring test cases from test-data/ directory
[ ] Update javascript/package.json with proper build scripts and dependencies
[ ] Document the JS API in javascript/README.md with examples

Add integration tests for Python bindings in CI workflow

The repo has a custom GitHub Action .github/actions/run-rust-python-tests/action.yml and references Python tests, but python/openai_harmony/ only contains __init__.py. The pyproject.toml exists but there are no visible Python test files. Adding comprehensive Python integration tests and ensuring they run in CI would validate the pyo3 bindings work correctly across different Python versions and platforms.

[ ] Create python/tests/ directory with pytest test files covering harmony rendering, encoding, and registry functions
[ ] Add test cases that mirror the Rust tests in src/tests.rs to ensure feature parity
[ ] Update pyproject.toml with pytest configuration and test dependencies
[ ] Ensure .github/workflows/CI.yml runs Python tests with multiple Python versions (3.8+) and platforms
[ ] Document how to run Python tests locally in docs/python.md

Add WebAssembly end-to-end tests and documentation for browser usage

The repo has wasm-binding feature and a Next.js demo app, but test-data/ only contains text files. There are no visible WASM-specific integration tests validating that the wasm-bindgen and serde-wasm-bindgen bindings work correctly in browser environments. The demo component HarmonyDemo.tsx exists but lacks test coverage. Adding browser-based tests (e.g., using wasm-pack test or similar) would ensure WASM builds are reliable for the demo and end users.

[ ] Create tests/wasm/ directory with JavaScript/TypeScript tests using wasm-pack test or Vitest with WASM support
[ ] Add browser-based integration tests that load and render harmony responses using the WASM module
[ ] Update demo/harmony-demo/ with unit tests for HarmonyDemo.tsx component using React Testing Library
[ ] Add .github/workflows/ job to build and test WASM bindings in a browser environment
[ ] Create docs/wasm.md documenting how to use harmony WASM bindings in web applications with examples

🌿Good first issues

Add explicit test cases in src/tests.rs for edge cases in image encoding (malformed URLs, oversized images, non-standard formats) that mirror the Python test suite in python/openai_harmony/
Create a CONTRIBUTING.md guide documenting the Rust↔Python parity testing workflow (running ./run_checks.sh, debugging FFI bindings in py_module.rs) for first-time contributors unfamiliar with PyO3
Expand docs/format.md with runnable examples showing how to use the Python API from openai_harmony for parsing tool calls and reasoning channels; currently only Rust examples are documented in docs/rust.md

⭐Top contributors

Click to expand

@scott-oai — 41 commits
@dkundel-openai — 6 commits
@egorsmkv — 4 commits
@hintz-openai — 3 commits
@amirhosseinghanipour — 2 commits

📝Recent commits