quantopian/zipline

Item: quantopian/zipline
Rating: 3
Author: RepoPilot

Zipline, a Pythonic Algorithmic Trading Library

Mixed

Stale — last commit 2y ago

HealthyDependency

Permissive license, no critical CVEs, actively maintained — safe to depend on.

HealthyFork & modify

Has a license, tests, and CI — clean foundation to fork and modify.

HealthyLearn from

Documented and popular — useful reference codebase to read through.

MixedDeploy as-is

last commit was 2y ago; Scorecard "Token-Permissions" is 0/10

⚠Stale — last commit 2y ago
⚠Concentrated ownership — top contributor handles 55% of recent commits
⚠Scorecard: marked unmaintained (0/10)
✓12 active contributors
✓Apache-2.0 licensed
✓CI configured
✓Tests present

What would improve this?

→Deploy as-is Mixed → Healthy if: 1 commit in the last 180 days

Maintenance signals: commit recency, contributor breadth, bus factor, license, CI, tests + OpenSSF Scorecard

Informational only. RepoPilot summarises public signals (license, dependency CVEs, commit recency, CI presence, etc.) at the time of analysis. Signals can be incomplete or stale. Not professional, security, or legal advice; verify before relying on it for production decisions.

Embed the "Safe to depend on" badge

Paste into your README — live-updates from the latest cached analysis.

Variant:

[![RepoPilot: Safe to depend on](https://repopilot.app/api/badge/quantopian/zipline?axis=dependency)](https://repopilot.app/r/quantopian/zipline)

Paste at the top of your README.md — renders inline like a shields.io badge.

▸Preview social card

This card auto-renders when someone shares https://repopilot.app/r/quantopian/zipline on X, Slack, or LinkedIn.

Ask AI about quantopian/zipline

Grounded in the actual source code. Pick a starter question or write your own.

What does this repo do, in one paragraph?How would I get started using it?What are the main alternatives?Show me the entry point.

Or write your own question →

Onboarding doc

Onboarding: quantopian/zipline

Generated by RepoPilot · 2026-06-21 · Source

🎯Verdict

WAIT — Stale — last commit 2y ago

12 active contributors
Apache-2.0 licensed
CI configured
Tests present
⚠ Stale — last commit 2y ago
⚠ Concentrated ownership — top contributor handles 55% of recent commits
⚠ Scorecard: marked unmaintained (0/10)

<sub>Maintenance signals: commit recency, contributor breadth, bus factor, license, CI, tests + OpenSSF Scorecard</sub>

⚡TL;DR

Zipline is an event-driven backtesting engine and live-trading framework for algorithmic trading strategies, written in Python. It powers Quantopian's backtesting platform and enables traders to develop, test, and execute quantitative trading algorithms using pandas DataFrames for data I/O and built-in technical analysis (moving averages, linear regression, etc.). Monolithic Pythonic package structure: core engine in zipline/ directory handles event-driven simulation, data ingestion, and performance calculation; conda/ contains recipe specs for vendored dependencies (bcolz, numexpr, logbook, alembic); ci/ holds platform-specific CI scripts (appveyor, travis); examples/ (implicit) contain runnable trading algorithms using the zipline.api interface.

👥Who it's for

Quantitative traders and algorithmic trading developers who need to backtest trading strategies against historical market data and deploy them for live trading. Also used by Quantopian's community platform users and the FactSet-backed managed service for professionals.

🌱Maturity & risk

Production-ready and actively maintained. The codebase has 4M+ lines of Python, is deployed at scale on Quantopian's platform, includes comprehensive CI/CD via GitHub Actions (.github/workflows/ci.yaml, .appveyor.yml, .travis.yml), and supports Python 2.7, 3.5, 3.6+. The project shows active development with conda package management and Docker support (Dockerfile, Dockerfile-dev), indicating mature infrastructure.

Moderate risk: Python 2.7 support is legacy and EOL; dependency on Quantopian's ecosystem (empyrical, alphalens, pyfolio) creates coupling; large monolithic codebase (4M LoC) may have high maintenance burden. Check last commit date and open issue backlog on GitHub, as Quantopian's business changes could impact maintenance velocity.

Active areas of work

Cannot determine from file list alone—check .github/workflows/ci.yaml execution logs and recent commit history for active development. The presence of .pre-commit-config.yaml and modern CI (GitHub Actions) suggests ongoing maintenance and code quality enforcement.

🚀Get running

git clone https://github.com/quantopian/zipline.git
cd zipline
./etc/dev-install  # or: pip install -e .[dev]
python -m zipline ingest-quandl

Daily commands: Development setup via ./etc/dev-install script; run backtests using zipline run -f <algo>.py or programmatically via zipline.run_algorithm(); ingest market data first with python -m zipline ingest-quandl. See README.rst 'Quickstart' for the dual moving average example.

🗺️Map of the codebase

zipline/__init__.py — Entry point for the Zipline library; defines the public API and core imports that every contributor must understand
zipline/algorithm.py — Core TradingAlgorithm class that orchestrates the entire backtesting event loop—essential to understand the framework's backbone
zipline/data/bundles — Data bundle system that handles ingestion and storage of market data, critical for understanding how data flows into the engine
zipline/finance/performance.py — Performance tracking and risk metric calculation system that produces backtest results
zipline/gens/tradesimulation.py — Trade simulation engine that drives the main event loop and processes each trading day
setup.py — Project configuration and dependency declaration; critical for understanding build requirements and library versions
.github/workflows/ci.yaml — CI/CD pipeline definition that enforces code quality standards and prevents regressions

🛠️How to make changes

Add a New Data Bundle

Create a new module in zipline/data/bundles/ (e.g., my_data_source.py) with an ingest function that reads data and yields OHLCV tuples (zipline/data/bundles/my_data_source.py)
Register the bundle in zipline/data/bundles/init.py using @register_bundle_ingest('my-bundle') decorator (zipline/data/bundles/__init__.py)
Test data ingestion by running: zipline ingest -b my-bundle (zipline/data/bundles)

Add a New Performance Metric

Implement a metric calculation function in zipline/finance/metrics/ or extend existing metric classes (zipline/finance/metrics)
Integrate the metric into zipline/finance/performance.py's PerformanceTracker.to_dict() method to include in backtest results (zipline/finance/performance.py)
Add unit tests in tests/finance/ to validate metric computation across various scenarios (tests/finance)

Add a New Algorithm Context Method

Define the method in the TradingAlgorithm class in zipline/algorithm.py or as a mixin in zipline/api.py (zipline/algorithm.py)
Export the method signature in zipline/api.py so users can access it within their algorithm's initialize() and handle_data() functions (zipline/api.py)
Write integration tests demonstrating the method's behavior during a backtest in tests/test_algorithm.py (tests/test_algorithm.py)

Support a New Asset Class

Extend the Equity/Future classes in zipline/assets/assets.py with new asset type (e.g., Crypto, Commodity) (zipline/assets/assets.py)
Update zipline/assets/asset_finder.py to handle symbol resolution and metadata lookup for the new asset class (zipline/assets/asset_finder.py)
Modify zipline/data/data_portal.py to fetch correct data feeds and handle price adjustments specific to the new asset class (zipline/data/data_portal.py)
Add tests in tests/assets/ to verify creation, lookup, and trading of the new asset type (tests/assets)

🔧Why these technologies

Python with Cython/NumPy — Enables fast numerical computation for backtest loops and financial calculations while maintaining ease of use for algorithm writers
Event-driven simulation loop (generators) — Allows efficient, memory-conscious iteration through time steps without loading entire datasets; mirrors real trading flow
SQLite/BColz for data storage — Provides persistent, efficient storage of OHLCV bars and corporate actions with fast columnar access patterns
Pandas DataFrames — De facto standard for time-series financial data; simplifies metric computation and result presentation
Trading-calendars library — Encapsulates exchange-specific trading sessions and holidays, allowing backtests across multiple exchanges and time zones

⚖️Trade-offs already made

Event-driven loop vs. end-of-day batch processing
- Why: Allows intraday algorithm logic and more realistic fill simulation
- Consequence: Higher memory footprint during backtests and more complex state management; slower for daily-only strategies
Single-threaded simulation with GIL
- Why: Simplifies state consistency and prevents race conditions in portfolio/order tracking
- Consequence: Cannot parallelize a single backtest across cores; multiple backtests must run independently
Bundle-based data ingestion vs. live API calls
- Why: Ensures reproducibility and offline capability; decouples algorithm from data provider uptime
- Consequence: Requires upfront data download and storage; stale data risk if not refreshed regularly
Full portfolio revaluation each day
- Why: Ensures accurate mark-to-market and performance metrics
- Consequence: Performance is O(n_assets) per simulation step; can be slow with thousands of positions

🚫Non-goals (don't propose these)

Real-time live trading execution (backtest-only; live trading requires separate module/deployment)
Cryptocurrency derivatives or highly exotic instruments (focuses on equities, futures, and simple options)
Multi-currency portfolio hedging or FX conversion (must be handled in user strategy code)
Machine learning model training (integrates with external ML libraries; not a native ML framework)
Regulatory compliance or audit logging (user responsibility to implement for production use)

🪤Traps & gotchas

Data ingestion required: Must run zipline ingest-quandl or equivalent before first backtest; missing data bundle causes silent failures. Timezone handling: All data assumed UTC internally; incorrect timestamp handling causes order execution errors. Memory-intensive: Large backtests on full market data require substantial RAM; bcolz compression helps but not always enough. Pandas version sensitivity: Some older versions have breaking API changes (e.g., Panel deprecation); check requirements/. Windows support: .appveyor.yml and ci/appveyor/ indicate Windows CI exists but may lag behind Linux testing.

🏗️Architecture

💡Concepts to learn

Event-driven simulation — Zipline's entire architecture revolves around simulating market events (OHLC bars, dividends, splits) in chronological order and triggering algorithm callbacks; understanding event dispatch is essential to extending the backtester.
Adjustments pipeline (splits & dividends) — Zipline automatically adjusts historical prices for corporate actions; this is non-obvious but critical—incorrect adjustment logic breaks strategy backtests and causes Sharpe ratio miscalculations.
Columnar storage with bcolz — Zipline uses bcolz (a columnar, compressed storage format) to store OHLCV data efficiently; understanding columnar vs. row-based storage explains memory efficiency gains and query performance.
Portfolio rebalancing and order matching — Zipline implements realistic order matching (limit/market orders, fills, slippage) against simulated market prices; understanding order lifecycle prevents logic bugs in algorithm trade execution.
Context/state persistence across bars — User algorithms store state in the 'context' object between bars; this closure-like pattern is core to Zipline's API and differs from imperative trading systems.
Performance metrics (Sharpe, Sortino, drawdown) — Zipline computes risk-adjusted returns on-the-fly during backtests; understanding these metrics (via empyrical) is critical for evaluating strategy performance objectively.
Data bundle abstraction — Zipline's bundle system allows pluggable data sources (Quandl, CSV, custom); extending bundles/ lets users integrate proprietary or alternative data without modifying the core engine.

quantopian/empyrical — Computes tear sheet metrics (Sharpe, sortino, drawdown) for Zipline backtest results; sister library used for performance analysis.
quantopian/pyfolio — Post-backtest analysis and visualization of returns, risk, and factor attribution; complements Zipline output.
quantopian/alphalens — Alpha factor research library that integrates with Zipline for signal generation and testing.
pandas-dev/pandas — Fundamental dependency for all data handling, historical series, and portfolio DataFrames in Zipline.
ultralytics/yolov5 — Not related—example of Python ML framework users might combine with Zipline for ML-based trading strategies.

🪄PR ideas

To work on one of these in Claude Code or Cursor, paste: Implement the "<title>" PR idea from CLAUDE.md, working through the checklist as the task list.

Add GitHub Actions workflow for macOS and Linux matrix testing

The repo has .github/workflows/ci.yaml and windows_ci.yml, but lacks a unified cross-platform test matrix. The .travis.yml and .appveyor.yml files suggest legacy CI infrastructure. A modern GitHub Actions workflow with matrix strategy (Python 3.8-3.11, macOS/Linux) would replace legacy CI, provide faster feedback, and reduce maintenance overhead of multiple CI systems.

[ ] Review existing .github/workflows/ci.yaml and .travis.yml to extract test commands and dependencies
[ ] Create .github/workflows/test-matrix.yml with matrix for Python versions (3.8, 3.9, 3.10, 3.11) × OS (ubuntu-latest, macos-latest)
[ ] Include coverage reporting step that uploads to coverage service referenced in .coveragerc
[ ] Test locally with act tool and verify all conda dependencies from conda/ directory are handled
[ ] Add job to run doctests from docs/notebooks/ to ensure notebook examples stay current

Add type hints and mypy configuration for zipline core modules

The repo lacks .mypy.ini or type checking configuration despite being a mature library used in production at Quantopian. Adding gradual type hints to core trading logic modules (implied by backtesting, event-driven system) would improve maintainability, catch bugs at development time, and make the codebase more accessible to contributors. Start with high-impact modules.

[ ] Create setup.cfg or mypy.ini with mypy configuration (start with --ignore-missing-imports for gradual adoption)
[ ] Identify core modules in source tree (likely zipline/oms/, zipline/algorithm.py, zipline/events.py based on event-driven architecture)
[ ] Add type hints to 2-3 critical modules and run mypy to establish baseline
[ ] Add mypy to .pre-commit-config.yaml to enforce types on new commits
[ ] Document type-checking setup in CONTRIBUTING.rst for contributors

Create integration test suite for conda package builds

The conda/ directory contains 15+ package recipes (bcolz, ta-lib, logbook, etc.) and ci/make_conda_packages.py exists, but there's no visible integration test workflow validating that conda packages build correctly and interdependencies resolve. This is critical since Zipline's installation complexity is a known friction point for users.

[ ] Review ci/make_conda_packages.py and conda package structure in conda/*/meta.yaml files
[ ] Create ci/test_conda_build.sh script that builds packages in dependency order and runs basic import tests
[ ] Add GitHub Actions workflow .github/workflows/conda-build.yml to test conda package builds on Python 3.8+ for Linux/macOS
[ ] Test that conda/zipline/meta.yaml correctly declares all dependencies from recipes in conda/ subdirectories
[ ] Document conda package maintenance workflow in conda/README.md with troubleshooting guide

🌿Good first issues

Add unit tests for zipline/data/bundles/quandl.py—currently likely under-tested for edge cases like missing OHLC fields or split/dividend adjustments.
Expand zipline/api.py docstrings with type hints and examples for order_target(), record(), and symbol()—critical for onboarding new algorithmic traders.
Create missing integration tests in tests/ for the dual moving average algorithm from README.rst—ensures Quickstart example remains functional across releases.

⭐Top contributors

Click to expand

@richafrank — 55 commits
[@Joe Jevnik](https://github.com/Joe Jevnik) — 16 commits
[@Scott Sanderson](https://github.com/Scott Sanderson) — 12 commits
[@Gerry Manoim](https://github.com/Gerry Manoim) — 5 commits
@gerrymanoim — 3 commits

📝Recent commits

Click to expand

014f1fc — Merge pull request #2782 from quantopian/missing-dep (richafrank)
a79d19f — BLD: Fixing CI bcolz installation (richafrank)
5526f35 — MAINT: Added missing direct dependency to requirements.in (richafrank)
60f0fa4 — REL: Release 1.4.1 (richafrank)
922db29 — DOC: Add a description to build docs for anaconda users (#2772) (yoshinobu1579)
a119e91 — Merge pull request #2780 from quantopian/conda-forge-docs (richafrank)
580ca94 — DOC: Updated issues in whatsnew (richafrank)
bdd3eb6 — DOC: Added label to install docs (to link from dev guidelines) (richafrank)
2a26402 — DOC: Added pypi zipline version badge (richafrank)
e9c83d6 — DOC: Update docs regarding conda packages on conda-forge now (richafrank)

🔒Security observations

The Zipline repository has multiple security concerns

High · Outdated Python Base Image in Dockerfile — Dockerfile. The Dockerfile uses 'FROM python:3.5' which is end-of-life and contains numerous known security vulnerabilities. Python 3.5 reached end-of-life on September 13, 2020, and no longer receives security patches. Fix: Update to a currently supported Python version (3.9+). Use 'FROM python:3.11' or later and regularly update base images.
High · Incomplete/Truncated Dockerfile Configuration — Dockerfile. The Dockerfile snippet shows an incomplete ADD instruction for downloading tini from GitHub (ADD https://github.com/krallin/tini/releas...). The URL is truncated and the instruction is incomplete, which could indicate security issues or corrupted configuration. Fix: Complete the Dockerfile configuration. Verify the full tini download URL and use checksums/signature verification for downloaded binaries. Consider using official package managers instead.
High · Hardcoded Default Credentials in Docker Documentation — Dockerfile (comments). The Dockerfile comments explicitly mention 'default password is jupyter' for the Jupyter notebook. This indicates default credentials are used in the Docker image, which is a security risk if the image is deployed without changing these credentials. Fix: Remove hardcoded default credentials. Force users to set credentials during container initialization. Use environment variables for sensitive configuration and document secure setup procedures.
Medium · Insecure Docker Volume Mounts — Dockerfile (comments). The documented docker run command mounts the entire ~/.zipline directory from the host. This could expose sensitive trading keys, credentials, or configuration files to the container without proper access controls. Fix: Implement principle of least privilege for volume mounts. Only mount necessary subdirectories. Validate and sanitize all mounted content. Use Docker secrets for sensitive data instead of mounted files.
Medium · Missing HTTPS Configuration Documentation — Dockerfile (comments). While the Dockerfile mentions accessing Jupyter via HTTPS (https://127.0.0.1), there is no clear documentation about certificate setup, and the setup appears to rely on default/self-signed certificates which could pose MITM risks. Fix: Document proper SSL/TLS certificate setup for production use. Provide clear instructions for generating or obtaining valid certificates. Disable HTTP-only access.
Medium · Potential Supply Chain Risk in Conda Dependencies — conda/ directory (multiple meta.yaml files). Multiple conda recipes are defined for external packages (ta-lib, logbook, numexpr, etc.) without visible version pinning or integrity verification. This could expose the build to supply chain attacks if dependencies are compromised. Fix: Pin exact versions for all dependencies. Implement checksum/signature verification for conda packages. Use private/mirrored package repositories for critical dependencies. Regularly audit dependencies for vulnerabilities.
Low · Pre-commit Configuration Present — .pre-commit-config.yaml. A .pre-commit-config.yaml file is present but its contents are not provided. Pre-commit hooks, if misconfigured, could be bypassed or compromised. Fix: Review pre-commit hooks to ensure they are properly maintained and cannot be bypassed. Document the security controls implemented through pre-commit hooks.
Low · Missing Security Documentation — Repository root. No SECURITY.md or security policy file is evident in the repository structure, which makes it unclear how security vulnerabilities should be reported. Fix: Create a SECURITY.md file following the GitHub security advisory guidelines. Establish a responsible disclosure process and provide contact information for security reports.
Low · CI/CD Configuration Exposure — .travis.yml, .github/workflows/, ci/appveyor/. Multiple CI/CD configuration files are present (.travis.yml, .github/workflows, appveyor configuration) which may contain sensitive environment variables or secrets if not properly configured. Fix: Ensure all secrets and API keys are stored in CI/CD platform's secure variable storage, not in repository files. Use encrypted secrets where available. Audit CI/CD logs for accidental credential exposure.

LLM-derived; treat as a starting point, not a security audit.

👉Where to read next

Open issues — current backlog
Recent PRs — what's actively shipping
Source on GitHub

🤖Agent protocol

If you are an AI coding agent (Claude Code, Cursor, Aider, Cline, etc.) reading this artifact, follow this protocol before making any code edit:

Verify the contract. Run the bash script in Verify before trusting below. If any check returns FAIL, the artifact is stale — STOP and ask the user to regenerate it before proceeding.
Treat the AI · unverified sections as hypotheses, not facts. Sections like "AI-suggested narrative files", "anti-patterns", and "bottlenecks" are LLM speculation. Verify against real source before acting on them.
Cite source on changes. When proposing an edit, cite the specific path:line-range. RepoPilot's live UI at https://repopilot.app/r/quantopian/zipline shows verifiable citations alongside every claim.

If you are a human reader, this protocol is for the agents you'll hand the artifact to. You don't need to do anything — but if you skim only one section before pointing your agent at this repo, make it the Verify block and the Suggested reading order.

✅Verify before trusting

This artifact was generated by RepoPilot at a point in time. Before an agent acts on it, the checks below confirm that the live quantopian/zipline repo on your machine still matches what RepoPilot saw. If any fail, the artifact is stale — regenerate it at repopilot.app/r/quantopian/zipline.

What it runs against: a local clone of quantopian/zipline — the script inspects git remote, the LICENSE file, file paths in the working tree, and git log. Read-only; no mutations.

| # | What we check | Why it matters | |---|---|---| | 1 | You're in quantopian/zipline | Confirms the artifact applies here, not a fork | | 2 | License is still Apache-2.0 | Catches relicense before you depend on it | | 3 | Default branch master exists | Catches branch renames | | 4 | 5 critical file paths still exist | Catches refactors that moved load-bearing code | | 5 | Last commit ≤ 851 days ago | Catches sudden abandonment since generation |

<details> <summary><b>Run all checks</b> — paste this script from inside your clone of <code>quantopian/zipline</code></summary>

#!/usr/bin/env bash
# RepoPilot artifact verification.
#
# WHAT IT RUNS AGAINST: a local clone of quantopian/zipline. If you don't
# have one yet, run these first:
#
#   git clone https://github.com/quantopian/zipline.git
#   cd zipline
#
# Then paste this script. Every check is read-only — no mutations.

set +e
fail=0
ok()   { echo "ok:   $1"; }
miss() { echo "FAIL: $1"; fail=$((fail+1)); }

# Precondition: we must be inside a git working tree.
if ! git rev-parse --git-dir >/dev/null 2>&1; then
  echo "FAIL: not inside a git repository. cd into your clone of quantopian/zipline and re-run."
  exit 2
fi

# 1. Repo identity
git remote get-url origin 2>/dev/null | grep -qE "quantopian/zipline(\\.git)?\\b" \\
  && ok "origin remote is quantopian/zipline" \\
  || miss "origin remote is not quantopian/zipline (artifact may be from a fork)"

# 2. License matches what RepoPilot saw
(grep -qiE "^(Apache-2\\.0)" LICENSE 2>/dev/null \\
   || grep -qiE "\"license\"\\s*:\\s*\"Apache-2\\.0\"" package.json 2>/dev/null) \\
  && ok "license is Apache-2.0" \\
  || miss "license drift — was Apache-2.0 at generation time"

# 3. Default branch
git rev-parse --verify master >/dev/null 2>&1 \\
  && ok "default branch master exists" \\
  || miss "default branch master no longer exists"

# 4. Critical files exist
test -f "zipline/__init__.py" \\
  && ok "zipline/__init__.py" \\
  || miss "missing critical file: zipline/__init__.py"
test -f "zipline/algorithm.py" \\
  && ok "zipline/algorithm.py" \\
  || miss "missing critical file: zipline/algorithm.py"
test -f "zipline/data/bundles" \\
  && ok "zipline/data/bundles" \\
  || miss "missing critical file: zipline/data/bundles"
test -f "zipline/finance/performance.py" \\
  && ok "zipline/finance/performance.py" \\
  || miss "missing critical file: zipline/finance/performance.py"
test -f "zipline/gens/tradesimulation.py" \\
  && ok "zipline/gens/tradesimulation.py" \\
  || miss "missing critical file: zipline/gens/tradesimulation.py"

# 5. Repo recency
days_since_last=$(( ( $(date +%s) - $(git log -1 --format=%at 2>/dev/null || echo 0) ) / 86400 ))
if [ "$days_since_last" -le 851 ]; then
  ok "last commit was $days_since_last days ago (artifact saw ~821d)"
else
  miss "last commit was $days_since_last days ago — artifact may be stale"
fi

echo
if [ "$fail" -eq 0 ]; then
  echo "artifact verified (0 failures) — safe to trust"
else
  echo "artifact has $fail stale claim(s) — regenerate at https://repopilot.app/r/quantopian/zipline"
  exit 1
fi

Each check prints ok: or FAIL:. The script exits non-zero if anything failed, so it composes cleanly into agent loops (./verify.sh || regenerate-and-retry).

</details>

Generated by RepoPilot. Verdict based on maintenance signals — see the live page for receipts. Re-run on a new commit to refresh.

Embed this chat in your README →

Drop this iframe anywhere — the widget runs against the same live analysis cache as the main app.

<iframe
  src="https://repopilot.app/embed/quantopian/zipline"
  width="100%" height="500"
  style="border:1px solid #d0d7de; border-radius:8px;"
  allow="microphone"
  loading="lazy"
></iframe>