# Active Graph

> An event-sourced reactive graph runtime for long-running, auditable, agentic systems.

Active Graph is an event-sourced reactive graph runtime for
long-running, auditable, agentic systems. Behaviors react to
events, mutate a shared graph, and emit more events; the
event log is the source of truth, so every run is replayable,
forkable, and diff-able from its log. Install with
`pip install activegraph` and run `activegraph quickstart`
for the bundled fixture-backed demo.


# Quickstart

# Quickstart

Ten minutes from install to a working custom behavior. By the end of this tutorial you'll have run the framework, written your own code in it, saved a run, inspected it from the outside, and used the fork-and-diff primitive that's specific to Active Graph and uncommon in other agent frameworks. The seven steps build on each other; do them in order.

If you finish in less than ten minutes, the tutorial isn't broken — you read faster than the average. If you finish in more than fifteen, something is rough; file an issue at [GitHub](https://github.com/yoheinakajima/activegraph/issues) with where you got stuck.

## 1. Install

```bash
pip install activegraph
activegraph --version
```

The bare install includes the runtime, the SQLite store, and the bundled Diligence pack. No API key needed for this tutorial.

## 2. Run the bundled demo

```bash
activegraph quickstart
```

This runs the bundled Diligence pack against recorded fixtures — three companies, three diligence memos, no network, about twenty seconds. The run is byte-deterministic; every machine produces the same output, which is why the snapshot test for this command works.

You'll see four sections of output: a header naming the pack and companies, a long trace, one of the produced memos rendered in full, and two prose sections ("what just happened" and "try next"). Don't worry about reading every trace line yet — step 3 is where we'll look at the trace.

The key beat: that memo came from nothing but a fixture-backed demo, in seconds, with the same output on your machine as on mine. The framework's pitch is "auditable agentic systems"; the deterministic demo is that pitch made tangible.

## 3. Read the trace

Scroll back up to the trace block in the output. The lines starting with `[goal.created]`, `[behavior.started]`, `[llm.requested]`, `[object.created]`, and so on are **events** — the framework's append-only record of everything that happened. Active Graph models the world as a graph of objects connected by typed edges; events are how the graph changes over time.

A few specific lines to find:

- `[pack.loaded]` near the top — when the Diligence pack registered its behaviors, tools, and prompt templates.
- `[goal.created] user: "Diligence: Northwind Robotics"` — when the runtime received its first goal.
- A `[behavior.started]` for `diligence.company_planner` followed immediately by `[object.created] company#1` — the planner behavior fired in response to the goal, and produced a `company` object on the graph.
- `[llm.requested]` and `[llm.responded]` pairs — every LLM call was served by the bundled fixture provider (`RecordedDiligenceProvider`), so no network requests fired. The trace shows `cost=$0.00X latency=0.Xs` on each `llm.responded` line; in a production run against a real provider those numbers would be real costs and real latencies.
- `[runtime.idle] queue empty, budget remaining` at the end of each goal's events — the runtime finished all the work it could do and stopped.

Two layers worth distinguishing now so the vocabulary lands cleanly later: the **provider** is what produces LLM responses (here, the fixture provider; in production, an `AnthropicProvider`). The runtime's **replay cache** is a separate layer that records `llm.responded` events and serves them back when a run replays under strict-replay mode or when `Runtime.fork(at_event=...)` is called in-process — that's where you'll see `cache_hit=true` in the trace. See [`concepts/replay`](https://docs.activegraph.ai/concepts/replay/index.md) and [`concepts/forking`](https://docs.activegraph.ai/concepts/forking/index.md) for the deep dive.

That trace is the framework's audit trail. The same artifact you just read for fun is what you'd read while debugging a production incident. Most agent frameworks don't have this kind of trace, and that's one of the things that makes Active Graph different.

We'll come back to events in more detail in [`concepts/events`](https://docs.activegraph.ai/concepts/events/index.md). For now: the trace is the truth; everything else is a projection of it.

## 4. Write a custom behavior

```bash
activegraph quickstart --interactive
```

This walks you through writing your first behavior. A **behavior** is the framework's unit of reactive code — a Python function decorated with `@behavior` that subscribes to events and produces more events (new objects, new relations, custom events).

The interactive command scaffolds a starter behavior at `./activegraph_quickstart/my_first_behavior.py` and prompts you to edit it. The TODO in the scaffold is a small problem: flag any claim that mentions revenue growth above 25%. You can parse the text with a regex; the framework supplies the integration with the graph.

Open the file in your editor, replace the TODO with the parsing logic, and save. The full file is short — fewer than twenty lines when you're done.

Don't worry about getting the regex perfect. The goal of this step is to feel the shape of writing a behavior: a decorator declaring when it fires, a function body that reads from the event and writes to the graph.

We'll go deeper on behaviors in [`concepts/behaviors`](https://docs.activegraph.ai/concepts/behaviors/index.md).

## 5. Run your behavior

Back in the terminal, type `continue` at the prompt. The framework loads your file fresh, runs the Diligence pack against one company, and reports how many times your behavior fired.

Scroll the trace and find your behavior's lines:

```text
[behavior.started]    growth_flagger  (matched object.created: claim#NN)
[event.emitted]       growth.flagged claim_id=claim#NN growth=28
[behavior.completed]  growth_flagger
```

That's your code, running in the same runtime as the Diligence pack, firing on the same events, producing events that downstream behaviors could subscribe to. Your behavior is a first-class citizen of the graph — there's no separate "user behavior" path.

Iterate as much as you want — edit the file, type `continue`, edit again. When you're done, type `quit`. Your file persists at `./activegraph_quickstart/my_first_behavior.py`; keep it, modify it, or delete the directory.

## 6. Save and inspect

The fixture-backed run from step 2 saved itself to `/tmp/activegraph_quickstart/quickstart_demo_run.db`. Try inspecting it:

```bash
activegraph inspect sqlite:////tmp/activegraph_quickstart/quickstart_demo_run.db
```

You'll see a summary: run id, state, budget snapshot, registered behaviors, the tail of recent events. The same data the trace showed, but as a query surface — you read it from outside the run, which means you can read it after the run finishes, after the process exits, after a restart. Active Graph runs persist.

Try a focused query:

```bash
activegraph inspect sqlite:////tmp/activegraph_quickstart/quickstart_demo_run.db \
    --event evt_006
```

That prints the full payload of one event. The event id is what an error message would name if something went wrong; `--event` is how you'd start investigating.

`activegraph inspect --help` shows the full surface. The [CLI reference](https://docs.activegraph.ai/reference/cli/index.md) is the canonical doc; the [debugging cookbook](https://docs.activegraph.ai/cookbook/debugging/index.md) walks through diagnostic workflows that build on these primitives.

## 7. Fork and diff

The closer. Forking is the framework's most differentiated capability — most agent frameworks can't do this. A fork is a new run that shares the parent's event log up to a chosen point, then diverges from there. Combined with a diff against the parent, fork answers the question "what would have happened if I'd configured this differently?"

The full fork-with-override workflow uses both a Python snippet and the `activegraph diff` CLI command. Drop this into a file (`fork_and_diff.py`) and run it:

```python
import sqlite3

from activegraph import Runtime
from activegraph.packs.diligence import DiligenceSettings, pack as diligence_pack
from activegraph.packs.diligence.fixtures import (
    RecordedDiligenceProvider,
    THREE_COMPANIES,
)
from activegraph.store import open_store
from activegraph.store.sqlite import SQLiteEventStore

DB_PATH = "/tmp/activegraph_quickstart/quickstart_demo_run.db"
PARENT_URL = f"sqlite:///{DB_PATH}"
PARENT_RUN = "quickstart_demo_run"
FORK_RUN = "quickstart_cautious"

# Tutorial-only: remove any prior fork so this snippet is re-runnable.
# Real workflows handle fork-id collisions intentionally — pick a
# unique FORK_RUN per experiment instead of deleting the prior one.
with sqlite3.connect(DB_PATH) as _conn:
    deleted = _conn.execute(
        "DELETE FROM events WHERE run_id = ?", (FORK_RUN,)
    ).rowcount
    _conn.execute("DELETE FROM runs WHERE run_id = ?", (FORK_RUN,))
    if deleted:
        print(f"Removed previous fork ({deleted} events) to re-run cleanly.")

# Pick the goal event for the first company as the fork point.
parent_store = open_store(PARENT_URL, run_id=PARENT_RUN)
fork_at = next(
    e.id for e in parent_store.iter_events()
    if e.type == "goal.created"
)

# Copy the parent's events up to the fork point into a new run.
SQLiteEventStore.fork_run(
    DB_PATH,
    parent_run_id=PARENT_RUN,
    new_run_id=FORK_RUN,
    at_event_id=fork_at,
    label="cautious",
    created_at="2026-01-01T00:00:00Z",
)

# Load the fork. The provider matches the parent run's
# RecordedDiligenceProvider, so cached LLM responses from the parent
# replay byte-identically and no network or API key is needed.
fork_rt = Runtime.load(
    PARENT_URL,
    run_id=FORK_RUN,
    llm_provider=RecordedDiligenceProvider(companies=THREE_COMPANIES),
)
fork_rt.load_pack(
    diligence_pack,
    settings=DiligenceSettings(
        llm_model="claude-sonnet-4-5",
        confidence_threshold_for_review=0.9,
    ),
)
fork_rt.run_until_idle()
fork_rt.save_state()

print(f"forked: {FORK_RUN}  (parent: {PARENT_RUN})")
print(f"next:   activegraph diff {PARENT_URL} \\")
print(f"            --run-a {PARENT_RUN} --run-b {FORK_RUN}")
```

The snippet does the fork half of fork-and-diff; the diff half is a CLI command that reads the same SQLite file from outside. The snippet's final two `print` lines spell out the exact command — copy it from the terminal output, or use the block below:

```bash
activegraph diff sqlite:////tmp/activegraph_quickstart/quickstart_demo_run.db \
    --run-a quickstart_demo_run \
    --run-b quickstart_cautious
```

You'll see five counts (shared events, parent-only events, fork-only events, divergent objects, divergent relations) and a list of objects that exist in both runs with different state. On the bundled fixtures the diff produces 61 divergent objects and 49 divergent relations — the threshold change fans out further than you'd guess. The first divergent object is where the threshold change started producing different work.

What you just did: ran the same starting state through a different decision, and got a structural comparison of the results. Hypothesis testing on an agentic system, without losing the parent run. This is what fork-and-diff means in this framework.

The fork-and-diff workflow will collapse into a single `activegraph fork --set` CLI command in v1.1 ([CONTRACT v1.1 #1](https://github.com/yoheinakajima/activegraph/blob/main/CONTRACT.md#v11-1-cli-flags-specd-but-not-implemented)); the Python form above is the v1.0 canonical recipe. For the conceptual deep-dive on forks (shared lineage, cache replay, the strict-vs-permissive replay distinction), read [`concepts/forking`](https://docs.activegraph.ai/concepts/forking/index.md).

## What to read next

You've now run the framework, written your own behavior, persisted a run, queried it from outside, and forked it. That's the loop; everything else is depth on one of these primitives.

In rough order of usefulness from here:

- [`concepts/graph`](https://docs.activegraph.ai/concepts/graph/index.md) and [`concepts/behaviors`](https://docs.activegraph.ai/concepts/behaviors/index.md) — the mental model. Read both in one sitting; together they take about fifteen minutes.
- [`cookbook/common-patterns`](https://docs.activegraph.ai/cookbook/common-patterns/index.md) — recurring idioms with copy-pasteable code. Eight patterns, most of which apply to the kind of agentic systems you'd build on this framework.
- [`cookbook/debugging`](https://docs.activegraph.ai/cookbook/debugging/index.md) — the operator- facing diagnostic walkthrough. Useful when something goes wrong; useful before something goes wrong because it teaches you how the framework's audit trail actually works.
- [`reference/cli`](https://docs.activegraph.ai/reference/cli/index.md) — the full CLI surface.
- [`concepts/failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) — the framework's stance on what counts as a recoverable failure. Short, load-bearing, worth reading once.
- [Authoring packs](https://docs.activegraph.ai/quickstart/guides/writing-behaviors.md) and [Writing LLM behaviors](https://docs.activegraph.ai/quickstart/guides/writing-llm-behaviors.md) — for when you're ready to build something larger than a single behavior.

If you're back here on Monday, you found what you were looking for.
# Concepts

# Graph

The graph is the world state of an Active Graph run. Objects sit on it as typed nodes; relations connect them as typed edges. Behaviors react to changes in the graph by emitting more changes. Goals are the inputs operators push in from the outside.

The graph isn't a control-flow structure. It models what the system **knows about**, not what the system **does next**. That's the load-bearing distinction between Active Graph and workflow-graph frameworks (LangGraph, the various DAG runners) — the nodes here are facts and entities, not steps. Steps are behaviors, and behaviors live alongside the graph, not inside it.

## Graph as projection of the event log

The graph is the projection of an append-only event log. Every mutation — `add_object`, `patch_object`, `add_relation`, every behavior fire — emits an event. The event lands in the store, and the graph in memory is updated. `Runtime.load(url, run_id=...)` reconstructs the graph by replaying the events; nothing else is persisted.

This is the framework's most foundational invariant. Other concepts pages link here for it:

- [`events`](https://docs.activegraph.ai/concepts/events/index.md) documents the event types that drive the projection.
- [`replay`](https://docs.activegraph.ai/concepts/replay/index.md) is the operation that uses the projection property to reconstruct state.
- [`forking`](https://docs.activegraph.ai/concepts/forking/index.md) creates a new run by copying a prefix of the event log; the forked graph is the projection of that prefix.
- [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) is why the framework refuses to silently produce events that don't represent real work — the projection would lie.

You can read the graph state at any time:

```python
graph.all_objects()                       # every object
graph.objects(type="claim")               # filtered by type
graph.relations(source=claim_id)          # outgoing edges
graph.relations(target=claim_id)          # incoming edges
graph.relations(type="depends_on")        # by edge type
graph.get_object(object_id)               # by id
```

`graph.relations(source=, target=, type=)` is the canonical filter API on `Graph`; all three kwargs compose by AND, and calling with no kwargs returns every relation. `graph.get_relations(object_id=, type=, direction=)` is an alias preserved for backward compatibility; new code should use `graph.relations(...)`.

But you can't mutate it except through events. There's no `graph.objects["x"] = ...` setter; every mutation goes through a method that emits an event.

## Objects

Objects are typed entities. The type is a string declared by the pack that owns the object type (`@pack(object_types=[...])`) or freeform if no pack declares it. The data is a dict of JSON-encodable values:

```python
claim = graph.add_object("claim", {
    "text": "Q3 revenue grew 28% YoY.",
    "confidence": 0.85,
})
```

Object ids are framework-generated (`IDGen`), monotonic per run, and unique per run. The pack format can declare a schema (Pydantic model) for the object type; if so, the data is validated at `add_object` — see [`pack-schema-violation`](https://docs.activegraph.ai/reference/errors/pack-schema-violation/index.md).

## Relations

Relations are typed edges between objects. The type is a string, the endpoints are object ids, and optional data is a dict on the edge itself:

```python
graph.add_relation(claim.id, evidence.id, "supports", {"strength": 0.9})
```

Relations have ids too (also framework-generated). A relation type can carry a behavior — see [`relations`](https://docs.activegraph.ai/concepts/relations/index.md) for the distinction between passive, rule, and agentic relations.

## Goals

Goals are the inputs operators push in from outside. A goal isn't an object on the graph; it's an event of type `goal.created` that behaviors subscribed to it react to:

```python
rt.run_goal("Diligence: Northwind Robotics")
```

Behaviors on `goal.created` fire first; their output (objects, relations, more events) triggers other behaviors, and the runtime loop continues until the queue is empty.

## What's NOT on the graph

- **Control flow.** The runtime's behavior dispatch is not modeled as graph nodes. The graph models the work product (objects, relations); behaviors are the framework's reactive code.
- **Configuration.** Pack settings, budget limits, the runtime's store URL — none of these are graph state. They're constructor arguments.
- **The event log itself.** The graph is a *projection* of the log; the log itself lives in the store. Read it via `graph.events` (in-memory) or `activegraph inspect` (operator-side).

## What's related

- [`events`](https://docs.activegraph.ai/concepts/events/index.md) — the append-only history that drives the graph projection.
- [`behaviors`](https://docs.activegraph.ai/concepts/behaviors/index.md) — the reactive code that mutates the graph in response to events.
- [`relations`](https://docs.activegraph.ai/concepts/relations/index.md) — the typed-edge primitive and its optional behaviors.
- [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) — why the framework refuses to silently bypass the event log.

# Type system

Active Graph has three layers of types: **event types** (framework- defined), **object types** (developer-defined), and **relation types** (developer-defined). One layer is the fixed vocabulary the framework speaks; the other two are the domain vocabulary the developer chooses. A maintainer reading this page for the first time will most likely arrive looking for the answer to one question: *are there framework base types I need to know about?* The answer is no — for objects and relations. The framework ships zero base object types and zero base relation types. The Diligence pack's `claim / evidence / question / memo / …` ontology is an example, not a base.

This page covers the three layers, how they compose, the patch-lifecycle states (the fourth small framework-defined vocabulary), and design guidance for the developer-defined layers.

## The framework-defined layer: event types

Every event has a `type` — a string discriminator that says what happened. The framework emits a fixed set of dotted-namespace event types; user code may emit additional types via `graph.emit` (any string is valid, the dot-namespaced convention is recommended). The fixed set is the framework's vocabulary; the things you can build on top of it.

The complete set of framework-emitted event types:

### Lifecycle

- **`goal.created`** — an operator pushed a goal into the run (`rt.run_goal("…")`). Behaviors subscribed to `goal.created` fire first; the runtime loop continues from their output.
- **`runtime.idle`** — the runtime queue is empty and there is budget remaining; the loop is paused, ready to resume on the next emit.
- **`runtime.budget_exhausted`** — the per-run budget (LLM tokens, wall-clock seconds, behavior fires) was hit; the loop stops with this event as its terminal record.

### Graph mutations

- **`object.created`** — `graph.add_object(...)` succeeded. Payload carries the full object — id, type, data, version, provenance.
- **`object.removed`** — `graph.remove_object(...)` succeeded.
- **`relation.created`** — `graph.add_relation(...)` succeeded. Payload carries source, target, type, data, provenance.
- **`relation.removed`** — `graph.remove_relation(...)` succeeded.

### Behavior dispatch

- **`behavior.scheduled`** — the runtime queued a behavior for dispatch. One per matching subscription on the triggering event.
- **`behavior.started`** — the behavior body began executing.
- **`behavior.completed`** — the body returned without raising.
- **`behavior.failed`** — the body raised; the runtime caught the exception and emitted this event. Payload carries the reason code and structured failure context. See [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) for the events-not-exceptions principle and [`reference/errors`](https://docs.activegraph.ai/reference/errors/index.md) for the closed reason-code taxonomy.
- **`relation_behavior.started`** — a `@relation_behavior` body began; sibling of `behavior.started`, carries the bound relation.

### Patterns

- **`pattern.matched`** — a Cypher-subset pattern subscription matched. Emitted before `behavior.started` for the matched bindings; carries the binding map. See [`patterns`](https://docs.activegraph.ai/concepts/patterns/index.md).

### LLM and tools

- **`llm.requested`** / **`llm.responded`** — every LLM call appears as a request/response pair in the event log. Payload carries prompt content hash, model name, recorded-fixture key (in fixture-replay runs), and the response body.
- **`tool.requested`** / **`tool.responded`** — every tool call, same shape. Payload carries the tool name, input, output, and cache-hit status.

### Patches

- **`patch.proposed`** — `graph.propose_patch(...)` or `ctx.propose_object(...)` recorded a proposal. Carries the target id, observed version, intended diff, proposer identity.
- **`patch.applied`** — the proposal succeeded (or `graph.patch_object(...)` shortcut ran). Carries the resulting object version and the computed diff.
- **`patch.rejected`** — the proposal was refused (version conflict, policy refusal, or explicit `reject_patch`). Carries the rejection reason.

### Approvals

- **`approval.proposed`** — a policy-gated mutation produced a pending approval. Carries the approval id and the object/patch it gates.
- **`approval.granted`** — `runtime.approve(approval_id)` resolved a pending approval; the gated mutation lands.

### Pack lifecycle

- **`pack.loaded`** — `runtime.load_pack(...)` succeeded. Carries the pack name, version, object/relation types, behaviors, tools, policies, prompt content hashes, and the canonical settings dump. The pack-load order participates in the replay contract — a loaded run replays the same `pack.loaded` event at the same point in the log.

This list is the framework's stable vocabulary. The cookbook, trace formatter, replay engine, observability metrics, and CLI inspect command all key off these types. Custom event types from user code live alongside them and follow the same shape; the framework treats unknown types as opaque payload carriers.

## The developer-defined layer: object types

`graph.add_object(type, data)` accepts **any string** as the type. There is no central enum, no required `register_object_type(...)` call, no schema-definition step. The framework's stance is that an object type is whatever string identifies the role an object plays in your domain.

```python
graph.add_object("claim", {"text": "Q3 revenue grew 28% YoY.", "confidence": 0.85})
graph.add_object("memo",  {"company_id": "obj_007", "summary": "…"})
graph.add_object("topic", {"name": "battery thermal runaway"})
```

These three calls each produce an `object.created` event with the given type string. The framework does not check the type against anything. The data dict is JSON-encodability-validated and otherwise opaque.

If you come from a typed-schema background (databases, Pydantic, GraphQL, Protobuf), expect a schema-definition step and don't find it — there isn't one. This is intentional. The framework's abstraction surface is *events and reactions*, not *entity-relationship diagrams*. Schemas are useful when you have them; the optional-validation path below shows how to add one.

### Optional: pack-level schema validation

A pack can declare an object type with a Pydantic schema, and the runtime validates `add_object(type, data)` against the schema **after the pack is loaded**:

```python
from pydantic import BaseModel, Field
from activegraph.packs import ObjectType, Pack

class Claim(BaseModel):
    text: str
    confidence: float = Field(ge=0.0, le=1.0)

pack = Pack(
    name="my_pack",
    version="0.1.0",
    object_types=[ObjectType(name="claim", schema=Claim, description="…")],
    # …
)
```

After `runtime.load_pack(pack)`, `add_object("claim", data)` validates `data` against `Claim`; a mismatch raises [`pack-schema-violation`](https://docs.activegraph.ai/reference/errors/pack-schema-violation/index.md). **Validation is post-load and not retroactive** — objects of type `claim` created before the pack loaded stay as-is; objects of types no loaded pack contributes pass through unchanged. This preserves the no-pack default: any string works, any data shape works, you opt into a schema by loading a pack that declares one.

See [`authoring-packs`](https://docs.activegraph.ai/guides/authoring-packs/#4-object-types-and-relation-types) for the full pack-side mechanics.

### Why the type lives on the data, not in a central schema

Validation, when you want it, happens at the **binding moment** — where a behavior consumes an object type, it can declare what fields it expects. A behavior that fires on `object.created` filtered to `type="claim"` and reads `event.payload["object"]["data"]["text"]` is the de facto consumer-side schema: if the field isn't there, the behavior raises and the runtime emits `behavior.failed`. The framework's stance is that this consumer-side discipline carries the weight a central schema would, with the upside that domain ontologies can evolve without a migration step.

## The developer-defined layer: relation types

Same model. `graph.add_relation(source, target, type)` accepts any string. No central registry. A pack may declare endpoint-type rules — "`supports` connects `evidence` to `claim`" — and the runtime enforces them after the pack loads:

```python
from activegraph.packs import RelationType

RelationType(
    name="supports",
    source_types=("evidence",),
    target_types=("claim",),
    description="Evidence supports a claim.",
)
```

Without a pack-declared rule, any source/target/type combination is allowed. Pack-declared rules raise [`pack-schema-violation`](https://docs.activegraph.ai/reference/errors/pack-schema-violation/index.md) on a forbidden endpoint pair.

A relation type can also carry behavior — `@relation_behavior` attaches a rule or LLM body to a type so the type itself owns coordination logic between its endpoints. The relation kind (passive / rule / agentic) is a property of the *type*, not of any individual relation instance. See [`relations`](https://docs.activegraph.ai/concepts/relations/index.md) for that distinction.

## How the three layers compose

The framework's vocabulary is the event types; the domain vocabulary is the object and relation types the developer chooses. The two interlock through behaviors:

1. An operator pushes a `goal.created` event (framework type).
1. A behavior subscribed to `goal.created` runs and creates an object — `graph.add_object("topic", …)` (developer type).
1. The runtime emits an `object.created` event (framework type) carrying the new `topic` object (developer type) in its payload.
1. Behaviors subscribed to `object.created` filtered to `type="topic"` fire — perhaps emitting `tool.requested` (framework type) for a web search, perhaps creating `query` objects (developer type).
1. The cycle continues — every developer-typed mutation produces a framework-typed event; every framework-typed event can trigger more developer-typed mutations.

The discipline: the framework speaks a small fixed vocabulary about *what happened*; the developer speaks a domain vocabulary about *what kind of thing it happened to*.

## Patch lifecycle states

The fourth small framework-defined vocabulary: a patch's `status` field. Three values, defined on `core/patch.py`:

- **`proposed`** — the patch was recorded as a `patch.proposed` event but has not yet been applied or rejected.
- **`applied`** — the patch reached its terminal "applied" state via `graph.apply_patch(patch_id)` (or the `patch_object` auto-apply shortcut). Emits `patch.applied`.
- **`rejected`** — the patch reached its terminal "rejected" state via `graph.reject_patch(patch_id, reason)` or via the optimistic-concurrency version check at apply time. Emits `patch.rejected`.

`proposed` is the only non-terminal state. Re-applying or re-rejecting a terminal patch raises [`invalid-patch-lifecycle-state`](https://docs.activegraph.ai/reference/errors/invalid-patch-lifecycle-state/index.md). See [`patches`](https://docs.activegraph.ai/concepts/patches/index.md) for the canonical lifecycle prose; this list exists here so the type-system page enumerates every framework-defined vocabulary in one place.

## Designing an ontology

Because object and relation types are developer-defined, **the ontology is part of the system you're building**. Three rules that survive scrutiny across the v0.7 / v0.9 / external-research- agent ontologies the framework has been built and tested against:

**Object types are nouns describing roles in the domain, not data bags.** `claim`, `evidence`, `question`, `risk` each name a role something plays in a diligence workflow; a behavior that fires on `object.created` type-filtered to one of them knows what kind of thing it's reacting to. A generic `record` or `entity` type that holds arbitrary data is a smell — the type discriminator has collapsed and behaviors lose the ability to subscribe selectively. The external user-test on a deep-research agent surfaced this explicitly: a first pass used `data` as the type for everything, and behaviors had to inspect payload shape to dispatch. The second pass split into `topic / query / fact / report`, and behaviors became one-liners on `on=["object.created"], where=lambda e: e.payload["object"]["type"] == "topic"`.

**Relation types are verbs or predicates describing meaningful structure.** `supports`, `contradicts`, `depends_on`, `references`, `derived_from` each describe a relationship that something downstream cares about. A generic `related_to` is a smell — it collapses the type discriminator the same way a generic object type does, and pattern subscriptions on the relation type stop being useful. Verbs that read naturally in the call site (`graph.add_relation(evidence, claim, "supports")` reads as "evidence supports claim") are the heuristic.

**Keep the vocabulary small.** Eight to fifteen object types covers most domains. The Diligence pack ships eight object types and six relation types and is intentionally on the small end of that range — packs that try to model everything tend to model nothing. New types earn their place when an actual behavior or query needs to distinguish them; future-proofing with speculative types pollutes the ontology without adding behavior.

The discipline carries the weight that a central schema would: the *type itself* is the consumer-side contract. When a behavior fires on `type="claim"` it expects `claim` semantics; when it emits a `supports` relation it commits to `supports` semantics. Multiple behaviors agreeing on what those names mean is the ontology, and it's encoded in the behavior bodies — not in a schema file.

## Worked example: the Diligence pack ontology

The shipped Diligence pack is a concrete, well-designed type vocabulary. It is **an example ontology, not framework base types** — you would design your own for your domain. The pack is documented here so the design pattern is visible.

Eight object types (`activegraph/packs/diligence/object_types.py`):

| Type            | Role                                           |
| --------------- | ---------------------------------------------- |
| `company`       | The target of a diligence run.                 |
| `document`      | A source document the researcher pulled in.    |
| `question`      | A research question generated from the thesis. |
| `claim`         | A factual statement about the company.         |
| `evidence`      | A verbatim quote supporting a claim.           |
| `contradiction` | A detected conflict between two claims.        |
| `risk`          | A material risk identified during diligence.   |
| `memo`          | The final diligence memo for a company.        |

Six relation types:

| Type           | Endpoints (source → target)      | Meaning                                       |
| -------------- | -------------------------------- | --------------------------------------------- |
| `addresses`    | `claim` → `question`             | A claim addresses a research question.        |
| `supports`     | `evidence` → `claim`             | Evidence supports a claim.                    |
| `contradicts`  | `claim` → `claim`                | Two claims are in conflict.                   |
| `references`   | `{claim, memo}` → `document`     | A claim or memo references a source document. |
| `derived_from` | `{claim, evidence}` → `document` | Provenance back to a source document.         |
| `mitigates`    | `{evidence, claim}` → `risk`     | Evidence or a claim mitigates a risk.         |

Each object type carries a Pydantic schema (validated when the pack is loaded); each relation type pins its endpoints. Together they form a small graph ontology that a small set of behaviors (claim extractor, contradiction detector, memo synthesizer) operates on. None of these types are special to the framework; load a different pack and you get a different ontology.

The Diligence pack is the [reference pack](https://docs.activegraph.ai/reference/api/packs/diligence/index.md); [`authoring-packs`](https://docs.activegraph.ai/guides/authoring-packs/index.md) is the how-to for building your own.

## What's related

- [`graph`](https://docs.activegraph.ai/concepts/graph/index.md) — objects and relations as projections of the event log; the "graph as projection" principle.
- [`events`](https://docs.activegraph.ai/concepts/events/index.md) — the append-only history and how framework event types drive behavior dispatch.
- [`relations`](https://docs.activegraph.ai/concepts/relations/index.md) — the three relation kinds (passive / rule / agentic) and when to attach behavior to a relation type.
- [`patches`](https://docs.activegraph.ai/concepts/patches/index.md) — the patch lifecycle in full; this page only enumerates the state values.
- [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) — the `behavior.failed` reason-code taxonomy that lives on the event payload.
- [`authoring-packs`](https://docs.activegraph.ai/guides/authoring-packs/index.md) — declaring object types, relation types, and their Pydantic schemas in a pack.
- [Diligence pack reference](https://docs.activegraph.ai/reference/api/packs/diligence/index.md) — the worked example ontology rendered from source.

# Events

An event is an immutable record of something that happened in a run. Events are append-only — once an event lands in the store, nothing modifies it. The graph state is a projection of the event log (see [`graph`](https://docs.activegraph.ai/concepts/graph/index.md)); behaviors fire by subscribing to events and producing more events.

The event log is the source of truth. Everything else — the graph, the trace, the audit history — is derived from it.

## The structure

Every event has:

- `id` — framework-generated, monotonic per run, unique per run.
- `type` — a string discriminator. Framework events use a dotted namespace (`object.created`, `behavior.completed`, `runtime.idle`); user code emits custom types via `graph.emit` (any string is valid, but the dot-namespaced convention is recommended).
- `payload` — a dict of JSON-encodable values. The framework enforces JSON encodability at emit time; see [`non-serializable-event-error`](https://docs.activegraph.ai/reference/errors/non-serializable-event-error/index.md).
- `actor` — who or what produced the event. `"user"` for goals pushed in from outside, `"runtime"` for framework-emitted events, a behavior name for behavior-emitted events.
- `caused_by` — the id of the event that triggered the behavior that produced this one. The causal chain is reconstructable by walking `caused_by` back to a root event (`goal.created`, typically).
- `timestamp` — ISO 8601, set at emit time. Used for the trace display; behavior bodies must not depend on it for determinism (see [`behaviors`](https://docs.activegraph.ai/concepts/behaviors/index.md) — the determinism contract).

## The framework event types

Events emitted by the runtime itself fall into a small set of families:

- **Lifecycle**: `goal.created`, `runtime.idle`, `runtime.budget_exhausted` — boundary events around a run.
- **Object mutations**: `object.created`, `object.patched`, `object.removed` — every graph mutation lands as one of these.
- **Relation mutations**: `relation.created`, `relation.removed`.
- **Behavior dispatch**: `behavior.started`, `behavior.completed`, `behavior.failed`, `behavior.scheduled` — what the runtime did while running behaviors.
- **Pattern matching**: `pattern.matched` — emitted before `behavior.started` when the behavior used a pattern subscription; carries the match count.
- **LLM / tool**: `llm.requested`, `llm.responded`, `tool.requested`, `tool.responded` — every LLM call and every tool call appears as a request/response pair.
- **Patches**: `patch.proposed`, `patch.applied`, `patch.rejected` — the patch lifecycle.
- **Approvals**: `approval.proposed`, `approval.granted` — the policy-gated approval lifecycle.
- **Pack lifecycle**: `pack.loaded` — emitted once per `runtime.load_pack` call, carries the pack name, version, and prompt content hashes.

Custom event types from user code live alongside these and follow the same shape. Behaviors subscribe to either set with the same `on=` argument.

## Append-only and what that means

Once an event is in the store, it doesn't change. No edit, no delete, no truncate (except via the explicit `truncate_after` primitive, which is operator-side, not behavior-side). This is the property that makes replay work: [`Runtime.load`](https://docs.activegraph.ai/guides/operating-in-production/index.md) reads the event log and produces the same graph state every time.

Three consequences:

- **There's no "current value" of an object outside its event history.** An object's data is the result of applying every `object.created` and `object.patched` event for that object id, in order. The in-memory `Object.data` dict is a cache of that computation, not an authoritative store.
- **Operations that look like mutations are emissions.** `add_object` emits `object.created`; `patch_object` emits `object.patched`; `remove_object` emits `object.removed`. The graph in memory updates as a side effect of the emit.
- **The audit trail is automatic.** Anything that happened in a run is in the event log. Nothing else is needed for audit — there's no separate audit-log subsystem because the event log is the audit log.

## Events vs exceptions

The framework distinguishes two failure modes: exceptions for caller-actionable problems the caller can catch at the call site, events for non-fatal stops the audit trail should record and the runtime should continue past. Behavior failures, tool failures, budget exhaustion, and approval denials are events. Construction errors, lookup misses, replay divergence, and pattern syntax errors are exceptions.

See [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) for the full principle and why the framework treats them differently. The principle was load-bearing across the v1.0 audit and is referenced from most other concept pages — `failure-model.md` is the canonical statement.

## Reading the event log

The event log is available three ways:

```python
# In-memory, current run:
for event in graph.events:
    ...

# From the store, by run id:
from activegraph.store import open_store
store = open_store(url, run_id)
for event in store.iter_events():
    ...

# CLI, operator-side:
# activegraph inspect <url> --run-id <run> --tail 50
# activegraph inspect <url> --event <event_id>
```

The trace printer (`Runtime.print_trace()`) is the human-readable projection of the event log — same data, formatted with tags and short summaries for visual scanning. The trace is informational; the events are the data.

## What's related

- [`graph`](https://docs.activegraph.ai/concepts/graph/index.md) — the projection of the event log. Owns the "graph as projection" principle.
- [`behaviors`](https://docs.activegraph.ai/concepts/behaviors/index.md) — the reactive code that subscribes to events.
- [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) — the events-vs-exceptions distinction.
- [`replay`](https://docs.activegraph.ai/concepts/replay/index.md) — the operation that uses the append-only property to reconstruct state.

# Behaviors

A behavior is the framework's unit of reactive code. It subscribes to events, runs when its subscription matches, and produces more events (new objects, new relations, patches, custom events). The runtime dispatches behaviors against events in the queue until the queue is empty.

Behaviors are how a developer adds custom logic to the framework. Most code that ships with a pack is behaviors. Most code a developer writes is behaviors.

A behavior is **not an agent.** It doesn't decide what to do — it reacts. The decision is the subscription rule; the work is the body. An agentic-feeling system emerges from many small behaviors firing in response to each other's outputs, not from one agent-orchestrator behavior calling everything else.

## The decorator

```python
from activegraph import behavior

@behavior(
    name="contradiction_detector",
    on=["object.created"],
    where={"object.type": "claim"},
    pattern="(c:claim)-[:contradicts]->(other:claim)",
    view={"around": "event.payload.object.id", "depth": 1},
    activate_after=1,
)
def contradiction_detector(event, graph, ctx):
    for match in ctx.matches:
        ...
```

Every argument is a separate activation condition; the behavior fires when **all** of them hold:

- `on=` — event types the behavior subscribes to. Most behaviors subscribe to a single type (`object.created`, `goal.created`, custom event names). Match-all is allowed with `on=["*"]` but rarely useful.
- `where=` — a dict-shaped filter on the event payload. Equality on values; nested keys via dotted paths.
- `pattern=` — a Cypher-subset pattern subscription. The behavior fires only when the pattern matches the graph at event time. See [`patterns`](https://docs.activegraph.ai/concepts/patterns/index.md) for the locked subset and grammar.
- `view=` — a scoped view of the graph passed to the behavior body via the `ctx.view` accessor. Default is the full graph; narrow via `around=` + `depth=` to limit what the behavior reads.
- `activate_after=` — schedule the behavior to fire N events after the triggering event. Integer event count only; wall-clock units are refused (see [`invalid-activate-after`](https://docs.activegraph.ai/reference/errors/invalid-activate-after/index.md)).

## The signature

```python
def my_behavior(event, graph, ctx):
    ...
```

- `event` — the triggering event, with `id`, `type`, `payload`, `actor`, `caused_by`, `timestamp`.
- `graph` — the graph as it existed at event time, scoped by the `view=` argument.
- `ctx` — the runtime-bound context, with `.matches` (pattern bindings), `.view` (the scoped graph), `.propose_object` (the approval-gated add path), and a few framework-internal hooks.

The body mutates the graph by calling `graph.add_object`, `graph.patch_object`, `graph.add_relation`, `graph.remove_object`, or emits arbitrary events via `graph.emit(type, payload)`. Each mutation lands as an event in the log; downstream behaviors react.

## The three behavior kinds

- **Regular `@behavior`** (function or class) — the workhorse. Reacts to events, mutates the graph. Synchronous, deterministic.
- **`@llm_behavior`** — wraps a function whose return value comes from an LLM call. The framework handles the prompt assembly, the provider call, the cache, the tool loop, and the schema validation; the body receives the parsed LLM output and turns it into graph mutations. See the [LLM behavior guide](https://docs.activegraph.ai/concepts/guides/writing-llm-behaviors.md).
- **`@relation_behavior`** — attached to a relation type rather than an event type. Fires when an event affects an endpoint of the relation. See [`relations`](https://docs.activegraph.ai/concepts/relations/index.md).

## The determinism contract

Behavior bodies must be **deterministic given their inputs**. Same event, same graph state, same view → same mutations. This is the load-bearing assumption that makes replay and forking work. Two practical consequences:

- **No `random`, no `datetime.now()`, no `uuid.uuid4()` in behavior bodies.** If you need randomness or wall-clock time, get it from the event (which carries the recorded timestamp) or from the runtime's deterministic id generator (`graph.ids`).
- **No I/O outside the framework's primitives.** Network calls go through `@tool` so the framework can cache and replay them. LLM calls go through `@llm_behavior` so the prompt-hash cache works. Direct `requests.get` in a behavior body breaks replay determinism in a way the framework can't recover from.

The framework doesn't enforce determinism with static analysis; the discipline is on the developer. The cost of breaking it is a fork that produces a different result from its parent — see [`replay-divergence-error`](https://docs.activegraph.ai/reference/errors/replay-divergence-error/index.md).

## The failure model

When a behavior body raises, the runtime catches the exception and emits a `behavior.failed` event with the original exception's type, message, and (for LLM/tool errors) the structured `reason` code. The exception does NOT escape to your code — the loop continues, other behaviors keep firing, and the operator sees the failure in the trace.

Code that wants to react to failures subscribes to `behavior.failed`. The retry-behavior pattern is the canonical idiom:

```python
@behavior(
    on=["behavior.failed"],
    where={"reason": ["llm.network_error", "tool.timeout"]},
)
def retry_transient(event, graph, ctx):
    ...
```

See [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) for the events-not-exceptions principle and [`llm-behavior-error`](https://docs.activegraph.ai/reference/errors/llm-behavior-error/index.md) / [`tool-error`](https://docs.activegraph.ai/reference/errors/tool-error/index.md) for the LLM/tool failure shapes specifically.

## What's related

- [`graph`](https://docs.activegraph.ai/concepts/graph/index.md) — the world state behaviors react to and mutate.
- [`events`](https://docs.activegraph.ai/concepts/events/index.md) — the append-only history behaviors subscribe to.
- [`relations`](https://docs.activegraph.ai/concepts/relations/index.md) — the typed-edge primitive and `@relation_behavior`.
- [`patterns`](https://docs.activegraph.ai/concepts/patterns/index.md) — the Cypher-subset pattern subscription primitive.
- [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) — what happens when a behavior body raises.
- [Writing behaviors](https://docs.activegraph.ai/concepts/guides/writing-behaviors.md) — the how-to guide.

# Relations

A relation is a typed edge between two objects on the graph. Like objects, relations have a type (string), an id (framework-generated), optional data (dict of JSON-encodable values), and they live in the event log — created by `add_relation`, removed by `remove_relation`, each transition emitted as an event.

What makes relations distinctive in this framework is that the relation type itself can carry behavior. A relation isn't just a passive edge for the graph projection to render; it can be a rule that fires when its endpoints change, or an agentic actor with its own LLM-backed reasoning. The relation type is the unit of coordination logic between its endpoints.

This is the framework's most differentiated primitive. Most graph frameworks have nodes-with-behavior; relations-with-behavior is where Active Graph diverges.

## The three relation kinds

Three flavors of relation type, on a spectrum of how much logic the relation itself owns:

- **Passive.** No behavior attached. The relation is structural data — it exists, pattern subscriptions can match on it, behaviors on the endpoints can read it. The vast majority of relations are passive (`supports`, `contradicts`, `cites`, `depends_on`).
- **Rule.** A `@relation_behavior` attached to the type. Fires deterministically when an event affects either endpoint of any relation of that type. Used for coordination logic that semantically belongs to the relationship, not to either endpoint (e.g., a `depends_on` relation that auto-blocks the dependent when the dependency changes status).
- **Agentic.** A `@relation_behavior` that wraps an LLM call (same `@llm_behavior` machinery, but anchored on relation events). Used when the coordination logic needs LLM reasoning — e.g., a `contradicts` relation that drafts a contradiction-resolution memo when both endpoint claims change.

The three flavors share the same event types (`relation.created`, `relation.removed`) and the same data representation. The flavor is a property of the relation *type*, not of any individual relation instance.

## The `@relation_behavior` decorator

```python
from activegraph import relation_behavior

@relation_behavior(
    name="auto_unblock",
    relation_type="depends_on",
    on=["task.completed"],
)
def auto_unblock(relation, event, graph, ctx):
    if event.payload["task_id"] == relation.source:
        graph.patch_object(relation.target, {"status": "open"})
```

The body receives the `relation` (the typed edge instance), the triggering `event`, the `graph`, and the `ctx`. The relation behavior fires once per relation that matches — if three `depends_on` edges all point at the same source and the source's `task.completed` event fires, the body runs three times, once per edge, each call with that edge as `relation`.

The decorator's `relation_type=` argument narrows dispatch to one type. Other arguments (`on=`, `where=`, `pattern=`) work the same as on regular `@behavior`. See [`behaviors`](https://docs.activegraph.ai/concepts/behaviors/index.md) for the full activation model.

## When to use a relation behavior vs a regular behavior

The test: **does the coordination logic semantically belong to the relationship, not to either endpoint?**

- A `depends_on` relation auto-unblocking the dependent when the dependency completes → relation behavior. The unblock logic is about the relationship, not about either task in isolation.
- A `claim` getting flagged when its `confidence` drops below 0.5 → regular behavior on `object.patched`. The flag is about the claim itself; no relationship is involved.
- A `contradicts` relation drafting a resolution memo when both endpoints change → agentic relation behavior. The reasoning needs both endpoints' state; it's relationship logic, not endpoint logic.

When the test is ambiguous (the logic could go either way), default to regular behaviors. They're more discoverable — they show up under the endpoint's type in `inspect --behaviors`, and the coordination logic appears as a single behavior fire rather than N fires (one per matching edge).

## Pattern subscriptions and relations

Pattern subscriptions match on relations naturally. The Cypher-subset syntax `(a:type1)-[r:rel_type]->(b:type2)` binds both endpoints and optionally the relation itself. See [`patterns`](https://docs.activegraph.ai/concepts/patterns/index.md) for the binding rules and when to use the `r` variable vs the bare `-[:rel_type]->` form.

A behavior with a pattern subscription that mentions a relation type fires when the pattern matches — which is a different activation mechanism from `@relation_behavior` (which subscribes to events on relation endpoints rather than to graph structure). Both are valid; pick by which question you're asking: "fire when this edge plus this surrounding structure exist" (pattern) vs "fire when something happens to either end of any edge of this type" (relation behavior).

## What's related

- [`graph`](https://docs.activegraph.ai/concepts/graph/index.md) — the world state relations sit on. Relations are projections of `relation.created` / `relation.removed` events, same as objects.
- [`behaviors`](https://docs.activegraph.ai/concepts/behaviors/index.md) — the broader behavior model. `@relation_behavior` is a sibling of `@behavior` / `@llm_behavior`.
- [`patterns`](https://docs.activegraph.ai/concepts/patterns/index.md) — pattern subscriptions that match on relation structure.
- [Writing relation behaviors](https://docs.activegraph.ai/concepts/guides/writing-behaviors.md) — practical how-to; the decision rules for relation vs regular behavior get more attention there.

# Patches

A patch is a proposed mutation to the graph, recorded as an event before the mutation happens. Patches are how the framework keeps the audit trail honest about who proposed what change, what version of the target they observed, and whether the change succeeded or was refused.

A direct `graph.patch_object(target, diff)` call also lands in the event log (as `object.patched`), but the patch primitive is different: it's a **two-phase** operation. The first phase records the proposal as a `patch.proposed` event, with the proposer's identity, the version of the target they observed, and the intended diff. The second phase applies (success) or rejects (refusal), emitting `patch.applied` or `patch.rejected`.

The two phases let policies, behaviors, or operators sit between proposal and application. A pack's `memo_approval` policy is the canonical example — `ctx.propose_object` produces a pending approval, the operator (or an auto-approve setting) calls `runtime.approve(id)`, and the object lands only at that point. Without the two-phase shape, the policy would have to fire after the mutation, which is too late.

## The lifecycle

A patch begins in `'proposed'` and ends in exactly one of two terminal states:

```text
            proposed ──apply──> applied
                |
                └──reject────> rejected
```

Both transitions are one-shot. A `'proposed'` patch becomes `'applied'` exactly once (via `graph.apply_patch(patch_id)`) or `'rejected'` exactly once (via `graph.reject_patch(patch_id, reason)`). Re-calling either on an already-terminal patch raises [`invalid-patch-lifecycle-state`](https://docs.activegraph.ai/reference/errors/invalid-patch-lifecycle-state/index.md) — the framework refuses to emit a duplicate `patch.applied` event because that would break the replay contract.

Each transition emits an event:

- `patch.proposed` — carries the proposer, target id, observed version, diff, and any provenance metadata.
- `patch.applied` — carries the patch id, the resulting object version, and the mutation outcome.
- `patch.rejected` — carries the patch id and the rejection reason.

The events sit in the log alongside everything else. Downstream behaviors can subscribe to them, the trace renders them, and replay reconstructs the full proposal-and-decision sequence.

## Optimistic concurrency on object versions

Every object carries a version that increments on each mutation. When a behavior proposes a patch, the proposal records the version of the target at proposal time. When `apply_patch` runs, it checks whether the target's current version still matches the recorded one. If not, the patch is refused with a version-conflict reason.

The rule: **two behaviors that observed the same starting version can both propose patches, but only the first to apply succeeds.** The second sees the version drifted and reads its own outcome from the rejected event — usually re-reading the target and proposing a new patch against the new version.

The concurrency model is optimistic by design. Locks would serialize behavior dispatch and break the parallel-firing model that pattern subscriptions and event fan-out depend on. Version checks at apply-time keep the audit trail honest without serializing.

## When to use patches vs direct mutation

The test: **is this change durable or audit-critical?**

- **Yes** — use a patch. Pack policies gating writes, multi-step workflows where the proposal needs to survive operator review, any state change a downstream behavior might subscribe to via `patch.proposed`. The two-phase shape is the right primitive here.
- **No** — direct mutation is fine. Adding a new object, appending to a graph that has no concurrency contention, emitting an event whose payload doesn't represent durable state. `graph.add_object` and `graph.emit` cover most of this.

The default is direct mutation. Patches are for the cases where the two-phase shape earns its weight — when proposal and decision are semantically distinct operations the audit trail should record separately. Most behaviors mutate directly; a small number of policy-gated behaviors propose.

## The events-not-exceptions principle applied to patches

Patch rejection is a `patch.rejected` event, not an exception. A behavior that proposes a patch and finds it rejected reads the rejection from the event log; the runtime continues without interrupting. The rejection is not a failure — it's a normal outcome of the two-phase shape.

The exception case is misuse of the primitive: calling `apply_patch` on a patch that's already in a terminal state. That fires [`invalid-patch-lifecycle-state`](https://docs.activegraph.ai/reference/errors/invalid-patch-lifecycle-state/index.md) because the caller can fix the bug at the call site (check status before applying) and silently no-op'ing would emit a duplicate event.

See [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) for the broader principle.

## What's related

- [`graph`](https://docs.activegraph.ai/concepts/graph/index.md) — the world state patches modify. Patches are projections of `patch.proposed`, `patch.applied`, and `patch.rejected` events, same as objects and relations.
- [`events`](https://docs.activegraph.ai/concepts/events/index.md) — the append-only history that records every patch transition.
- [`behaviors`](https://docs.activegraph.ai/concepts/behaviors/index.md) — what proposes and applies patches. `ctx.propose_object` is the policy-gated path.
- [`policies`](https://docs.activegraph.ai/concepts/policies/index.md) — the mechanism that gates patches through approval flows.
- [`replay`](https://docs.activegraph.ai/concepts/replay/index.md) — the operation that reconstructs the full proposal-and-decision sequence from the event log.
- [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) — why patch rejection is an event but patch-lifecycle misuse is an exception.
- [`invalid-patch-lifecycle-state`](https://docs.activegraph.ai/reference/errors/invalid-patch-lifecycle-state/index.md) — the exception for misuse of the patch primitive.

# Views

A view is a scoped read of the graph. Behaviors observe the graph through views; patches and direct mutations are how they write back. Patches and views are the read/write counterparts in the framework's behavior model — patches own the write side and the audit trail, views own the read side and the cost surface.

A view is computed per-invocation. The framework doesn't cache views across behavior fires; each call to a behavior receives a freshly-scoped view of the graph as it exists at event time. That's the read-side equivalent of patches' optimistic concurrency on the write side — both primitives let parallel behaviors operate on consistent snapshots without locks.

## The scoping arguments

Views are declared on the behavior decorator and accessed via `ctx.view` in the body:

```python
@behavior(
    on=["object.created"],
    where={"object.type": "claim"},
    view={"around": "event.payload.object.id", "depth": 2},
)
def claim_with_neighbors(event, graph, ctx):
    claim = ctx.view.get_object(event.payload["object"]["id"])
    for related in ctx.view.objects():
        ...
```

Two arguments control the scope:

- `around=` — an expression evaluated against the triggering event that names the object the view centers on. Most commonly the triggering object's id (`event.payload.object.id`); also accepts a literal id, a list of ids, or `None` for a full-graph view.
- `depth=` — how many relation hops to include from the `around=` center. `depth=0` includes only the center object; `depth=1` includes its direct neighbors; `depth=2` includes neighbors of neighbors.

The full graph is available via `ctx.view` regardless of scope — the scope determines what the view's accessor methods return by default, not what's reachable. A scoped view's `objects()` returns objects in the scope; the underlying `graph` is still accessible if a behavior needs the unscoped read.

## Read-only contract

Views never mutate. The view accessor methods (`objects()`, `relations()`, `get_object()`) return existing graph data; there's no write path through `ctx.view`. Mutations go through `graph` (or `ctx.propose_object` for the policy-gated path), not through the view.

The separation is intentional. A behavior that observes through a narrow view and mutates through the full graph is the common pattern; the framework refuses to fuzz the read/write surfaces because mutations through a scoped accessor would silently miss relevant state outside the scope.

## How views compose with patterns

Pattern subscriptions and view scoping serve different jobs:

- The **pattern** selects which events fire the behavior. The pattern matcher reads the full graph (it has to, to evaluate the structural conditions), and produces `ctx.matches` — one entry per binding combination that satisfies the pattern.
- The **view** scopes what the behavior body reads during execution. Once the behavior is firing, the view determines what `ctx.view.objects()` returns.

The two can be different scopes. A pattern can match on a two-hop structural condition while the view is one-hop — the match identifies the event, the view bounds the work.

Pattern bindings (`ctx.matches[i].bindings`) are object ids; the behavior can look them up against `ctx.view` when they're in scope, or against `graph` directly when the pattern matched on objects outside the view's scope.

See [`patterns`](https://docs.activegraph.ai/concepts/patterns/index.md) for the pattern subscription model in detail.

## Why scope views

Scoping is the framework's main cost-efficiency lever for LLM behaviors. An LLM behavior passes its view to the prompt assembler as serialized objects; the bigger the view, the bigger the prompt, the higher the cost per call.

A behavior on a single claim probably doesn't need the full diligence pack's graph in its prompt — `view={"around": "event.payload.object.id", "depth": 1}` keeps the prompt focused and predictable. The cost saving compounds: 100 claim-extraction calls × 50% smaller prompt × $X/token adds up.

Non-LLM behaviors benefit too, more subtly — narrow views are cheaper to construct and iterate. The cost is smaller per-call but the rule still holds: scope to what the behavior actually needs.

## What a view is not

Three things views explicitly are not:

- **Not a query language.** The framework deliberately doesn't have a query language beyond pattern subscriptions. Views are scoping declarations, not queries. If you find yourself wanting to filter view results by complex conditions, you're reaching for the wrong primitive — use a pattern subscription instead.
- **Not a graph snapshot.** Views are computed per-invocation, not cached. A behavior firing twice on two events gets two fresh views; the framework doesn't cache or invalidate.
- **Not a subscription primitive.** Patterns subscribe; views scope. The behavior fires because of `on=` / `where=` / `pattern=`; the view only determines what the body reads after it fires.

The negative space matters because views are easy to over-interpret as "the LangChain retriever" or "the query DSL." They're neither. They're scoping declarations on the read side of behaviors.

## What's related

- [`graph`](https://docs.activegraph.ai/concepts/graph/index.md) — the world state views observe.
- [`behaviors`](https://docs.activegraph.ai/concepts/behaviors/index.md) — where `view=` is declared and `ctx.view` is used.
- [`patterns`](https://docs.activegraph.ai/concepts/patterns/index.md) — the subscription primitive that determines when behaviors fire; views determine what they read.
- [`patches`](https://docs.activegraph.ai/concepts/patches/index.md) — the write-side counterpart. Behaviors read through views and write through patches (or direct mutation).
- [Writing LLM behaviors](https://docs.activegraph.ai/concepts/guides/writing-llm-behaviors.md) — practical guidance on view scoping for cost efficiency.

# Frames

A frame is a bounded context for behavior dispatch. Events carry a `frame_id`; behaviors can filter their subscription by `frame_id`; a run can contain multiple frames running in parallel without their behaviors crossing wires.

Frames are an optional primitive. Most uses of the framework don't need them — a single-frame run is the default and covers every example in the quickstart, the diligence pack, and the cookbook patterns. **If you're not sure whether you need frames, you probably don't.** Use them when you need to scope behavior dispatch beyond a single event-type filter.

## What frames are for

Three use cases where frames earn their weight:

- **Multi-tenant graph state.** One runtime, many tenants. Each tenant gets a frame; behaviors filter by `frame_id` to keep tenant A's events from triggering work on tenant B's data. Without frames, the same separation would require either a per-tenant runtime (heavy) or `where=` filters on every behavior (error-prone).
- **Parallel hypothesis exploration before fork is appropriate.** When the framework is reasoning about multiple hypotheses simultaneously and you want each to be a distinct context but don't yet want the fork primitive's cost (a fork is a separate run; frames are sub-contexts within one run). Useful for short-lived parallel reasoning that converges back to a single output.
- **Structured conversations.** When a long-running goal has multiple distinct sub-tasks, each with its own behavior dispatch logic, frames are the bounded-context primitive for the sub-task. Each sub-task's events stay in its own frame; the sub-task's behaviors filter on the frame_id.

## How frames scope dispatch

A behavior with `frame_id="..."` in its `where=` clause fires only on events from that frame:

```python
@behavior(
    on=["object.created"],
    where={"frame_id": "tenant_a"},
)
def tenant_a_only(event, graph, ctx):
    ...
```

Without the filter, the behavior fires on events from every frame in the run. The runtime doesn't auto-scope behaviors by frame — the explicit filter is the contract.

Frame ids are strings, framework- or developer-generated. Framework-generated frames use the same monotonic id pattern as events; developer-generated frames can use semantically-meaningful names (`tenant_a`, `hypothesis_left`, `sub_task_42`).

## Relationship to runs

A run is the framework's top-level unit (one event log, one store binding). A frame is a sub-context within a run.

- **One run, one frame** — the default. Every event in the run belongs to the same frame; behaviors don't need to filter by `frame_id`.
- **One run, many frames** — the use cases above. Events from different frames coexist in the same event log; behaviors that care about isolation filter explicitly.
- **Many runs, many frames** — also valid; each run's frames are independent. Common when multi-tenant systems shard tenants across multiple runs and also frame within each.

Frames don't cross runs. A frame is run-scoped — moving a frame between runs would mean copying events, which is what fork and migrate are for.

## Frames vs forks

Both let parallel computations proceed in isolation. The difference is durability and replay:

- **Fork** is a separate run with shared event-log lineage up to the fork point. Forks are durable and replayable independently. Use fork when the parallel branches might diverge permanently or need independent persistence.
- **Frames** are sub-contexts within one run. The frame's events live in the same event log as everything else; replay is the whole run. Use frames when the parallel contexts are short-lived or semantically belong together.

A common pattern is to start parallel work in frames and fork only the branches that prove worth keeping. See [`forking`](https://docs.activegraph.ai/concepts/forking/index.md) for the fork primitive.

## What's related

- [`behaviors`](https://docs.activegraph.ai/concepts/behaviors/index.md) — where `frame_id` filters appear in `where=`.
- [`events`](https://docs.activegraph.ai/concepts/events/index.md) — events carry `frame_id`; the field is in the event structure.
- [`forking`](https://docs.activegraph.ai/concepts/forking/index.md) — the durable parallel-context primitive; complements frames.
- [`replay`](https://docs.activegraph.ai/concepts/replay/index.md) — frames replay as part of their run; there's no per-frame replay primitive.

# Pattern subscriptions

Behaviors fire on event types by default (`@behavior(on=["object.created"])`). For richer triggers — match an event against the current graph and fire only when a specific structural pattern holds — behaviors subscribe to a **pattern** instead.

Pattern subscriptions are a first-class activation primitive, alongside event-type subscriptions and `where=` filters. A behavior can use any combination of the three; all three conditions must hold for the behavior to fire.

## Syntax

Patterns are written in a strict subset of Cypher:

```python
@behavior(
    name="risk_escalator",
    pattern="(c:claim)-[:supports]->(e:evidence) WHERE c.confidence > 0.7",
)
def risk_escalator(event, graph, ctx):
    for match in ctx.matches:
        claim = match.bindings["c"]
        evidence = match.bindings["e"]
        ...
```

`ctx.matches` is a list of `Match` objects, one per distinct binding combination that satisfies the pattern. Iteration is the developer's responsibility — the framework does not collapse matches into a single fire-per-event; each match is exposed and the behavior body decides what to do with them.

## What the v0.7 subset supports

- **Node patterns:** `(var:type {prop: value, ...})`. Properties are equality-only; comparisons go in `WHERE`.
- **Relationship patterns:** `(a)-[var:rel_type]->(b)` and `(a)<-[var:rel_type]-(b)`. Direction is required.
- **Multi-hop:** `(a)-[:r1]->(b)-[:r2]->(c)`.
- **`WHERE` clauses:** comparisons (`=`, `<>`, `<`, `<=`, `>`, `>=`), `AND`, `NOT`, `NOT EXISTS { ... }`.

The full grammar is enforced by the parser in `activegraph/runtime/patterns.py`. Anything outside the subset raises [`UnsupportedPatternError`](https://docs.activegraph.ai/reference/errors/unsupported-pattern-error/index.md) at behavior-registration time, not at match time — the parser validates the pattern when the decorator runs.

## What the subset deliberately refuses

The subset is small on purpose. A fuzzy superset of Cypher would let patterns appear to match input they did not actually match, which would corrupt the audit trail that pattern-driven behaviors are designed to preserve. Specifically refused (each with a documented workaround in the error message that fires):

- **OR in WHERE clauses.** Register two behaviors, one per branch of the disjunction.
- **`RETURN`, `WITH`, multiple `MATCH`.** Patterns observe; they don't compose pipelines. Express the pipeline as multiple behaviors chained through emitted events.
- **Variable-length paths (`-[*]-`).** Unbounded match cost. Express as N separate one-hop patterns if the lengths are bounded.
- **`OPTIONAL MATCH`.** No null binding. Register a second behavior whose pattern is the optional sub-pattern.
- **Aggregation, `UNWIND`, `UNION`.** Iterate in the behavior body instead — `ctx.matches` is the iteration surface.
- **`CREATE`, `MERGE`, `SET`, `DELETE`, `DETACH`.** Patterns observe; they don't mutate. Mutations go in the behavior body via `graph.add_object`, `graph.patch_object`, `graph.remove_object`.

CONTRACT v0.7 #8 locked the subset and is the canonical reference for why each refusal stands.

## Composition with event-type and `where=` subscriptions

Pattern subscriptions combine with the other activation conditions:

```python
@behavior(
    name="contradiction_detector",
    on=["object.created"],
    where={"object.type": "claim"},
    pattern="(c:claim)-[:contradicts]->(other:claim)",
)
```

This behavior fires when **all three** conditions hold: an `object.created` event occurred, the new object's type is `claim`, and the new claim has an outgoing `contradicts` edge to another claim. The pattern is evaluated against the graph at the time the event fires; the new object is present in the match if the pattern references it.

## When to use the relationship variable

`(a)-[r:type]->(b)` binds `r` to the relation object so the behavior body can read its properties. `(a)-[:type]->(b)` binds nothing; the relation is part of the match but its properties aren't available. Use the variable form when the behavior needs to read the relation; omit it when the relation is just a structural constraint.

## Tracing pattern fires

Each behavior fire produced by a pattern subscription emits a `pattern.matched` event ahead of the `behavior.started` event. The trace shows how many matches the pattern produced for that fire:

```text
[pattern.matched]    evt_042  contradiction_detector  matches=2
[behavior.started]   contradiction_detector
```

The match count is also in the event's payload for downstream code that wants to subscribe to pattern matches without owning the behavior.

## Related

- [`UnsupportedPatternError`](https://docs.activegraph.ai/reference/errors/unsupported-pattern-error/index.md) — what fires when a pattern uses syntax outside the subset.
- [`behaviors`](https://docs.activegraph.ai/concepts/behaviors/index.md) — the broader behavior model. Pattern subscriptions are one of three activation conditions.
- [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) — the framework's stance on what counts as a recoverable failure. The "refuse rather than fuzzy-match" choice for the pattern subset is one application of the broader principle.

# Policies

A policy is a runtime-attached rule that gates changes to the graph before they land. A behavior proposing a graph mutation under a policy doesn't get a direct apply — the change becomes an **approval** in `proposed` state. An operator (or an auto-approve setting) then approves the proposal, and the change lands.

Policies are how the framework lets an operator sit in the loop without rewriting the behavior. The behavior says "I want to add this memo"; the policy says "memos require explicit approval"; the operator says "yes, approve it." The same behavior runs in dev (auto-approve) and prod (explicit-approve) without code changes.

## What gets gated

Two operations can be policy-gated:

- **Object proposals via `ctx.propose_object(type, data, reason)`.** Instead of an immediate `add_object`, the framework creates a pending approval and emits `approval.proposed`. The object lands only when the approval is granted.
- **Patches.** A patch declared as policy-gated takes the same proposed-and-approved path, except the patch lifecycle lives in the patch event types (`patch.proposed` / `patch.applied`) rather than approval event types. See [`patches`](https://docs.activegraph.ai/concepts/patches/index.md) for the patch state machine.

Object proposals are the more common shape. The diligence pack's `memo_approval` and `risk_approval` policies are the canonical examples: the pack declares which object types require approval, the operator decides per-instance.

Not every change is policy-gated. Direct `graph.add_object`, `graph.patch_object`, and `graph.emit` calls land immediately; they're for changes the behavior author decided don't need operator review. The behavior chooses by calling the proposal method instead of the direct method.

## The approval lifecycle

```text
            proposed ──approve──> granted
                |
                └──deny────────> denied
```

Both transitions are one-shot. A proposed approval becomes granted exactly once (via `runtime.approve(id, approved_by=...)`) or denied exactly once (via `runtime.deny(id, denied_by=..., reason=...)`). Calling either on an already-terminal approval raises [`approval-not-found-error`](https://docs.activegraph.ai/reference/errors/approval-not-found-error/index.md) — the approval id is consumed by the transition.

Each transition emits an event:

- `approval.proposed` — carries the proposal kind (`object` / `patch`), the type, the data, the reason from the proposing behavior, and the pack that owns the gating policy.
- `approval.granted` — carries the approval id, the approver identity, and the resulting object id (or applied patch id).
- `approval.denied` — carries the approval id, the denier identity, and the denial reason.

The events sit in the log alongside everything else. Downstream behaviors can subscribe to them; replay reconstructs the full proposal-and-decision sequence; the trace renders them.

## Declaring policies

Packs declare policies as part of their `Pack(...)` declaration:

```python
from activegraph.packs import Pack, PackPolicy

pack = Pack(
    name="diligence",
    version="0.1.0",
    policies=[
        PackPolicy(
            name="memo_approval",
            requires_approval=["memo"],
            settings_key="auto_approve_memos",
        ),
        ...
    ],
    ...
)
```

`requires_approval` lists the object types the policy gates. `settings_key` names the pack-settings boolean that controls auto-approve behavior; when `True`, the framework approves every proposal automatically and the behavior runs as if the policy weren't there. When `False`, every proposal pauses until an operator decides.

The pack ships with the policies; the runtime instance decides auto-approve via its `DiligenceSettings(auto_approve_memos=...)`. That separation lets one pack run in different approval modes across environments.

## How a behavior proposes

A behavior that wants its change to flow through a policy calls `ctx.propose_object` instead of `graph.add_object`:

```python
@behavior(name="memo_synthesizer", on=["claim.completed"])
def memo_synthesizer(event, graph, ctx):
    ...
    ctx.propose_object(
        "memo",
        data={"title": "Diligence memo", "body": "..."},
        reason="diligence run complete",
    )
```

The propose call returns an approval id. The runtime decides whether to apply immediately (auto-approve setting is `True`) or queue the proposal (setting is `False`). Either way, the behavior body completes; the approval lifecycle continues independently.

If the behavior tries `ctx.propose_object` outside a runtime-bound context — typically a test fixture or a refactored helper — it raises [`runtime-context-required-error`](https://docs.activegraph.ai/reference/errors/runtime-context-required-error/index.md).

## The operator-facing recovery

When auto-approve is off, the operator drives the lifecycle:

```python
for pa in rt.pending_approvals():
    print(pa.id, pa.kind, pa.object_type, pa.reason)

# Approve one:
rt.approve(approval_id, approved_by="operator-jane")

# Or deny:
rt.deny(approval_id, denied_by="operator-jane", reason="not yet")
```

The CLI surface for production approval workflows is in the [operating guide](https://docs.activegraph.ai/guides/operating-in-production/index.md).

## The events-not-exceptions principle applied

A denied approval is an event (`approval.denied`), not an exception. A behavior whose proposal gets denied doesn't see a raised exception — it sees the denial in the event log if it subscribes to `approval.denied`. The runtime continues; the behavior author writes a retry-or-escalate behavior if denial needs a response.

The exception case is misuse of the primitive — passing a nonexistent approval id to `approve` / `deny` — which fires `ApprovalNotFoundError`. See [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) for the broader principle.

## What's related

- [`patches`](https://docs.activegraph.ai/concepts/patches/index.md) — the durable-change primitive that policies gate. Approvals and patches share the proposed-and- decided shape; patches are the lower-level primitive.
- [`behaviors`](https://docs.activegraph.ai/concepts/behaviors/index.md) — where `ctx.propose_object` is called from.
- [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) — why denials are events.
- [`approval-not-found-error`](https://docs.activegraph.ai/reference/errors/approval-not-found-error/index.md) — the exception for misuse of the approval API.
- [Operating in production](https://docs.activegraph.ai/guides/operating-in-production/index.md) — production workflows for the operator side.

# Replay

Replay is reconstructing graph state from the event log. The graph is a projection of the log (see [`graph`](https://docs.activegraph.ai/concepts/graph/index.md)); replay is the operation that computes the projection. Every time you load a run from a store, fork a run, or strict-check a run, replay is what runs underneath.

The framework guarantees that **replay is deterministic** given the event log. Two replays of the same log produce byte-identical graph state. That guarantee is the foundation for forking, strict-mode validation, and the audit-trail contract.

## What replay does

Three operations trigger replay:

- **`Runtime.load(url, run_id=...)`** — loads a persisted run. Replay reads every event from the store and rebuilds the in-memory graph state.
- **`runtime.fork(at_event=...)`** — creates a new run sharing the parent's events up to the fork point. Replay reconstructs the shared prefix in the fork; new behavior fires after the fork point execute fresh. See [`forking`](https://docs.activegraph.ai/concepts/forking/index.md).
- **`runtime.replay()`** (explicit) — re-applies the in-memory event log. Less common; used by tests and migration scripts that need to verify replay determinism without going through the store.

The framework doesn't separate "load" from "replay" in the public API — `Runtime.load` is the canonical entry point. Replay is the verb the load uses.

## The cache layer

For LLM and tool calls to replay deterministically, the framework caches their responses by content hash:

- **LLM responses** are keyed on the prompt's full content hash (system message + user messages + model + tool definitions + output schema). Replay reads `llm.responded` events from the log, indexes them by their corresponding `llm.requested`'s prompt hash, and returns the cached response when a behavior re-fires with the same prompt.
- **Tool responses** are keyed on the tool's name plus a deterministic hash of its arguments. Same mechanism — replay reads `tool.responded` events and serves them to re-firing behaviors.

The cache makes replay cheap: no LLM calls, no tool execution, just event-log reads. The cost is the disk space for the responses in the store, which is bounded by the run's size.

## Strict mode vs permissive mode

Replay runs in one of two modes:

- **Permissive replay** (`replay_strict=False`, the default for `Runtime.load`). Events are re-emitted from the log; the runtime trusts the recording. The cache serves responses for any behavior whose prompt hash matches a recorded one. Behaviors whose prompt hash doesn't match get fresh LLM/tool calls (with the caveat that those calls land as new events in the new run's log, not the parent's).
- **Strict replay** (`replay_strict=True`). Behaviors re-fire against the recorded seed and the framework compares the live event stream against the recorded one. Any drift fires [`replay-divergence-error`](https://docs.activegraph.ai/reference/errors/replay-divergence-error/index.md) pinned to the first divergent event id.

Strict mode is for verifying that the run is replayable — a green strict replay proves the run is reproducible. Permissive mode is for development workflows where behaviors are still being edited and divergence is expected. The fork primitive runs strict by default because a fork's value is its shared lineage with its parent.

## The determinism contract

Replay determinism rests on the [`behaviors`](https://docs.activegraph.ai/concepts/behaviors/index.md) determinism contract: same event, same graph state, same view → same mutations. Three rules from that contract that replay specifically depends on:

- **No `random`, `datetime.now()`, or `uuid.uuid4()` in behavior bodies.** If the body needs these, get them from the event (which carries the recorded timestamp) or from the runtime's deterministic id generator.
- **No I/O outside the framework's primitives.** Direct `requests.get` in a behavior body breaks replay — the response isn't in the cache.
- **No mutable global state across behavior fires.** A counter in a module-level variable that increments per fire would diverge under replay.

The framework doesn't statically enforce these rules. A behavior that breaks them runs fine on first execution; replay or fork discovers the violation as [`replay-divergence-error`](https://docs.activegraph.ai/reference/errors/replay-divergence-error/index.md).

## When replay is invoked

The triggers, restated for reference:

- **Store load** — every `Runtime.load(url, run_id=...)` runs replay during construction. The graph state is rebuilt from the event log before any new work happens.
- **Fork** — `runtime.fork(at_event=...)` runs replay up to the fork point in the new run, then resumes live execution from there.
- **Explicit replay** — `runtime.replay()` rebuilds graph state from the current in-memory event log. Uncommon outside of tests and migration code.

## What's related

- [`graph`](https://docs.activegraph.ai/concepts/graph/index.md) — the projection replay computes. Owns the "graph as projection of event log" principle.
- [`events`](https://docs.activegraph.ai/concepts/events/index.md) — the append-only history replay reads.
- [`behaviors`](https://docs.activegraph.ai/concepts/behaviors/index.md) — the determinism contract that makes replay work.
- [`forking`](https://docs.activegraph.ai/concepts/forking/index.md) — the operation that runs replay up to the fork point.
- [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) — events vs exceptions; why divergence is an exception rather than a silent event.
- [`replay-divergence-error`](https://docs.activegraph.ai/reference/errors/replay-divergence-error/index.md) — the strict-mode error case.

# Forking

A fork is a branch from a parent run at a specific event. The fork shares the parent's event log up to the fork point; from there, it has its own independent log. The two runs can be diffed, configured differently, and inspected side-by-side without touching the parent's state.

Forking is what lets the framework answer "what would have happened if I'd done X differently?" — a question agentic systems need to answer routinely and most frameworks can't answer at all. The shared-lineage model plus the [cache layer](https://docs.activegraph.ai/concepts/replay/index.md) makes fork cheap (no LLM re-execution for the shared prefix) and honest (the fork's lineage is verifiable from the event log).

## The shared-lineage model

A fork copies events from the parent run, in order, up to the `--at-event` cutoff. The cutoff is **inclusive** — events at the cutoff id and before are in the fork; events after are not.

```text
parent:  evt_001 ... evt_042 evt_043 evt_044 ... (continues)
                            |
                            +- fork from evt_042
                            v
fork:    evt_001 ... evt_042 evt_045 evt_046 ... (fork's own work)
```

The fork's events 1 through 42 are the parent's. Event 45 onward is the fork's own; event ids don't collide because the fork has its own run id and its own monotonic id generator.

The shared prefix doesn't re-execute when the fork starts. The framework [replays](https://docs.activegraph.ai/concepts/replay/index.md) the prefix against the fork's in-memory graph, then resumes live execution from the cutoff. LLM and tool responses for the shared prefix are served from the cache — no new LLM calls, no new tool calls — which keeps fork cheap.

## The CLI surface

The `--set` flag is part of the v1.1 release

The `--set <pack>.<key>=<value>` flag below is documented in CONTRACT v1.0 but lands in v1.1 (see [CONTRACT v1.1 #1](https://github.com/yoheinakajima/activegraph/blob/main/CONTRACT.md#v11-1-cli-flags-specd-but-not-implemented)). Until then, use the Python-API form documented in [Fork with a pack-setting override (v1.0 — Python API)](https://docs.activegraph.ai/cookbook/common-patterns/#fork-with-a-pack-setting-override-v10-python-api) for fork-with-override workflows.

```bash
activegraph fork <parent-url> \
    --run-id <parent-run> \
    --at-event <event-id> \
    --label <human-readable> \
    --set <pack>.<setting>=<value> \
    --record
```

Three flags shape the fork:

- **`--at-event`** — the cutoff. Required.
- **`--set <key>=<value>`** — override a pack setting in the fork. The key is a dotted path into pack settings only (`diligence.confidence_threshold_for_review=0.9` is in scope; `runtime.budget.max_cost_usd=10` is out of scope). Multiple `--set` flags compose; type coercion is Pydantic's job; unknown keys fail loud at fork-time with a `RegistrationError`-style message naming the typo and the valid keys.
- **`--record`** — mark the fork as a re-recording. Behaviors whose prompts changed since the parent run will be re-recorded rather than cache-hit; new cache entries land in the fork's events.

`--set` is the primitive that makes "what if I'd configured this differently?" cheap. The semantics — pack settings only, fail loud on typos — are documented in the [CLI reference](https://docs.activegraph.ai/concepts/reference/cli/index.md).

## How the cache replays

For events before the fork point, the cache serves recorded responses by content hash:

- **LLM call with the same prompt hash** → cached response from the parent run's `llm.responded` event.
- **Tool call with the same args hash** → cached response from the parent run's `tool.responded` event.
- **LLM or tool call whose hash drifted from the parent** — expected only after `--set` changed something upstream. Without `--record`, the fork's strict replay fires [`replay-divergence-error`](https://docs.activegraph.ai/reference/errors/replay-divergence-error/index.md); with `--record`, the fork accepts the new responses and records them as its own.

The cache is per-store, indexed by run id. A fork that needs to re-execute the same prompt the parent already ran in a different run can't reach across — caches don't cross runs. (Migration is the primitive for moving runs across stores; see [Operating in production](https://docs.activegraph.ai/guides/operating-in-production/index.md).)

## When to fork vs when to use frames

Both let parallel computations proceed in isolation. The difference is durability and replay:

- **Fork** — a separate run with shared event-log lineage up to the fork point. Forks are durable, replayable, diffable. Use fork when the parallel branches might diverge permanently or need independent persistence.
- **Frames** ([`frames.md`](https://docs.activegraph.ai/concepts/frames/index.md)) — sub-contexts within one run. The frame's events live in the same event log as everything else; replay is the whole run. Use frames when the parallel contexts are short-lived or semantically belong together.

The decision rule: **if you'd want to inspect, diff, or migrate the two branches independently after the fact, use fork. If the branches converge back to a single output within the same run, use frames.**

A common pattern is to start parallel work in frames, then fork only the branches worth keeping. Frames are the cheap parallel primitive; forks are the durable one.

## What's related

- [`graph`](https://docs.activegraph.ai/concepts/graph/index.md) — the world state the fork projects from its event log.
- [`events`](https://docs.activegraph.ai/concepts/events/index.md) — the append-only history forks share up to the cutoff.
- [`replay`](https://docs.activegraph.ai/concepts/replay/index.md) — the operation that reconstructs the shared prefix in the fork.
- [`frames`](https://docs.activegraph.ai/concepts/frames/index.md) — the in-run parallel primitive that complements fork.
- [`patches`](https://docs.activegraph.ai/concepts/patches/index.md) — patches in a fork are independent of the parent's patches once the fork point passes.
- [`replay-divergence-error`](https://docs.activegraph.ai/reference/errors/replay-divergence-error/index.md) — the error case when strict-mode replay finds a divergent prompt hash or event stream.
- [CLI reference](https://docs.activegraph.ai/concepts/reference/cli/index.md) — the full surface for `activegraph fork` and the surrounding commands.

# Failure model

The framework distinguishes two kinds of failure, and the distinction governs how you write behaviors, how you read errors, and how you build on top of the runtime.

## The principle

> **Exceptions are for caller-facing failures the caller can reasonably catch and act on. Non-fatal stops — budget exhaustion, behavior failures, tool failures, approval denials — are events in the log. The distinction: exceptions interrupt control flow; events extend the audit trail. When in doubt, an event.**

Behaviors that fail during a run don't raise out to your code. The runtime catches the exception, emits a `behavior.failed` event with the original exception's type, message, and `reason` code in the payload, and the loop continues. Other behaviors keep firing. The operator sees the failure in the trace; downstream code that subscribes to `behavior.failed` can react (alert, retry-with-different-args, escalate).

The same shape applies to tools: a `ToolError` raised inside a tool body becomes a `tool.responded` event with `error.reason` set, and the calling behavior's loop reads the structured failure and decides what to do.

The same shape applies to budget exhaustion: when a `max_*` limit is hit, the runtime emits `runtime.budget_exhausted` with the dimension in the payload and stops gracefully. No exception escapes to your code — you read the event from `runtime.status()` or from the trace.

## When exceptions are the right answer

Exceptions are for failures the caller is making **right now, at this line of code**, and can reasonably catch:

- Constructing a runtime with conflicting arguments (`InvalidRuntimeConfiguration`)
- Looking up a behavior or tool that isn't registered (`BehaviorNotFoundError`, `ToolNotFoundError`)
- Passing a malformed store URL (`InvalidStoreURL`)
- Replaying a run whose recorded event stream doesn't match the live re-run (`ReplayDivergenceError`)
- Calling `runtime.approve(id)` on an id that doesn't exist (`ApprovalNotFoundError`)

These all interrupt the call. The caller catches the exception, fixes the input, and tries again. There's no audit-trail entry to preserve because the call never produced one.

## The exception hierarchy

Every framework exception inherits from `ActiveGraphError`. Seven categories live one level down:

```text
ActiveGraphError
├── ConfigurationError      construction-time / API-call argument errors
├── RegistrationError       behavior/tool/pack registration problems
├── ExecutionError          runtime execution problems (escaped to the caller)
├── ReplayError             replay/fork divergence
├── StorageError            persistence problems
├── PatternError            pattern subscription syntax errors
└── PackError               pack-specific runtime problems
```

Catch `ActiveGraphError` to catch every framework exception. Catch a category base to catch every leaf in that category. Catch a specific leaf when the recovery is leaf-specific.

The category leaves also multi-inherit from Python builtins where it preserves existing catch sites: `EventNotFoundError` is also a `KeyError`, `InvalidStoreURL` is also a `ValueError`, etc. Existing code catching the builtin keeps working; new code can catch the category for richer context.

## The structured event types

`behavior.failed`, `tool.responded` (with error), `runtime.budget_exhausted`, `approval.denied` — each carries a `reason` field with a stable discriminator code so downstream code can branch on the failure mode without parsing prose. The codes are documented in [Reference: Events](https://docs.activegraph.ai/concepts/reference/events/index.md).

## Observing failures in caller code

The framework gives you two surfaces for noticing failures without subscribing a `@behavior` to `behavior.failed`:

**1. The WARNING log line.** Every `behavior.failed` emission produces one log line at `WARNING` level on the `activegraph.runtime` logger:

```text
WARNING activegraph.runtime: behavior failed: my_behavior (reason=llm.network_error)
```

The structured log record carries `behavior`, `event_id`, `reason`, `error_type`, `error_message`, and a `doc_url` pointing at the reason's documentation page. Operators tail logs and click through to the doc-page from the URL.

Opt out via the standard Python logging API:

```python
import logging
logging.getLogger("activegraph.runtime").setLevel(logging.ERROR)
```

**2. The `Runtime.errors` property.** After a run, inspect failures programmatically without parsing event payloads:

```python
rt.run_goal("...")
for err in rt.errors:
    if err.reason == "llm.network_error":
        retry_with_backoff(err.event_id)
```

Each `err` is a `BehaviorFailure` named tuple with five fields plus the `behavior.failed` event id:

| Field             | Meaning                                            |
| ----------------- | -------------------------------------------------- |
| `behavior`        | the failing behavior's name                        |
| `event_id`        | the triggering event's id                          |
| `reason`          | the v0.6 #11 reason code (None for raw exceptions) |
| `exception_type`  | the Python exception class name                    |
| `message`         | the exception's `str(...)`                         |
| `failed_event_id` | the `behavior.failed` event's id                   |

The property reads from the graph's event log on each access — the events are the source of truth and the property is a structured projection. No caching, no listeners, no new runtime state.

The two surfaces use different field names for the same values: the WARNING log line uses the v0.8 #6 structured-logging schema keys (`error_type` / `error_message`), while `BehaviorFailure` uses Python-conventional attribute names (`exception_type` / `message`). The values are identical — only the names differ.

The two surfaces are *additive* and don't change the failure model: events stay the durable record, behaviors that fail still don't raise out of `run_goal()`, and existing code subscribing to `behavior.failed` keeps working unchanged.

## "When in doubt, an event"

If you're writing a behavior and you're about to raise an exception because something downstream "should never happen," ask:

- Can the caller reasonably catch and act on this?
- Is the failure attributable to a specific event in the log?

If the answer to the first is "no" and the answer to the second is "yes," emit an event instead. The audit trail is the durable record; exceptions are just the runtime's way of refusing the current call.

This rule is what kept `BehaviorFailedError` and `BudgetExhaustedError` out of the framework's exception hierarchy. Both were considered during the v1.0 error-rewrite series and rejected because their information already lives in events. Adding them as exceptions would have surfaced two parallel failure surfaces — one in the trace, one in caller code — and the divergence is exactly the kind of subtle inconsistency that makes a framework feel unreliable six months in.
# Guides

# Operating Active Graph

This document is for **operators**: people responsible for running an Active Graph runtime as part of a system other people depend on. The README is for developers writing behaviors. The audience is different and so is this document.

If you are evaluating Active Graph, read the README first. If you have a behavior that doesn't run on your machine, the README will help. If you have a behavior that runs fine on your machine but you need to put it somewhere a team can rely on it, you are in the right place.

The companion example is [`examples/operate_a_run.py`](https://github.com/yoheinakajima/activegraph/blob/main/examples/operate_a_run.py). Read it alongside this guide — every CLI command and library call shown here appears there. If the two ever disagree, the example is right.

______________________________________________________________________

## The operator surface

The framework treats the boundary between itself and the world it runs in as a load-bearing contract. Five primitives compose that surface; together they make a run inspectable, observable, and recoverable without reading source code:

1. **Postgres** as a second `EventStore`, behind the same protocol as SQLite. Same schema, same semantics, different driver.
1. **Structured logging** with a documented JSON schema. One log line per event, every line carries `run_id` / `event_id` when applicable.
1. **Metrics**: a three-method `Metrics` protocol with a `NoOpMetrics` default and a reference `PrometheusMetrics` implementation. The runtime emits a fixed, documented set of counters, histograms, and gauges. Custom backends (OpenTelemetry, Datadog, statsd) implement the protocol — three methods.
1. **`activegraph` CLI**: `inspect`, `replay`, `fork`, `diff`, `export-trace`, `migrate`, `pack`, `quickstart`. The CLI is a thin wrapper around library APIs; anything it does, programmatic callers can do too.
1. **Runtime introspection**: `runtime.status(recent=N)` returns a frozen snapshot of queue depth, budget remaining, registered behaviors, recent events, and current frame. The CLI's `inspect` command sits on top of this primitive.

The operator surface introduced in v0.8 has been extended additively since: v0.9 added the pack format (and `activegraph pack` for listing/scaffolding); v1.0 added per-error reference pages ([Reference: Errors](https://docs.activegraph.ai/reference/errors/replay-divergence-error/index.md)) that every error message links to, plus operator-targeted CLI follow-on flags (`inspect --event <id>` for divergence triage, `inspect --behaviors` for replay length mismatches, `inspect --pack-version` for prompt-hash audits, `migrate --skip-corrupted` for corrupted-payload recovery, `fork --record` for intentional re-recording).

What the framework deliberately does **not** ship: a web UI, an HTTP server, a distributed runtime, real-time subscriptions, multi-model LLM routing, or streaming LLM responses. The framework is small, sharp, and operable. Plug in adapters at the boundaries where you need them.

______________________________________________________________________

## Persistence: SQLite vs Postgres

SQLite is the default and the right answer for solo work, demos, ephemeral runs, and most single-machine production cases. The event log fits in one file, WAL mode gives you crash-safe writes, and you have no operational dependencies.

Postgres is the right answer when:

- More than one process or machine needs to inspect a run (the operator on a laptop, a dashboard, a CLI on a jump box, a CI job).
- You already operate Postgres and want one fewer storage system.
- You want to put the JSONB column behind a read replica or pipe it into your data warehouse.

Both stores conform to the same `EventStore` protocol. The runtime, the CLI, and every library API treat them identically. **Migration is one-directional and explicit** (see below).

### Connection URLs

Stores are addressed by URL throughout the framework — runtime, CLI, library APIs. The schemes follow the SQLAlchemy convention:

- `sqlite:///relative/path.db` (**three** slashes — relative path)
- `sqlite:////absolute/path/to/run.db` (**four** slashes — absolute path; the leading `/` of the absolute path adds the fourth slash)
- `postgres://user:password@host:port/dbname`
- `postgresql://user:password@host:port/dbname` (same scheme)

A path with no scheme is an error. The framework will not guess. `activegraph inspect run.db` will fail with a message pointing here. Use `sqlite:///run.db` (relative) or `sqlite:////tmp/run.db` (absolute).

### Postgres setup

```bash
# Postgres 16 or newer, anywhere reachable from the runtime.
createdb activegraph_prod
# Schema is created lazily on first connection. No migration step.
pip install 'activegraph[postgres]'   # pulls psycopg>=3.1,<4
```

The first time the runtime opens a Postgres URL it issues `CREATE TABLE IF NOT EXISTS` for `events`, `runs`, and `meta`, mirroring the SQLite schema with Postgres-native types (`BIGSERIAL`, `JSONB`, `TIMESTAMPTZ`). Schema version is stored in `meta` and verified on every open. A schema version mismatch is a hard error — the runtime refuses to operate on a log it does not understand.

### Connection management

`PostgresEventStore` accepts:

1. A URL string. The store opens a single dedicated connection.
1. A `psycopg.Connection` you already have. The store does not own its lifecycle — you must close it.
1. A `psycopg_pool.ConnectionPool`. The store will `getconn()` / `putconn()` around each operation.

For production, pass a pool. The framework does not ship its own pool because we are not in a position to make tuning decisions for your deployment.

```python
import psycopg_pool
from activegraph.store.postgres import PostgresEventStore

pool = psycopg_pool.ConnectionPool(
    conninfo="postgres://localhost/activegraph_prod",
    min_size=2,
    max_size=10,
)
store = PostgresEventStore(pool, run_id="run_01J...")
```

### Migration (transaction-per-run)

```bash
activegraph migrate --from sqlite:///path/to/dev.db \
                    --to   postgres://localhost/activegraph_prod
```

Migration semantics:

- Each run in the source migrates in **a single transaction** against the destination. If a run fails partway, that run's destination state is unchanged (Postgres rolls back).
- Migration is **idempotent** at the event level: writes use `INSERT ... ON CONFLICT DO NOTHING` against the `UNIQUE(id, run_id)` index. Re-running migration after a partial failure resumes safely.
- Runs are migrated independently. A bad run does not block the others.
- The default migrates **all** runs in the source. To pick one: `--run-id <id>`.
- A per-run report is printed at the end (machine-readable with `--json`). Each entry is `{run_id, status, events_migrated, error?}`. The CLI exit code is non-zero iff any run failed.
- Migration is **not bidirectional**. There is no `sync` mode and no rollback. To go back, migrate the other direction.

When migration is the right tool: you are graduating a run from a laptop SQLite file to a shared Postgres database, or moving a historical archive between Postgres instances. When it is the wrong tool: you are trying to keep two stores in sync. Don't.

______________________________________________________________________

## Structured logging

The framework emits structured logs through stdlib `logging`. **It does not auto-configure logging on import** — a library that does is hostile to operators who have already configured their own. By default the framework logs to `logging.getLogger("activegraph")` and lets your config handle the rest.

If you want the opinionated setup:

```python
from activegraph.observability import configure_logging
configure_logging(level="INFO", json_output=True)
```

That installs a JSON formatter on the `activegraph` logger hierarchy. Every log line becomes one JSON object on one line, suitable for ingestion by Loki, Splunk, BigQuery, Cloud Logging, or any other line- oriented log aggregator.

### Log schema

Every line is a JSON object. These fields appear when applicable. Fields that don't apply are **omitted**, not nulled:

| Field             | Type   | When                                                         |
| ----------------- | ------ | ------------------------------------------------------------ |
| `timestamp`       | string | always (ISO 8601, UTC)                                       |
| `level`           | string | always (`DEBUG` / `INFO` / `WARNING` / `ERROR` / `CRITICAL`) |
| `logger`          | string | always (e.g. `activegraph.runtime`)                          |
| `message`         | string | always                                                       |
| `run_id`          | string | any log line associated with a specific run                  |
| `event_id`        | string | log lines about a specific event                             |
| `behavior`        | string | log lines about a specific behavior invocation               |
| `tool`            | string | log lines about a tool invocation                            |
| `model`           | string | log lines about an LLM call                                  |
| `cache_hit`       | bool   | LLM/tool calls; true if served from cache                    |
| `cost_usd`        | string | LLM calls that incurred cost (Decimal-as-string)             |
| `latency_seconds` | number | LLM/tool/behavior calls with measured latency                |
| `reason`          | string | failure log lines (see reason taxonomy)                      |
| `error_type`      | string | failure log lines                                            |
| `error_message`   | string | failure log lines                                            |

The schema is **the operator contract**. Dashboards built against these field names will keep working across framework versions. Breaking the schema is a breaking change.

### Level discipline

| Level    | What                                                         |
| -------- | ------------------------------------------------------------ |
| DEBUG    | View construction, prompt assembly, cache lookup, queue ops  |
| INFO     | Every event emitted, every behavior invoked, every tool call |
| WARNING  | Budget approaching limits, retries, pattern eval slowness    |
| ERROR    | `behavior.failed` with non-budget reasons                    |
| CRITICAL | Event log inconsistency, schema mismatch, replay divergence  |

INFO is a high-volume stream in any active run. Operators typically filter at WARNING for production dashboards and crank to DEBUG when debugging.

There are no `print()` calls anywhere in the framework. The trace printer (`runtime.print_trace()`) is a developer tool, not an operator tool — it prints to stdout, does not log, and is independent of the logging configuration.

### Payload redaction

LLM behaviors include rendered prompts in DEBUG logs. Tool responses include their full payloads. Goals can contain anything the user typed. If your environment requires redaction (PII, secrets, customer data):

```python
def redact(payload: dict) -> dict:
    return {k: ("<redacted>" if k == "email" else v) for k, v in payload.items()}

configure_logging(level="INFO", json_output=True, payload_redactor=redact)
```

The redactor runs on every payload that would otherwise appear in a log message. It does not affect the event log itself — the source of truth keeps the original. Redaction is a logging concern.

______________________________________________________________________

## Metrics

The framework emits metrics through a three-method `Metrics` protocol:

```python
class Metrics(Protocol):
    def counter(self, name: str, tags: dict[str, str], value: float = 1.0) -> None: ...
    def histogram(self, name: str, tags: dict[str, str], value: float) -> None: ...
    def gauge(self, name: str, tags: dict[str, str], value: float) -> None: ...
```

That's it. Three methods. No timers (use a histogram with a latency value). No summaries (Prometheus-specific). No custom types.

```python
from activegraph.observability import PrometheusMetrics
rt = Runtime(graph, metrics=PrometheusMetrics())
```

The default is `NoOpMetrics`, which does nothing. The runtime is fully functional with no metrics configured.

`PrometheusMetrics` lazy-imports `prometheus_client`. Install with `pip install 'activegraph[prometheus]'`.

For OpenTelemetry, Datadog, statsd, or anything else: write a class with three methods. We do not ship adapters.

### Standard metrics

These metrics are emitted by the runtime. Names follow Prometheus conventions (snake_case with underscores, `_total` for counters, `_seconds` for duration histograms, `_usd` for cost histograms). They are the **operator contract**: dashboards built against these names keep working across framework versions.

| Name                                               | Type      | Tags                 |
| -------------------------------------------------- | --------- | -------------------- |
| `activegraph_events_emitted_total`                 | counter   | `event_type`         |
| `activegraph_behaviors_invoked_total`              | counter   | `behavior`           |
| `activegraph_behaviors_failed_total`               | counter   | `behavior`, `reason` |
| `activegraph_behaviors_duration_seconds`           | histogram | `behavior`           |
| `activegraph_llm_calls_total`                      | counter   | `model`              |
| `activegraph_llm_cache_hits_total`                 | counter   | `model`              |
| `activegraph_llm_failed_total`                     | counter   | `model`, `reason`    |
| `activegraph_llm_tokens_in`                        | histogram | `model`              |
| `activegraph_llm_tokens_out`                       | histogram | `model`              |
| `activegraph_llm_cost_usd`                         | histogram | `model`              |
| `activegraph_tools_calls_total`                    | counter   | `tool`               |
| `activegraph_tools_cache_hits_total`               | counter   | `tool`               |
| `activegraph_tools_failed_total`                   | counter   | `tool`, `reason`     |
| `activegraph_tools_duration_seconds`               | histogram | `tool`               |
| `activegraph_queue_depth`                          | gauge     | (none)               |
| `activegraph_budget_cost_remaining_usd`            | gauge     | `run_id`             |
| `activegraph_budget_events_remaining`              | gauge     | `run_id`             |
| `activegraph_patterns_evaluated_total`             | counter   | (none)               |
| `activegraph_patterns_evaluation_duration_seconds` | histogram | (none)               |
| `activegraph_replay_divergence_detected_total`     | counter   | `reason`             |

**Adding a metric is a public API change.** The list is documented and test-pinned. New metrics get added in named releases, not silently.

### Cardinality rule (locked)

> `run_id` MAY appear as a tag on **gauges of active state** (where cardinality is bounded by the number of concurrently active runs). `run_id` MUST NOT appear as a tag on **counters or histograms**.

This rule prevents the most common Prometheus operational disaster: unbounded cardinality from per-run labels accumulating forever. The budget gauges are the only exception, and they live only for the duration of a run.

The conformance suite enforces this rule against the standard metric list. If you implement a custom `Metrics` backend, do the same.

### Tag conventions

Standard tag keys are: `event_type`, `behavior`, `tool`, `model`, `reason`, `run_id` (gauges only). Boolean tags (`cache_hit` is modeled as a separate counter rather than a tag — see `activegraph_llm_cache_hits_total`). If your backend distinguishes booleans from strings, you won't have to special-case.

Custom tags beyond the standard set are fine but may explode cardinality. The cardinality rule above is your guide.

______________________________________________________________________

## Runtime introspection

`runtime.status(recent: int = 20)` returns a `RuntimeStatus` — a frozen dataclass. Calling it is cheap: no graph traversal, no event log scan. It is safe to call from any thread.

```python
status = rt.status()
print(status.run_id, status.state, status.queue_depth)
for ev in status.recent_events:
    print(ev.id, ev.type)
```

Shape:

```python
@dataclass(frozen=True)
class RuntimeStatus:
    run_id: str
    state: Literal["idle", "running", "stopped", "exhausted"]
    queue_depth: int
    events_processed: int
    budget: BudgetSnapshot
    frame: FrameSnapshot | None
    registered_behaviors: list[BehaviorInfo]
    recent_events: list[EventSummary]
```

`recent_events` length is `recent` (default 20). The CLI's `inspect --tail N` passes through to this argument.

There is **no `last_error` field**. Errors are events. Filter `recent_events` for type `behavior.failed`, or query the event store directly for a window-independent view.

______________________________________________________________________

## CLI

The `activegraph` binary is a thin wrapper around library APIs. Every subcommand calls into Python; nothing is implemented in the CLI itself. A programmatic user can do everything the CLI does.

```text
activegraph inspect <url> [--run-id <id>] [--tail N] [--json]
                          [--event <id> | --behaviors | --pack-version]
activegraph replay <url> --run-id <id> [--json]
activegraph fork <url> --run-id <id> --at-event <id> --label <label>
                       [--to <url>] [--record] [--json]
activegraph diff <url> --run-a <id> --run-b <id> [--json]
activegraph export-trace <url> --run-id <id> [--format text|jsonl] [-o PATH]
activegraph migrate --from <url> --to <url> [--run-id <id>] [--skip-corrupted] [--json]
activegraph pack list
activegraph pack new <name>
```

`--event`, `--behaviors`, and `--pack-version` on `inspect` are mutually-exclusive selectors that focus the output on one section instead of the full snapshot. See the [CLI reference](https://docs.activegraph.ai/reference/cli/index.md) for the full surface; the [debugging cookbook](https://docs.activegraph.ai/cookbook/debugging/index.md) walks through diagnostic workflows that build on these flags.

### Exit codes

| Code | Meaning                                                       |
| ---- | ------------------------------------------------------------- |
| 0    | Success                                                       |
| 1    | Generic error                                                 |
| 2    | Usage error (bad arguments, missing options)                  |
| 3    | Not found (run id does not exist, file does not exist)        |
| 4    | Corruption (schema version mismatch, event log inconsistency) |
| 5    | Divergence (replay-strict failure)                            |

These are documented contract. Wrap the CLI in shell scripts or CI jobs against these codes.

### `inspect`

Default-mode prints a human-readable snapshot of the most recent run in the store (or `--run-id <id>` for a specific run). `--json` prints the same data as a single JSON object — the same shape as the `RuntimeStatus` returned by the library.

```bash
activegraph inspect sqlite:///run.db
activegraph inspect postgres://localhost/agdb --run-id run_01J... --tail 50 --json
```

Three v1.0 selectors focus the output on one section instead of the full snapshot:

- `--event <id>` prints the full payload of one event. Used when an error message names an event id — `ReplayDivergenceError` always does.
- `--behaviors` prints only the registered-behaviors section. Used when diagnosing a replay length mismatch: compare which behaviors fire now against which fired in the recorded run.
- `--pack-version` prints every `pack.loaded` event in the run with prompt content-hash summaries. Used to confirm the pack version (or prompt drift) responsible for a divergence.

The three are mutually exclusive — they're selectors, not filters.

### `replay`

Opens the store, rebuilds the graph by replaying the log (no behaviors fire), and prints a summary: event count, object count, relation count. Useful for sanity-checking a run after a crash or after a migration.

```bash
activegraph replay sqlite:///run.db --run-id run_01J...
```

### `fork`

Creates a new run by copying events from `--run-id` up to and including `--at-event`. `--to <url>` defaults to the source store; pass a different URL to fork across stores. Prints the new run id and the number of events copied.

```bash
activegraph fork sqlite:///run.db \
  --run-id run_01J... \
  --at-event evt_42 \
  --label investigate-alternative-thesis
```

The forked run is dormant — nothing is running it. To continue from the fork point, load it with `Runtime.load(url, run_id=<new_run_id>)` and call `run_until_idle()`.

Pass `--record` to mark the fork as an intentional re-recording (used after a `ReplayDivergenceError` when the divergence was intentional). The flag appends `-recording` to the label and prints follow-on guidance. The fork-with-pack-setting-override workflow (the canonical recipe behind `--set`, landing in v1.1) is documented under [Cookbook: common patterns — Fork with a pack-setting override](https://docs.activegraph.ai/cookbook/common-patterns/#fork-with-a-pack-setting-override-v10-python-api).

### `diff`

Structural diff between two runs in the same store. Prints shared and divergent event counts, divergent objects, and divergent relations. The library equivalent is `parent.diff(other)`.

```bash
activegraph diff sqlite:///run.db --run-a run_a --run-b run_b
```

### `export-trace`

Dump a run's event log to a file or stdout.

- `--format text` (default) — the human-readable trace printer output.
- `--format jsonl` — one JSON event per line. Suitable for ingestion by any log aggregator.

```bash
activegraph export-trace sqlite:///run.db --run-id run_01J... --format jsonl -o run.jsonl
```

### `migrate`

See [Migration](#migration-transaction-per-run) above. The v1.0 `--skip-corrupted` flag lets a migration recover the readable subset when source events have corrupted JSON payloads instead of failing the whole run; the skipped event ids appear in the per-run report's `skipped_events`. The resulting destination run is partial — the operator is on notice.

______________________________________________________________________

## Runbook

### A run is stuck

Call `runtime.status()` (or `activegraph inspect`). Check `state`:

- `idle` — the queue is empty, the budget is fine, the run is waiting for new input. This is the normal terminal state for a goal-driven run. Not stuck.
- `exhausted` — the run hit a budget limit. The `budget` field shows which dimension. Raise the limit or accept the partial result.
- `running` — the run is actually working. `queue_depth` should be decreasing. If it's increasing or steady, a behavior is producing events faster than the runtime processes them. Check the trace.
- `stopped` — the runtime is loaded but no `run_until_idle()` call is in progress. Call it.

### A run is over budget

`runtime.status().budget` shows used vs. limits across dimensions (events, behavior calls, LLM calls, tool calls, cost, depth, seconds). The `activegraph_budget_*` gauges expose the same data. Set up an alert on `cost_remaining_usd < threshold` to catch runs before they exhaust.

To resume a budget-exhausted run with a higher limit:

```python
rt = Runtime.load(url, run_id=stuck_run_id, budget={"max_cost_usd": "10.0"})
rt.run_until_idle()
```

### Replay diverges

You called `Runtime.load(..., replay_strict=True)` and got `ReplayDivergenceError`. The runtime's re-execution of recorded behaviors produced different events than the log. Causes, in order of likelihood:

1. A behavior reads from a non-deterministic source (clock, `random`, network) it didn't read on the original run.
1. A behavior depends on a value (an LLM response, a tool result) that was cached on the original run but no longer is.
1. The framework version changed and an event payload shape changed. Check the v0.8 schema-mismatch guard caught this; if not, file an issue.

The error pins the offending event id. Look at it. The fix is in your behavior, not the framework.

### Postgres connection saturated

You are passing a `psycopg.Connection` per request. Use a `psycopg_pool.ConnectionPool` and pass that instead. The framework calls `getconn()` / `putconn()` around each operation.

### Trace lines do not appear in my log aggregator

`runtime.print_trace()` prints to stdout. It is not a log. To get events into your aggregator:

- Use `activegraph export-trace --format jsonl` ad-hoc.
- Or write a behavior that subscribes to the event types of interest and emits a structured log record. The framework's logging will carry it through.

______________________________________________________________________

## Capacity planning

These are reference numbers from a single Postgres 16 instance on commodity hardware. They are not benchmarks; they are the order of magnitude an operator should expect.

- **Event log writes**: a single connection sustains a few thousand events per second. With a pool of 10 and writes spread across runs, tens of thousands per second is achievable.
- **Event log reads**: replay of a 100k-event run on a warm cache takes single-digit seconds. Plan for that on a cold start.
- **Storage**: ~1-2 KB per event in JSONB form (including indexes). A million-event run is around 1.5 GB.
- **Run concurrency**: bounded by your connection pool size, not by the framework. The runtime itself is single-threaded.

If your runs are big enough that any of this is a concern, the framework's single-process design is the next constraint you will hit. That is a v1.0+ conversation.

______________________________________________________________________

## What this guide is not

This guide will not tell you how to set up Postgres, configure Prometheus, or operate Grafana. Those are well-documented elsewhere and the framework's integration with them is intentionally generic.

This guide will not recommend SLOs, alerts, or dashboards. Your business context determines those. The metrics list above is the foundation; what you build on top is yours.

This guide will not stay current with every release. The locked contracts — log schema, metric names, exit codes, status shape — will. Examples may drift; the contracts will not.

# Pack Authoring Guide

A **pack** is a Python package that bundles object types, relation types, behaviors, tools, prompts, and policies for a specific domain. Packs are how a developer goes from "I installed activegraph" to "I have a working diligence system in ten minutes."

This document is the canonical reference for the pack format. It is companion reading to `examples/diligence_real_run.py` (the killer demo / executable spec) and [`CONTRACT.md`](https://github.com/yoheinakajima/activegraph/blob/main/CONTRACT.md) (the locked design decisions — v0.9 introduced the pack format; v1.0 added the per-pack-error reference catalog under [Reference: Errors](https://docs.activegraph.ai/reference/errors/pack-conflict-error/index.md)). When this guide and the contract disagree, the contract wins.

______________________________________________________________________

## TL;DR

```python
# my_pack/__init__.py
from pathlib import Path
from pydantic import BaseModel, Field
from activegraph.packs import (
    Pack, ObjectType, RelationType, PackPolicy,
    behavior, llm_behavior, tool,        # pack-aware decorators
    load_prompts_from_dir,
)

class MyPackSettings(BaseModel):
    threshold: float = 0.5

class Insight(BaseModel):
    text: str
    confidence: float = Field(ge=0.0, le=1.0)

@llm_behavior(
    name="insight_extractor",
    on=["object.created"],
    where={"object.type": "document"},
    output_schema=Insight,
)
def insight_extractor(event, graph, ctx, out, *, settings: MyPackSettings):
    if out.confidence >= settings.threshold:
        graph.add_object("insight", out.model_dump())

pack = Pack(
    name="my_pack",
    version="0.1.0",
    description="Extracts insights from documents.",
    object_types=[ObjectType(name="insight", schema=Insight)],
    behaviors=[insight_extractor],
    prompts=load_prompts_from_dir(Path(__file__).parent / "prompts"),
    settings_schema=MyPackSettings,
)
```

```toml
# pyproject.toml
[project.entry-points."activegraph.packs"]
my-pack = "my_pack:pack"
```

```python
# user code
from activegraph import Runtime
from activegraph.packs import load_by_name

rt = Runtime(graph)
rt.load_pack(load_by_name("my_pack"), settings=MyPackSettings(threshold=0.8))
rt.run_goal("...")
```

That's the whole contract. The rest of this guide explains why each piece is shaped the way it is, and the conventions third-party pack authors are expected to follow.

______________________________________________________________________

## 1. A pack is a Python package, not a manifest

There is no `pack.yaml`. There is no `manifest.json`. There is a Python module that exports a single `pack` symbol of type `Pack`.

Why: packs need to express real logic (behaviors, prompts, policies) and Python is the right language for that. A declarative manifest would shove logic into prose comments or jinja templates, which is how every "configuration as data" framework eventually grows a half-broken DSL. Python is the DSL.

Convention: a pack package has the layout

```text
my_pack/
  pyproject.toml
  my_pack/
    __init__.py         # exports `pack`
    object_types.py     # Pydantic schemas + ObjectType list
    relation_types.py   # RelationType list (optional)
    behaviors.py        # @behavior / @llm_behavior / @relation_behavior
    tools.py            # @tool
    settings.py         # the Pydantic settings model
    prompts/
      <prompt_name>.md  # one per LLM behavior, with TOML frontmatter
    fixtures/           # recorded LLM responses + tool outputs (optional)
      __init__.py
      <fixtures>.py
    docs/
      README.md
      settings.md
      behaviors.md
      prompts.md
  tests/
    test_pack_loads.py  # smoke test
  README.md
```

The scaffolding command (`activegraph pack new <name>`) generates this layout.

______________________________________________________________________

## 2. Pack-aware decorators: import path matters

Pack code uses **pack-aware** decorators imported from `activegraph.packs`:

```python
from activegraph.packs import behavior, llm_behavior, relation_behavior, tool
```

These have **identical signatures** to the decorators imported from `activegraph`. The only behavioral difference is that pack-aware decorators do not register anything globally — they attach metadata to the function, return a `Behavior` / `LLMBehavior` / `RelationBehavior` / `Tool` object, and that's it.

Why: a pack module is safe to import without a runtime. Importing the diligence pack must not put `claim_extractor` into the global behavior registry, where it would silently fire in any `Runtime(graph)` call regardless of whether the pack was loaded.

Pack tests can construct a pack, assert its shape, and verify it loads cleanly without ever instantiating a runtime.

**Inside a pack, never import decorators from `activegraph` directly.** The `tests/test_pack_loads.py` smoke test verifies this by importing the pack and checking that `activegraph.behaviors.decorators._REGISTRY` and `activegraph.tools.decorators._TOOL_REGISTRY` are empty.

______________________________________________________________________

## 3. The `Pack` dataclass

```python
@dataclass(frozen=True, eq=False)
class Pack:
    name: str
    version: str
    description: str = ""
    object_types: tuple[ObjectType, ...] = ()
    relation_types: tuple[RelationType, ...] = ()
    behaviors: tuple = ()
    tools: tuple = ()
    policies: tuple[PackPolicy, ...] = ()
    prompts: tuple[PackPrompt, ...] = ()
    settings_schema: type = EmptySettings
```

**Frozen**: mutation after construction raises. This forces packs to be declarative even though they're written in Python.

**`eq=False`**: equality and hashing are based on `(name, version)`, not on field-by-field comparison. Behaviors are dataclasses and are not hashable; full structural equality would not work. The `(name, version)` key is what idempotent loading and replay hinge on.

**Tuples, not lists**: tuples are hashable and signal immutability. List arguments are converted to tuples in `__post_init__` for convenience.

`Pack.__post_init__` validates:

- `name` is a non-empty lowercase ASCII identifier (matches `^[a-z][a-z0-9_]*$`)
- `version` is non-empty
- object types have unique names within the pack
- relation types have unique names within the pack
- behavior names are unique within the pack
- tool names are unique within the pack
- prompts have unique names within the pack
- `settings_schema` is a Pydantic `BaseModel` subclass

Validation failures raise `PackValidationError` at construction — not at load.

______________________________________________________________________

## 4. Object types and relation types

A pack declares its object types with Pydantic schemas:

```python
from pydantic import BaseModel, Field
from activegraph.packs import ObjectType

class Claim(BaseModel):
    text: str
    confidence: float = Field(ge=0.0, le=1.0)
    source_url: str | None = None

object_types = [
    ObjectType(
        name="claim",
        schema=Claim,
        description="A factual statement with confidence.",
    ),
]
```

When the pack is loaded, `graph.add_object("claim", data=...)` validates `data` against `Claim`. Validation errors raise `PackSchemaViolation` (subclass of `ValueError`) and no object is created. The exception lists the field name, the violating value, and the constraint that failed.

**Load-order asymmetry** (v0.9 #5): validation applies only to objects created **after** the pack loads. Objects created before the `pack.loaded` event are not retroactively validated. The `pack.loaded` event is part of the event log, so replay enforces the same load order.

Relation types are simpler:

```python
from activegraph.packs import RelationType

relation_types = [
    RelationType(
        name="addresses",
        source_types=("claim",),
        target_types=("question",),
        description="A claim addresses a question.",
    ),
    RelationType(
        name="supports",
        source_types=("evidence",),
        target_types=("claim",),
    ),
]
```

`source_types` and `target_types` are tuples of object type names. Empty (the default) means "any". Mismatches raise `PackSchemaViolation` at `graph.add_relation` time, same as object types.

Object types and relation types declared by a pack are **global to the runtime**, not pack-scoped. Two packs declaring object type `claim` with different schemas raise `PackConflictError` at load time — you cannot have two definitions of `claim` in one runtime.

______________________________________________________________________

## 5. Behaviors are namespace-prefixed

A behavior declared in a pack with `name="claim_extractor"` is registered as `diligence.claim_extractor`. The fully-qualified form is the **canonical** identifier:

- the trace prints `[behavior.started] diligence.claim_extractor`
- metrics labels read `{behavior="diligence.claim_extractor"}`
- error messages name the prefixed form
- `runtime.status().registered_behaviors` lists prefixed names
- the replay manifest uses prefixed names

Lookups from user code are **lenient**. A short name resolves when unambiguous; the load-time conflict check makes "unambiguous" a load-time invariant:

```python
rt.get_behavior("claim_extractor")           # works when unambiguous
rt.get_behavior("diligence.claim_extractor") # always works
```

Same rule for tools (`diligence.fetch_company_docs`). LLM behaviors with `tools=["fetch_company_docs"]` resolve the short name through the same rule — short forms work when only one pack declares the tool.

Why this asymmetry: the canonical form is what shows up in operational artifacts where ambiguity is dangerous (a trace, a metric query, an error log). User code, on the other hand, is checked at load time, so leniency is safe — the runtime guarantees the short name is unambiguous before any user lookup happens.

______________________________________________________________________

## 6. Tools are pack-scoped by default

```python
from activegraph.packs import tool

@tool(name="fetch_company_docs", input_schema=FetchInput, output_schema=FetchOutput)
def fetch_company_docs(args, ctx):
    ...
```

This tool is registered as `diligence.fetch_company_docs`. To opt into the global tool namespace:

```python
@tool(name="public_helper", export_globally=True, ...)
def public_helper(args, ctx):
    ...
```

`export_globally=True` registers the tool under its short name **also**. The pack-prefixed name is always available. This is intended for infrastructure packs that explicitly provide tools for other packs to use. The default is scoped so that pack tools cannot silently collide with each other or with user-defined tools.

______________________________________________________________________

## 7. Settings: three forms, typed injection is primary

Every pack declares a `settings_schema` — a Pydantic `BaseModel` subclass. If a pack has no configurable settings, use the shipped `EmptySettings`:

```python
from activegraph.packs import EmptySettings, Pack

pack = Pack(..., settings_schema=EmptySettings)
```

The user provides settings at load time:

```python
rt.load_pack(pack, settings=DiligenceSettings(
    llm_model="claude-sonnet-4-5",
    confidence_threshold_for_review=0.7,
))
```

If `settings_schema` accepts construction with no arguments (all fields default), `settings=` may be omitted. Otherwise omitting raises `PackSettingsMissingError`.

Behaviors access settings in **one of three forms**, in order of preference:

### Form 1: typed parameter injection (primary)

The runtime inspects the handler's signature. Parameters beyond the standard `(event, graph, ctx)` or `(event, graph, ctx, out)` whose type annotation matches a loaded pack's `settings_schema` are injected by keyword:

```python
@llm_behavior(name="claim_extractor", ...)
def claim_extractor(event, graph, ctx, out, *, settings: DiligenceSettings):
    if out.confidence < settings.confidence_threshold_for_review:
        return
    ...
```

Type-checker-friendly. IDE-friendly. Refactor-safe. **Recommended for all new in-pack behaviors.** Use keyword-only (`*,`) so the runtime always invokes by keyword and the parameter name is clear.

### Form 2: `ctx.settings` (secondary)

`ctx.settings` returns the settings instance for the pack that owns the currently-executing behavior. Convenient when you don't want a type annotation:

```python
def claim_extractor(event, graph, ctx, out):
    if out.confidence < ctx.settings.confidence_threshold_for_review:
        return
```

Equivalent to Form 1 at runtime. Use when the type is obvious from the file context.

### Form 3: `ctx.pack_settings("other_pack")` (cross-pack, rare)

```python
def my_behavior(event, graph, ctx):
    memory_settings = ctx.pack_settings("memory")
    if memory_settings is None:
        return  # memory pack not loaded
    ...
```

String-keyed. Returns `None` for unloaded packs. **Using `ctx.pack_settings("diligence")` from inside the diligence pack is a code smell** — use Form 1 or Form 2. This form exists for the rare case where a behavior needs to read another pack's settings.

______________________________________________________________________

## 8. Prompts: TOML frontmatter, content-hash replay

Pack prompts live in `prompts/` inside the pack package. Each prompt is a markdown file with TOML frontmatter between `---` delimiters:

```markdown
---
version = "1.0.0"
name = "claim_extractor"   # optional; defaults to filename without .md
---
You extract factual claims from a document.

For each claim, return:
- text (verbatim, ≤ 200 chars)
- confidence (0.0–1.0, calibrated)
- supporting evidence (verbatim quote)

Do not invent claims. If the document does not support a claim, do not return it.
```

Parsed with `tomllib` (stdlib, Python 3.11+). No external YAML parser is used; the codebase deliberately stays YAML-free.

The frontmatter MUST include `version`. Other keys are advisory.

Load prompts with the helper:

```python
from pathlib import Path
from activegraph.packs import load_prompts_from_dir

pack = Pack(
    ...,
    prompts=load_prompts_from_dir(Path(__file__).parent / "prompts"),
)
```

`load_prompts_from_dir`:

- scans `*.md` files in the directory
- parses TOML frontmatter (raises `PackPromptLoadError` on malformed)
- computes a SHA-256 hash of the body, truncated to 16 hex chars (`"sha256:abcd...ef01"`)
- returns a tuple of `PackPrompt(name, version, body, content_hash)`

### The hash, not the version, is the replay contract

When the pack loads, the runtime emits a `pack.loaded` event whose payload includes a `prompts` map: `{prompt_name: {"version": "1.0.0", "hash": "sha256:..."}, ...}`.

On replay, the same event must be emitted with the same hashes. If you edit a prompt and don't bump the version, replay fires `ReplayDivergenceError` — the hash caught it. The error message includes the declared version on both sides so an operator sees "v1.0.0 → v1.0.0 — version unchanged, content drift," not just an opaque hash mismatch.

Bumping the declared version is good operator practice (it shows up in the trace and in `pack.loaded` payloads), but it is not the source of truth for correctness. The hash is. This is by design: humans forget; hashes don't.

### Referencing prompts from behaviors

Each `@llm_behavior` resolves its prompt by name. If the behavior is declared in a pack and the pack has a prompt with the same `name=`, that prompt is used as the behavior's prompt template:

```python
# prompts/claim_extractor.md   ← frontmatter version=1.0.0
@llm_behavior(name="claim_extractor", ...)
def claim_extractor(...):
    ...
```

If you need an explicit override, pass `prompt_template="..."` to `@llm_behavior` directly. Inline templates are also content-hashed and pinned in `pack.loaded`.

______________________________________________________________________

## 9. Policies

```python
from activegraph.packs import PackPolicy

policies = [
    PackPolicy(
        name="memo_approval",
        requires_approval=("memo",),  # object types
    ),
    PackPolicy(
        name="risk_approval",
        requires_approval=("risk",),
    ),
]
```

Loaded policies modify how `graph.add_object` behaves: objects of the listed types are emitted as `object.proposed` (not `object.created`) and require `rt.approve(id)` before becoming visible in the projected graph.

Policy names are pack-scoped via the same prefixing rule: `diligence.memo_approval`.

`DiligenceSettings.auto_approve_memos: bool = True` (default true so the demo flows without manual intervention) lets the pack flip the gating off. Set to `False` to see the approval flow.

______________________________________________________________________

## 10. Discovery via Python entry points

Packs register themselves under the `activegraph.packs` entry point group:

```toml
# pyproject.toml of any pack
[project.entry-points."activegraph.packs"]
diligence = "activegraph.packs.diligence:pack"
```

The framework can enumerate installed packs:

```python
from activegraph.packs import discover, load_by_name

for entry in discover():
    print(entry.name, entry.version)
```

`pip install activegraph-my-extension` then `runtime.load_pack( load_by_name("my-extension"))` Just Works. This is the third-party distribution mechanism.

`discover()` is cached per process; call `clear_discovery_cache()` to force a re-scan (useful in tests that install packages dynamically).

______________________________________________________________________

## 11. Fixtures and reproducible demos

A pack that ships a demo should ship recorded fixtures alongside, so the demo runs without API keys and produces byte-for-byte identical output:

```text
activegraph_my_pack/
  fixtures/
    __init__.py
    companies.py   # canned LLM responses + tool outputs
```

The convention is:

- Fixtures live inside the pack package, NOT in the framework and NOT in the user's `tests/` directory.
- A `RecordedProvider` class (matching the `LLMProvider` protocol) is exported from `pack.fixtures` and is used by the demo.
- Fixture builders are pure-Python — no I/O at import time, no network, no sleeping.
- The demo runs in under 30 seconds in CI.

The shipped Diligence pack does this. Look at `activegraph/packs/diligence/fixtures/` for the reference layout.

______________________________________________________________________

## 12. Pack discovery and loading: idempotency

`runtime.load_pack(pack, settings=...)` is **idempotent on `(name, version)`**. Calling it twice with the same `(name, version)` is a no-op (no second `pack.loaded` event, no re-prefixing).

Loading the same `name` with a different `version` raises `PackVersionConflictError` — install conflicts. The runtime cannot hold two versions of the same pack.

Loading two distinct packs that conflict on object types, relation types, behavior names, tool names, or policy names raises `PackConflictError`. The error names both packs and the conflicting identifier. **Conflict detection runs before any state mutation** — a failed `load_pack` leaves the runtime unchanged.

______________________________________________________________________

## 13. The `pack.loaded` event

```json
{
  "id": "evt_005",
  "type": "pack.loaded",
  "payload": {
    "name": "diligence",
    "version": "0.1.0",
    "description": "Investment diligence ...",
    "object_types": ["company", "document", "question", "claim", ...],
    "relation_types": ["supports", "contradicts", ...],
    "behaviors": ["diligence.question_generator", "diligence.researcher", ...],
    "tools": ["diligence.fetch_company_docs", ...],
    "policies": ["diligence.memo_approval", "diligence.risk_approval"],
    "prompts": {
      "question_generator": {"version": "1.0.0", "hash": "sha256:..."},
      ...
    },
    "settings": {<JSON-serialized settings>}
  }
}
```

`pack.loaded` lives in the event log. The trace renders it; the JSONL export includes it; `activegraph inspect` surfaces it. It is NOT suppressed from the queue — pack-aware behaviors can subscribe to `pack.loaded` to bootstrap (the shipped Diligence pack does not, but the option exists).

Re-loading an already-loaded pack does not emit a second `pack.loaded`. The settings payload is canonical-JSON-serialized so diffs between runs surface settings drift.

______________________________________________________________________

## 14. Pack scaffolding: `activegraph pack new <name>`

```sh
activegraph pack new my-pack
cd my-pack
pip install -e .
pytest                                # smoke test passes
python -c "import my_pack; print(my_pack.pack)"
```

The scaffolding command produces a package that:

- declares `activegraph` as a dependency
- registers itself under the `activegraph.packs` entry point
- has empty stubs for object types, behaviors, tools, settings
- has a `tests/test_pack_loads.py` smoke test that imports the pack, asserts no global registry side effects, loads it into a fresh runtime, and asserts the `pack.loaded` event appears

The package name (directory and Python package) is the kebab-to-snake transformation of the pack name: `pack new diligence-extension` produces `diligence-extension/` with internal package `diligence_extension/`.

`activegraph pack list` enumerates every pack the framework can discover in the current Python environment (entry-point name, version, and dotted import path). Useful for verifying that `pip install activegraph-extension` registered correctly before calling `load_by_name`.

______________________________________________________________________

## 15. Trust model and packs as code

**Packs are not sandboxed.** A pack is a Python package. Installing a pack is equivalent to installing any Python package: it can read your files, make network calls, exec arbitrary code in your process. Trust at install time, not at runtime.

The runtime does not enforce any pack-specific privilege restrictions. There is no allowlist, no capability system, no syscall filter. If you don't trust a pack's source, don't install it. This is the same model as `pip` and as Python itself.

This decision is locked. See CONTRACT v0.9 #12.

______________________________________________________________________

## 16. Backward compatibility

The pack format is a strict addition. All v0–v0.9 tests pass unchanged in v1.0. Global decorators behave exactly as before. The `Graph.add_object` path is unchanged in the no-packs-loaded case.

If you have a v0.7-era custom diligence example (`examples/ diligence_with_tools.py`), it continues to work. The pack does not replace it; the pack is a different audience (using a pre-built system) than the example (building a custom system from primitives).

______________________________________________________________________

## 17. Where to look in the reference implementation

- `activegraph/packs/__init__.py` — public Pack API, decorators, exceptions, prompt loader.
- `activegraph/packs/loader.py` — `Runtime.load_pack` internals, conflict detection, namespace prefixing, settings injection.
- `activegraph/packs/discovery.py` — entry point enumeration.
- `activegraph/packs/scaffold.py` — `activegraph pack new`.
- `activegraph/packs/diligence/` — the reference pack. Read this end-to-end before writing your own.
- `examples/diligence_real_run.py` — the killer demo / executable spec for the pack format.
- `tests/test_packs_*.py` — every property in this document is tested.

Implementation details may evolve. The contract in `CONTRACT.md` v0.9 is the binding reference. This guide explains the *why*.
# Cookbook

# Common patterns

Recurring idioms with copy-pasteable code. Each pattern is one sub-section: the code, a short rationale, and a pointer at the concept page that owns the underlying primitive.

If you're writing a new behavior and one of these patterns fits, use it — the patterns are how the framework's primitives compose to solve the everyday shapes that come up in agentic systems. If none of them fit, you're probably reaching for a primitive in a new way, and the [concepts](https://docs.activegraph.ai/concepts/graph/index.md) section is the right next stop.

## Retry behaviors on transient failures

The canonical pattern for handling LLM or tool failures that are non-deterministic (network errors, rate limits, timeouts). `behavior.failed` events carry the original `reason` code; a retry behavior subscribes to them with a `where=` filter on the codes that warrant retry:

```python
from activegraph import behavior

@behavior(
    name="retry_transient",
    on=["behavior.failed"],
    where={
        "reason": [
            "llm.network_error",
            "llm.rate_limited",
            "tool.timeout",
            "tool.network_error",
        ],
    },
)
def retry_transient(event, graph, ctx):
    attempt = (event.payload.get("attempt") or 0) + 1
    if attempt > 3:
        return
    graph.emit("retry.requested", {
        "for_event": event.payload["triggering_event_id"],
        "attempt": attempt,
        "behavior": event.payload["behavior"],
    })
```

Retries are first-class graph citizens (CONTRACT v0.6 #13). Every retry appears in the trace and can be forked from. Per-behavior caps live in the behavior body; the framework doesn't have a global retry policy. See [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) for why `behavior.failed` is an event rather than an exception that escapes to your code.

## Fork-and-diff to compare alternative hypotheses

When you want to know "what would happen if I changed this setting," fork from a point before the setting takes effect, run the fork with the override, and diff.

The `fork --set` flag is part of the v1.1 release

The CLI form below shows the `--set <pack>.<key>=<value>` flag documented in CONTRACT v1.0. The flag itself lands in v1.1 (see [CONTRACT v1.1 #1](https://github.com/yoheinakajima/activegraph/blob/main/CONTRACT.md#v11-1-cli-flags-specd-but-not-implemented)). Until then, use the Python-API form in [Fork with a pack-setting override (v1.0 — Python API)](#fork-with-a-pack-setting-override-v10-python-api) below.

```bash
# Find the event before the setting matters (usually the goal
# event or a pack.loaded event):
activegraph inspect <store> --run-id <run> --tail 50

# Fork with the override (v1.1):
activegraph fork <store> --run-id <run> --at-event <evt> \
    --label cautious \
    --set diligence.confidence_threshold_for_review=0.9 \
    --record

# Diff the two runs:
activegraph diff <store> --run-a <parent> --run-b <fork>
```

The diff prints shared events, parent-only events, fork-only events, and divergent objects. The first divergent object tells you where the override started producing different work. See [`forking`](https://docs.activegraph.ai/concepts/forking/index.md) for the cutoff semantics and the `--set` rules (pack-settings-only, fail-loud-on-typo).

## Fork with a pack-setting override (v1.0 — Python API)

The canonical home for the fork-with-override workflow until the CLI's `--set` flag lands in v1.1. The Python form does the same thing the CLI form will: copies the parent's events up to the fork point, then resumes execution under different pack settings.

```python
from activegraph import Graph, IDGen, FrozenClock, Runtime
from activegraph.packs.diligence import DiligenceSettings, pack as diligence_pack
from activegraph.packs.diligence.fixtures import (
    RecordedDiligenceProvider, THREE_COMPANIES, company_goal,
)
from activegraph.store import open_store

PARENT_URL = "sqlite:////tmp/activegraph_quickstart/quickstart_demo_run.db"
PARENT_RUN = "quickstart_demo_run"
FORK_RUN = "quickstart_cautious_fork"

# Find a fork point — typically the goal.created event for the
# company you want to re-run with the override.
parent_store = open_store(PARENT_URL, run_id=PARENT_RUN)
fork_at = next(
    e.id for e in parent_store.iter_events()
    if e.type == "goal.created"
)

# Copy parent events up to the fork point into the new run.
from activegraph.store.sqlite import SQLiteEventStore
SQLiteEventStore.fork_run(
    "/tmp/activegraph_quickstart/quickstart_demo_run.db",
    parent_run_id=PARENT_RUN,
    new_run_id=FORK_RUN,
    at_event_id=fork_at,
    label="cautious",
    created_at="2026-01-01T00:00:00Z",
)

# Load the fork and run it with the override settings.
fork_rt = Runtime.load(PARENT_URL, run_id=FORK_RUN)
fork_rt.load_pack(
    diligence_pack,
    settings=DiligenceSettings(
        llm_model="claude-sonnet-4-5",
        confidence_threshold_for_review=0.9,  # ← the override (was 0.7)
    ),
)
fork_rt.run_until_idle()
fork_rt.save_state()
```

Diff the two runs from the CLI as usual:

```bash
activegraph diff sqlite:////tmp/activegraph_quickstart/quickstart_demo_run.db \
    --run-a quickstart_demo_run \
    --run-b quickstart_cautious_fork
```

The diff shows the structural difference produced by the threshold change. When `--set` lands in v1.1, the same workflow collapses to a single CLI command; until then, this is the canonical recipe.

## Pattern subscriptions for cross-object reactivity

When a behavior should fire only when a specific structural relationship exists in the graph, use a pattern subscription instead of an event-type filter:

```python
@behavior(
    name="risk_escalator",
    pattern="(c:claim)-[:supports]->(e:evidence) WHERE c.confidence > 0.7",
)
def risk_escalator(event, graph, ctx):
    for match in ctx.matches:
        claim = match.bindings["c"]
        evidence = match.bindings["e"]
        ...
```

The pattern matcher reads the full graph; the behavior body operates on `ctx.matches`, one entry per binding combination that satisfies the pattern. See [`patterns`](https://docs.activegraph.ai/concepts/patterns/index.md) for the supported Cypher subset and what's deliberately refused.

## `ctx.propose_object` for policy-gated writes

When an object should require approval before landing — memos, risks, anything an operator should review — use `ctx.propose_object` instead of `graph.add_object`:

```python
@behavior(name="memo_synthesizer", on=["claims.complete"])
def memo_synthesizer(event, graph, ctx):
    ...
    ctx.propose_object(
        "memo",
        data={"title": "Diligence memo", "body": "..."},
        reason="diligence run complete",
    )
```

The proposal lands as `approval.proposed`. If the pack's auto-approve setting is on, the framework approves immediately and the object lands. If off, the proposal sits until `rt.approve(id)` is called. See [`policies`](https://docs.activegraph.ai/concepts/policies/index.md) for the full lifecycle.

The operator-side enumeration pattern:

```python
for pa in rt.pending_approvals():
    print(pa.id, pa.kind, pa.object_type, pa.reason)
    rt.approve(pa.id, approved_by="reviewer")
```

## Scoped views for cost-efficient LLM behaviors

When an LLM behavior only needs to read a few neighbors of the triggering object, narrow the view to bound prompt size and cost:

```python
@behavior(
    name="claim_summarizer",
    on=["object.created"],
    where={"object.type": "claim"},
    view={"around": "event.payload.object.id", "depth": 1},
)
def claim_summarizer(event, graph, ctx):
    claim = ctx.view.get_object(event.payload["object"]["id"])
    for neighbor in ctx.view.objects():
        ...
```

`around=` + `depth=` scope what `ctx.view` returns. The prompt assembler serializes the view; smaller view, smaller prompt. LLM behaviors that pass the full graph to the prompt assembler are the canonical source of unbounded cost growth in agentic systems — scoping is the answer. See [`views`](https://docs.activegraph.ai/concepts/views/index.md).

## `@relation_behavior` for coordination logic between endpoints

When the logic semantically belongs to a relationship, not to either endpoint, use `@relation_behavior`:

```python
from activegraph import relation_behavior

@relation_behavior(
    name="auto_unblock",
    relation_type="depends_on",
    on=["task.completed"],
)
def auto_unblock(relation, event, graph, ctx):
    if event.payload["task_id"] == relation.source:
        graph.patch_object(relation.target, {"status": "open"})
```

The behavior fires once per matching edge. See [`relations`](https://docs.activegraph.ai/concepts/relations/index.md) for the decision rule between relation behaviors and regular behaviors.

## Emit a custom event for cross-behavior signaling

When two behaviors need to coordinate but neither owns the trigger, emit a custom event from one and subscribe from the other:

```python
@behavior(name="produce", on=["object.created"])
def produce(event, graph, ctx):
    ...
    graph.emit("memo.ready_for_review", {"memo_id": memo.id})


@behavior(name="review", on=["memo.ready_for_review"])
def review(event, graph, ctx):
    ...
```

Custom event names use dot-namespace convention (`my.feature.event`); behaviors subscribing by name pick them up. The events land in the trace alongside framework events. See [`events`](https://docs.activegraph.ai/concepts/events/index.md).

## Save state across processes

When a long-running goal needs to survive process restart, attach a SQLite store and call `save_state` at quiescence:

```python
rt = Runtime(graph, persist_to="/path/to/run.db")
rt.run_goal("...")
rt.save_state()
```

To resume later:

```python
rt = Runtime.load("sqlite:////path/to/run.db", run_id=rt.run_id)
rt.run_until_idle()
```

Restoring loads the event log and replays it. Behaviors fire fresh after the replay; the framework treats them as a continuation of the original run. See [Operating in production](https://docs.activegraph.ai/guides/operating-in-production/index.md) for the full operator-facing surface.

# Debugging

The framework's audit trail is the same artifact whether you're investigating a production bug or developing a new behavior. The event log records what happened; the trace renders it for human scanning; `activegraph inspect` slices it for narrow questions; the fork primitive lets you re-run with one variable changed to isolate cause.

This page is the diagnostic walkthrough: how to use the framework's operator surface to answer the common debugging questions in order, from "what just happened" to "why did it happen that way."

## Read the trace first

```python
rt.print_trace()
```

The trace is the event log rendered with one event per line, tags in brackets, short summaries. Read it top-to-bottom for a small run, scan for tag patterns in a large one. Every event the framework emits has a tag and a payload summary; nothing happens that isn't in the trace.

For a saved run:

```bash
activegraph inspect <store-url> --run-id <run> --tail 100
```

The `--tail` flag bounds output. Increase it (`--tail 500`, or no `--tail` for the full log) when the question is "what led up to the failure" rather than "what failed."

## Narrow with `activegraph inspect`

The CLI has selectors for the three diagnostic questions that come up repeatedly:

```bash
# Print one event's full payload (every error message names an
# event id; this is the next click):
activegraph inspect <store> --event <event-id>

# List behaviors registered in this run (compare against what
# fired in the trace to spot missing dispatch):
activegraph inspect <store> --behaviors

# Show pack versions and prompt content hashes (compare against
# replay-divergence errors to spot prompt drift):
activegraph inspect <store> --pack-version
```

The three selectors are mutually exclusive — they're focused queries, not filters on the full status output. See the [CLI reference](https://docs.activegraph.ai/cookbook/reference/cli/index.md) for the full surface.

## Read `behavior.failed` events

Most failures inside a goal run land as `behavior.failed` events, not as escaped exceptions. The runtime catches the behavior's exception, captures the type/message/reason, emits the event, and keeps going. The trace shows them inline:

```text
[behavior.failed]   evt_NNN  your.behavior  reason=llm.parse_error
```

To read the full payload (the original exception, the provider/tool response, any `payload_extras`):

```bash
activegraph inspect <store> --event <behavior.failed-id>
```

The structured `reason` codes group failures by recovery shape; the [error catalog](https://docs.activegraph.ai/reference/errors/llm-behavior-error/index.md) has per-reason recovery prose. The [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) page covers why these are events rather than escaped exceptions.

## Walk the causal chain

Every event carries `caused_by` — the id of the event that triggered the behavior that produced this one. Walking the chain backward reconstructs the causal path from any event to its root.

```python
# Walk backward from a specific event:
event = rt.graph.get_event(event_id)
chain = []
while event is not None:
    chain.append(event)
    event = rt.graph.get_event(event.caused_by) if event.caused_by else None

for e in reversed(chain):
    print(f"{e.id}  {e.type}  {e.actor}")
```

The chain ends at a root event — usually `goal.created` (an operator-pushed goal) or a custom event from outside the runtime loop. Reading the chain is the answer to "why did this fire?"

## Fork-and-replay-in-isolation for narrowing bugs

When the question is "is the bug in this specific behavior, or upstream?", fork the run at a point before the suspected behavior fires, re-run with the behavior modified or removed, and diff:

```bash
# Fork from the event just before the suspect behavior fired:
activegraph fork <store> --run-id <run> --at-event <evt-before> \
    --label suspect-removed \
    --record

# Run the fork through your modified behavior set, then diff:
activegraph diff <store> --run-a <parent-run> --run-b <fork-run>
```

The diff output shows shared events, parent-only events, fork-only events, and divergent objects. The first divergent object tells you where the new behavior set started producing different work. [`forking`](https://docs.activegraph.ai/concepts/forking/index.md) covers the fork primitive in detail; the [`replay-divergence-error`](https://docs.activegraph.ai/reference/errors/replay-divergence-error/index.md) page covers what fires when strict-mode fork detects drift.

## When the bug is in your prompt

LLM behaviors are debug-instrumented by default. Every call lands two events:

```text
[llm.requested]    evt_NNN  your.behavior  model=... prompt_hash=...
[llm.responded]    evt_NNN  your.behavior  cache_hit=false ...
```

To read the full prompt the framework assembled:

```bash
activegraph inspect <store> --event <llm.requested-id>
```

The payload includes the system message, the messages list, and the tool definitions — exactly what went to the provider. If the behavior's body parses the response wrong, the `llm.responded` event has the raw response. If the prompt itself is wrong, the `llm.requested` event is the source.

For prompt-hash drift across runs (the "this worked yesterday" case), compare `--pack-version` between the two runs. The prompt content hash is in the `pack.loaded` event; if the hashes differ, the prompt template changed.

## Reproducing intermittent failures

If a failure only happens sometimes (rate limits, race conditions in external systems, model temperature variance), the trace from the failing run is still the most reliable artifact. Save it:

```bash
activegraph export-trace <store> --run-id <run> --to failure.jsonl
```

Then re-run with `RecordedLLMProvider` against the saved fixtures — the recorded provider replays the recorded responses deterministically, so the failing path runs the same way every time. The fixture-missing path (no recorded response for a given prompt) raises [`llm-behavior-error`](https://docs.activegraph.ai/reference/errors/llm-behavior-error/index.md) with `reason=llm.fixture_missing`, which is informative on its own.

## What's related

- [The trace](https://docs.activegraph.ai/concepts/events/index.md) — the rendered form of the event log.
- [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) — why most failures are events, not exceptions.
- [`forking`](https://docs.activegraph.ai/concepts/forking/index.md) — the primitive for fork-and-replay-in-isolation debugging.
- [`replay`](https://docs.activegraph.ai/concepts/replay/index.md) — the strict-vs-permissive modes and what each one catches.
- [Error catalog](https://docs.activegraph.ai/reference/errors/replay-divergence-error/index.md) — per-error recovery prose. Every error message links into the catalog.
- [Operating in production](https://docs.activegraph.ai/guides/operating-in-production/index.md) — the production-facing operator guide; this page is the developer-facing complement.

# Multi-run scripts

A common pattern when scripting against the framework: run a goal, inspect the result, run another goal in a fresh `Runtime` against a fresh `Graph`. Tests do this through the autouse `clear_registry()` fixture in `tests/conftest.py`; user scripts hit the same shape when they iterate on a hypothesis ("what would happen with this seed event vs that one") inside one process.

The wrinkle: the framework's `@behavior` decorators populate a global registry on module import. `clear_registry()` empties it for isolation between runs. After the first clear, the second `Runtime(graph)` finds the registry empty and runs no behaviors — because the modules whose decorators populated it are already imported, so re-importing them is a no-op.

v1.0.1 ships two small additions that make this pattern explicit:

- `clear_registry()` returns the list of behaviors it cleared.
- `register(behavior_obj)` appends a behavior back into the global registry.

Capture once, re-register per run:

```python
from activegraph import Graph, Runtime, behavior, clear_registry, register


@behavior(name="extract_claims", on=["document.created"])
def extract_claims(event, graph, ctx):
    ...


@behavior(name="check_contradictions", on=["claim.created"])
def check_contradictions(event, graph, ctx):
    ...


# Capture the registry once at module top, right after the
# decorators have run.
REGISTERED_BEHAVIORS = clear_registry()


def run_one(seed_documents: list[dict]) -> Graph:
    for b in REGISTERED_BEHAVIORS:
        register(b)
    graph = Graph()
    for doc in seed_documents:
        graph.add_object("document", doc)
    rt = Runtime(graph)
    rt.run_until_idle()
    clear_registry()
    return graph


# Now scripts can iterate on hypotheses without stale-registry surprises:
g1 = run_one([{"title": "Q3 update", "body": "..."}])
g2 = run_one([{"title": "Q4 update", "body": "..."}])
g3 = run_one([{"title": "Annual report", "body": "..."}])
```

The same pattern works for `@relation_behavior` and `@llm_behavior` — `clear_registry()` returns every kind of registered behavior in registration order, and `register()` accepts any of them.

## Why the captured list rather than re-importing the module

Importing a module a second time runs no decorator code — Python caches the module after the first import. To re-populate the registry from a re-import you'd have to `del sys.modules[...]` and re-`import`, which is fragile and slow once the module imports dozens of types and constants.

Capturing the list once and re-registering is the same shape that the framework's own test conftest uses (the autouse `clear_registry()` fixture relies on test-module re-imports being no-ops; the registry stays empty between cases because each test that needs behaviors defines them inline).

## When NOT to use this pattern

If you only need one `Runtime` per Python process — the usual shape for a long-running agent process, a CLI command, or a single notebook cell — you don't need any of this. The decorators populated the registry once at import; the single `Runtime(graph)` picks them up; you're done.

The multi-run pattern is for scripts that iterate. Hypothesis sweeps, A/B comparisons in one process, batch jobs that want per-input graph isolation without per-input process startup.

## See also

- [Fork-and-diff to compare alternative hypotheses](https://docs.activegraph.ai/cookbook/common-patterns/#fork-and-diff-to-compare-alternative-hypotheses) — when the second run should branch from the first's state rather than start from scratch, fork instead.
- [Debugging](https://docs.activegraph.ai/cookbook/debugging/index.md) — when a run misbehaves, the trace is the first thing to read.

# Migration from v0.7

This page is the runbook for upgrading runs and code from v0.7 through to v1.0. Each milestone added surface; backward compatibility was preserved throughout (CONTRACT v0.7 #22 / v0.8

# 14 / v0.9 #21), so the upgrades are additive — your existing

behaviors keep working as you adopt new primitives.

Three milestones span the upgrade path: **v0.8** added the Postgres store, store URLs, migration, and the observability surface; **v0.9** added the pack format and shipped the Diligence reference pack; **v1.0** is the adoption-surface milestone (this release) — the per-error catalog, the doc site, the quickstart, and the gates.

The order below is the order to apply the changes. Skip steps that don't apply.

## 1. Upgrade the activegraph package

```bash
pip install --upgrade activegraph
```

`activegraph[all]` pulls in the optional extras (anthropic, psycopg, prometheus_client, pydantic). For a minimal install, just `activegraph` — see [`missing-optional-dependency`](https://docs.activegraph.ai/reference/errors/missing-optional-dependency/index.md) for the per-feature extras.

## 2. Migrate the store schema

The store schema_version advanced across milestones. v1.0 expects schema_version `'1'`; runs written by older builds carry their own. Mismatched schemas raise [`schema-version-mismatch`](https://docs.activegraph.ai/reference/errors/schema-version-mismatch/index.md) at open time.

To migrate runs forward:

```bash
activegraph migrate --from sqlite:///old.db --to sqlite:///new.db
```

The migration is transaction-per-run, idempotent, one-directional (CONTRACT v0.8 #5). Each run migrates in a single transaction; a failed run leaves the destination unchanged for that run, and re-running picks up where it left off.

If the source has corrupted event payloads, add `--skip-corrupted` to recover the readable subset (CONTRACT v1.0 PR-C):

```bash
activegraph migrate --from sqlite:///old.db --to sqlite:///new.db \
    --skip-corrupted
```

The skipped event ids appear in the per-run report.

## 3. Adopt connection URLs (v0.7 → v0.8)

v0.7 store construction took a path argument:

```python
# v0.7:
rt = Runtime(graph, persist_to="/path/to/run.db")
```

v0.8 added connection URLs as the canonical addressing form, with the path form preserved as shorthand for SQLite:

```python
# v0.8+:
rt = Runtime(graph, persist_to="/path/to/run.db")           # still works
rt = Runtime(graph, store=SQLiteEventStore("/path/to/run.db"))
rt = Runtime(graph, store=PostgresEventStore("postgres://host/db"))
```

URLs are required for the CLI and for cross-store operations (`activegraph migrate`, `activegraph inspect <url>`). See [`invalid-store-url-error`](https://docs.activegraph.ai/reference/errors/invalid-store-url-error/index.md) for the URL grammar.

If your code passes a bare filesystem path to a CLI command (an easy v0.7-era habit), the CLI rejects it with `InvalidStoreURL` naming the corrected URL. The fix is `sqlite:///<your-path>` or `sqlite:////<absolute-path>` (note the slash count).

## 4. Adopt the pack format (v0.8 → v0.9)

v0.9 introduced packs. If your v0.7/v0.8 code declared behaviors, tools, and object types globally with `@behavior` / `@tool`, it keeps working — packs are additive. Loading a pack adds its behaviors and tools to the runtime alongside the global ones.

To author a pack from existing v0.7/v0.8 code, see [Authoring packs](https://docs.activegraph.ai/guides/authoring-packs/index.md). The shipped [Diligence pack](https://docs.activegraph.ai/cookbook/packs/diligence.md) is the reference example.

Two things to know if you're loading third-party packs:

- **Pack name conflicts.** Two loaded packs claiming the same canonical symbol raises [`pack-conflict-error`](https://docs.activegraph.ai/reference/errors/pack-conflict-error/index.md). Rename one pack or load them in separate runtimes.
- **Pack version pinning.** A runtime holds at most one version of any pack; loading a different version raises [`pack-version-conflict-error`](https://docs.activegraph.ai/reference/errors/pack-version-conflict-error/index.md).

## 5. Adopt the v1.0 error hierarchy (v0.9 → v1.0)

Every exception the framework raises now inherits from `ActiveGraphError`. The v1.0 hierarchy preserves builtin lineage through multi-inheritance, so existing `except ValueError` / `except KeyError` / `except TypeError` clauses keep working:

```python
# v0.9 — these patterns still work in v1.0:
try:
    store.get_event(event_id)
except KeyError:
    ...

try:
    graph.add_object("claim", bad_data)
except ValueError:
    ...
```

The v1.0 hierarchy adds richer catches:

```python
# v1.0 — broader catches with structured context:
try:
    rt = Runtime.load(url, run_id=rid)
except activegraph.StorageError as e:
    log(e.what_failed, e.how_to_fix, e.context)
except activegraph.ActiveGraphError as e:
    log(e.what_failed, e.how_to_fix)
```

See [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) for the hierarchy and the events-not-exceptions principle.

## 6. Adopt the v1.0 CLI follow-ons

v1.0 added five operator-facing CLI flags that the error messages reference in their recovery prose:

- `activegraph inspect <run> --event <event-id>`
- `activegraph inspect <run> --behaviors`
- `activegraph inspect <run> --pack-version`
- `activegraph fork <run> --at-event <evt> --set <key>=<value>`
- `activegraph fork <run> --at-event <evt> --record`
- `activegraph migrate --from <src> --to <dst> --skip-corrupted`

If your operator runbooks reference older flag combinations, the new flags are additive — old commands keep working. See the [CLI reference](https://docs.activegraph.ai/cookbook/reference/cli/index.md) for the full surface.

## 7. Adopt structured logging (v0.7 → v0.8)

v0.8 added structured logging with a documented schema. If your v0.7 deployment was reading the trace stream from stderr, the v0.8 schema is richer and JSON-shaped; opt in via:

```python
from activegraph import configure_logging
configure_logging(level="INFO", json_output=True)
```

The structured schema is documented under [Operating in production](https://docs.activegraph.ai/guides/operating-in-production/index.md). The legacy text logs still emit when `json_output=False`.

## 8. Adopt the metrics protocol (v0.7 → v0.8)

v0.8 added a three-method `Metrics` protocol with two shipped backends (NoOp by default, Prometheus opt-in). Existing code without metrics keeps working — `NoOpMetrics` is the default, so no surface changes if you don't opt in. To enable Prometheus:

```bash
pip install 'activegraph[prometheus]'
```

```python
from activegraph import PrometheusMetrics
rt = Runtime(graph, metrics=PrometheusMetrics())
```

See [Operating in production](https://docs.activegraph.ai/guides/operating-in-production/index.md) for the metric names and the operator contract (CONTRACT v0.8 #9).

## Backward compatibility — what's guaranteed

Every v0–v0.9 test passes unchanged in v1.0 (CONTRACT v1.0 #9). The only deliberately-changed user-visible surface is two trace snapshot files (`llm_trace.txt`, `tool_trace.txt`) that gained the `[trace.flags]` rollup header in v0.9.1 — operator-visible but additive, not removing.

For specific compatibility questions, the per-milestone CONTRACT sections enumerate the back-compat clauses:

- v0.7 #22 — v0/v0.5/v0.6 tests pass; trace snapshots stay byte-identical except for the `prompt_normalized=true` flag
- v0.8 #14 — v0–v0.7 tests pass; library APIs unchanged
- v0.9 #21 — v0–v0.8 tests pass; pack loading is opt-in
- v1.0 #9 — all 384 v0–v0.9 tests pass through every v1.0 PR

If something that worked in v0.7–v0.9 doesn't work in v1.0, that's a bug — file an issue at [GitHub Issues](https://github.com/yoheinakajima/activegraph/issues).

## What's related

- [`schema-version-mismatch`](https://docs.activegraph.ai/reference/errors/schema-version-mismatch/index.md) — the error this page is forward-referenced from.
- [Operating in production](https://docs.activegraph.ai/guides/operating-in-production/index.md) — the v0.8+ operator surface in detail.
- [Authoring packs](https://docs.activegraph.ai/guides/authoring-packs/index.md) — the v0.9 pack format reference.
- [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) — the v1.0 hierarchy and the events-not-exceptions principle.
# Reference

# Command-line reference

The `activegraph` CLI is a thin wrapper around library APIs (CONTRACT v0.8 #12). Every operation it performs is also available programmatically; the CLI is the convenient form for operators, scripts, and CI.

## Invocation

```text
activegraph <subcommand> [<args>] [<options>]
```

Every subcommand supports `--help` for the autogenerated flag documentation (`activegraph inspect --help`). Top-level `activegraph --help` lists every subcommand; `activegraph --version` prints the installed version.

## Exit codes (CONTRACT v0.8 #13)

| Code | Constant             | Meaning                                                                                                                                                                              |
| ---- | -------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ |
| 0    | `EXIT_OK`            | Subcommand succeeded.                                                                                                                                                                |
| 1    | `EXIT_GENERIC_ERROR` | An unhandled error occurred, or a per-run report contains failures (`migrate` sets this when any run failed).                                                                        |
| 2    | `EXIT_USAGE_ERROR`   | Invalid arguments (click's default — bad flags, missing required options, unparseable values).                                                                                       |
| 3    | `EXIT_NOT_FOUND`     | A named resource doesn't exist (store, run, event id, behavior name).                                                                                                                |
| 4    | `EXIT_CORRUPTION`    | The store contains data the framework can't decode (caught via [`CorruptedEventPayloadError`](https://docs.activegraph.ai/reference/errors/corrupted-event-payload-error/index.md)). |
| 5    | `EXIT_DIVERGENCE`    | A strict-mode replay diverged from the recorded log (caught via [`ReplayDivergenceError`](https://docs.activegraph.ai/reference/errors/replay-divergence-error/index.md)).           |

This table is the single source of truth. The [operating guide](https://docs.activegraph.ai/guides/operating-in-production/index.md) and several error pages reference it.

## Connection URLs

Every CLI command that opens a store takes a URL. Two schemes are supported:

```text
sqlite:///relative/path/to/run.db        # three slashes — relative
sqlite:////absolute/path/to/run.db       # four slashes — absolute
postgres://user:pass@host:port/dbname    # also accepted: postgresql://
```

Bare filesystem paths are **rejected** with a clear message naming the corrected URL — see [`invalid-store-url-error`](https://docs.activegraph.ai/reference/errors/invalid-store-url-error/index.md). The framework refuses to silently coerce because guessing wrong would open an unintended store.

## `--json` convention

Every subcommand below supports `--json` for machine-readable output. The JSON shapes are documented per-subcommand; the shapes are stable within a major version (CONTRACT v0.8 #12). Text output is for human scanning; JSON is for scripts and log aggregators.

______________________________________________________________________

## `inspect`

Print a status snapshot of a run.

```text
activegraph inspect <url> [--run-id <run>] [--tail N] [--json]
                          [--event <evt-id>] [--behaviors] [--pack-version]
```

| Flag             | Meaning                                                                 |
| ---------------- | ----------------------------------------------------------------------- |
| `<url>`          | Store URL. Required positional.                                         |
| `--run-id <run>` | Run to inspect. Defaults to the most recent run in the store.           |
| `--tail N`       | Recent events to include. Default 20.                                   |
| `--json`         | Machine-readable output.                                                |
| `--event <id>`   | Print one event's full payload by id.                                   |
| `--behaviors`    | Print only the registered-behaviors section.                            |
| `--pack-version` | Print every `pack.loaded` event (name, version, prompt content hashes). |

The three selectors (`--event`, `--behaviors`, `--pack-version`) are mutually exclusive — they're focused queries, not filters on the full status. Combining them is a usage error (exit 2).

**JSON shape (default)**: `{run_id, state, queue_depth, events_processed, frame, budget, registered_behaviors, recent_events}`. With a selector, the shape narrows to that selector's payload (one event, one behaviors list, one packs list).

Exits: 0 on success, 3 if the store / run / event id doesn't exist, 2 on bad selector combination.

______________________________________________________________________

## `replay`

Rebuild the graph from a run's event log without firing behaviors.

```text
activegraph replay <url> --run-id <run> [--json]
```

| Flag             | Meaning                         |
| ---------------- | ------------------------------- |
| `<url>`          | Store URL. Required positional. |
| `--run-id <run>` | Run to replay. Required.        |
| `--json`         | Machine-readable output.        |

`replay` runs in permissive mode — it doesn't compare the re-emitted stream against the recording, so it can't fire [`replay-divergence-error`](https://docs.activegraph.ai/reference/errors/replay-divergence-error/index.md). For strict-mode replay, use the `replay_strict=True` argument on `Runtime.load` programmatically.

**JSON shape**: `{run_id, events, objects, relations}`.

Exits: 0 on success, 3 if the store / run doesn't exist.

______________________________________________________________________

## `fork`

Create a new run by copying events from a parent run up to and including `--at-event`.

The `--set` flag is part of the v1.1 release

`--set <pack>.<setting>=<value>` is documented in CONTRACT v1.0 and shown in the signature below, but the implementation lands in v1.1 (see [CONTRACT v1.1 #1](https://github.com/yoheinakajima/activegraph/blob/main/CONTRACT.md#v11-1-cli-flags-specd-but-not-implemented)). Until then, use the Python-API form in [Fork with a pack-setting override (v1.0 — Python API)](https://docs.activegraph.ai/cookbook/common-patterns/#fork-with-a-pack-setting-override-v10-python-api) for fork-with-override workflows. Other flags below (`--at-event`, `--label`, `--to`, `--record`, `--json`) are available now.

```text
activegraph fork <url> --run-id <parent> --at-event <evt> \
                      [--label <text>] [--to <dest-url>]
                      [--set <key>=<value>] [--record] [--json]
```

| Flag                  | Meaning                                                                                                                                                                                                                                                                                                          |
| --------------------- | ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| `<url>`               | Source store URL. Required positional.                                                                                                                                                                                                                                                                           |
| `--run-id <parent>`   | Parent run to fork from. Required.                                                                                                                                                                                                                                                                               |
| `--at-event <evt>`    | Event id at which to fork (inclusive). Required.                                                                                                                                                                                                                                                                 |
| `--label <text>`      | Optional human-readable label for the new run.                                                                                                                                                                                                                                                                   |
| `--to <dest-url>`     | Destination store URL. Defaults to the source store (cross-store fork is not supported in v1.0; use `migrate` first).                                                                                                                                                                                            |
| `--set <key>=<value>` | Override a pack setting in the fork. Pack-settings-only; dotted-path `<pack>.<setting>=<value>`; multiple `--set` flags compose; unknown keys fail loud with a registration-time error (see the [errors catalog](https://docs.activegraph.ai/reference/errors/replay-divergence-error/index.md) for the family). |
| `--record`            | Mark the fork as a re-recording. The new run accepts new cache entries instead of strict-checking against the parent's prompt hashes.                                                                                                                                                                            |
| `--json`              | Machine-readable output.                                                                                                                                                                                                                                                                                         |

Fork is the durable parallel-context primitive. The shared-lineage model and the `--set` semantics are documented in [`concepts/forking`](https://docs.activegraph.ai/concepts/forking/index.md). The [fork-and-diff cookbook pattern](https://docs.activegraph.ai/cookbook/common-patterns/index.md) shows the common workflow.

**JSON shape**: `{parent_run_id, new_run_id, at_event, label, events_copied, recording?}` (`recording: true` when `--record` was passed).

Exits: 0 on success, 3 if the parent run or `--at-event` id doesn't exist, 2 if `--to` names a different store from the source (cross-store fork not supported).

______________________________________________________________________

## `diff`

Compare two runs in the same store.

```text
activegraph diff <url> --run-a <run> --run-b <run> [--json]
```

| Flag            | Meaning                         |
| --------------- | ------------------------------- |
| `<url>`         | Store URL. Required positional. |
| `--run-a <run>` | Left-hand run. Required.        |
| `--run-b <run>` | Right-hand run. Required.       |
| `--json`        | Machine-readable output.        |

The output is the trio that matters for fork-and-diff workflows: shared events, parent-only events, fork-only events, plus the list of divergent objects and relations (objects that exist in both runs but with different state).

**JSON shape**: `{run_a, run_b, shared_events, parent_only_events, fork_only_events, divergent_objects, divergent_relations}`.

Exits: 0 on success, 3 if either run doesn't exist.

______________________________________________________________________

## `export-trace`

Export a run's trace as `text` or `jsonl`.

```text
activegraph export-trace <url> --run-id <run> [--format text|jsonl] [--out <path>]
```

| Flag             | Meaning                                                                           |
| ---------------- | --------------------------------------------------------------------------------- |
| `<url>`          | Store URL. Required positional.                                                   |
| `--run-id <run>` | Run to export. Required.                                                          |
| `--format`       | `text` (default; the human-scannable trace) or `jsonl` (one event JSON per line). |
| `--out <path>`   | Destination file. Defaults to stdout.                                             |

`jsonl` format is the canonical handoff to log aggregators and event-processing pipelines. Every event lands as one JSON object per line with the full payload.

Exits: 0 on success, 3 if the store / run doesn't exist.

______________________________________________________________________

## `migrate`

Copy runs from a source store to a destination store.

```text
activegraph migrate --from <src-url> --to <dst-url> \
                    [--run-id <run>...] [--skip-corrupted] [--json]
```

| Flag               | Meaning                                                                                                                                                                                 |
| ------------------ | --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| `--from <src>`     | Source store URL. Required.                                                                                                                                                             |
| `--to <dst>`       | Destination store URL. Required.                                                                                                                                                        |
| `--run-id <run>`   | Migrate only these run(s). Repeat the flag to specify multiple. Defaults to all runs in the source.                                                                                     |
| `--skip-corrupted` | Skip events whose payload fails JSON decode instead of failing the run. Skipped event ids appear in the per-run report. The destination run is **partial** — the operator is on notice. |
| `--json`           | Machine-readable output.                                                                                                                                                                |

Migration is **transaction-per-run, idempotent, one-directional** (CONTRACT v0.8 #5). Each run writes in a single destination transaction; a failure mid-run rolls back that run's destination state. Writes use `INSERT ... ON CONFLICT DO NOTHING` on `(id, run_id)` so re-running after a failure is safe.

`--skip-corrupted` is the recovery primitive for runs containing [`CorruptedEventPayloadError`](https://docs.activegraph.ai/reference/errors/corrupted-event-payload-error/index.md). Without the flag, a corrupted event in any run causes that run to fail (other runs migrate normally). With the flag, the corrupted events are recorded in `skipped_events` on the per-run report and the rest of the run migrates.

**JSON shape**: `{source_url, dest_url, runs: [{run_id, status, events_migrated, error?, skipped_events?}]}`. `status` is `"ok"` or `"failed"`.

Exits: 0 if every run's status is `"ok"`; 1 if any run failed; 2 on a bad source/destination URL.

______________________________________________________________________

## `pack new`

Scaffold a new pack package skeleton.

```text
activegraph pack new <name> [--output-dir <path>]
```

| Flag                  | Meaning                                                                                              |
| --------------------- | ---------------------------------------------------------------------------------------------------- |
| `<name>`              | Pack name. Required positional. Becomes the Python module name and the `Pack(name=...)` declaration. |
| `--output-dir <path>` | Directory under which to create the package. Defaults to the current directory.                      |

Generates `pyproject.toml`, the Python package with stubs for object types, behaviors, tools, settings, an example prompt, a smoke test, and a README. After `cd <name>` and `pip install -e .`, the pack is discoverable via `activegraph pack list` and loadable via `Runtime.load_pack`.

The [Authoring packs](https://docs.activegraph.ai/guides/authoring-packs/index.md) guide is the companion reference.

Exits: 0 on success, 2 on bad arguments.

______________________________________________________________________

## `pack list`

List installed packs discovered via the `activegraph.packs` entry-point group.

```text
activegraph pack list
```

Prints one pack per line: name, version, distribution name. The underlying mechanism is the same `discover()` API programmatically — see [`load-by-name`](https://docs.activegraph.ai/reference/api/packs/#activegraph.load_by_name) in the API reference.

Exits: 0 always (an empty list is not an error).

______________________________________________________________________

## `quickstart`

> **Placeholder.** The `quickstart` subcommand is documented when the command ships in the v1.0-rc1 work. The shape is locked by the transcript at `examples/quickstart_session.txt` — fixture-backed demo of the Diligence pack with a "what just happened" prose section, plus an `--interactive` mode for writing a first behavior.

______________________________________________________________________

## What's related

- [Operating in production](https://docs.activegraph.ai/guides/operating-in-production/index.md) — the deployed-runtime operator guide. References the EXIT_CODES table here.
- [Debugging](https://docs.activegraph.ai/cookbook/debugging/index.md) — the diagnostic walkthrough. Uses every selector flag on `inspect`, plus `fork`, `diff`, and `export-trace`.
- [Migration from v0.7](https://docs.activegraph.ai/cookbook/migration-from-v0-7/index.md) — the upgrade runbook. Uses `migrate` and references the URL conventions here.
- [`forking`](https://docs.activegraph.ai/concepts/forking/index.md) — the conceptual model for the `fork` command.
- [`replay`](https://docs.activegraph.ai/concepts/replay/index.md) — the conceptual model for the `replay` command and the strict-vs-permissive distinction.
- [`invalid-store-url-error`](https://docs.activegraph.ai/reference/errors/invalid-store-url-error/index.md) — what fires when a URL is malformed.

# LLM providers

Active Graph ships two concrete `LLMProvider` implementations. Both expose identical Protocol surface — `complete()`, `estimate_cost()`, `count_tokens()` — so a runtime swapping one for the other doesn't reshape any call site. Choose by the model family you want; everything else is the same.

```python
from activegraph import Graph, Runtime
from activegraph.llm import AnthropicProvider, OpenAIProvider

rt = Runtime(Graph(), llm_provider=AnthropicProvider())  # or:
rt = Runtime(Graph(), llm_provider=OpenAIProvider())
```

## Installing

Pick one of three extras. They install cleanly and don't conflict.

```bash
pip install "activegraph[anthropic]"   # AnthropicProvider only
pip install "activegraph[openai]"      # OpenAIProvider only
pip install "activegraph[llm]"         # both providers
```

The `[openai]` extra also pulls in `tiktoken` so client-side token counting is accurate; see the count_tokens row below for what happens when tiktoken is missing.

## API keys

Both providers read their API key from the environment, never from code or a checked-in config:

```bash
export ANTHROPIC_API_KEY='...'
export OPENAI_API_KEY='...'
```

Override the env-var name via the `api_key_env=` constructor kwarg if you need a different one (per-environment key rotation, for example).

## Default model resolution

Each provider declares a `default_model` — the model name the runtime uses when an `@llm_behavior` doesn't pin one explicitly:

```python
@llm_behavior(name="extractor", output_schema=Claim)
def extractor(event, graph, ctx, llm_output):
    ...
```

With `AnthropicProvider()` this resolves to `"claude-sonnet-4-5"`; with `OpenAIProvider()` it resolves to `"gpt-4o-mini"`. The runtime stamps the resolved name onto the behavior at registration time (inside `Runtime(...)`'s first registry materialization), so swapping providers is a one-line change:

```python
rt = Runtime(Graph(), llm_provider=OpenAIProvider())  # gpt-4o-mini
rt = Runtime(Graph(), llm_provider=AnthropicProvider())  # claude-sonnet-4-5
```

Pass `model="..."` on the decorator to override:

```python
@llm_behavior(name="extractor", output_schema=Claim, model="gpt-4o")
def extractor(event, graph, ctx, llm_output):
    ...
```

## Cross-provider model-name validation

When a behavior pins `model="..."` explicitly, the runtime checks the name against each shipped provider's `recognizes_model()`:

| Provider            | Recognized prefixes         |
| ------------------- | --------------------------- |
| `AnthropicProvider` | `claude-`                   |
| `OpenAIProvider`    | `gpt-`, `o1-`, `o3-`, `o4-` |

If the configured provider doesn't recognize the name but a *different* shipped provider does, the runtime raises `InvalidRuntimeConfiguration` at registration time with a structured error naming both providers. This catches the most common shape of provider-swap misconfiguration — an `@llm_behavior` copied from an Anthropic example into an OpenAI-configured runtime — before the first network call, instead of letting the provider 404 silently.

Names no shipped provider recognizes (custom deployments, OpenAI fine-tunes like `ft:gpt-4o-mini:org::id`, internal naming conventions) pass through silently. The validation is permissive by design: only *recognized* cross-provider mismatches fire.

## Side-by-side

| Aspect                                                     | `AnthropicProvider`                                                                                                                                                                                                                                   | `OpenAIProvider`                                                                                                                                                             |
| ---------------------------------------------------------- | ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| `default_model` (used when `@llm_behavior` omits `model=`) | `"claude-sonnet-4-5"`                                                                                                                                                                                                                                 | `"gpt-4o-mini"`                                                                                                                                                              |
| Recognized model families (per `recognizes_model()`)       | `claude-*`                                                                                                                                                                                                                                            | `gpt-*`, `o1-*`, `o3-*`, `o4-*`                                                                                                                                              |
| API key env                                                | `ANTHROPIC_API_KEY`                                                                                                                                                                                                                                   | `OPENAI_API_KEY`                                                                                                                                                             |
| SDK                                                        | `anthropic>=0.40`                                                                                                                                                                                                                                     | `openai>=1.0`                                                                                                                                                                |
| Structured output                                          | Instruction-based: schema + example instance embedded in the system prompt by [`build_system_prompt`](https://docs.activegraph.ai/reference/api/index.md); provider parses JSON out of the response via the shared `parse_structured_response` helper | Same path. Native `response_format={"type":"json_schema",...}` mode is a v1.1 candidate                                                                                      |
| `count_tokens()`                                           | Server-side via `messages.count_tokens` (1 roundtrip per call when `budget.max_cost_usd` is set and no cache hit)                                                                                                                                     | Client-side via `tiktoken` when available; char/4 heuristic fallback with a one-time debug log if tiktoken is missing                                                        |
| Tool use                                                   | Supported (`Tool.to_definition()` emits Anthropic shape)                                                                                                                                                                                              | **Not supported in v1.0.1.** A non-empty `tools=` raises `LLMBehaviorError(reason="llm.network_error")` with a v1.1 pointer. Tool-shape translation is a scheduled v1.1 item |
| Exception mapping                                          | `llm.rate_limited` on 429-shaped errors; `llm.network_error` for everything else (timeouts, connection errors, **auth failures**)                                                                                                                     | Same mapping                                                                                                                                                                 |
| Pricing                                                    | Family-prefix lookup; override with `pricing=` kwarg                                                                                                                                                                                                  | Family-prefix lookup; override with `pricing=` kwarg                                                                                                                         |

## Mixing with [`RecordedLLMProvider`](https://docs.activegraph.ai/reference/api/index.md)

The fixture-backed provider is provider-agnostic: fixtures are keyed by prompt-content hash, and the model name (`claude-…` or `gpt-…`) is part of the hash input. Fixtures recorded against one provider replay against `RecordedLLMProvider` regardless of which live provider you switch to next.

```python
from activegraph.llm import RecordingLLMProvider, OpenAIProvider

inner = OpenAIProvider()
provider = RecordingLLMProvider(inner, fixtures_dir="tests/fixtures/llm")
```

`RecordingLLMProvider` wraps either concrete provider the same way. Record once against a live key, commit the fixtures, run tests against `RecordedLLMProvider` thereafter.

## Writing a custom provider

`LLMProvider` is a runtime-checkable `Protocol`. Any class with the three methods is a provider — no inheritance required, no registration step:

```python
from decimal import Decimal
from activegraph.llm import LLMMessage, LLMResponse, LLMProvider

class MyProvider:
    default_model = "my-model-name"   # v1.0.2 #1 — used when @llm_behavior omits model=

    def complete(self, *, system, messages, model, max_tokens,
                 temperature, top_p, output_schema, timeout_seconds,
                 tools=None) -> LLMResponse:
        ...

    def estimate_cost(self, *, input_tokens, output_tokens, model) -> Decimal:
        ...

    def count_tokens(self, *, system, messages, model) -> int:
        ...

    def recognizes_model(self, name: str) -> bool:  # v1.0.2 #1
        return name.startswith("my-")

assert isinstance(MyProvider(), LLMProvider)
```

`default_model` and `recognizes_model` are additive (v1.0.2 #1). Custom providers that pre-date v1.0.2 and omit them keep working at the three core call sites — they just require an explicit `model=` on every `@llm_behavior` and don't participate in cross-provider validation.

If your provider exposes the framework's instruction-based structured-output path (most do), reuse `parse_structured_response(text, schema)` from `activegraph.llm.parsing` for byte-identical error semantics with the shipped providers — same `llm.parse_error` and `llm.schema_violation` reason codes for the same response shapes.

See [CONTRACT v1.0.1 #5](https://github.com/yoheinakajima/activegraph/blob/main/CONTRACT.md) for the provider-commitment surface: which methods are stable, which behaviors are provider-dependent (`count_tokens`), and which capabilities are explicitly v1.1 (tool use for OpenAI, native structured-output modes).

# Errors reference catalog

Every exception the framework raises has a dedicated reference page. The error message in the runtime ends with a `More:` link to its page; you should rarely need to visit this catalog directly.

If you arrived here from an error message, follow the link the message printed. If you're browsing — start with [ReplayDivergenceError](https://docs.activegraph.ai/reference/errors/replay-divergence-error/index.md) (the voice reference for the catalog) or [UnsupportedPatternError](https://docs.activegraph.ai/reference/errors/unsupported-pattern-error/index.md) (the authoring guide for pattern subscriptions).

The hierarchy itself is documented at [Concepts: Failure model](https://docs.activegraph.ai/concepts/failure-model/index.md).

## By category

The seven category bases match the [`ActiveGraphError` hierarchy](https://docs.activegraph.ai/reference/api/errors/index.md):

### ReplayError

- [ReplayDivergenceError](https://docs.activegraph.ai/reference/errors/replay-divergence-error/index.md)

### PatternError

- [UnsupportedPatternError](https://docs.activegraph.ai/reference/errors/unsupported-pattern-error/index.md)
- [InvalidActivateAfter](https://docs.activegraph.ai/reference/errors/invalid-activate-after/index.md)

### StorageError

- [CorruptedEventPayloadError](https://docs.activegraph.ai/reference/errors/corrupted-event-payload-error/index.md)
- [DuplicateEventError](https://docs.activegraph.ai/reference/errors/duplicate-event-error/index.md)
- [EventNotFoundError](https://docs.activegraph.ai/reference/errors/event-not-found-error/index.md)
- [InvalidStoreURL](https://docs.activegraph.ai/reference/errors/invalid-store-url-error/index.md)
- [NonSerializableEventError](https://docs.activegraph.ai/reference/errors/non-serializable-event-error/index.md)
- [SchemaVersionMismatch](https://docs.activegraph.ai/reference/errors/schema-version-mismatch/index.md)

### ExecutionError

- [LLMBehaviorError](https://docs.activegraph.ai/reference/errors/llm-behavior-error/index.md)
- [ToolError](https://docs.activegraph.ai/reference/errors/tool-error/index.md)
- [UnknownToolError](https://docs.activegraph.ai/reference/errors/unknown-tool-error/index.md)
- [ApprovalNotFoundError](https://docs.activegraph.ai/reference/errors/approval-not-found-error/index.md)

### ConfigurationError

- [MissingProviderError](https://docs.activegraph.ai/reference/errors/missing-provider-error/index.md)
- [MissingToolError](https://docs.activegraph.ai/reference/errors/missing-tool-error/index.md)
- [MissingOptionalDependency](https://docs.activegraph.ai/reference/errors/missing-optional-dependency/index.md)
- [InvalidToolRegistration](https://docs.activegraph.ai/reference/errors/invalid-tool-registration/index.md)
- [InvalidRuntimeConfiguration](https://docs.activegraph.ai/reference/errors/invalid-runtime-configuration/index.md)
- [InvalidArgumentType](https://docs.activegraph.ai/reference/errors/invalid-argument-type/index.md)
- [RuntimeContextRequiredError](https://docs.activegraph.ai/reference/errors/runtime-context-required-error/index.md)
- [InvalidPatchLifecycleState](https://docs.activegraph.ai/reference/errors/invalid-patch-lifecycle-state/index.md)

### RegistrationError

- [BehaviorNotFoundError](https://docs.activegraph.ai/reference/errors/behavior-not-found-error/index.md)
- [AmbiguousBehaviorError](https://docs.activegraph.ai/reference/errors/ambiguous-behavior-error/index.md)
- [ToolNotFoundError](https://docs.activegraph.ai/reference/errors/tool-not-found-error/index.md)
- [AmbiguousToolError](https://docs.activegraph.ai/reference/errors/ambiguous-tool-error/index.md)
- [IncompatibleRuntimeState](https://docs.activegraph.ai/reference/errors/incompatible-runtime-state/index.md)

### PackError

- [PackNotFoundError](https://docs.activegraph.ai/reference/errors/pack-not-found-error/index.md)
- [PackConflictError](https://docs.activegraph.ai/reference/errors/pack-conflict-error/index.md)
- [PackVersionConflictError](https://docs.activegraph.ai/reference/errors/pack-version-conflict-error/index.md)
- [PackSchemaViolation](https://docs.activegraph.ai/reference/errors/pack-schema-violation/index.md)

### Internal (framework-bug voice)

- [InternalEvaluatorError](https://docs.activegraph.ai/reference/errors/internal-evaluator-error/index.md)

## What's related

- [Concepts: Failure model](https://docs.activegraph.ai/concepts/failure-model/index.md) — the hierarchy and the events-not-exceptions principle.
- [API reference: Errors](https://docs.activegraph.ai/reference/api/errors/index.md) — the class hierarchy rendered from docstrings.
- [Cookbook: Debugging](https://docs.activegraph.ai/cookbook/debugging/index.md) — diagnostic workflows that build on the per-error catalog.

# AmbiguousBehaviorError

A short behavior name (`researcher`) resolves to behaviors in more than one loaded pack. The runtime can't pick one without an explicit choice, so it refuses the lookup and asks for the canonical pack-prefixed form.

This page is the anchor for the framework's **canonical-strict / lookup-lenient** name resolution rule. The other lookup errors ([`behavior-not-found`](https://docs.activegraph.ai/reference/errors/behavior-not-found-error/index.md), [`tool-not-found`](https://docs.activegraph.ai/reference/errors/tool-not-found-error/index.md), [`ambiguous-tool`](https://docs.activegraph.ai/reference/errors/ambiguous-tool-error/index.md)) link here for the rule statement.

## Quick fix

Use the canonical form `<pack_name>.<behavior_name>`:

```python
# Instead of:
b = rt.get_behavior("researcher")        # ambiguous if two packs declare it

# Use the canonical form:
b = rt.get_behavior("diligence.researcher")
```

The error message names which packs collided and shows the canonical form using one of them as a copy-paste example. The fix is one edit at the call site.

## The resolution rule (canonical strict, lookup lenient)

The framework resolves behavior names against the runtime registry using two precedence rules, locked in CONTRACT v0.9 #8:

1. **Canonical names are strict.** A name containing a dot (`diligence.researcher`) addresses exactly one symbol — the behavior declared under that fully-qualified name in the loaded pack `diligence`. If no such symbol exists, the lookup raises [`behavior-not-found`](https://docs.activegraph.ai/reference/errors/behavior-not-found-error/index.md).
1. **Short names are lenient.** A name without a dot (`researcher`) resolves to a canonical name *only when the resolution is unambiguous*. If one pack declares `diligence.researcher` and nothing else uses the short name `researcher`, the lookup succeeds. If two packs both declare a `researcher`, the lookup refuses with `AmbiguousBehaviorError`.

The rule keeps the convenient single-pack case ergonomic while making the multi-pack case explicit. Behaviors registered globally (not from a pack) follow the same rule using their declared name as both canonical and short.

The same rule applies to tool name resolution — see [`ambiguous-tool`](https://docs.activegraph.ai/reference/errors/ambiguous-tool-error/index.md).

## How to diagnose

The error message names every pack that collided on the short name:

```text
AmbiguousBehaviorError: behavior name 'researcher' is ambiguous
across loaded packs

What failed:
  The short behavior name 'researcher' resolves to behaviors in
  more than one loaded pack: 'diligence', 'research'.
```

From code:

```python
try:
    b = rt.get_behavior("researcher")
except AmbiguousBehaviorError as e:
    print(e.name)         # 'researcher'
    print(e.packs)        # ('diligence', 'research')
```

To see all canonical behavior names the runtime knows about:

```bash
activegraph inspect <store> --behaviors
```

## When does this fire

At `rt.get_behavior(name)` and equivalent lookups (e.g., the runtime's internal `_lookup_behavior_by_name` paths). It does NOT fire during trigger dispatch — when an event fires and matches both packs' behaviors, the framework registers them separately and both fire on a matching event. The error is for explicit by-name access, not for trigger dispatch.

## Why the framework refuses to continue

Picking one pack silently would route dispatch by load order, which would change behavior on a re-arrangement of imports. Picking neither would be a no-op that the caller couldn't distinguish from "no behavior matched at all." The runtime refuses the lookup and asks for the canonical name — explicit beats either silent failure.

See [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) for the broader principle.

## What's related

- [`behavior-not-found-error`](https://docs.activegraph.ai/reference/errors/behavior-not-found-error/index.md) — the sibling for "no behavior under any name." Uses the same resolution rule defined on this page.
- [`ambiguous-tool-error`](https://docs.activegraph.ai/reference/errors/ambiguous-tool-error/index.md) — the symmetric case for tool names. Same rule, same recovery.
- [`pack-conflict-error`](https://docs.activegraph.ai/reference/errors/pack-conflict-error/index.md) — fires at load time when two packs declare the same *canonical* name. This page is the load-time companion to that.

# AmbiguousToolError

A short tool name resolves to tools in more than one loaded pack. The runtime can't pick one without an explicit choice — picking silently would let an `@llm_behavior` call the wrong pack's tool, with a potentially-different input/output schema.

Symmetric with [`ambiguous-behavior-error`](https://docs.activegraph.ai/reference/errors/ambiguous-behavior-error/index.md) — same canonical-strict / lookup-lenient resolution rule (the rule is defined in detail on that page; this page applies it to tools).

## Quick fix

Use the canonical form `<pack_name>.<tool_name>`:

```python
# Instead of:
t = rt.get_tool("fetch_docs")        # ambiguous if two packs declare it

# Use the canonical form:
t = rt.get_tool("diligence.fetch_docs")
```

If you want an `@llm_behavior` to use a specific pack's version, list the canonical name in the `tools=[...]` argument:

```python
@llm_behavior(
    name="researcher",
    tools=[
        "diligence.fetch_docs",   # canonical — unambiguous
        "diligence.fetch_filings",
    ],
    ...
)
```

The error message names which packs collided and shows the canonical form using one of them as a copy-paste example.

## How to diagnose

The error names the conflicting packs:

```text
AmbiguousToolError: tool name 'fetch_docs' is ambiguous across
loaded packs

What failed:
  The short tool name 'fetch_docs' resolves to tools in more than
  one loaded pack: 'diligence', 'research'.
```

From code:

```python
try:
    t = rt.get_tool("fetch_docs")
except AmbiguousToolError as e:
    print(e.name)    # 'fetch_docs'
    print(e.packs)   # ('diligence', 'research')
```

`activegraph inspect <store> --behaviors` also lists registered tools alongside the behaviors that declare them.

## When does this fire

At `rt.get_tool(name)` lookups, and at `@llm_behavior` registration when the behavior's `tools=[...]` lists a short name that resolves ambiguously. The check is the same as the behavior-name check — short names are lenient unless they're ambiguous.

It does NOT fire during an LLM tool-call. The LLM-call path uses the canonical name the behavior declared, which was already resolved at registration time. If the LLM tries to call a tool the behavior didn't declare, you get [`unknown-tool-error`](https://docs.activegraph.ai/reference/errors/unknown-tool-error/index.md) instead.

## Why the framework refuses to continue

Tool canonical names are unique across loaded packs for the same reason behavior names are: silent dispatch routing would let an `@llm_behavior` call the wrong pack's tool, with a potentially- different input/output schema. Refusing the lookup surfaces the conflict at registration time rather than producing wrong-shape data at runtime.

See [`ambiguous-behavior-error`](https://docs.activegraph.ai/reference/errors/ambiguous-behavior-error/index.md) for the canonical statement of the resolution rule. See [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) for the broader principle.

## What's related

- [`ambiguous-behavior-error`](https://docs.activegraph.ai/reference/errors/ambiguous-behavior-error/index.md) — the sibling for behavior names. Canonical statement of the canonical-strict / lookup-lenient resolution rule lives there.
- [`tool-not-found-error`](https://docs.activegraph.ai/reference/errors/tool-not-found-error/index.md) — the sibling for "no tool under any name." Uses the same resolution rule.
- [`unknown-tool-error`](https://docs.activegraph.ai/reference/errors/unknown-tool-error/index.md) — fires at LLM-call time when the LLM asks for a tool the behavior didn't declare; distinct from this page's lookup-time ambiguity.

# ApprovalNotFoundError

`runtime.approve(approval_id)` was called with an id that doesn't match any pending approval in this runtime. The framework refuses to no-op an unknown id — that would silently corrupt the approval audit trail.

Multi-inherits `LookupError` for back-compat: code that does `except LookupError` around the approval API continues to work.

## Quick fix

List currently-pending approvals and confirm the id matches:

```python
for pa in rt.pending_approvals():
    print(pa.id, pa.kind, pa.object_type, pa.reason)
```

The id you pass to `rt.approve(...)` has to be exactly one of those ids. The most common causes when it isn't:

- **The id was already approved or denied.** Approval is one-shot — each id is consumed when `approve()` succeeds or `reject()` denies it. Re-using an id after the first call always fires this error.
- **The runtime instance is fresh.** Approvals don't carry across `Runtime.load`. If you saved a run, restarted, and tried to approve an id from before the restart, the new runtime has no record of it.
- **Typo in the id.** The error message names the id you passed; the context dict also names how many approvals are currently pending, so a "currently 2 pending; none match" message is the operator signal that you have the right runtime but the wrong id.

The diligence demo's `step_2_approval_demo` (in `examples/diligence_real_run.py`) shows the canonical pattern: enumerate, then approve by id.

## How to diagnose

The error message names both the offending id and the pending count:

```text
ApprovalNotFoundError: no pending approval named 'approval_999'

What failed:
  runtime.approve('approval_999') could not find a pending approval
  with that id.
    There are currently 2 pending approvals; none of them match
    'approval_999'.
```

From code:

```python
try:
    rt.approve(approval_id)
except ApprovalNotFoundError as e:
    print(e.approval_id)
    print(e.pending_count)
```

If `pending_count` is `0` and you expected approvals, either no behavior has called `ctx.propose_object` under a gating policy yet, or the runtime is fresh after a `Runtime.load` and pending approvals weren't preserved.

## When does this fire

Only at `runtime.approve()`. The check happens twice in the implementation — once when no pack state exists (so no approvals can exist), and once after walking the pending-approvals list without finding a match. Either path produces the same error with appropriate `pending_count` context.

## Why the framework refuses to continue

Approval ids are generated by the runtime when a behavior calls `ctx.propose_object` under a gating policy (`memo_approval`, `risk_approval`, etc.). Each id is unique to its runtime instance and is consumed when `approve()` succeeds. A miss could mean the id is stale, the runtime is fresh, or the id is a typo. The framework refuses to no-op rather than guess — silently doing nothing on an unknown id would mean an operator could approve an already-applied or non-existent action without noticing the call didn't do anything, and the audit trail would lie about what was approved.

See [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) for the broader principle.

## What's related

- [`concepts/policies`](https://docs.activegraph.ai/concepts/policies/index.md) — the policy mechanism that produces pending approvals. Covers the `ctx.propose_object` → `approval.proposed` → `runtime.approve()` → `approval.granted` lifecycle.
- `examples/diligence_real_run.py:step_2_approval_demo` — the canonical enumerate-then-approve pattern, runnable.

# BehaviorNotFoundError

`rt.get_behavior(name)` couldn't resolve the name to a registered behavior. The framework refuses to fall back to a fuzzy match or a no-op because wrong-behavior dispatch would silently corrupt the audit trail.

Multi-inherits `LookupError` for back-compat — code that does `except LookupError` around behavior lookups continues to work.

The name resolution rule (canonical strict, lookup lenient) is defined on [`ambiguous-behavior-error`](https://docs.activegraph.ai/reference/errors/ambiguous-behavior-error/index.md) and applies here.

## Quick fix

Check the spelling and the pack:

```python
# List all registered behavior names:
status = rt.status(recent=0)
for b in status.registered_behaviors:
    print(b.name)
```

Or from the CLI:

```bash
activegraph inspect <store> --behaviors
```

Common causes when the name is right:

- **The behavior comes from a pack that isn't loaded.** Load the pack:

  ```python
  rt.load_pack(my_pack, settings=...)
  ```

- **The behavior is in user code that hasn't imported yet.** The `@behavior` decorator registers the behavior at module-import time. If the module containing the decorator runs after `Runtime(...)` is constructed, the registry will be empty. Import the module first.

- **You used a short name when a canonical name was needed.** If the behavior comes from a pack, use the fully-qualified form:

  ```python
  rt.get_behavior("diligence.researcher")   # not just "researcher"
  ```

  See [`ambiguous-behavior-error`](https://docs.activegraph.ai/reference/errors/ambiguous-behavior-error/index.md) for the resolution rule.

## How to diagnose

The error names the offending name and the registered behaviors:

```text
BehaviorNotFoundError: no behavior named 'extract_claims' is loaded

What failed:
  rt.get_behavior('extract_claims') could not resolve the name to a
  registered behavior.
    registered: 'diligence.researcher', 'diligence.memo_synthesizer'
```

From code:

```python
try:
    b = rt.get_behavior("extract_claims")
except BehaviorNotFoundError as e:
    print(e.name)         # 'extract_claims'
    print(e.registered)   # the registered names
    print(e.context["pack_state"])  # True if any pack is loaded
```

If `registered` is empty, no behaviors are registered at all — likely an import ordering problem (the `@behavior` decorators haven't run).

If `pack_state` is `True` but the name doesn't appear in `registered`, the name might exist as a short name that resolves to a different canonical — try the canonical form.

## When does this fire

At `rt.get_behavior(name)` and equivalent lookups. The runtime's trigger dispatch path doesn't go through this — events fire behaviors that subscribed to their type, regardless of name lookup. This error is for explicit by-name access (the operator CLI's `inspect --behavior <name>`, programmatic introspection, test fixtures referencing behaviors directly).

## Why the framework refuses to continue

Behaviors are addressable by their declared name. The lookup is strict — the runtime refuses to fall back to a fuzzy match or a no-op because a wrong-behavior dispatch would silently corrupt the audit trail. Behaviors live either in the global registry (decorated with `@behavior` or `@llm_behavior` at module load) or in a loaded pack; if neither has the name, the lookup misses.

See [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) for the broader principle.

## What's related

- [`ambiguous-behavior-error`](https://docs.activegraph.ai/reference/errors/ambiguous-behavior-error/index.md) — fires when a short name resolves to two different behaviors. Defines the canonical-strict / lookup-lenient rule that applies here.
- [`tool-not-found-error`](https://docs.activegraph.ai/reference/errors/tool-not-found-error/index.md) — the symmetric case for tool lookups.
- [`missing-provider-error`](https://docs.activegraph.ai/reference/errors/missing-provider-error/index.md) / [`missing-tool-error`](https://docs.activegraph.ai/reference/errors/missing-tool-error/index.md) — the construction-time variants for when an `@llm_behavior` declares dependencies that aren't registered.

# CorruptedEventPayloadError

A stored event's payload bytes don't parse as JSON. The framework refuses to silently skip the row — that would make the replay contract unverifiable, and the next fork or diff would lie about what happened. The fix is recovery (partial migration that skips the corrupt rows) or repair (manual edit of the offending row, if you have the original payload elsewhere).

This is distinct from [`NonSerializableEventError`](https://docs.activegraph.ai/reference/errors/non-serializable-event-error/index.md), which fires at *encode time* when a Python value can't be made into JSON. Corrupted payload fires at *decode time* when the bytes on disk can't be made into a Python value.

## Quick fix

```bash
# Recover the readable subset of the run. The destination run is
# partial — corrupted events are skipped, the rest are migrated.
# The skipped event ids appear in the per-run report.
activegraph migrate --from <src> --to <new-dst> --skip-corrupted

# To see surrounding events before deciding:
activegraph inspect <store> --tail 50
```

The `--skip-corrupted` flag walks the run row-by-row, decoding each event individually. Rows that fail JSON decode are recorded in `skipped_events` on the per-run report. Rows around the corruption that decode cleanly migrate to the destination.

If you have the original payload elsewhere (a previous run, a backup, a log), open the source store directly with `sqlite3` or `psql` and repair the row in place — preferable to skip-and-lose when the data is recoverable.

If the corruption is intrinsic and the run isn't worth recovering, re-run from the original goal in a fresh store. The store is append-only; partial corruption does not propagate backward in time.

## How to diagnose

The error message body shows the parser's error location and a preview of the corrupted payload:

```text
What failed:
  While reading a stored event payload, the JSON parser failed at
  line 1, column 24:
    Expecting value
    payload preview: '{"goal": "x", "broken":'
```

`column` is the position in the corrupted JSON where the parser gave up. `preview` is the first 64 bytes of the row's payload column.

From Python:

```python
try:
    rt = Runtime.load(url, run_id=run_id)
except CorruptedEventPayloadError as e:
    print(e.context["line"], e.context["column"])
    print(e.context["preview"])
    print(e.context["underlying_msg"])
```

To see which event id failed, look at the events near the failure point with `activegraph inspect <store> --tail 50` — the corrupted row will be the one immediately after the last readable event.

## When does this fire

At load time, whenever the store reads a row whose payload column doesn't parse as JSON. `Runtime.load`, `iter_events`, `activegraph inspect`, and `activegraph migrate` all trigger this if they hit a bad row. `activegraph migrate --skip-corrupted` is the only operation that catches the error per-row and continues; every other operation propagates it.

## Why the framework refuses to continue

The store persists every event payload as JSON so the audit trail is human-inspectable and round-trips through any JSON-aware tool. A row that doesn't parse means either the bytes on disk are corrupted, the store schema is mismatched (someone wrote a non-JSON format here), or an out-of-band edit damaged the file. Silently skipping the row would make the replay contract unverifiable — the next fork or diff would behave as if the event never happened, and the audit trail wouldn't record that anything went wrong.

See [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) for the broader principle.

## What's related

- [`NonSerializableEventError`](https://docs.activegraph.ai/reference/errors/non-serializable-event-error/index.md) — the encode-time sibling. Fires when a Python value can't be written to JSON in the first place.
- [`SchemaVersionMismatch`](https://docs.activegraph.ai/reference/errors/schema-version-mismatch/index.md) — fires when the store opens cleanly but the schema_version meta row doesn't match this build. Distinct from corruption.
- `activegraph migrate --skip-corrupted` in the [CLI reference](https://docs.activegraph.ai/reference/errors/cli/index.md).

# DuplicateEventError

A store append failed because an event with the same id already exists in the run. Event ids must be unique per run — duplicates would silently reroute references that downstream code depends on, corrupting the audit trail.

In normal use, this never fires. The runtime's id generator (IDGen) is monotonic; events constructed through `graph.add_object`, `graph.emit`, etc. always have fresh ids. The error is almost always a test-fixture problem: hand-constructed events with fixed ids, plus state left over from a previous test.

Multi-inherits `ValueError` for back-compat: user code that does `except ValueError` around appends continues to work.

## Quick fix

If you're in test code:

```python
# Use IDGen to generate ids:
from activegraph import IDGen
ids = IDGen()
event = Event(id=ids.event(), type="my.event", payload={}, timestamp=...)

# Or call clear_registry() / construct a fresh Graph between tests:
from activegraph import clear_registry, Graph
clear_registry()
graph = Graph()
```

If you genuinely need fixed event ids in a fixture (e.g., for snapshot tests), ensure each test gets a fresh store rather than sharing state:

```python
@pytest.fixture
def fresh_store():
    return InMemoryEventStore(run_id="run_test")
```

If this fires in production code, you've found a bug — the runtime should never produce a duplicate id. File an issue with the run id and the event id that collided.

## How to diagnose

The error message names the offending event id and the run:

```text
DuplicateEventError: duplicate event id: evt_001

What failed:
  An event with id 'evt_001' already exists in this in-memory store.
  Appends are id-unique.
```

From code:

```python
try:
    store.append(event)
except DuplicateEventError as e:
    print(e.context["event_id"])
    print(e.context["run_id"])
```

If the collision is in a test, check whether the test's setup tears down state from the previous test — `pytest`'s function-scoped fixtures are the canonical pattern; module-scoped or session-scoped fixtures that hold an `InMemoryEventStore` will accumulate events across tests and produce duplicates on the second run of any test that constructs the same id.

## When does this fire

At `store.append()` only. Iteration, lookup, and read operations can't produce duplicates — they're append-side only.

The check is a constant-time lookup against the store's id index, so it adds no measurable cost to a clean append.

## Why the framework refuses to continue

Event ids are the addressing primitive for the entire framework. Behaviors reference events by id, the replay cache keys on them, the causal chain walks them. A duplicate id would silently reroute one of those references, with the second-added event shadowing the first or vice versa depending on store implementation. Either way, the audit trail would record two events as one.

See [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) for the broader principle.

## What's related

- [`EventNotFoundError`](https://docs.activegraph.ai/reference/errors/event-not-found-error/index.md) — the sibling for the opposite failure: a lookup for an id that doesn't exist.
- `activegraph.IDGen` — the canonical id generator. Use it instead of hand-constructing ids in test fixtures unless you need fixed ids for a specific reason.

______________________________________________________________________

See [Observing failures in caller code](https://docs.activegraph.ai/concepts/failure-model/#observing-failures-in-caller-code) for `Runtime.errors` and the `BehaviorFailure` shape.

# EventNotFoundError

A store lookup asked for an event id that doesn't exist in the run. The framework refuses to return a default — that would silently corrupt any downstream fork, replay, or causal-chain walk.

Multi-inherits `KeyError` for back-compat: user code that does `except KeyError` around store lookups continues to work, and code that wants the richer context can `except EventNotFoundError` instead.

## Quick fix

Check the event id against what's actually in the run:

```bash
activegraph inspect <store-url> --run-id <run> --tail 100
```

The id you passed is in the error message's `What failed:` section; compare against the tail. The most common causes:

- **Typo in a hand-typed id** — `evt_42` vs `evt_042`. The error message uses `repr()` formatting so leading zeros and quote characters are visible.
- **Referencing an id from a different run** — event ids are unique per run; an id valid in one run isn't in another.
- **Run truncated by an earlier fork** — fork copies events up to and including `--at-event`; events after the cut don't appear in the forked run.

## How to diagnose

The error message names the operation that triggered it (lookup, `iter_events(after=)`, `iter_events(until=)`, `truncate_after`, fork cut), the event id, and the run id. From code:

```python
try:
    event = store.get_event(event_id)
except EventNotFoundError as e:
    print(e.context["event_id"])
    print(e.context["run_id"])
    print(e.context["driver"])  # 'sqlite' | 'postgres' | 'memory'
    print(e.context["operation"])  # 'fork' or absent for direct lookups
```

The fork operation has its own variant — when `activegraph fork` names an `--at-event` that doesn't exist in the parent run, the error's `operation` context is `"fork"` and the recovery prose points at the parent run's tail rather than the destination run.

## When does this fire

Any operation that addresses an event by id:

- `store.get_event(event_id)`
- `store.iter_events(after=event_id)` or `until=event_id`
- `store.truncate_after(event_id)`
- `activegraph fork <run> --at-event <event_id>`
- `activegraph inspect <run> --event <event_id>` (returns `EXIT_NOT_FOUND` at the CLI level rather than raising, but the underlying lookup is the same)

The check runs against the run's event index. A run with no events returns this error for any non-empty id; an empty id returns it for any operation.

## Why the framework refuses to continue

Event ids are the addressing primitive for the entire framework. Behaviors reference events by id, the replay cache keys on them, the causal-chain walk traverses them. A lookup against an unknown id is a bug in the caller; returning a default (None, empty list, no-op) would silently reroute downstream code that depends on the event existing.

See [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) for the broader principle.

## What's related

- [`DuplicateEventError`](https://docs.activegraph.ai/reference/errors/duplicate-event-error/index.md) — the sibling for the opposite failure: an event id that already exists.
- `activegraph inspect --tail` in the [CLI reference](https://docs.activegraph.ai/reference/errors/cli/index.md) — the canonical command for listing valid ids in a run.
- `activegraph fork` — the most common operation that triggers `EventNotFoundError` with `operation="fork"` when the `--at-event` id isn't in the parent.

# IncompatibleRuntimeState

An operation requires a runtime state that isn't satisfied — either a state that must be set but isn't, or a state that mustn't be set but is. Two sites currently raise this: `runtime.fork()` requires a SQLite-backed runtime, and `graph.attach_store()` requires no store already attached.

This is part of the three-page Configuration cluster, along with [`invalid-runtime-configuration`](https://docs.activegraph.ai/reference/errors/invalid-runtime-configuration/index.md) and [`invalid-argument-type`](https://docs.activegraph.ai/reference/errors/invalid-argument-type/index.md). All three fire at construction or operation time when the runtime's setup doesn't match what the operation needs.

## Quick fix

The recovery depends on which site fired the error. The summary line names the operation and the current state.

### fork() requires SQLite

```bash
# Migrate the run to a SQLite store first, then fork:
activegraph migrate --from <current-url> --to sqlite:///fork-source.db
activegraph fork sqlite:///fork-source.db --run-id <run> --at-event <evt>
```

`fork` uses SQLite-specific transactional copy primitives (CONTRACT v0.8 #5). Postgres-native forking is a known v1.1 follow-on — file an issue if you need it for a production workflow.

### attach_store when one is already attached

```python
# Construct a new Graph rather than re-attaching:
fresh = Graph()
fresh.attach_store(new_store)

# Or, to copy the existing run to a new store, use migration:
# activegraph migrate --from <old-url> --to <new-url>
```

A Graph's store attaches at most once per lifetime. Re-attaching would split the event log across two stores (subsequent events going to the new one, earlier events stuck in the old) or require a copy operation that isn't an attach. Migration is the right primitive for moving a run between stores.

## How to diagnose

The error names the operation and the current runtime state:

```text
IncompatibleRuntimeState: runtime.fork() requires a SQLite-backed
runtime (current: PostgresEventStore)
```

From code:

```python
try:
    rt.fork(at_event=evt_id, label="...")
except IncompatibleRuntimeState as e:
    print(e.context["current_store_kind"])   # 'PostgresEventStore'
```

The `context` dict carries the current state so test code or operator scripts can branch on it before invoking the operation.

## When does this fire

At the operation itself — `runtime.fork()`, `graph.attach_store()` — not at runtime construction. The runtime constructs fine in both cases; the constraint is on what the runtime can do, not what it can be.

Multi-inherits `RuntimeError` for back-compat with code that catches the builtin around runtime operations.

## Why the framework refuses to continue

Both raise sites protect runtime invariants that, if violated, would corrupt the audit trail:

- Fork on non-SQLite would either skip the operation silently (no fork happens) or attempt a copy via a primitive the store doesn't support (partial copy that mixes runs).
- Re-attaching a store would split a single run's event log across two stores, making replay see only half.

Refusing the operation is the framework's way of asking the operator to pick the right primitive (migrate, fresh Graph) instead of discovering the data corruption later.

See [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) for the broader principle.

## What's related

- [`invalid-runtime-configuration`](https://docs.activegraph.ai/reference/errors/invalid-runtime-configuration/index.md) — sibling for argument-shape problems at construction (persist_to vs store, save_state path conflicts, recent\<0).
- [`invalid-argument-type`](https://docs.activegraph.ai/reference/errors/invalid-argument-type/index.md) — sibling for wrong-type values at construction (e.g., PostgresEventStore target).
- `activegraph migrate` in the [CLI reference](https://docs.activegraph.ai/reference/errors/cli/index.md) — the canonical primitive for moving a run across stores.

# InternalEvaluatorError

**This is a framework bug, not a problem with your code.** A framework-internal evaluator received input it doesn't recognize — specifically, a comparison operator or AST node that the parser produced (or that user code injected via an AST shortcut) but the evaluator's dispatch table doesn't handle.

In normal use, this never fires. The framework's parsers produce a closed set of operators and AST nodes; the evaluators handle every member of that set. An unrecognized one means the parser and the evaluator are out of sync, or external code constructed an AST that bypassed the parser. Either way, the runtime would have to silently mis-evaluate the input to continue — which would corrupt the audit trail in a way you'd discover much later.

## Quick fix: file an issue

The error message includes the framework version, the internal location (module:function), and the offending input. Copy the body into a new GitHub Issue:

```text
https://github.com/yoheinakajima/activegraph/issues/new
```

Include the pattern, filter, or AST that triggered the error if possible. The error's `context` dict carries everything the issue template needs:

```python
try:
    ...
except InternalEvaluatorError as e:
    print(e.context["framework_version"])
    print(e.context["internal_error_location"])
    # Plus per-site context, e.g.:
    print(e.context.get("operator"))         # patterns.py / graph.py
    print(e.context.get("ast_node_type"))    # patterns.py only
```

## Immediate workaround

Until the bug is fixed, the framework can't proceed past the offending evaluation. Two paths work for most cases:

- **Simplify the pattern or filter that triggered it.** If the error fires on a complex pattern subscription, try a shorter version that matches the same intent. The framework supports a closed subset; an evaluator failure usually means the AST contains a node the supported subset doesn't include.
- **Catch and continue.** If the offending evaluation is non-critical (a view filter that can be skipped, an optional pattern subscription), catch `InternalEvaluatorError` at the call site and skip the evaluation. Code that does `except ValueError` around view operations continues to work — `InternalEvaluatorError` multi-inherits `ValueError` for back-compat.

## How to diagnose

The error's `internal_error_location` field names which evaluator fired:

- `activegraph/runtime/patterns.py:_eval_where (unknown comparison operator)` — the WHERE evaluator hit an unsupported operator while evaluating a pattern subscription's WHERE clause.
- `activegraph/runtime/patterns.py:_eval_where (unrecognized AST node)` — the WHERE evaluator hit an AST node type it doesn't handle.
- `activegraph/core/graph.py:evaluate_where` — the view-filter evaluator hit an unsupported operator while evaluating a view's WHERE filter.

All three sites use the shared `internal_bug_fields` helper so the message shape and context-dict keys are identical. A GitHub Issue filed from any of them arrives with the same metadata for triage.

## Why the framework refuses to continue

The operator table (`_OPS`) and the AST node set are closed and produced by the framework's parsers. An unrecognized operator or node means either the parser drifted from the evaluator (a framework-internal inconsistency) or the AST was constructed externally (bypassing the parser's validation). Both would produce silent mis-evaluation if the runtime continued — the WHERE filter would silently match or mismatch input it wasn't supposed to, and the audit trail wouldn't record that anything went wrong.

The framework refuses to evaluate rather than risk it. This is the invariant-protection stance applied to the framework's internal state, not just user input — see [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) for the broader principle.

## What's related

- [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) — the invariant-protection principle. This page is the framework applying that principle to its own internals.
- [GitHub Issues](https://github.com/yoheinakajima/activegraph/issues/new) — file framework-bug reports here. The error message carries everything the issue needs.

______________________________________________________________________

See [Observing failures in caller code](https://docs.activegraph.ai/concepts/failure-model/#observing-failures-in-caller-code) for `Runtime.errors` and the `BehaviorFailure` shape.

# InvalidActivateAfter

A behavior decorator passed an unparseable or out-of-range value to `activate_after`. The scheduler refuses values that don't denote a positive integer event count; wall-clock units are deliberately out of scope (CONTRACT v0.7 #13).

Multi-inherits `ValueError` for back-compat — code that catches `ValueError` around behavior registration continues to work.

## Quick fix

Pass a positive integer event count:

```python
# Accepted:
@behavior(name="...", activate_after=5)
@behavior(name="...", activate_after="5")
@behavior(name="...", activate_after="5 events")
@behavior(name="...", activate_after="5 event")

# Refused (raises this error):
@behavior(name="...", activate_after=0)            # must be >= 1
@behavior(name="...", activate_after=-1)           # must be >= 1
@behavior(name="...", activate_after=True)         # bool not int
@behavior(name="...", activate_after="5 seconds")  # wall-clock
@behavior(name="...", activate_after="five")       # unparseable
```

## How to diagnose

The error's `kind` field discriminates which validation rule fired:

```text
InvalidActivateAfter: activate_after='5 seconds' is invalid
(wall-clock unit)
```

From code:

```python
try:
    @behavior(activate_after="5 seconds")
    def my_behavior(...):
        ...
except InvalidActivateAfter as e:
    print(e.spec)   # '5 seconds'
    print(e.kind)   # 'wall-clock unit' | 'unparseable string' | etc.
```

The five `kind` discriminators each have their own recovery prose inline in the error message — bool-not-int, wall-clock unit, unparseable string, wrong-type, must-be->=1.

## Why wall-clock units are refused

`activate_after` schedules a behavior to fire N events after its triggering event. The runtime evaluates the schedule against the event log, not against wall-clock time, so replay produces identical timing (CONTRACT v0.7 #13). Wall-clock units would let scheduling depend on real time, which would make replay non-deterministic.

If you genuinely need wall-clock scheduling, file an issue — the v1+ contract leaves room for it behind a separate primitive.

## When does this fire

At behavior registration — the `@behavior` / `@llm_behavior` decorator runs at module import time and the scheduler parses `activate_after` then. Misconfiguration surfaces before any goal runs.

## What's related

- [Writing behaviors](https://docs.activegraph.ai/reference/guides/writing-behaviors.md) — the canonical reference for the `@behavior` decorator including `activate_after`.

# InvalidArgumentType

A value passed to a constructor or method has the wrong type. Used when the framework's contract is type-based — currently one site: `PostgresEventStore`'s target argument, which accepts a URL string, a `psycopg.Connection`, or a `psycopg_pool.ConnectionPool` and refuses anything else.

Part of the three-page Configuration cluster, alongside [`invalid-runtime-configuration`](https://docs.activegraph.ai/reference/errors/invalid-runtime-configuration/index.md) and [`incompatible-runtime-state`](https://docs.activegraph.ai/reference/errors/incompatible-runtime-state/index.md).

Multi-inherits `TypeError` for back-compat — code that catches the builtin around constructor argument validation continues to work.

## Quick fix

Pass one of the accepted target types:

```python
from activegraph.store.postgres import PostgresEventStore

# URL string:
PostgresEventStore("postgres://host/dbname", run_id="...")

# Borrowed Connection (the store doesn't take ownership):
PostgresEventStore(my_psycopg_connection, run_id="...")

# ConnectionPool (the store checks out per operation):
PostgresEventStore(my_connection_pool, run_id="...")
```

If you have a SQLAlchemy engine or another abstraction, extract a raw `psycopg.Connection` from it and pass that — the framework doesn't wrap higher-level abstractions because their connection lifecycle differs from psycopg's.

## How to diagnose

The error names the offending value and its Python type:

```text
InvalidArgumentType: PostgresEventStore target has wrong type
(got int)

What failed:
  PostgresEventStore was constructed with a target of type int:
    value: 42
    type:  int
  Accepted types are: a `postgres://...` URL string, a
  `psycopg.Connection`, or a `psycopg_pool.ConnectionPool`.
```

From code:

```python
try:
    store = PostgresEventStore(some_value, run_id="...")
except InvalidArgumentType as e:
    print(e.context["type"])   # 'int'
    print(e.context["repr"])   # the repr of the value (truncated)
```

If the type is unexpected, check whether you imported the right module — accidentally importing `from sqlalchemy import Connection` instead of `from psycopg import Connection` is the canonical mistake.

## When does this fire

At `PostgresEventStore(target, run_id=...)` construction, before any connection attempt. The check is the first thing the constructor does after stashing the run_id.

A bad target never reaches the driver — the framework rejects it at the Python boundary, so the error is a clean type mismatch with no network or DB activity.

## Why the framework refuses to continue

The constructor branches on the target's type — strings open a fresh connection, `Connection` instances are borrowed without ownership, `ConnectionPool` instances are checked out per operation. An unknown type has no defined connection lifecycle, and a fuzzy match (e.g., duck-typing on `cursor()`) would silently leak connections or double-close them.

See [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) for the broader principle.

## What's related

- [`invalid-runtime-configuration`](https://docs.activegraph.ai/reference/errors/invalid-runtime-configuration/index.md) — sibling for argument-shape problems (conflicting kwargs, out-of-range, missing required).
- [`incompatible-runtime-state`](https://docs.activegraph.ai/reference/errors/incompatible-runtime-state/index.md) — sibling for state invariants violated at operation time.
- [`missing-optional-dependency`](https://docs.activegraph.ai/reference/errors/missing-optional-dependency/index.md) — fires before this error in the Postgres-without-psycopg path (the import fails first); both relate to PostgresEventStore construction.

# InvalidPatchLifecycleState

`graph.apply_patch(patch_id)` was called on a patch that isn't in the `'proposed'` state. Patches are one-shot: a proposed patch becomes `'applied'` (success) or `'rejected'` (refusal) exactly once. Re-applying an already-applied patch — or applying one that's been rejected — would either emit a duplicate `patch.applied` event (breaking replay) or contradict an explicit refusal.

This is an **exception**, not an event, because the caller has made a mistake the caller can fix at the call site — see [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) for the events-not-exceptions principle and why patch-lifecycle violations land on the exception side of the line.

## Quick fix

Check the patch's status before applying:

```python
patch = graph.get_patch(patch_id)
if patch.status == "proposed":
    graph.apply_patch(patch_id)
else:
    # Already applied, rejected, or in another terminal state.
    # Whatever you were going to do, the patch already did it (or
    # explicitly didn't). Don't re-apply.
    pass
```

If you genuinely need to apply a new mutation, propose a new patch — patches aren't re-used:

```python
new_patch = graph.propose_patch(target, op="patch", diff={...})
graph.apply_patch(new_patch.id)
```

## The patch lifecycle (three sentences)

A patch is created in `'proposed'` state by `graph.propose_patch`. `graph.apply_patch` transitions it to `'applied'` and emits a `patch.applied` event; `graph.reject_patch` transitions it to `'rejected'` and emits `patch.rejected`. Both transitions are terminal — a patch leaves `'proposed'` exactly once.

For the full lifecycle including optimistic concurrency on object versions and the policy-gating semantics, see [`concepts/patches`](https://docs.activegraph.ai/concepts/patches/index.md).

## How to diagnose

The error names the patch id and its current status:

```text
InvalidPatchLifecycleState: patch patch_017 is 'applied', not 'proposed'
```

From code:

```python
try:
    graph.apply_patch(patch_id)
except InvalidPatchLifecycleState as e:
    print(e.patch_id)        # 'patch_017'
    print(e.current_status)  # 'applied' | 'rejected'
```

To see what happened to the patch:

```bash
activegraph inspect <store> --event <patch.applied or patch.rejected event id>
```

The status transition is in the event log — every `patch.applied` and `patch.rejected` event names the patch it transitioned, so the audit trail shows when and why the patch left `'proposed'`.

## When does this fire

At `graph.apply_patch(patch_id)`, after the patch is fetched and its current status is read. The check is the second thing `apply_patch` does (after the patch-exists check that raises `KeyError` if the id is unknown), so misuse is caught early.

The error never fires from `propose_patch`, `reject_patch`, or `get_patch` — those are read-only or transition-initiating, not transition-completing.

## Why the framework refuses to continue

Patches are one-shot. A `'proposed'` patch becomes `'applied'` (success) or `'rejected'` (refusal) exactly once. Re-applying an already-applied patch would emit a duplicate `patch.applied` event, which would break the replay contract — replay would produce a different event stream than the original run. The framework refuses re-application rather than emit the duplicate.

This is why the error is an exception and not an event: the caller can fix it (check status, propose a new patch). It's not a non-fatal stop the runtime should record and continue past. See [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) for the broader principle.

## What's related

- [`concepts/patches`](https://docs.activegraph.ai/concepts/patches/index.md) — the canonical patch lifecycle reference. Optimistic concurrency, policy gating, the apply/reject transitions.
- [`runtime-context-required-error`](https://docs.activegraph.ai/reference/errors/runtime-context-required-error/index.md) — the sibling ExecutionError for "the caller is using the primitive outside its intended context."
- [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) — why patch lifecycle violations are exceptions, not events.

______________________________________________________________________

See [Observing failures in caller code](https://docs.activegraph.ai/concepts/failure-model/#observing-failures-in-caller-code) for `Runtime.errors` and the `BehaviorFailure` shape.

# InvalidRuntimeConfiguration

A construction-time or method-call argument is invalid — conflicting kwargs, missing required argument, out-of-range value. This is the catch-all for argument-shape problems at the runtime's API surface.

Part of the three-page Configuration cluster, alongside [`invalid-argument-type`](https://docs.activegraph.ai/reference/errors/invalid-argument-type/index.md) (wrong-type values at construction) and [`incompatible-runtime-state`](https://docs.activegraph.ai/reference/errors/incompatible-runtime-state/index.md) (operations requiring specific runtime state).

Multi-inherits `ValueError` for back-compat — code that catches the builtin around runtime construction or method calls continues to work.

## Quick fix

The error message names the specific misconfiguration. Four sites currently raise this; the recovery is one of:

### Conflicting `persist_to` and `store`

```python
# Pass exactly one — they're alternative ways to attach storage:
rt = Runtime(graph, persist_to="/path/to/run.db")
# or:
rt = Runtime(graph, store=SQLiteEventStore("/path/to/run.db"))
```

`persist_to=` is shorthand for "open a SQLite store at this path." `store=` is the explicit form for any EventStore. Both at once would force the runtime to pick one or merge, and silent precedence rules would surface as bugs the first time an operator switched stores.

### `recent < 0` in `status()`

```python
rt.status(recent=20)   # last 20 events
rt.status(recent=0)    # totals only, no recent_events
```

For every event, read `rt.graph.events` directly rather than passing a large `recent`.

### `save_state(path=X)` when already attached to Y

```python
# To flush the attached store:
rt.save_state()

# To move the run to a different store:
# activegraph migrate --from sqlite:///<attached> --to sqlite:///<new>
```

`save_state` flushes whatever store is attached; it can't redirect mid-run. Migration is the right primitive for moving a run.

### `save_state()` without `path=` and no attached store

```python
# Either attach at construction:
rt = Runtime(graph, persist_to="/path/to/run.db")
rt.run_goal("...")
rt.save_state()

# Or pass path explicitly:
rt.save_state(path="/path/to/run.db")
```

For ephemeral runs that shouldn't persist, omit `save_state()` — the in-memory graph is the run's lifetime.

## How to diagnose

The summary line names the operation and the misconfiguration:

```text
InvalidRuntimeConfiguration: Runtime(...) was passed both
`persist_to=` and `store=`
```

From code:

```python
try:
    rt = Runtime(graph, persist_to="...", store=...)
except InvalidRuntimeConfiguration as e:
    print(str(e))   # full structured message with the fix
```

Each raise site has its own per-call-site recovery prose in the error body — the doc page groups them by shape because the recoveries don't share a generic pattern. Read the error message itself for the specific fix.

## When does this fire

At the operation that received the bad argument:

- `Runtime(...)` construction (conflicting kwargs)
- `rt.status(recent=N)` (out-of-range)
- `rt.save_state(path=X)` (path conflict with attached store)
- `rt.save_state()` (missing required path when no store attached)

The check runs synchronously at the call site, so the failure is where the misconfiguration is.

## Why the framework refuses to continue

Each of the four raise sites protects a different invariant:

- Conflicting `persist_to`/`store` would force a silent precedence choice that changes which store gets the events.
- Negative `recent` has no defined semantics.
- Path conflict in `save_state` would split the event log across two stores.
- Missing path with no store would silently default to a temp file, losing the run on process exit.

All four would corrupt the audit trail or silently lose data. The runtime refuses and asks for an explicit choice.

See [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) for the broader principle.

## What's related

- [`invalid-argument-type`](https://docs.activegraph.ai/reference/errors/invalid-argument-type/index.md) — sibling for wrong-type values (e.g., PostgresEventStore target).
- [`incompatible-runtime-state`](https://docs.activegraph.ai/reference/errors/incompatible-runtime-state/index.md) — sibling for state invariants violated at operation time (fork on non-SQLite, attach_store when attached).
- `activegraph migrate` in the [CLI reference](https://docs.activegraph.ai/reference/errors/cli/index.md) — the primitive for the save_state-path-conflict recovery.

# InvalidStoreURL

A store URL is missing a scheme, has an unsupported scheme, or is otherwise malformed. The framework refuses to silently coerce the URL to a default scheme — that could open an unintended store and corrupt the audit trail.

The error message always shows the exact corrected URL, so the fix is usually a copy-paste.

Existing v0.8+ behavior preserved

`InvalidStoreURL` has been the public URL-parse error since v0.8. v1.0 re-parents it as `InvalidStoreURL(StorageError, ValueError)` — multi-inheritance preserves `except ValueError` while adding `except ActiveGraphError` and `except StorageError` as broader catch options. Existing code keeps working unchanged.

## Quick fix

```bash
# SQLite file (note the slash count — three for relative, four for absolute):
activegraph inspect sqlite:///relative/path.db
activegraph inspect sqlite:////absolute/path.db

# Postgres database:
activegraph inspect postgres://host/dbname
activegraph inspect postgres://user:pass@host:port/dbname
```

If the bare path was already a filesystem path (the most common mistake), the error message includes the exact `sqlite:///<that-path>` to copy.

## How to diagnose

The error message names the offending URL and the specific shape problem (no scheme, no path, no host, unsupported scheme). From Python:

```python
try:
    rt = Runtime.load(url, run_id=run_id)
except InvalidStoreURL as e:
    print(e.context["url"])  # the URL that was rejected
    print(str(e))            # the structured message with the fix
```

The four shapes the error distinguishes:

- **No scheme** — `/tmp/run.db` instead of `sqlite:////tmp/run.db`. Most common operator mistake.
- **SQLite URL with no path** — `sqlite:///`. Easy to hit if the path is computed from a missing env var.
- **Postgres URL with no host or database** — `postgres://`. Same shape, missing the hostname.
- **Unsupported scheme** — `mysql://`, `redis://`, etc. The framework supports `sqlite` and `postgres` (also accepted: `postgresql`); other backends are not in v1.0.

## When does this fire

At any operation that opens a store from a URL: `Runtime.load`, `Runtime(graph, persist_to=...)`, `activegraph inspect <url>`, `activegraph migrate --from <url>`, `open_store(url, run_id)`.

The check runs at parse time, before any connection attempt. A malformed URL never hits the driver — the parser catches it first.

## Why the framework refuses to continue

The framework addresses stores by URL everywhere (runtime, CLI, library) so the same string can be passed around without ambiguity about which driver opens it. A malformed URL is refused at parse time rather than silently coerced to a default scheme; guessing wrong would either corrupt the audit trail or open an unintended store. The operator who types `activegraph inspect run.db` should see "use `sqlite:///run.db`", not a confusing parse error from psycopg.

See [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) for the broader principle.

## What's related

- [Operating in production](https://docs.activegraph.ai/guides/operating-in-production/index.md) — the canonical reference for store URLs in deployed runtimes.
- Custom backends — the `EventStore` protocol in `activegraph/store/base.py` is the extension point. Other databases are not in v1.0; v1.1+ may add Postgres-native fork primitives and other drivers.

# InvalidToolRegistration

A value passed to `Runtime(tools=[...])` isn't a `Tool` instance. The most common cause is forgetting the `@tool` decorator and passing the bare function. Fires at Runtime construction; the check fails fast before any behavior runs.

Multi-inherits `TypeError` for back-compat — code that catches `TypeError` around runtime construction continues to work.

## Quick fix

Decorate the function with `@tool`, and pass the decorated object:

```python
from activegraph.tools import tool

@tool(name="my_tool", input_schema=MyInput, output_schema=MyOutput)
def my_tool(args, ctx):
    ...

rt = Runtime(graph, tools=[my_tool])
```

If the function was already decorated, confirm you're passing the decorator's return value (the wrapped `Tool`), not the original function. A common mistake is passing the unwrapped name from a module that exports both:

```python
# Wrong: passing the bare function
from my_tools_module import my_tool_function   # not the @tool object
rt = Runtime(graph, tools=[my_tool_function])

# Right: passing the @tool-wrapped Tool instance
from my_tools_module import my_tool   # the @tool-decorated symbol
rt = Runtime(graph, tools=[my_tool])
```

## How to diagnose

The error names the offending value and its type:

```text
InvalidToolRegistration: tool registration value is not a Tool
instance (got function)

What failed:
  Runtime(tools=[...]) was given a value that isn't a Tool instance:
    value: <function bare_function at 0x...>
    type:  function
```

From code:

```python
try:
    rt = Runtime(graph, tools=[some_value])
except InvalidToolRegistration as e:
    print(e.value)              # the offending value
    print(type(e.value).__name__)
```

If the type is `function`, the most likely cause is the missing `@tool` decorator. If the type is something else (a `dict`, a class, a `Protocol`), something deeper is wrong with the value you're passing.

## When does this fire

At `Runtime(...)` construction, while building the tool registry. Each value in `tools=[...]` is checked individually; the first non-`Tool` value triggers the error and the rest aren't checked (the runtime construction can't proceed).

## Why the framework refuses to continue

The `Tool` wrapper carries the tool's declared name, input schema, output schema, timeout, and deterministic flag. Registering a bare function would skip those declarations and the runtime could not validate calls into the tool — schema-violating calls would reach the body and produce wrong-shape data. The check fails fast at construction.

See [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) for the broader principle.

## What's related

- [`missing-tool-error`](https://docs.activegraph.ai/reference/errors/missing-tool-error/index.md) — fires when an `@llm_behavior` declares a tool name that isn't in the registry. Different shape: declared name missing vs. wrong-type value registered.
- [Writing tools](https://docs.activegraph.ai/reference/guides/writing-tools.md) — the canonical reference for the `@tool` decorator and the Tool wrapper.

# LLMBehaviorError

An `@llm_behavior` failed during a goal run. The provider returned something the framework can't use (couldn't parse, didn't match the declared schema, no fixture for the prompt), or the call itself failed (rate limit, network).

The error you see is a **carrier** — the runtime catches it inside the behavior dispatch and emits a `behavior.failed` event with the same `reason` and `payload_extras`. Downstream behaviors subscribed to `behavior.failed` can react. The exception only surfaces to your code if you're calling the behavior directly (rare; most code runs through `runtime.run_goal()` and reads the trace).

See [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) for why behavior failures are events, not exceptions you have to catch.

## Quick fix by category

Group the reason codes by what you do about them — the framework distinguishes ~5 reasons but the recovery shapes cluster.

### Failures you can't fix in code: retry

`llm.network_error`, `llm.rate_limited`. The provider is briefly unavailable. The right pattern is a **retry behavior** that subscribes to `behavior.failed` and re-fires the work with backoff:

```python
@behavior(
    name="llm_retry",
    on=["behavior.failed"],
    where={
        "behavior": "your.behavior.name",
        "reason": ["llm.network_error", "llm.rate_limited"],
    },
)
def llm_retry(event, graph, ctx):
    attempt = (event.payload.get("attempt") or 0) + 1
    if attempt > 3:
        return
    graph.emit("retry.requested", {
        "for_event": event.payload["triggering_event_id"],
        "attempt": attempt,
    })
```

Retries are first-class graph citizens (CONTRACT v0.6 #13), not buried in framework middleware. You see every retry in the trace and can fork from any of them.

### Failures from your prompt: tighten the prompt

`llm.parse_error`, `llm.schema_violation`. The provider returned something, but it wasn't valid JSON or didn't match the behavior's `output_schema`. Tighten the prompt so the model produces the right shape:

- Lower `temperature` if available; reduce sampling variance.
- Add an explicit example of the expected JSON in the prompt.
- Tighten the Pydantic schema to reject ambiguous shapes earlier (e.g., `Literal[...]` instead of `str` for enum-shaped fields).

The full provider response is in the `behavior.failed` event's `payload_extras`:

```bash
activegraph inspect <store> --event <behavior.failed-id>
```

#### `llm.schema_violation` — "the model returned the schema, not an instance"

A specific shape of `llm.schema_violation` worth naming: the provider response is the JSON Schema definition itself, echoed back verbatim, rather than an instance that conforms to the schema. In `payload_extras["raw_response"]` the symptom is unmistakeable — top-level keys like `"properties"`, `"type": "object"`, and `"$defs"` appear where the schema's actual fields should appear:

```json
{
  "type": "object",
  "properties": {"claims": {"type": "array", ...}},
  "required": ["claims"]
}
```

instead of:

```json
{"claims": [{"speaker": "...", "statement": "...", "confidence": 0.9}]}
```

**Root cause:** the model treated the schema definition shown in the prompt as the requested output shape rather than as a contract its output should satisfy. Smaller / older / non-tool-trained models hit this more often.

**Fix in v1.0.1+ (automatic):** the runtime now assembles the system prompt with both the schema AND a synthesized example instance, plus explicit "return an INSTANCE, not the schema" language. Most schema-echo failures stop firing without any code change on your end.

**If it still fires in v1.0.1+:** the schema is too abstract for the auto-derived example to be useful (deeply nested generics, large `anyOf` unions, schemas with no `properties` at the top level). In those cases, override the prompt assembly with a `prompt_template=` that bakes in a real example from your domain:

```python
@llm_behavior(
    output_schema=ClaimList,
    prompt_template=(
        "{system}\n\n"
        "{view}\n\n"
        "Example response (this is the shape — substitute real values):\n"
        '{{"claims": [{{"speaker": "CFO", "statement": "Revenue grew 28%.", '
        '"confidence": 0.92}}]}}\n\n'
        "{event}\n\n"
        "{instruction}"
    ),
)
def extract_claims(event, graph, ctx, llm_output): ...
```

See `@llm_behavior`'s `prompt_template=` docstring for what each placeholder contains. If the model never recovers even with a real example, switch to a tool-trained model (the small models that echo schemas back rarely come from the tool-trained families).

### Failures from fork/replay: re-record

`llm.fixture_missing`. You're running against `RecordedLLMProvider` and the prompt's hash doesn't match any recorded response. Either the prompt changed since the fixtures were recorded or this is a new prompt that was never recorded.

```bash
# Re-record live, then run again against the recorded provider:
ANTHROPIC_API_KEY=... python your_script.py   # records as it runs
```

This is the same fix as [`ReplayDivergenceError`](https://docs.activegraph.ai/reference/errors/replay-divergence-error/index.md)'s prompt_hash mismatch — the cache contract is the same on both sides.

## How to diagnose

The reason code is in the error's `.reason` attribute and in the emitted event's payload:

```python
try:
    rt.run_goal("...")
except LLMBehaviorError as e:
    print(e.reason)            # 'llm.parse_error', etc.
    print(e.payload_extras)    # full provider response, raw text, etc.
```

In the trace, look for the `behavior.failed` event the runtime emitted in your behalf:

```text
[behavior.failed]   evt_NNN  your.behavior  reason=llm.parse_error
```

The recovery flow always starts there. The error's `More:` link points at this page; the trace event points at the behavior that fired the carrier.

## When does this fire

Inside an `@llm_behavior` wrapper, after the provider returned (or raised) and before the behavior body's output is merged back into the graph. The framework catches it, emits `behavior.failed`, and moves on — the goal run doesn't halt. The exception only escapes to your code if you're invoking the behavior outside of `runtime.run_goal()` / `run_until_idle()`.

## Why the framework refuses to continue (the behavior, not the run)

The runtime treats LLM failures as graph-level events because LLM behavior is inherently flaky and "halt the entire goal on first provider hiccup" is the wrong default for long-running agentic work. The failure is captured in the audit trail with full context (reason, payload_extras, behavior name, triggering event); downstream code subscribes if it wants to react, ignores if it doesn't.

See [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) for the broader principle and [`tool-error`](https://docs.activegraph.ai/reference/errors/tool-error/index.md) for the sibling on the tool side.

## What's related

- [`tool-error`](https://docs.activegraph.ai/reference/errors/tool-error/index.md) — if the failure came from the tool side rather than the LLM side, see here. The carrier shape is symmetric.
- [`unknown-tool-error`](https://docs.activegraph.ai/reference/errors/unknown-tool-error/index.md) — for the registration-time variant: an LLM behavior declared a tool that isn't registered.
- [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) — why `behavior.failed` is an event rather than an escaped exception.

______________________________________________________________________

See [Observing failures in caller code](https://docs.activegraph.ai/concepts/failure-model/#observing-failures-in-caller-code) for `Runtime.errors` and the `BehaviorFailure` shape.

# MissingOptionalDependency

A subsystem you tried to use requires an optional Python package that isn't installed. The framework keeps optional subsystems off the default install path so a minimal install stays small — each subsystem declares its dependency and fires this error when the subsystem is actually used.

**This is not a "your code is wrong" error.** Your installation is incomplete for the feature you're trying to use. The recovery is a `pip install`, not a code change.

## Quick fix

The error message names the missing package and the `pip install` line that fixes it:

```text
MissingOptionalDependency: PostgresEventStore requires the
'psycopg' Python package

How to fix:
  Install the optional dependency:
      pip install 'activegraph[postgres]'
```

## The optional-extras list

Three subsystems require optional packages, declared as installable extras in `pyproject.toml`:

| Extra                     | Provides                                  | Required package        |
| ------------------------- | ----------------------------------------- | ----------------------- |
| `activegraph[llm]`        | LLM behaviors (pack format requires this) | `anthropic`, `pydantic` |
| `activegraph[postgres]`   | `PostgresEventStore`                      | `psycopg>=3.1`          |
| `activegraph[prometheus]` | `PrometheusMetrics`                       | `prometheus_client`     |
| `activegraph[all]`        | All of the above                          | (everything)            |

A minimal install (just `pip install activegraph`) includes the core runtime, the SQLite store, and the in-memory observability backend. The optional extras keep their dependencies off the critical path for users who don't need them.

## How to diagnose

The error message names the package, the feature, and the extras group:

```python
try:
    store = PostgresEventStore("postgres://...", run_id=...)
except MissingOptionalDependency as e:
    print(e.package)   # 'psycopg'
    print(e.feature)   # 'PostgresEventStore'
    print(e.extras)    # 'postgres'
```

Multi-inherits `ImportError` for back-compat — code that catches `ImportError` around optional-dep imports continues to work.

## When does this fire

At the first construction or call into the subsystem. The check runs lazily, on the import inside the subsystem's lazy-import path:

- `PostgresEventStore(...)` first construction → `psycopg` import
- `PrometheusMetrics(...)` first construction → `prometheus_client` import
- `import activegraph.packs` (or any pack-related import) → `pydantic` import (pack format depends on Pydantic models)

A bare `pip install activegraph` followed by an `import activegraph` won't fire any of these — the error only surfaces when you actually use the subsystem.

## Why the framework refuses to continue

Each optional subsystem declares its dependency explicitly so the missing-dep error fires at the boundary, not later inside the subsystem with a confusing `AttributeError` or import error from a nested module. The structured error names the install line so recovery is a one-command operation.

See [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) for the broader principle.

## What's related

- [Operating in production](https://docs.activegraph.ai/guides/operating-in-production/index.md) — the canonical reference for choosing extras in a deployed runtime.
- [`invalid-store-url-error`](https://docs.activegraph.ai/reference/errors/invalid-store-url-error/index.md) — fires before this error in the Postgres-without-psycopg path; the URL parses fine, then the store tries to import psycopg.

# MissingProviderError

You constructed a `Runtime` with `@llm_behavior`-decorated behaviors in the registry, but didn't pass an `llm_provider=` argument. The framework refuses to dispatch LLM behaviors without a provider — silently no-op'ing them would produce events that claim to depend on an LLM call that never happened.

Fires at Runtime **construction time**, not at first LLM call. The validation runs once when the runtime initializes.

## Quick fix

Pass an `llm_provider=` to the Runtime constructor:

```python
from activegraph import Runtime, Graph
from activegraph.llm.anthropic import AnthropicProvider

rt = Runtime(
    Graph(),
    llm_provider=AnthropicProvider(),
)
```

For offline replay or tests, use a recorded or scripted provider instead of a live one:

```python
from activegraph.llm.recorded import RecordedLLMProvider

rt = Runtime(
    Graph(),
    llm_provider=RecordedLLMProvider(fixture_dir="path/to/fixtures"),
)
```

## How to diagnose

The error names which `@llm_behavior` triggered the check:

```text
MissingProviderError: no LLM provider configured for @llm_behavior

What failed:
  An LLM-backed behavior ('diligence.researcher') was registered,
  but Runtime(...) was constructed without an `llm_provider=`
  argument.
```

From code:

```python
try:
    rt = Runtime(Graph(), behaviors=[...])
except MissingProviderError as e:
    print(e.behavior_name)   # 'diligence.researcher'
```

If `behavior_name` is unset, no specific behavior was named — the runtime found at least one `@llm_behavior` in its registry but didn't track which one in the check.

## When does this fire

At `Runtime(...)` construction, after the behavior registry is populated and before any goal runs. The check walks the registry once and fails fast if any `@llm_behavior` is present without a provider.

This is deliberate (CONTRACT v0.6 #21): silently no-op'ing the behavior at first LLM call would produce events that claim to depend on an LLM call that never happened. Failing at construction makes the misconfiguration immediately visible.

## Why the framework refuses to continue

`@llm_behavior` dispatches LLM calls through the provider attached to the runtime at construction. Failing loud at registration rather than at first invocation is the v0.6 contract — silently no-op'ing the behavior would corrupt the audit trail (behaviors fire and produce events; a missing provider would produce events that claim to depend on an LLM call that never happened).

See [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) for the broader principle.

## What's related

- [`missing-tool-error`](https://docs.activegraph.ai/reference/errors/missing-tool-error/index.md) — the symmetric registration-time error for tools. Fires when an `@llm_behavior` declares a tool name that isn't registered.
- [`missing-optional-dependency`](https://docs.activegraph.ai/reference/errors/missing-optional-dependency/index.md) — fires when `AnthropicProvider()` itself can't construct because the `anthropic` SDK isn't installed.
- [`llm-behavior-error`](https://docs.activegraph.ai/reference/errors/llm-behavior-error/index.md) — the runtime-time carrier for when a configured provider's call fails.

# MissingToolError

An `@llm_behavior` declares a tool name that the runtime's tool registry doesn't have. The framework refuses to start the runtime rather than discover the missing tool at first LLM-call time — the behavior would either produce [`unknown-tool-error`](https://docs.activegraph.ai/reference/errors/unknown-tool-error/index.md) on every invocation (cost without progress) or silently drop the call.

Fires at Runtime **construction time**. The check runs once when the behavior registers; misconfiguration fails before any LLM call burns budget.

## Quick fix

Register the missing tool with the runtime:

```python
from activegraph import Runtime, Graph
from activegraph.tools import tool

@tool(name="web_search", input_schema=..., output_schema=...)
def web_search(args, ctx):
    ...

rt = Runtime(
    Graph(),
    llm_provider=...,
    tools=[web_search],   # ← register here
)
```

If the tool comes from a pack, load the pack instead:

```python
rt.load_pack(my_pack, settings=...)
```

For pack-scoped tools, the canonical name (`pack_name.tool_name`) in the `@llm_behavior`'s `tools=[...]` ensures the right tool resolves even when multiple packs are loaded — see [`ambiguous-tool`](https://docs.activegraph.ai/reference/errors/ambiguous-tool-error/index.md).

## How to diagnose

The error names the missing tool, the behavior that declared it, and the tools that *are* registered:

```text
MissingToolError: no tool named 'web_search' is registered

What failed:
  @llm_behavior declares the tool 'web_search' on @llm_behavior
  'diligence.researcher', but the Runtime's tool registry has no
  tool by that name.
    registered tools: 'diligence.fetch_company_docs',
                      'diligence.fetch_filings'
```

From code:

```python
try:
    rt = Runtime(Graph(), llm_provider=..., tools=[...])
except MissingToolError as e:
    print(e.tool_name)        # 'web_search'
    print(e.behavior_name)    # 'diligence.researcher'
    print(e.registered)       # what's registered
```

Compare `registered` against the behavior's declared tools — the gap is your missing registration.

## When does this fire

At `Runtime(...)` construction, after the behavior registry and tool registry are both populated, before any goal runs. The check walks every `@llm_behavior`'s `tools=[...]` and verifies each name resolves in the tool registry.

The runtime-time variant — when the LLM asks for a tool the behavior didn't declare — is [`unknown-tool-error`](https://docs.activegraph.ai/reference/errors/unknown-tool-error/index.md), not this. The distinction is the same one called out on the `unknown-tool-error` page: registration-time mismatch (this page) vs LLM-call-time mismatch (other page).

## Why the framework refuses to continue

`@llm_behavior` validates its declared tools at startup so a misconfiguration fails before any LLM call burns budget. A missing tool at LLM-call time would either produce `UnknownToolError` on every invocation (cost without progress) or silently drop the call (which would corrupt the audit trail). Validation at registration prevents both.

See [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) for the broader principle.

## What's related

- [`unknown-tool-error`](https://docs.activegraph.ai/reference/errors/unknown-tool-error/index.md) — the runtime-time variant. Fires at LLM-call time when the LLM asks for a tool not in the behavior's declared set. Read both if you're debugging LLM-tool plumbing.
- [`missing-provider-error`](https://docs.activegraph.ai/reference/errors/missing-provider-error/index.md) — the symmetric registration-time error for LLM providers.
- [`tool-not-found-error`](https://docs.activegraph.ai/reference/errors/tool-not-found-error/index.md) — fires at explicit `rt.get_tool(name)` lookups (operator-side, not registration-side).

______________________________________________________________________

See [Observing failures in caller code](https://docs.activegraph.ai/concepts/failure-model/#observing-failures-in-caller-code) for `Runtime.errors` and the `BehaviorFailure` shape.

# NonSerializableEventError

A value in an event payload can't be JSON-encoded. The framework refuses to silently pickle or drop it at emit time, because either choice would corrupt the audit trail. The fix is to convert the value to a JSON primitive before emitting.

This is the *encode-time* sibling of [`CorruptedEventPayloadError`](https://docs.activegraph.ai/reference/errors/corrupted-event-payload-error/index.md), which fires at decode time on bytes that won't parse.

Existing v0.5+ behavior preserved

This error has been in the framework since v0.5 as a plain `TypeError` subclass. v1.0 re-parents it as `NonSerializableEventError(StorageError, TypeError)` — multi-inheritance preserves `except TypeError` while adding `except ActiveGraphError` and `except StorageError` as broader catch options. Existing code keeps working unchanged.

## Quick fix

The error message names the offending field path and its Python type. Convert the value to a JSON primitive at the emit site:

```python
# Pydantic model:
payload[field] = model.model_dump()

# dataclass:
import dataclasses
payload[field] = dataclasses.asdict(value)

# Custom object:
payload[field] = str(value)
```

If the type genuinely should serialize through the framework, add an adapter clause to `_default` in `activegraph/store/serde.py`. `Decimal` (→ string) and `datetime` (→ ISO 8601) are precedents.

## How to diagnose

The error message walks the payload to identify the first non-serializable field:

```text
What failed:
  While encoding an event payload for the store, the value at
  'nested.value' (type Custom) could not be JSON-encoded.
    underlying: object of type Custom is not JSON-serializable
```

`nested.value` is a dotted path into the payload dict; `Custom` is the Python class. From code catching the exception:

```python
try:
    graph.emit("my.event", payload)
except NonSerializableEventError as e:
    print(e.context["path"])  # 'nested.value'
    print(e.context["type"])  # 'Custom'
```

If the path is `<root>`, the top-level payload itself isn't a dict; if it ends with `[N]`, the offending value is a list element at index N.

## When does this fire

At `graph.emit()`, `graph.add_object()`, `graph.patch_object()`, and any other operation that constructs an event whose payload passes through `encode_payload`. The encoding runs synchronously before the event lands in the store, so the failure is at the call site that produced the bad payload — not at some later replay time.

This is deliberate (CONTRACT v0.5 #4): the failure is locatable. A behavior emitting a malformed payload sees the exception at its own emit call, with a stack trace pointing into the behavior body.

## Why the framework refuses to continue

The store persists events as JSON so the audit trail is human-inspectable. Custom Python types serialize through a strict adapter (`Decimal` → string, `datetime` → ISO 8601, `set` → sorted list); anything else is refused at emit time rather than silently pickled or dropped. A silently-dropped event would corrupt the replay contract; a pickled value would make the audit trail unreadable to anything but a Python process.

See [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) for the broader principle.

## What's related

- [`CorruptedEventPayloadError`](https://docs.activegraph.ai/reference/errors/corrupted-event-payload-error/index.md) — the decode-time sibling. Fires when stored JSON bytes don't parse.
- `activegraph/store/serde.py` — the canonical adapter clauses. Add new types there if they should serialize framework-wide.

______________________________________________________________________

See [Observing failures in caller code](https://docs.activegraph.ai/concepts/failure-model/#observing-failures-in-caller-code) for `Runtime.errors` and the `BehaviorFailure` shape.

# PackConflictError

Two loaded packs claim the same canonical identifier — a behavior name, a tool name, an object type, or a relation type. The framework refuses the second `load_pack` rather than silently routing dispatch one way or the other. Pre-mutation: the failed load leaves the runtime exactly as it was.

The error message names the conflicting symbol, the pack that declared it first, and the pack that tried to declare it second.

## Quick fix

Three concrete actions, listed in order of "least invasive":

```python
# 1. Don't load both packs in the same runtime. Pick one.
rt.load_pack(diligence_pack, settings=DiligenceSettings(...))
# (skip rt.load_pack(research_pack, ...))

# 2. Rename one pack. Copy its source, change the Pack(name=...)
# declaration, re-install under the new name. The behaviors are
# then under a different canonical prefix.

# 3. If both behaviors need to fire, run them in separate Runtimes
# and emit events that chain across.
```

The error message names which kind of symbol conflicted (`behavior`, `tool`, `object_type`, `relation_type`) and the canonical name of the symbol. The `kind` and `canonical` keys in `.context` carry the same information programmatically.

## How to diagnose

The error names both pack owners — the existing one and the one attempting to register:

```text
PackConflictError: behavior name conflict: 'diligence.researcher'
declared by both pack 'diligence' and pack 'research'
```

From code:

```python
try:
    rt.load_pack(research_pack, settings=...)
except PackConflictError as e:
    print(e.context["kind"])               # 'behavior' | 'tool' | ...
    print(e.context["canonical"])          # the full qualified name
    print(e.context["owner_pack"])         # the pack that has it
    print(e.context["conflicting_pack"])   # the pack you tried to load
```

To list what each loaded pack actually provides:

```bash
activegraph inspect <store> --pack-version
```

## When does this fire

At `runtime.load_pack` time. The check runs pre-mutation: every declared symbol is checked against the runtime's existing registry before any state changes. A failed load means the runtime is unchanged — you can call `load_pack` again with a different pack without first cleaning up.

## Why the framework refuses to continue

Canonical names in the runtime registry are unique across loaded packs. Two packs claiming the same canonical name would silently route dispatch one way or the other depending on pack-load order; the runtime refuses the load instead so the conflict is visible and the operator decides which pack to keep.

The pre-mutation check is part of the contract — a `load_pack` that fails halfway and leaves the runtime in a mixed state would be harder to recover from than a refused load (CONTRACT v0.9 #6).

See [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) for the broader principle.

## What's related

- [`pack-version-conflict-error`](https://docs.activegraph.ai/reference/errors/pack-version-conflict-error/index.md) — the sibling for the same-pack-two-versions case.
- [`pack-not-found-error`](https://docs.activegraph.ai/reference/errors/pack-not-found-error/index.md) — the registration-time sibling for "the pack doesn't exist at all."
- [Authoring packs](https://docs.activegraph.ai/guides/authoring-packs/index.md) — the canonical pack format reference; useful when you need to rename a pack to resolve a conflict.

# PackNotFoundError

`activegraph.packs.load_by_name(name)` searched the `activegraph.packs` entry-point group and found no pack with that name. Either the pack isn't installed, the pack's entry-point group is wrong, or the name is a typo.

This is the third member of the pack-lifecycle cluster, along with [`pack-conflict`](https://docs.activegraph.ai/reference/errors/pack-conflict-error/index.md) (two packs claim the same symbol) and [`pack-version-conflict`](https://docs.activegraph.ai/reference/errors/pack-version-conflict-error/index.md) (same pack, different versions). A developer hitting one of the three might be one step away from hitting another.

## Quick fix

Confirm the pack is installed:

```bash
pip show <pack-distribution-name>
```

List currently-discovered packs:

```python
from activegraph.packs import discover
print([p.name for p in discover()])
```

If the pack is installed but not discovered, its `pyproject.toml` should declare an entry point under the `activegraph.packs` group:

```toml
[project.entry-points."activegraph.packs"]
your_pack = "your_pack_module:pack"
```

The `your_pack_module:pack` form names the Python module where the `Pack` instance lives. Common mistake: pointing the entry point at the module without naming the `pack` attribute.

## How to diagnose

The error message lists what's currently installed:

```text
PackNotFoundError: no installed pack named 'diligence'

What failed:
  activegraph.packs.load_by_name('diligence') searched the
  `activegraph.packs` entry-point group and found no pack with
  that name.
    installed: 'research', 'memory'
```

From code:

```python
try:
    p = load_by_name("diligence")
except PackNotFoundError as e:
    print(e.name)        # 'diligence'
    print(e.installed)   # ('research', 'memory')
```

If `installed` includes the name but `load_by_name` still fails, the entry point is misconfigured — check that the module imports without error and that the named attribute is a `Pack` instance.

## When does this fire

At `activegraph.packs.load_by_name(name)`. Other pack-loading paths (`rt.load_pack(pack_instance, ...)`) take a `Pack` directly and don't go through entry-point discovery, so they don't fire this error.

The CLI's `activegraph quickstart` and similar commands that discover packs by name will surface this if their named pack isn't installed; the error message points at the install command.

## Why the framework refuses to continue

Packs register via Python entry points so the framework can discover them without import-side-effect cost. A missing pack means either the install didn't happen, the entry-point declaration is wrong, or the name is a typo. The runtime refuses to guess — a pack name that doesn't resolve is operator-visible and the recovery is documented.

See [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) for the broader principle.

## What's related

- [`pack-conflict-error`](https://docs.activegraph.ai/reference/errors/pack-conflict-error/index.md) — fires when two loaded packs declare the same canonical symbol.
- [`pack-version-conflict-error`](https://docs.activegraph.ai/reference/errors/pack-version-conflict-error/index.md) — fires when the runtime already holds a different version of the same pack.
- [Authoring packs](https://docs.activegraph.ai/guides/authoring-packs/index.md) — the canonical pack-format reference, including the `pyproject.toml` entry-point declaration shape.

# PackSchemaViolation

Data passed to `graph.add_object` or `graph.add_relation` doesn't match the schema declared by a loaded pack. The framework validates every add against the pack's declared types so downstream behaviors and pattern matches can rely on the shape — a malformed add would silently corrupt views and patterns that depend on the declared fields.

This is the lone runtime-shape leaf under [`PackError`](https://docs.activegraph.ai/concepts/failure-model/index.md). It fires at add-time, after pack load — distinct from [`pack-conflict-error`](https://docs.activegraph.ai/reference/errors/pack-conflict-error/index.md) and [`pack-version-conflict-error`](https://docs.activegraph.ai/reference/errors/pack-version-conflict-error/index.md), which fire at pack-load time.

Multi-inherits `ValueError` for back-compat — code that catches the builtin around `graph.add_object` / `add_relation` continues to work.

## Quick fix by shape

The error message names the specific violation inline (the offending type, the offending data, and the pack that declared the schema). The three call shapes that produce this error each have a different fix:

### Object data doesn't match the declared schema

The most common case. Recovery is fixing the dict shape — usually a missing required field, a wrong field type, or an extra field a strict schema rejects.

```python
# Inspect the pack's declared schema for the object type:
from activegraph.packs.diligence import pack as p
schema = next(ot for ot in p.object_types if ot.name == "claim").schema
print(schema.model_json_schema())

# Then adjust the data to match.
graph.add_object("claim", {
    "text": "...",
    "confidence": 0.85,    # the missing field, now included
})
```

### Relation source isn't an allowed type

The pack declared which object types can sit on the source side of a relation type. The fix is either passing a source of an allowed type or declaring the new type on the relation:

```python
# Pass a source of an allowed type (the error names which are allowed).
graph.add_relation(claim_id, evidence_id, "supports")

# Or, if the constraint is wrong, relax the relation type's
# declaration in the pack:
RelationType(
    name="supports",
    source_types=["claim", "memo"],   # ← add the new source type
    target_types=["evidence"],
)
```

### Relation target isn't an allowed type

Symmetric with source — same fix on the other endpoint. The error names which types are allowed on the target side.

## Multi-pack note

The error message names the pack that declared the violated schema. In a multi-pack runtime, the constraint that fired might not come from your own pack — check the named pack's declaration, not just the one you're working in. The factory methods carry `pack_name` through the `context` dict for programmatic introspection:

```python
try:
    graph.add_object("claim", data)
except PackSchemaViolation as e:
    print(e.context["pack"])         # which pack declared the schema
    print(e.context["object_type"])  # or relation_type, with "side"
```

## How to diagnose

The error names the offending type and the validation detail:

```text
PackSchemaViolation: object_type 'claim': schema validation failed

What failed:
  `graph.add_object('claim', data=...)` was rejected because the
  data did not match the pack's declared schema for 'claim'
  (declared by pack 'diligence').

  Validation error:
    1 validation error for Claim
    confidence: Field required ...
```

From code:

```python
try:
    graph.add_object("claim", data)
except PackSchemaViolation as e:
    print(e.context["object_type"])         # 'claim'
    print(e.context["pack"])                # 'diligence'
    print(e.context["validation_error"])    # the full Pydantic error
```

For relation violations, the `context` includes `relation_type`, the offending type, the allowed list, and `side` ("source" or "target").

## When does this fire

At `graph.add_object` and `graph.add_relation`, after a pack with a declared schema for the affected type has loaded. The check is post-load: objects created before the pack was loaded aren't retroactively validated (CONTRACT v0.9 #5 — the load-order asymmetry).

The check runs synchronously at the add site. The mutation never lands if validation fails — the graph is unchanged after the exception fires.

## Why the framework refuses to continue

Packs declare object schemas to constrain what shape of data can flow into objects of that type. The runtime validates every `add_object` against the schema so downstream behaviors can rely on the shape — a malformed add would silently corrupt views and pattern matches that depend on the declared fields. Relation type constraints serve the same purpose on the structural side: out-of-spec relations would cause pattern matches to silently miss or misfire.

Refusing the add is the framework's way of asking the caller to fix the data or the schema, not to discover the wrong-shape data later when a downstream behavior fires on it.

See [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) for the broader principle.

## What's related

- [`pack-conflict-error`](https://docs.activegraph.ai/reference/errors/pack-conflict-error/index.md) — fires at pack load time when two packs declare the same canonical symbol; distinct from this page's runtime-shape errors.
- [`pack-version-conflict-error`](https://docs.activegraph.ai/reference/errors/pack-version-conflict-error/index.md) — load-time sibling for same-pack-two-versions.
- [Authoring packs](https://docs.activegraph.ai/guides/authoring-packs/index.md) — the canonical reference for declaring `ObjectType` and `RelationType` schemas in a pack.

______________________________________________________________________

See [Observing failures in caller code](https://docs.activegraph.ai/concepts/failure-model/#observing-failures-in-caller-code) for `Runtime.errors` and the `BehaviorFailure` shape.

# PackVersionConflictError

The runtime already holds a version of the pack you're trying to load, and the versions don't match. A runtime can hold at most one version of any pack — two versions would compete for the same canonical names in the registry, and `pack.behavior_name` would resolve differently depending on dispatch order.

Pre-mutation: the failed load leaves the runtime exactly as it was. The currently-loaded version stays.

## Quick fix

Pick one version:

```python
# Keep the loaded version — don't load the new one. The runtime
# stays where it is.

# Or, swap versions by constructing a fresh Runtime:
rt = Runtime(Graph(), llm_provider=...)
rt.load_pack(new_version_of_pack, settings=...)
```

`load_pack` doesn't support in-place version swapping. If you need to migrate state, the canonical path is `activegraph migrate` to a fresh store, then load the new pack version against it.

If you genuinely need both versions in the same process — to compare behaviors side-by-side, for instance — copy one pack and rename it. A copy with `Pack(name="research_v2", ...)` has a distinct canonical namespace from the original `research`, and both can load together. Same workaround as [`pack-conflict-error`](https://docs.activegraph.ai/reference/errors/pack-conflict-error/index.md).

## How to diagnose

The error names both versions:

```text
PackVersionConflictError: pack 'diligence': already loaded version
'0.1.0', attempted to load version '0.2.0'
```

From code:

```python
try:
    rt.load_pack(new_pack, settings=...)
except PackVersionConflictError as e:
    print(e.context["pack"])               # 'diligence'
    print(e.context["loaded_version"])     # '0.1.0'
    print(e.context["attempted_version"])  # '0.2.0'
```

## When does this fire

At `runtime.load_pack` time, before the second pack's symbols are registered. The check happens early — the version mismatch is detected as soon as the runtime sees a pack with the same name as one already loaded. Idempotency: loading the *same* version twice is a no-op (CONTRACT v0.9 #6); only a version change triggers this error.

## Why the framework refuses to continue

A runtime can hold at most one version of any pack. Two versions would compete for the same canonical names in the registry — `pack.behavior_name` would resolve differently depending on dispatch order, which would silently corrupt the audit trail (a behavior fire recorded with one version's prompt hash could replay against the other version's prompt and silently produce different output).

See [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) for the broader principle and CONTRACT v0.9 #6 for the idempotency-by- `(name, version)` rule that makes this check necessary.

## What's related

- [`pack-conflict-error`](https://docs.activegraph.ai/reference/errors/pack-conflict-error/index.md) — the sibling for the different-packs-same-symbol case (two different packs both declaring `diligence.researcher`, for example).
- [`pack-not-found-error`](https://docs.activegraph.ai/reference/errors/pack-not-found-error/index.md) — the registration-time sibling for "the pack doesn't exist at all."
- `activegraph migrate` in the [CLI reference](https://docs.activegraph.ai/reference/errors/cli/index.md) — the canonical path when you need to migrate state across versions.

# ReplayDivergenceError

The replay or fork you just ran produced an event stream that doesn't match the recorded log. The framework refuses to silently use a stale result — that would break the audit trail the cache is designed to preserve. The fix depends on which kind of divergence fired; the error message itself names the kind in its summary line and the recovery command is below.

## Quick fix by kind

### Prompt hash mismatch

The recorded prompt hash for an `llm.requested` event doesn't match what the live re-run produced. Something changed in an LLM behavior: its code, a prompt template, a system message, or a tool's input arguments.

```bash
# If the change was intentional (you edited a behavior or template),
# re-record from the divergence point:
activegraph fork <parent-run> --at-event <event-id> --record

# If the change was unintentional, diff your code against the
# recorded run's pack version and revert:
activegraph inspect <parent-run> --pack-version
```

To see the full recorded prompt for the offending event:

```bash
activegraph inspect <parent-run> --event <event-id>
```

The recorded prompt hash and the live hash are both in the error message body so you can confirm at a glance whether the prompt is the same modulo whitespace or genuinely different.

### Event-type mismatch

At the pinned event id, the live re-run produced a different event type than recorded. The behavior graph took a different branch — usually because a behavior's `where` filter, a pattern subscription, or a conditional `graph.emit` changed since the recording.

```bash
# Identify the behavior that produced the recorded event:
activegraph inspect <parent-run> --event <event-id>

# Then diff that behavior against your current source. If the change
# was intentional, fork with --record to refresh the recording.
```

### Length mismatch (short live or extra live)

Either the live re-run finished before the recorded log did (a behavior that used to fire no longer fires), or the live re-run produced an event the recorded log doesn't have (a new behavior was added, or a pattern subscription was loosened).

```bash
# Compare what's currently registered against what fired in the
# recorded run:
activegraph inspect <parent-run> --behaviors

# Re-record from the divergence point if the new behavior set is
# intentional:
activegraph fork <parent-run> --at-event <event-id> --record
```

## How to diagnose deeper

If the quick fixes above haven't isolated the cause, three diagnostic commands cover most cases:

```bash
# Pack versions loaded in the recorded run (and their prompt content
# hashes — these are what the replay cache compares against):
activegraph inspect <parent-run> --pack-version

# Full payload of the offending event:
activegraph inspect <parent-run> --event <event-id>

# Tail of events near the divergence point to see what came before:
activegraph inspect <parent-run> --tail 50
```

The error message's `.context` dict carries the event id, the kind discriminator, and both expected and actual values. Code catching the exception can read these directly:

```python
try:
    rt = Runtime.load(url, run_id=parent_run, replay_strict=True)
except ReplayDivergenceError as e:
    print(e.event_id, e.kind, e.expected, e.actual)
```

`e.kind` is one of `"prompt_hash_mismatch"`, `"type_mismatch"`, or `"length_mismatch"` — the same discriminator the quick-fix sections above are organized around.

## When does this fire

Replay reconstructs a run by re-applying its event log. The framework offers two modes:

- **Permissive replay** (`replay_strict=False`, the default for `Runtime.load`). Events are re-emitted from the log; the runtime trusts the recording. `ReplayDivergenceError` never fires here.
- **Strict replay** (`replay_strict=True`). Behaviors re-fire against the recorded seed and the framework compares the live event stream against the recorded one. Any drift fires this error pinned to the first divergent event id.

The fork primitive runs strict by default — a fork's value is its shared lineage with its parent, and shared lineage requires the early events to replay identically. The error fires most often during fork operations after behavior or prompt edits, which is the workflow it was designed for.

## Why the framework refuses to continue

The replay cache keys on the full prompt hash (for LLM behaviors) and on the event type stream (for everything else). A cache hit that silently substituted a different recorded response under a different prompt would corrupt the audit trail — the trace would claim a specific LLM call produced a specific response when the recorded response came from different input. Replay determinism is a property the cache exists to preserve, not a constraint the cache fights against.

This is the same invariant-protection stance the framework takes everywhere: see [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) for the broader principle.

## What's related

- **[failure-model](https://docs.activegraph.ai/concepts/failure-model/index.md)** — why the framework prefers exceptions over silent substitutions in cases like this.
- **[forking](https://docs.activegraph.ai/concepts/forking/index.md)** — the operation ReplayDivergenceError fires most often during.
- **[replay](https://docs.activegraph.ai/concepts/replay/index.md)** — the broader replay model and the two modes (permissive vs strict).
- **`activegraph fork --record`** in the [CLI reference](https://docs.activegraph.ai/reference/errors/cli/index.md) — the canonical recovery command.

# RuntimeContextRequiredError

`ctx.propose_object` (or another `ctx` method that requires the runtime) was called from a behavior whose context isn't bound to a runtime. This usually means a developer is testing a behavior in isolation, or has lifted code out of a behavior into a helper that gets called from somewhere a runtime-bound ctx isn't available.

This is an **exception**, not an event, because the caller can fix it at the call site — see [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) for the events-not-exceptions principle. A behavior calling a runtime-only method without a runtime is a misuse the framework catches; it's not a non-fatal stop the audit trail should record and continue past.

## Quick fix

Drive the behavior through a real Runtime — the `ctx` is built from the runtime's context factory and bound automatically:

```python
from activegraph import Runtime, Graph

rt = Runtime(Graph(), llm_provider=...)
rt.run_goal("...")   # behaviors fire with ctx bound to rt
```

In a test, the canonical pattern is to construct a real Runtime rather than mock the `ctx` directly:

```python
def test_my_behavior():
    rt = Runtime(Graph(), llm_provider=RecordedLLMProvider(...))
    rt.run_goal("trigger event")
    # Assert on rt.graph state, behavior.failed events, etc.
```

If the test really needs to bypass the runtime — for instance to test a behavior's pure logic without firing it through a goal — mock the policy gate (`ctx.propose_object` returns a fake id, or the behavior path that calls it is mocked out) so `propose_object` isn't reached at all. Don't mock the ctx and call a real runtime-bound method on it.

## How to diagnose

The error names the offending ctx method:

```text
RuntimeContextRequiredError: ctx.propose_object requires a
runtime-bound context

What failed:
  A behavior called ctx.propose_object on a BehaviorGraph context
  that was constructed without a Runtime — likely a test fixture
  that stubbed the graph without going through Runtime.
```

From code:

```python
try:
    behavior(event, graph, ctx)
except RuntimeContextRequiredError as e:
    print(e.method)   # 'ctx.propose_object'
```

If the error fires in test code, the most likely cause is a stub graph or mocked ctx; check the test fixture's setup.

If it fires in production code, the most likely cause is a helper function refactored out of a behavior body that's now being called from a place where the ctx isn't a runtime-bound one — for instance, a setup hook that runs before `run_goal` or a CLI command that builds objects directly.

## When does this fire

At any `ctx`-method call that requires the runtime (currently `propose_object`; v1.1 may add others). The check runs at the top of the method, before any side effect.

The runtime constructs a runtime-bound `ctx` automatically for every behavior fire. The error fires only when a behavior body runs through a code path the runtime didn't initiate — test fixtures, isolated invocations, helper functions called from non-behavior contexts.

## Why the framework refuses to continue

`ctx.propose_object` writes to the runtime's pending-approvals queue and emits an `approval.proposed` event. Without a runtime, neither side effect can happen, and a no-op would silently break the policy gate the behavior depends on — the audit trail would show no proposal and the operator would have no record to approve.

See [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) for when the framework prefers exceptions over silent no-ops. This is the canonical example: a misuse the caller can fix at the call site, where silently doing nothing would corrupt the audit trail.

## What's related

- [`invalid-patch-lifecycle-state`](https://docs.activegraph.ai/reference/errors/invalid-patch-lifecycle-state/index.md) — the sibling ExecutionError for "the caller misused a runtime primitive." Both fire mid-execution, both are caller-correctable.
- [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) — why these are exceptions, not events.
- [`concepts/policies`](https://docs.activegraph.ai/concepts/policies/index.md) — the approval lifecycle that `ctx.propose_object` participates in.

______________________________________________________________________

See [Observing failures in caller code](https://docs.activegraph.ai/concepts/failure-model/#observing-failures-in-caller-code) for `Runtime.errors` and the `BehaviorFailure` shape.

# SchemaVersionMismatch

The store you opened was written by a different activegraph build. The runtime refuses to read a store with a different `schema_version` rather than risk silent data loss — a newer framework might interpret columns differently than the writer did, and an older framework might drop fields it doesn't recognize.

This fires on store open. The store file is intact; it's just schema-incompatible with the activegraph version you're running.

## Quick fix

One of three actions:

```bash
# 1. Install the activegraph version that wrote the store. The error
#    message names the recorded schema_version; check CHANGELOG.md
#    for which version shipped it.

# 2. Migrate runs from the old store to a fresh store written by
#    this build. The destination has the current schema.
activegraph migrate --from sqlite:///old.db --to sqlite:///new.db

# 3. If the store is empty or expendable, delete and start fresh.
rm old.db
```

The error message includes both the found and expected versions plus the activegraph version in the body, so you can match the schema_version against the changelog without separate inspection.

## How to diagnose

The error message names both versions and the driver:

```text
What failed:
  The SQLite store records schema_version='99' in its meta table,
  but activegraph 0.9.1 expects schema_version='1'.
```

From Python:

```python
try:
    rt = Runtime.load(url, run_id=run_id)
except SchemaVersionMismatch as e:
    print(e.context["found_version"])     # the store's schema_version
    print(e.context["expected_version"])  # what this build expects
    print(e.context["activegraph_version"])
    print(e.context["driver"])            # "sqlite" | "postgres"
```

The store file itself is readable with the schema_version's source build — no data is lost. Migration moves runs across schema versions without modifying the source.

## When does this fire

Whenever a store opens via `Runtime.load`, `activegraph inspect`, `activegraph migrate` (source side), or any other operation that calls `_ensure_schema`. The check runs once per store-open, against the meta table's recorded `schema_version`.

A fresh store auto-populates `schema_version` from the current build, so this error never fires on a store this build created. It fires only when reading a store that another build wrote.

## Why the framework refuses to continue

The store file format evolves with the framework. Mismatched schemas could mean column types changed, new required fields were added, or old fields were dropped — silently reading the store would either produce wrong-shape Python objects or drop fields the writer considered important. Either way, the audit trail would be corrupted in a way the operator wouldn't notice until later.

The framework refuses the open and asks the operator to choose: upgrade, migrate, or discard. All three are explicit, all three preserve the audit trail.

See [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) for the broader principle.

## What's related

- [`CorruptedEventPayloadError`](https://docs.activegraph.ai/reference/errors/corrupted-event-payload-error/index.md) — fires when a row's payload bytes don't parse, distinct from schema mismatch.
- `activegraph migrate` in the [CLI reference](https://docs.activegraph.ai/reference/errors/cli/index.md) — the canonical recovery path when you can't or don't want to switch activegraph versions.
- [Migration from v0.7](https://docs.activegraph.ai/cookbook/migration-from-v0-7/index.md) — the cross-version migration runbook when schema_version differs across milestones.

# ToolError

A tool invocation failed mid-execution. The tool body raised, timed out, hit a network error, returned data that didn't match its declared output schema, or hit the runtime's tool budget.

Like [`LLMBehaviorError`](https://docs.activegraph.ai/reference/errors/llm-behavior-error/index.md), this is a **carrier**. The runtime catches it inside the tool dispatch, emits a `tool.responded` event with `error.reason` set, and the calling behavior reads the structured failure from that event. The exception only escapes to your code if you're invoking the tool outside of an LLM behavior's tool-loop.

See [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) for the broader principle.

## Quick fix by category

The error message names the specific `reason` (e.g., `tool.timeout`) and gives per-reason recovery prose inline. The doc page groups those reasons by what you do about them.

### Failures you can't fix in code: retry

`tool.timeout`, `tool.network_error`. The tool reached an external system that was briefly slow or unavailable. Write a retry behavior on `behavior.failed`:

```python
@behavior(
    name="tool_retry",
    on=["behavior.failed"],
    where={
        "reason": ["tool.timeout", "tool.network_error"],
    },
)
def tool_retry(event, graph, ctx):
    attempt = (event.payload.get("attempt") or 0) + 1
    if attempt > 3:
        return
    graph.emit("retry.requested", {
        "for_event": event.payload["triggering_event_id"],
        "attempt": attempt,
    })
```

Retries are graph citizens (CONTRACT v0.6 #13), not framework middleware. Every retry appears in the trace.

### Failures from your inputs: tighten the call site

`tool.invalid_input`. The LLM (or your code) called the tool with arguments that didn't match its declared Pydantic input schema. Two fixes:

- If the LLM is producing the args, tighten the prompt with an example of correctly-shaped input.
- If your schema is too strict, relax the relevant field (e.g., a required field that's actually optional in practice).

### Failures from the tool author: fix the tool

`tool.invalid_output`. The tool body returned a value that didn't match its declared Pydantic output schema. This is a bug in the tool, not in the caller — fix the tool body to return data matching the schema, or relax the schema if the actual return shape is right.

`tool.execution_error` is the catch-all for unexpected exceptions inside the tool body. The original exception type and traceback are preserved in `payload_extras` for diagnosis:

```python
try:
    rt.run_goal("...")
except ToolError as e:
    print(e.reason)             # 'tool.execution_error'
    print(e.payload_extras)     # {'exception_type': '...', ...}
```

### Failures from fork/replay: re-record

`tool.fixture_missing`. You're running against `RecordedTool` and the live arguments don't match any recorded invocation. Re-record from a clean run with the current arguments, same flow as the LLM case.

### Failures from budget: recalibrate

`budget.tool_calls_exhausted`, `budget.cost_exhausted`. The runtime hit its declared limit before the behavior finished. Either raise the budget on the next run or accept the partial result the trace records.

## How to diagnose

The reason code is on the exception and in the `tool.responded` event the runtime emits in your behalf:

```text
[tool.responded]    evt_NNN  your.behavior  tool=your_tool error=tool.timeout
```

The full `payload_extras` includes whatever the tool body recorded before failing (input args, partial output, original exception trace for execution errors). Inspect it directly:

```bash
activegraph inspect <store> --event <tool.responded-id>
```

## When does this fire

Inside an `@tool`-decorated body, or in the runtime's tool-loop when validating inputs/outputs against the declared schemas. The runtime catches and emits a `tool.responded` event with the error, then the calling LLM behavior continues — usually the LLM sees the failure in the conversation and decides what to do next, or the calling behavior body itself reads the error and branches.

## Why the framework refuses to continue (the tool, not the run)

Tools that reach external systems will fail intermittently. Halting the goal run on first tool failure would make the framework brittle in exactly the case it was designed for (long-running, multi-LLM, multi-tool agentic work). The structured failure in the event log is the right surface: the LLM can see it, retry behaviors can react, the audit trail records what happened.

## What's related

- [`LLMBehaviorError`](https://docs.activegraph.ai/reference/errors/llm-behavior-error/index.md) — the sibling on the LLM side of an LLM behavior's call/response loop. The carrier shape is symmetric.
- [`unknown-tool-error`](https://docs.activegraph.ai/reference/errors/unknown-tool-error/index.md) — fires when the LLM asks for a tool the behavior didn't declare. Distinct from ToolError, which fires when a declared tool fails to execute.
- [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) — why `tool.responded` carries failures as structured events rather than escaping exceptions.

______________________________________________________________________

See [Observing failures in caller code](https://docs.activegraph.ai/concepts/failure-model/#observing-failures-in-caller-code) for `Runtime.errors` and the `BehaviorFailure` shape.

# ToolNotFoundError

`rt.get_tool(name)` couldn't resolve the name to a registered tool. The framework refuses to fall back to a fuzzy match because the tool's input/output schema is part of the contract — invoking the wrong tool with the right name would silently produce wrong-shape data.

Multi-inherits `LookupError` for back-compat — code that does `except LookupError` around tool lookups continues to work.

The name resolution rule (canonical strict, lookup lenient) is defined on [`ambiguous-behavior-error`](https://docs.activegraph.ai/reference/errors/ambiguous-behavior-error/index.md) and applies symmetrically to tool names. See also [`ambiguous-tool-error`](https://docs.activegraph.ai/reference/errors/ambiguous-tool-error/index.md).

## Quick fix

Check the spelling and the pack:

```python
# Tools are exposed alongside behaviors:
status = rt.status(recent=0)
# Inspect the runtime's tool registry directly:
print(list(rt.tool_registry.keys()))
```

Common causes:

- **The tool's pack isn't loaded.** Load it:

  ```python
  rt.load_pack(my_pack, settings=...)
  ```

- **The `@tool` decorator hasn't run.** Tools register at module-import time. Import the module before constructing the Runtime, or pass the tool explicitly via `Runtime(tools=[...])`.

- **You used a short name when canonical was needed.** If the tool comes from a pack, use `pack_name.tool_name`. See [`ambiguous-behavior-error`](https://docs.activegraph.ai/reference/errors/ambiguous-behavior-error/index.md) for the resolution rule.

## How to diagnose

The error names the offending name and the registered tools:

```text
ToolNotFoundError: no tool named 'fetch_pdfs' is loaded

What failed:
  rt.get_tool('fetch_pdfs') could not resolve the name to a
  registered tool.
    registered: 'diligence.fetch_company_docs',
                'diligence.fetch_filings'
```

From code:

```python
try:
    t = rt.get_tool("fetch_pdfs")
except ToolNotFoundError as e:
    print(e.name)         # 'fetch_pdfs'
    print(e.registered)   # registered tool names
```

If `registered` is empty, no tools are registered at all — likely an import ordering problem.

## When does this fire

At `rt.get_tool(name)` and equivalent lookups. The runtime's LLM-tool-loop has its own path that uses the canonical name resolved at `@llm_behavior` registration; if the LLM asks for a tool the behavior didn't declare, you get [`unknown-tool-error`](https://docs.activegraph.ai/reference/errors/unknown-tool-error/index.md) instead — different error, different recovery.

## Why the framework refuses to continue

Tools are addressable by their declared name. The runtime refuses to fall back to a fuzzy match because the tool's input/output schema is part of the contract — a fuzzy match could invoke a tool with a different schema than the caller expected, silently producing wrong-shape data. The Pydantic validation that runs on every tool call would fail with a different (less helpful) error, or worse, would pass on coincidentally-compatible shapes.

See [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) for the broader principle.

## What's related

- [`ambiguous-tool-error`](https://docs.activegraph.ai/reference/errors/ambiguous-tool-error/index.md) — fires when a short name resolves to two different tools across loaded packs.
- [`behavior-not-found-error`](https://docs.activegraph.ai/reference/errors/behavior-not-found-error/index.md) — the symmetric case for behavior lookups.
- [`missing-tool-error`](https://docs.activegraph.ai/reference/errors/missing-tool-error/index.md) — the construction-time variant for when an `@llm_behavior` declares a tool that isn't registered.
- [`unknown-tool-error`](https://docs.activegraph.ai/reference/errors/unknown-tool-error/index.md) — the runtime variant for when an LLM asks for an undeclared tool inside a declared behavior.

# UnknownToolError

**This is not a tool execution failure.** An LLM response asked to invoke a tool whose name isn't declared on the calling `@llm_behavior`'s `tools=[...]` argument. The runtime refuses any tool call that isn't in the declared set — an undeclared tool could perform side effects the behavior's audit trail doesn't account for, which would break replay determinism.

If you're looking for the error that fires when a declared tool fails to execute, see [`tool-error`](https://docs.activegraph.ai/reference/errors/tool-error/index.md) instead.

The runtime catches `UnknownToolError` inside the tool-loop and surfaces it as `behavior.failed reason="tool.unknown_tool"`.

## Quick fix

Two paths depending on whether the LLM should be calling the tool:

### The LLM should be calling this tool — declare it

Add the missing tool to the behavior's `tools=[...]`:

```python
@llm_behavior(
    name="diligence.researcher",
    tools=[
        fetch_company_docs,
        fetch_filings,
        web_search,          # ← add this
    ],
    ...
)
```

Confirm the tool itself is registered with `@tool` (or passed via `Runtime(tools=[...])`). If the tool's name doesn't appear in `activegraph inspect <run> --behaviors`, the registration didn't take — usually a missing import.

### The LLM shouldn't be calling this tool — tighten the prompt

If the model is consistently asking for an undeclared tool, the prompt is implying capabilities the behavior doesn't have. Be explicit about which tools are available:

```python
description=(
    "Research a company using ONLY the declared tools. "
    "If a question requires capabilities outside this set, return a "
    "structured 'unanswered' response and let downstream behaviors "
    "handle it. Do not invent tool calls."
)
```

## How to diagnose

The error message names three things — the tool requested, the behavior that triggered the call, and the tools the behavior declared:

```text
What failed:
  An LLM response asked to invoke a tool that the calling behavior
  did not declare.
    tool requested: 'web_search'
    declared on behavior 'diligence.researcher':
      'diligence.fetch_company_docs', 'diligence.fetch_filings'
```

From code:

```python
try:
    rt.run_goal("...")
except UnknownToolError as e:
    print(e.tool_name)       # 'web_search'
    print(e.behavior_name)   # 'diligence.researcher'
    print(e.declared_tools)  # ('diligence.fetch_company_docs', ...)
```

The same fields appear in the `behavior.failed` event's `payload_extras` for downstream code that subscribes to the event.

## Registration-time vs runtime — the distinction matters

Two related errors gate the tool surface:

- [`MissingToolError`](https://docs.activegraph.ai/reference/errors/missing-tool-error/index.md) fires at **runtime startup**. An `@llm_behavior` declares a tool name that the Runtime's tool registry doesn't have. The check runs once at Runtime construction so the misconfiguration fails before any LLM call burns budget.
- **`UnknownToolError` (this page)** fires at **LLM-call time**. The declared tools are all registered, but the LLM asked for one that's not in the declared set for *this specific behavior*.

Read the error message's `tool requested` field carefully: if it names a tool you intend the behavior to call, you've hit `MissingToolError` shape — the tool isn't registered. If it names a tool you don't expect the LLM to ever call, the prompt is the problem.

## When does this fire

During an LLM behavior's tool-loop, when the provider returns a tool call whose `name` isn't in the behavior's declared `tools`. The runtime emits `behavior.failed reason="tool.unknown_tool"` and the goal continues — other behaviors keep firing, the LLM behavior itself doesn't retry automatically (a retry behavior subscribing to `behavior.failed` is the canonical pattern; see [`tool-error`](https://docs.activegraph.ai/reference/errors/tool-error/index.md)).

## Why the framework refuses to continue

`@llm_behavior` declares the exact set of tools the wrapped behavior is allowed to invoke. The runtime refuses any other tool call rather than silently execute it — an undeclared tool could perform side effects the behavior's audit trail doesn't account for, and re-running the behavior in replay would either produce a different event stream (if the undeclared tool wasn't called) or invoke an effect the recorded run didn't have.

See [`failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) for the broader principle.

## What's related

- [`tool-error`](https://docs.activegraph.ai/reference/errors/tool-error/index.md) — fires when a declared tool fails to execute. The distinction is on this page above; bookmark both if you're debugging LLM-driven tool calls.
- [`missing-tool-error`](https://docs.activegraph.ai/reference/errors/missing-tool-error/index.md) — the registration-time variant. Fires at Runtime construction when an `@llm_behavior` declares a tool that isn't registered.
- [`llm-behavior-error`](https://docs.activegraph.ai/reference/errors/llm-behavior-error/index.md) — the LLM-side failure carrier; covers the parse/schema/network reasons.

______________________________________________________________________

See [Observing failures in caller code](https://docs.activegraph.ai/concepts/failure-model/#observing-failures-in-caller-code) for `Runtime.errors` and the `BehaviorFailure` shape.

# UnsupportedPatternError

The pattern you passed to `@behavior(pattern=...)` uses syntax outside the v0.7 Cypher subset. The parser refused it at behavior-registration time — long before any match runs — so the misconfiguration surfaces before the runtime starts firing behaviors.

The error message names which feature the parser refused (in the summary line) and the per-feature workaround (in the body). The two shapes are **refused features** (a recognized Cypher feature the subset deliberately excludes) and **syntax errors** (the pattern didn't parse at all).

## Quick fix by kind

### Refused feature

The error message's `What failed:` section names the feature (`OR`, `OPTIONAL MATCH`, `variable-length path syntax (-[*]-)`, etc.), and the `How to fix:` section gives the specific workaround for that feature.

The general pattern: **when the subset refuses a feature, the workaround is usually to register multiple behaviors instead of one clever pattern.**

- `OR` in WHERE → register two behaviors, one per branch.
- `OPTIONAL MATCH` → register a second behavior whose pattern is the optional sub-pattern.
- Variable-length paths → register N behaviors, one per length.
- `CREATE` / `MERGE` / `SET` / `DELETE` / `DETACH` → patterns don't mutate; do the mutation in the behavior body.
- `RETURN` / `WITH` / extra `MATCH` → patterns observe; compose pipelines as chained behaviors via emitted events.

The error message itself has the specific recipe for the feature you hit. The full set is in [`concepts/patterns.md`](https://docs.activegraph.ai/concepts/patterns/#what-the-subset-deliberately-refuses).

### Syntax error

The parser couldn't tokenize or parse the pattern. The error message's `What failed:` includes the offending token and its position, and `How to fix:` points at the documented grammar.

```text
UnsupportedPatternError: pattern does not parse: unexpected character at position 17

What failed:
  While parsing the pattern: unexpected character at position 17.
    at: '@'
```

Common causes:

- **Missing relationship type.** Use `-[:type]->`, not `-[]->`. Relationships always require an explicit type in the v0.7 subset.
- **Missing arrow direction.** Use `-[:type]->` or `<-[:type]-`; undirected relationships are refused.
- **Unbalanced brackets.** `(a:type {prop: value` without a closing brace produces a parse error at the next token.
- **Reserved keyword as an identifier.** The forbidden-keywords list is enforced at tokenization; using one as a variable name fires this error.

## How to diagnose

If the error message's `at:` field doesn't make the cause obvious, print the pattern around the position:

```python
import activegraph
pattern = "your pattern here"
try:
    from activegraph.runtime.patterns import parse
    parse(pattern)
except activegraph.UnsupportedPatternError as e:
    print(f"pattern: {pattern}")
    print(f"position: {e.at!r}")
    print(f"context: {e.context}")
```

`e.at` is the offending token; `e.context` carries the same information the error message body includes.

## When does this fire

At behavior registration. The parser validates the pattern when `@behavior(pattern=...)` (or `@llm_behavior(pattern=...)`) runs at import time. Once a behavior is registered, its pattern is locked — the runtime never re-parses, so the error fires before the runtime loop ever starts.

This is a deliberate v0.7 choice (CONTRACT v0.7 #9): patterns are compiled once, at registration. A pattern that takes too long to parse, or that uses unsupported syntax, fails the developer's import rather than the production run.

## Why the framework refuses

The subset is small on purpose. A fuzzy superset of Cypher would let patterns *appear* to match input they did not actually match. Two patterns differing only in a refused feature would silently produce different match sets at runtime, and the audit trail would not record which pattern actually matched what. The subset is the contract that makes pattern subscriptions trustworthy.

For the broader principle, see [`concepts/failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md). For the locked subset and the rationale per feature, see [`concepts/patterns`](https://docs.activegraph.ai/concepts/patterns/index.md).

## What's related

- [`concepts/patterns`](https://docs.activegraph.ai/concepts/patterns/index.md) — the canonical reference for what the subset supports and what it refuses, with workaround patterns per refusal.
- [`concepts/failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md) — why the framework prefers "refuse loudly at registration" to "match fuzzily at runtime."

# API reference

The API reference is auto-generated from docstrings in the `activegraph` package. Symbols are organized by topical module — runtime, graph, behaviors, tools, store, packs, errors, observability — each as a separate page navigated from the sidebar.

Two conventions apply across the reference:

- **Public surface only.** Symbols listed in `activegraph.__all__` and the pack-level `__all__`s appear here. Internal symbols (those starting with `_`, or those not re-exported) are not in the reference; treat them as implementation details.
- **No source dumps.** The reference renders the API contract, not the implementation. Readers who want source go to [GitHub](https://github.com/yoheinakajima/activegraph).

The framework's docstrings are mostly free-form Markdown rather than structured (Google/NumPy/Sphinx). The renderer falls back to raw Markdown for prose-only docstrings; structured sections (`Args:`, `Returns:`, `Raises:`, `Examples:`) render as labeled blocks where they appear.

## Topical reference

- [Runtime](https://docs.activegraph.ai/reference/api/runtime/index.md) — the runtime loop, frames, budget, status.
- [Graph](https://docs.activegraph.ai/reference/api/graph/index.md) — graph and its primitives (objects, relations, patches, views, events).
- [Behaviors](https://docs.activegraph.ai/reference/api/behaviors/index.md) — the behavior decorators and base classes.
- [Tools](https://docs.activegraph.ai/reference/api/tools/index.md) — the `@tool` decorator and tool primitives.
- [Store](https://docs.activegraph.ai/reference/api/store/index.md) — event stores (in-memory, SQLite, Postgres), URL parsing, migration.
- [Packs](https://docs.activegraph.ai/reference/api/packs/index.md) — the pack format primitives.
- [Errors](https://docs.activegraph.ai/reference/api/errors/index.md) — the `ActiveGraphError` hierarchy.
- [Observability](https://docs.activegraph.ai/reference/api/observability/index.md) — the metrics protocol and shipped backends.
- [Diligence pack](https://docs.activegraph.ai/reference/api/packs/diligence/index.md) — the v0.9 reference pack.

## Docstring coverage

A coverage audit against CONTRACT v1.0 #C2's tier model (100% on public surface, 80% on second ring) is regenerated by `scripts/audit_docstrings.py`. The current state is in [`COVERAGE_REPORT.md`](https://docs.activegraph.ai/reference/api/COVERAGE_REPORT/index.md); the docstring-gate CI commit consumes it as a checklist.

# Runtime

The runtime loop. Constructed with a `Graph`, an optional set of behaviors, an optional LLM provider, an optional budget, and an optional store. Drives goal runs to completion and persists state through the attached store.

For the conceptual model, see [`concepts/graph`](https://docs.activegraph.ai/concepts/graph/index.md) and [`concepts/behaviors`](https://docs.activegraph.ai/concepts/behaviors/index.md).

## `errors`

Accumulated `behavior.failed` events as structured tuples.

v1.0.3 #3. Reads from `self.graph._events` on each access — the events are the source of truth and this property is a projection. No caching, no listener registration, no new state. Callers can inspect failures programmatically without reaching into `graph._events` or parsing payload dicts.

Each :class:`BehaviorFailure` carries five operationally useful fields plus the underlying `behavior.failed` event id for callers that want to re-read the full payload (e.g., traceback, LLM payload extras).

## `status(recent=20)`

Frozen snapshot of the runtime. CONTRACT v0.8 #11.

Cheap to call. No graph traversal beyond a tail-slice of the event log. Returns immutable data; mutating any field raises.

`recent` controls the length of the `recent_events` tail. The CLI's `inspect --tail N` passes through.

## `load_pack(pack, settings=None)`

Load a pack into the runtime.

Returns True on first load, False if the same `(name, version)` was already loaded (CONTRACT v0.9 #6 idempotency). Raises `PackVersionConflictError` for name-match-version-mismatch and `PackConflictError` for any contributor name collision. Pre-mutation: a failed load leaves the runtime exactly as it was.

## `loaded_packs()`

List of currently-loaded packs.

## `get_behavior(name)`

Look up a registered behavior by canonical or short name.

Short names resolve when unambiguous (load-time conflict check guarantees this invariant). Raises `LookupError` if not found or `ValueError` if ambiguous. CONTRACT v0.9 #8.

## `get_tool(name)`

Look up a registered tool by canonical or short name.

Same resolution rule as `get_behavior`. CONTRACT v0.9 #8 / #9.

## `pending_approvals()`

List of currently-pending approvals (in creation order).

## `approve(approval_id, approved_by=None)`

Materialize a pending approval. Returns the new object id.

Raises `LookupError` if `approval_id` is not pending. Emits an `approval.granted` event followed by the deferred `object.created`.

## `save_state(path=None)`

Persist the event log.

- With a store already attached: flush (no path needed). If `path` is given it must match the attached store's path.
- Without a store: late-bind a SQLite store at `path` and append all in-memory events to it (CONTRACT v0.5 #5). Returns the path the events were written to.

## `load(path, run_id=None, *, behaviors=None, frame=None, policy=None, budget=None, seed=0, replay_strict=False, llm_provider=None, replay_llm_cache=False, tools=None, replay_tool_cache=False, replay_reinvoke_deterministic=False, metrics=None)`

Open `path`, choose a run, replay its events, return a Runtime wired to continue from where the log left off.

If `run_id` is None, loads the most recently appended-to run (CONTRACT v0.5 #6).

`replay_strict=True` re-fires behaviors from the recorded seed events and compares the resulting event-type stream (id, type) to the log. KNOWN LIMITATION (v0.5): payload-only drift is not detected; see CONTRACT v0.5 #7. Tightens in v0.6 with LLMs.

v0.8: `path` accepts a URL (sqlite:///... or postgres://...) in addition to a bare SQLite path. Backward-compatible.

## `fork(at_event, label=None, *, behaviors=None, llm_provider=None, replay_llm_cache=False, tools=None, replay_tool_cache=False, replay_reinvoke_deterministic=False)`

Branch this run at `at_event` into an independent new run.

Requires a SQLite store. Copies events from the parent's log up to and including `at_event` into a fresh `run_id`, replays them into a new Graph, then returns a Runtime that operates on that Graph. Forks-of-forks work the same way (CONTRACT v0.5 #9).

## `cost_remaining(prospective_cost)`

Would `prospective_cost` push us past the ceiling? Returns True if it's safe to spend, False if it would exceed.

Per-graph monotonic ID generator. Not thread-safe (single-threaded loop).

## `reseed_from_events(events)`

Set counters past the highest id seen in `events`.

Used after replay so subsequent `object()/event()/...` continue monotonically from where the loaded log ended. Forks call this too, which is why two forks at the same point produce IDs that diverge identically (decision #12 — fine because the IDs live in different runs).

## Clocks

Real wall-clock UTC. ISO 8601 second precision, Z suffix.

Bases: `Clock`

Always returns the same timestamp. For tests and snapshots.

Bases: `Clock`

Monotonically advances by `step` seconds on every call. For tests that care about ordering but don't want wall-clock noise.

## Logging + registry helpers

Configure the activegraph logger hierarchy.

Idempotent: repeated calls replace the existing handler rather than stacking. Returns the activegraph root logger.

Parameters:

| Name               | Type                               | Description                                                                                              | Default                       |
| ------------------ | ---------------------------------- | -------------------------------------------------------------------------------------------------------- | ----------------------------- |
| `level`            | \`str                              | int\`                                                                                                    | numeric or string level name. |
| `json_output`      | `bool`                             | True for the documented JSON-line format; False for the stdlib default (one human-readable line).        | `True`                        |
| `stream`           | `Any`                              | where to write. Defaults to stderr (the logging default).                                                | `None`                        |
| `payload_redactor` | `Optional[Callable[[dict], dict]]` | optional callable(dict) -> dict applied to any payload before it's added to a log record's extra fields. | `None`                        |

Snapshot of the global behavior registry (a shallow copy).

Empty the global behavior registry and return what was cleared.

Tests that need isolation between cases call this in a fixture; the return value is the list of removed behaviors in registration order, so multi-run scripts can capture them once and re-register via :func:`register` on each subsequent run without re-importing the modules whose `@behavior` decorators populated the registry in the first place. See the *Multi-run scripts* cookbook recipe.

v1.0.1: the return value is new. v1.0 returned `None`; callers that ignored the return still work unchanged.

Append an already-constructed behavior to the global registry.

The decorators (:func:`behavior`, :func:`relation_behavior`, :func:`llm_behavior`) register on definition; this function exists for the case where definition and registration are decoupled — most commonly, multi-run scripts that call :func:`clear_registry` between runs and need to re-populate the registry without re-importing the decorator-bearing modules:

.. code-block:: python

```text
from activegraph import clear_registry, register

cleared = clear_registry()        # capture before the first run
rt1 = Runtime(graph1); rt1.run_goal("first")

for b in cleared:                 # restore for the next run
    register(b)
rt2 = Runtime(graph2); rt2.run_goal("second")
```

See the *Multi-run scripts* cookbook recipe.

v1.0.1: new. v1.0 required reaching into the private `_REGISTRY` list — the user-test gate surfaced that as a rough edge.

# Graph

The graph and its primitives — objects, relations, patches, views, and events. The graph is a projection of the event log; mutations go through events. For the conceptual model see [`concepts/graph`](https://docs.activegraph.ai/concepts/graph/index.md).

Event-sourced graph. The log is truth; objects/relations are projection.

## `relations(source=None, target=None, type=None)`

Return relations filtered by `source`, `target`, and/or `type`.

v1.0.4 #1: the canonical filter API on `Graph`. Decomposes the v0 `get_relations(object_id=, direction=)` axis into separate `source` and `target` slots so the call reads the way users already write it (matches `docs/concepts/graph.md`). Filter kwargs compose by AND; calling with no kwargs returns every relation. `Graph.get_relations(object_id=, type=, direction=)` stays as a backward-compatible alias.

## `objects(type=None, where=None)`

Return objects matching `type` and/or `where`.

v1.0.3 #1: the canonical query API on `Graph`, mirroring `View.objects(type=...)` so call sites read the same inside and outside behaviors. `Graph.query(object_type=...)` is kept as a backward-compatible alias.

## `query(object_type=None, where=None)`

Backward-compatible alias for :meth:`objects`. v1.0.3 #1.

New code should use `graph.objects(type=...)` — the kwarg `type` matches :meth:`View.objects` so the call reads the same in and out of behaviors.

## `attach_store(store)`

Wire an EventStore as the durability sink. Idempotent on the same store. Calling with a *different* store after events exist is an error — events would be persisted in two places and you'd lose history.

## `emit(event)`

Append to log, project, persist (if attached), notify. CONTRACT #2.

## `patch_object(target, updates, *, actor='system', caused_by=None, frame_id=None, rationale=None, evidence=None, llm_request_event_id=None, tool_request_event_ids=None)`

Auto-apply shortcut: build patch, version-check, emit applied/rejected.

## Primitives

## Diffs

# Behaviors

The behavior decorators and base classes. For the conceptual model see [`concepts/behaviors`](https://docs.activegraph.ai/concepts/behaviors/index.md); for the activation rules and the determinism contract see [`concepts/patterns`](https://docs.activegraph.ai/concepts/patterns/index.md) and [`concepts/failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md).

## Decorators

Decorate a function as an event-driven behavior.

v0.7 additions (both keyword-only):

- `pattern=`: a Cypher subset pattern string. When set, the behavior fires only when the pattern matches the post-event graph state. Combined with `on=` both conditions must hold (CONTRACT v0.7 #11). Matches are exposed as `ctx.matches`.
- `activate_after=`: int event count or "N events". Delays invocation by N events; re-checks `where=` at fire time (CONTRACT v0.7 #13).

Decorate a function as an LLM-driven behavior.

The decorated function's signature is `(event, graph, ctx, llm_output) -> None`. The runtime assembles the prompt, calls the provider, parses the structured output, and only then invokes the handler with the parsed result. Failures flow as `behavior.failed` events with a `reason` from CONTRACT v0.6 #11 (`llm.parse_error`, `llm.schema_violation`, `llm.network_error`, `llm.rate_limited`, `budget.cost_exhausted`).

Keyword-only on purpose — `@llm_behavior` carries enough parameters that positional binding would be a footgun.

`model=` is optional (v1.0.2 #1). Omitted, the runtime resolves it to the configured provider's `default_model` at registration time — `"claude-sonnet-4-5"` for `AnthropicProvider`, `"gpt-4o-mini"` for `OpenAIProvider`. Passing an explicit model string still works byte-identically; the runtime additionally validates the name against the configured provider's `recognizes_model()` and raises :class:`InvalidRuntimeConfiguration` at registration time if the name belongs to a different shipped provider's family (e.g. `model="gpt-4o-mini"` on a runtime configured with `AnthropicProvider`). Names no shipped provider recognizes (custom or fine-tuned models) pass through silently.

`prompt_template=` is the only escape hatch from the runtime-assembled prompt. It is a `str.format`-style template that receives four placeholders:

- `{system}` — the system block: behavior name, frame goal and constraints, role description, and (when `output_schema=` is set) the schema with an example instance.
- `{view}` — the scoped graph view: objects, relations, and recent events, rendered as Markdown (format locked per CONTRACT v0.6 #13).
- `{event}` — the triggering event as id, type, actor, and pretty-printed JSON payload with volatile keys stripped.
- `{instruction}` — the one-sentence task derived from `creates=` and `output_schema=`.

The four placeholders carry the same runtime-assembled content whether or not a template is set; the template only re-arranges them. Omitting `prompt_template=` (the default) uses the runtime's canonical layout.

Decorate a function as a relation behavior — fires once per matching edge.

v0.7: also accepts `pattern=` and `activate_after=` per CONTRACT v0.7 #8 / #11 / #13.

## Base classes

Bases: `Behavior`

A behavior whose body is an LLM call.

The runtime owns prompt assembly, cache lookup, the provider call, event emission, and structured-output parsing. The developer's `handler` is invoked only after the LLM has responded (or a cached response was found) and the output was successfully parsed.

CONTRACT v0.7: `tools=` declares a list of `Tool` objects (or strings naming globally-registered tools) the LLM can call during the turn loop. The runtime orchestrates the loop; the handler receives only the final structured output, never raw tool calls.

## `build_prompt(event, graph, *, frame=None)`

Assemble the prompt that would be sent for this event.

CONTRACT v0.6 #20 — public so a developer can inspect prompts without making an API call. Reproducible (pure over inputs); cheap (no I/O).

# Tools

The `@tool` decorator and tool primitives. For the conceptual model and the LLM-tool-loop interaction see [`concepts/behaviors`](https://docs.activegraph.ai/concepts/behaviors/index.md) and the [Writing tools](https://docs.activegraph.ai/reference/guides/writing-tools.md) guide.

## Decorator + base

Register a function as a Tool.

The decorated function's signature is `(args: input_schema, ctx: ToolContext) -> output_schema`. The runtime validates `args` against `input_schema` before invocation and validates the return value against `output_schema` after.

Keyword-only on purpose — too many fields for safe positional binding.

## `to_definition()`

Provider-facing tool definition.

Sent in the `tools=` parameter to `LLMProvider.complete()`. Anthropic and OpenAI both accept a similar shape; the provider translates if needed.

## Registry helpers

# Store

Event stores, URL parsing, and migration. For the conceptual model see [`concepts/graph`](https://docs.activegraph.ai/concepts/graph/index.md) (graph as projection of the event log) and [`concepts/replay`](https://docs.activegraph.ai/concepts/replay/index.md).

## Stores

Bases: `Protocol`

Append-only per-run event log. CONTRACT v0.5 #2.

Per-run view onto a SQLite-backed event log.

Direct construction expects an explicit `path` and `run_id`. For most cases prefer `Runtime(graph, persist_to=...)`, which opens the store, mints a `run_id` if needed, and wires it onto the runtime — the v1.0.1 user-test surfaced that constructing `SQLiteEventStore` by hand is a low-frequency operator path, not the happy path.

## `fork_run(path, *, parent_run_id, new_run_id, at_event_id, label, created_at)`

Copy events from parent_run_id up to and including at_event_id into new_run_id (CONTRACT v0.5 #11: copy rows, no row-sharing).

Returns the number of events copied.

## URL parsing + helpers

Open a store for `run_id` at `url`. Returns an EventStore.

This is the single entry point the runtime and CLI use to open a store from a URL. Drivers are imported lazily so the Postgres dependency stays optional.

Parse a store URL, or raise InvalidStoreURL with a helpful message.

## Migration

Copy every run (or a subset) from `source_url` into `dest_url`.

Parameters:

| Name             | Type                                             | Description                                                                                                                                                                                                                               | Default    |
| ---------------- | ------------------------------------------------ | ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | ---------- |
| `source_url`     | `str`                                            | e.g. sqlite:///dev.db                                                                                                                                                                                                                     | *required* |
| `dest_url`       | `str`                                            | e.g. postgres://localhost/prod                                                                                                                                                                                                            | *required* |
| `only_run_ids`   | `Optional[list[str]]`                            | if given, migrate only these runs.                                                                                                                                                                                                        | `None`     |
| `on_progress`    | `Optional[Callable[[MigrationRunReport], None]]` | called after each run finishes (success or failure) with the MigrationRunReport for that run.                                                                                                                                             | `None`     |
| `skip_corrupted` | `bool`                                           | if True, rows whose payload fails JSON decode are skipped (not migrated, not failing the run). The skipped event ids appear in the per-run report's skipped_events. The resulting destination run is partial — the operator is on notice. | `False`    |

Returns:

| Type              | Description                                             |
| ----------------- | ------------------------------------------------------- |
| `MigrationReport` | A MigrationReport. The overall operation is considered  |
| `MigrationReport` | successful iff every run's status is "ok" or "skipped". |

# Packs

The pack format primitives. For the conceptual model and the authoring workflow see the [Authoring packs](https://docs.activegraph.ai/guides/authoring-packs/index.md) guide.

The shipped reference pack is [Diligence](https://docs.activegraph.ai/reference/api/packs/diligence/index.md).

## Pack declaration

A frozen bundle of pack contents.

Equality and hashing are by (name, version) — NOT by deep field comparison. Behaviors and tools are dataclasses (not hashable); full structural equality would not work and isn't what users care about. The identity that matters is "is this the same pack name and version" — that's what idempotent loading hinges on.

## `prompt_manifest()`

The `pack.loaded` payload's `prompts` block. Maps prompt name to {"version", "hash"}. CONTRACT v0.9 #10.

A typed object the pack contributes.

`schema` is a Pydantic `BaseModel` subclass. When the pack is loaded, `graph.add_object(name, data=...)` validates against it. Validation applies only to objects created AFTER the pack loads (CONTRACT v0.9 #5).

A typed relation the pack contributes.

`source_types` and `target_types` are tuples of object type names; empty means "any".

A policy declared by a pack.

`requires_approval`: tuple of object type names whose `add_object` is gated until `runtime.approve(...)` is called.

A versioned, content-hashed prompt.

`version` is the declared human-readable version (for changelogs and operator messages). `content_hash` is the SHA-256 of the body truncated to 16 hex chars; this is the **replay contract** (the hash, not the version — see CONTRACT v0.9 #10).

Bases: `BaseModel`

For packs with no configurable settings. Pydantic model so it matches the rest of the settings API.

## Discovery

Enumerate installed packs via the `activegraph.packs` entry point group. Cached per process; call `clear_discovery_cache()` to force a re-scan.

Find a discovered pack by name. Raises `LookupError` if not found.

Reset the cached entry-point scan. Tests that install packages dynamically need to call this; normal usage does not.

Scan a directory of `*.md` prompt files with TOML frontmatter.

Each file MUST start with:

```text
---
version = "1.0.0"
name = "optional_name"   # defaults to filename without .md
---
<body>
```

Returns a tuple of `PackPrompt` sorted by name. Content hash is computed over the body (everything after the second `---` line and one separating newline), exactly as it will appear at runtime.

Errors

- missing/malformed frontmatter -> PackPromptLoadError
- missing required `version` field -> PackPromptLoadError
- duplicate prompt name -> PackPromptLoadError
- I/O failure -> PackPromptLoadError

A pack discovered via Python entry points but not yet loaded.

## Approvals + policies

An object creation that's gated behind a policy approval.

The `id` is unique within the runtime instance and is reused as the eventual object id once approved. `kind` is "object" in v0.9; the field exists so v1.0 can extend it to relations or patches without breaking the API.

# Errors

The `ActiveGraphError` hierarchy plus the cross-cutting helpers. For the format spec and the events-not-exceptions principle see [`concepts/failure-model`](https://docs.activegraph.ai/concepts/failure-model/index.md). For per-error recovery prose see the [error reference](https://docs.activegraph.ai/reference/errors/replay-divergence-error/index.md) catalog (one page per leaf).

## Hierarchy root

Bases: `Exception`

Root of every framework error. CONTRACT v1.0 #4.

Subclasses construct by passing a one-line `summary` plus the three structured fields (`what_failed`, `why`, `how_to_fix`) and any error-specific context. `__str__` produces the locked format; the structured fields stay accessible programmatically for tools that want to render errors differently (a doc-site error catalog, a machine-readable failure log, etc.).

## `__init__(summary_or_message, *, what_failed=None, why=None, how_to_fix=None, context=None)`

Two construction modes during the v1.0 transition:

- **Structured** (the v1.0 target): pass `summary` plus the three named fields. `__str__` produces the locked format.
- **Legacy**: pass a single positional message. Used by error leaves that have not yet migrated under the v1.0 PR series. `__str__` returns the message verbatim — format-noncompliant but valid Python, so existing raises keep working.

PR-A converts the ReplayError leaves (the reference category). PR-B through PR-F convert the rest one PR at a time. The legacy branch goes away once every leaf is migrated; until then this gateway is the bridge.

## `is_structured()`

True when the three structured fields are populated. Used by the format snapshot tests and the docs catalog to filter out leaves that have not yet migrated to the v1.0 format.

## Category bases

Bases: `ActiveGraphError`

Runtime construction problems: invalid budget, malformed store URL, missing required configuration. Fires before any work runs.

Bases: `ActiveGraphError`

Behavior, tool, or pack registration problems: conflicts at registration time, version mismatches, missing providers, unknown tools. Fires at registration / pack-load time.

Bases: `ActiveGraphError`

Runtime execution problems: behavior failures, budget exhausted, tool failures during a goal run. Named ExecutionError (not RuntimeError) because Python already has builtin `RuntimeError` and shadowing the builtin produces confusing stack traces.

Bases: `ActiveGraphError`

Replay and fork problems: cache hash mismatches, type-stream divergence between recorded and re-run event logs. Fires only during replay / fork; never during a fresh run.

Bases: `ActiveGraphError`

Persistence problems: failed writes, malformed event payloads on deserialize, schema version mismatches.

Bases: `ActiveGraphError`

Pattern subscription problems: invalid Cypher syntax, unsupported features, malformed WHERE clauses at registration time.

Bases: `ActiveGraphError`

Pack-specific problems at runtime (not registration): schema violations on add_object after pack load, pack-state inconsistencies. Registration-time pack errors live under :class:`RegistrationError` instead.

## Cross-cutting

Bases: `RegistrationError`, `ImportError`

A subsystem requires an optional Python package that isn't installed.

Used by the Postgres store, the Prometheus metrics backend, and the Pack format (which requires Pydantic). Multi-inherits :class:`ImportError` so user code that catches the builtin around optional-dep imports continues to work. v1.0 PR-E.

Construct with the missing package name and the activegraph extras name that bundles it; the structured message walks the user through the install line.

## Replay

Bases: `ReplayError`

Raised when a replay (`replay_strict=True`) or a fork produces an event stream that does not match the recorded log.

`event_id` pins the first divergence point so an operator can jump directly to it. `expected` and `actual` describe what was recorded vs. what the live re-run produced; one is `None` when the re-run finished early or produced an extra event with no recorded counterpart.

## Pattern

Bases: `PatternError`, `SyntaxError`

Pattern uses syntax outside the v0.7 Cypher subset.

Multi-inherits :class:`SyntaxError` so existing user code that catches `SyntaxError` around pattern compilation keeps working. The v1.0 structured-format superclass is :class:`PatternError`, which is itself an :class:`ActiveGraphError`.

Construct via :meth:`refused_feature` or :meth:`syntax_error` — the factory class methods produce the canonical voice for the two failure modes (a refused-but-recognized Cypher feature vs. a parser-level syntax error). Direct construction with the structured fields is supported for one-off cases.

## `refused_feature(*, feature, workaround, at=None, why=None)`

The canonical case: a recognized Cypher feature that the v0.7 subset deliberately refuses, with a documented workaround.

Use for OR, OPTIONAL MATCH, variable-length paths, undirected relationships, WITH, RETURN, CREATE, MERGE, etc. The `feature` argument is included in the summary verbatim, so the substring is the same string an operator would search a log for.

## `syntax_error(*, what, at=None, expected=None, got=None)`

Parser-level error: the pattern does not parse at all (vs. parses-but-uses-refused-feature). Recovery points the developer at the offending token / position.

## Storage

Bases: `StorageError`, `TypeError`

Raised at emit-time when a payload value cannot be JSON-encoded.

Multi-inherits :class:`TypeError` so user code that does `except TypeError` around emit/append calls keeps working. Distinct from :class:`CorruptedEventPayloadError` — this is the encode-side failure (Python value cannot be made into JSON); that one is the decode-side failure (JSON bytes cannot be made into a Python value).

Bases: `StorageError`, `ValueError`

Raised when a URL is missing a scheme, has an unsupported scheme, or is otherwise malformed.

Multi-inherits :class:`ValueError` so user code that catches `ValueError` around URL parsing keeps working. The message always points the user at a concrete fix — bare paths get `sqlite:///<that path>`, unsupported schemes get the list of supported ones.

Bases: `StorageError`

The store's recorded `schema_version` doesn't match what this activegraph build expects.

Fires on store open. The store file is intact; it was just written by a different (older or newer) activegraph build. Recovery is one of three things: upgrade activegraph, downgrade the store via migration, or migrate the run to a fresh store with the current build.

Bases: `StorageError`, `KeyError`

An event id wasn't found in the run's event log.

Multi-inherits :class:`KeyError` so user code that does `except KeyError` around store lookups keeps working. Fires from every `store.get_event(event_id)` and from the fork primitive when `--at-event` names a missing id.

Bases: `StorageError`, `ValueError`

Two events with the same id were appended to the same run.

Multi-inherits :class:`ValueError` for back-compat with user code catching ValueError around appends. Fires only on programmer error: the runtime's id generator is monotonic so duplicates shouldn't arise in normal use. Common cause: hand-constructing events with fixed ids in a test fixture.

Bases: `StorageError`

A stored event payload couldn't be decoded as JSON.

Fires at load-time when a row's payload column contains invalid JSON. Distinct from :class:`NonSerializableEventError`, which fires at emit-time when a Python value can't be encoded to JSON. Corruption-on-load means the bytes on disk don't parse — a different failure mode requiring a different recovery.

## Execution

Bases: `ExecutionError`, `Exception`

Structured failure from inside an @llm_behavior wrapper.

The runtime's `_invoke` catch reads `reason` and `payload_extras` off this exception and includes them in the emitted `behavior.failed` event. Other exception types fall through to the existing CONTRACT v0.6 #13 path unchanged.

Constructor signature `(reason, message, *, payload_extras=)` is preserved from v0.6 so the ~8 internal raise sites in providers do not change. The structured-format fields are auto-derived from `reason` via the per-reason prose table above.

Bases: `ExecutionError`, `Exception`

Structured failure from inside a tool invocation.

`reason` must be one of the v0.7 codes:

tool.timeout, tool.network_error, tool.invalid_input, tool.invalid_output, tool.execution_error, tool.unknown_tool, tool.fixture_missing, budget.tool_calls_exhausted, budget.cost_exhausted.

Constructor signature `(reason, message, *, payload_extras=)` is preserved from v0.7 so the internal raise sites in tool bodies do not change. The structured-format fields are auto-derived from `reason` via `_TOOL_REASON_PROSE`.

Bases: `ExecutionError`, `RuntimeError`

Raised when an LLM response calls a tool the behavior didn't declare.

The runtime catches it during the LLM tool-loop and surfaces it as `behavior.failed reason="tool.unknown_tool"`. Multi-inherits RuntimeError so user code that catches RuntimeError around runtime operations continues to work.

Bases: `ExecutionError`, `LookupError`

Pending-approval lookup miss.

Fires from `runtime.approve(approval_id)` when the id doesn't refer to any currently-pending approval. Multi-inherits :class:`LookupError` so user code that does `except LookupError` around the approval API continues to work.

Bases: `ExecutionError`, `RuntimeError`

`ctx.propose_object` (or another ctx method that requires the runtime) was called from a behavior whose context isn't bound to a runtime.

Fires at execution time, inside a running behavior — the caller is the behavior body that invoked the ctx method, not the framework's construction code. Multi-inherits :class:`RuntimeError` for back-compat.

Bases: `ExecutionError`, `ValueError`

`graph.apply_patch(patch_id)` was called on a patch that isn't in `"proposed"` state.

Patches go through `proposed → applied` (success) or `proposed → rejected` (policy or behavior rejection). Calling `apply_patch` on an already-applied or already-rejected patch is a programmer error in the calling code path; the framework refuses rather than re-apply or silently no-op. Multi-inherits :class:`ValueError`.

Bases: `ExecutionError`, `ValueError`

A framework-internal evaluator received input it does not recognize. Should not fire in normal use — the framework's parsers produce a closed set of operators / AST nodes, and an unrecognized one means either drift between parser and evaluator or external AST construction that bypassed the parser.

Multi-inherits :class:`ValueError` so user code that catches the builtin around view operations continues to work. Used by `activegraph/core/graph.py` for the view-filter evaluator; the pattern-subscription evaluator's two internal-bug raises stay as :class:`UnsupportedPatternError` (the natural category) but use the same :func:`activegraph.errors.internal_bug_fields` helper so the prose is uniform across all three sites.

## Registration

Bases: `RegistrationError`, `RuntimeError`

Raised when an @llm_behavior is invoked but no LLM provider is wired on the Runtime.

Fires at registration / startup, not at every invocation — the runtime validates the configuration once. Multi-inherits :class:`RuntimeError` for back-compat with user code catching the builtin around runtime construction.

Bases: `RegistrationError`, `RuntimeError`

An `@llm_behavior` declares a tool name the runtime cannot find in its tool registry at startup.

Fires at construction time, not at LLM-call time — the runtime validates the declared tools once when the behavior registers. Multi-inherits :class:`RuntimeError` for back-compat.

Bases: `RegistrationError`, `LookupError`

`runtime.get_behavior(name)` could not resolve the name to a registered behavior.

Multi-inherits :class:`LookupError` so user code that catches the builtin around behavior lookups continues to work.

Bases: `RegistrationError`, `ValueError`

A short behavior name resolves to more than one loaded pack.

Fires only when both packs declare a behavior under the same short name. The user is asked to disambiguate by using the canonical `pack_name.behavior_name` form. CONTRACT v0.9 #8.

Bases: `RegistrationError`, `LookupError`

`runtime.get_tool(name)` could not resolve the name to a registered tool.

Symmetric with :class:`BehaviorNotFoundError`. Multi-inherits :class:`LookupError` for back-compat.

Bases: `RegistrationError`, `ValueError`

A short tool name resolves to more than one loaded pack.

Symmetric with :class:`AmbiguousBehaviorError`. CONTRACT v0.9 #9.

Bases: `RegistrationError`, `ValueError`

`activate_after=` on a @behavior / @llm_behavior decorator was passed an unparseable or out-of-range value.

Multi-inherits :class:`ValueError` for back-compat with user code catching the builtin around behavior registration.

Bases: `RegistrationError`, `TypeError`

A value passed to `Runtime(tools=[...])` is not a Tool instance.

Common cause: the developer passed a bare function instead of one decorated with `@tool`. Multi-inherits :class:`TypeError` for back-compat.

Bases: `RegistrationError`, `LookupError`

`activegraph.packs.load_by_name(name)` could not find an installed pack with that name in the entry-point registry.

Multi-inherits :class:`LookupError` for back-compat with user code catching the builtin around pack discovery.

Bases: `RegistrationError`, `PackError`

Two loaded packs conflict on a declared identifier.

Raised at `runtime.load_pack` time. Pre-mutation: a failed `load_pack` call leaves the runtime exactly as it was.

Bases: `RegistrationError`, `PackError`

Same pack name loaded with two different versions.

A runtime cannot hold two versions of the same pack. Pre-mutation, same as `PackConflictError`.

## Pack

Bases: `PackError`, `ValueError`

`graph.add_object` or `graph.add_relation` data failed schema validation against a loaded pack's declared type.

Runtime-shape error (fires at add_object / add_relation, not at pack load), so stays under PackError only — this is the lone runtime-shape leaf in the PackError category. Subclass of :class:`ValueError` so user code catching the builtin around graph mutations continues to work.

v1.0 PR-G migrated to structured format. Three call sites are served by three factory class methods (object validation, relation source-type, relation target-type) so the recovery prose can be specific to each shape. Direct construction with the structured fields is also supported.

## `for_object(*, object_type, validation_error, pack_name=None)`

Object data failed the pack's declared schema validation.

## `for_relation_source(*, relation_type, source_type, allowed, pack_name=None)`

Relation source object type isn't in the allowed list.

## `for_relation_target(*, relation_type, target_type, allowed, pack_name=None)`

Relation target object type isn't in the allowed list.

Bases: `RegistrationError`, `PackError`

A `Pack(...)` constructor argument failed validation.

Raised at construction time, not at load time. Covers things like duplicate behavior names, an invalid pack name, an unhashable settings_schema, etc. Multi-inherits RegistrationError (v1.0 PR-E) and PackError (v0.9 base).

Bases: `RegistrationError`, `PackError`

`runtime.load_pack(pack)` called without `settings=` for a pack whose `settings_schema` doesn't accept no-arg construction.

Bases: `RegistrationError`, `PackError`

A prompt file is malformed, missing required frontmatter, or unreadable. Pack registration time.

## Configuration

Bases: `ConfigurationError`, `ValueError`

Caller-provided configuration is invalid (conflicting arguments, missing required argument, out-of-range value).

Multi-inherits :class:`ValueError` so user code that catches the builtin around runtime construction or method calls keeps working.

Construct with a one-line `summary` plus the three structured fields. The recovery prose is per-call-site, not table-driven — each configuration mistake has a different fix.

Bases: `ConfigurationError`, `TypeError`

A value passed to a constructor or method has the wrong type.

Multi-inherits :class:`TypeError`. Used when the framework's contract is type-based (e.g., :class:`PostgresEventStore` accepts a URL string, a psycopg.Connection, or a psycopg_pool.ConnectionPool — anything else is refused at construction).

Bases: `ConfigurationError`, `RuntimeError`

An operation requires a runtime state that isn't satisfied — either a state that must be set but isn't, or a state that mustn't be set but is.

Examples: `runtime.fork()` requires a SQLite-backed runtime; `graph.attach_store()` requires no existing store. Multi-inherits :class:`RuntimeError` for back-compat.

# Observability

The metrics protocol and shipped backends. For the conceptual model see the [Operating in production](https://docs.activegraph.ai/guides/operating-in-production/index.md) guide.

## Metrics protocol

Bases: `Protocol`

Three methods, all best-effort, all non-throwing.

Implementations MUST tolerate unknown metric names. Unknown tag keys are also accepted; cardinality discipline is the caller's job.

## Backends

Default Metrics implementation. Does nothing.

Three method bodies, each a single `return`. The runtime is fully functional with NoOpMetrics. Profile-checked for zero allocation pressure under steady load.

Drop-in Metrics implementation backed by prometheus_client.

Instruments are lazy. Tag keys for an instrument are fixed by the first observation; subsequent observations with a different key set raise (prometheus_client behavior). This matches the standard metric list's fixed tag schemas.

# Diligence pack

The v0.9 reference pack. Investment diligence: claims, evidence, contradictions, risks, memos. Three behaviors are LLM-backed; fixtures ship with the pack for reproducible demos.

For pack-authoring guidance using this pack as the reference see [Authoring packs](https://docs.activegraph.ai/guides/authoring-packs/index.md). A dedicated pack overview page lands post-merge alongside other pack documentation.

## Settings

Bases: `BaseModel`

Configuration for the Diligence pack.

Accessed by behaviors in three forms (CONTRACT v0.9 #7):

1. Typed parameter injection (primary): `def claim_extractor(event, graph, ctx, out, *, settings: DiligenceSettings): ...`
1. `ctx.settings.confidence_threshold_for_review`
1. `ctx.pack_settings("diligence")` for cross-pack lookups.

## Pack declaration
# Optional

______________________________________________________________________

# This page mirrors the canonical CHANGELOG.md at the repo root via

# pymdownx.snippets. Edit the root file, not this page.

______________________________________________________________________

# Changelog

All notable changes to **activegraph** are documented here.

The format follows [Keep a Changelog](https://keepachangelog.com/en/1.1.0/); versioning follows [Semantic Versioning](https://semver.org/spec/v2.0.0.html). Per-version migration notes reference the [Migration from v0.7](https://docs.activegraph.ai/cookbook/migration-from-v0-7/) cookbook, the canonical runbook for upgrading runs and code across milestones.

The doc site mirrors this file at [Changelog](https://docs.activegraph.ai/about/changelog/) via the mkdocs snippet plugin — edit `CHANGELOG.md` at the repo root.

## [Unreleased]

Nothing yet. v1.1 scope is tracked in `v1.1-plan.md` (consolidated by the post-v1.0.3 contract review). v1.0.4 surfaced two additional v1.1 candidates: C-3 (lock the failure-routing convention for eval-time pattern failures and `ReplayDivergenceError`) and I-4 (cross-link `replay-divergence-error.md` to replay/fixture documentation). v1.0.5 surfaced two more: content negotiation on the docs host (serving `text/markdown` per-page complementing the static `/llms.txt` and `/llms-full.txt` files), and an editorial doc-readability pass (front-loaded page summaries, terminology normalization). v1.0.5.post1 surfaced three more: CLA / DCO decision (Apache 2.0 §5's implicit grant covers today; revisit if contribution volume or enterprise legal review makes the ceremony concrete); `CODE_OF_CONDUCT.md` paired with a contact channel (Contributor Covenant v2.1 is the standard text; the missing piece is the contact inbox); and the relaxation of the issues-first contribution policy (currently a pre-launch posture; revisit based on actual contribution patterns observed during v1.0.x's public window). v1.0.5.post2 surfaced two more: the `object.patched` event-name drift in `docs/concepts/events.md` (the page lists a framework event the code does not emit — fix the doc or fix the code, design call deferred); and a dedicated reason-code taxonomy reference (the closed `behavior.failed` / `tool.responded` `reason=` vocabulary is documented only across the per-error pages; a single enumeration would mirror the event-type listing v1.0.5.post2 ships).

## [v1.0.5.post2] — 2026-05-20

Type-system concepts page. A maintainer-driven doc-gap review found that the framework's type system is documented across four concepts pages (`graph.md`, `events.md`, `relations.md`, `patches.md`) plus the pack-authoring guide, with no single page answering the question a new reader arrives with: *what types are framework-defined, what types are developer-defined, and how do they compose?* In particular, "are there framework base object types?" — the answer is no — was reachable only by assembling fragments from three pages.

v1.0.5.post2 ships one new concepts page. No framework code changes. No new public API. No reshape of any locked decision below v1.0.5.post2.

### The single finding

v1.0.5.post2 #1 — A type-system concepts page lands at `docs/concepts/type-system.md`, slotted into the Concepts nav between Graph and Events. The page commits to four claims: (a) object types are developer-defined strings; the framework ships zero base object types; (b) relation types follow the same model; (c) event types are framework-defined, with the complete enumerated set in named families (lifecycle / graph mutations / behavior dispatch / patterns / LLM / tools / patches / approvals / pack lifecycle); (d) patch lifecycle states (`proposed | applied | rejected`) are framework-defined.

### Added

- **`docs/concepts/type-system.md`** (v1.0.5.post2 #1). The new page. Sections: the framework-defined layer (event types, fully enumerated); the developer-defined layer (object types); the developer-defined layer (relation types); how the three layers compose; patch lifecycle states; designing an ontology; the Diligence pack ontology as the worked example. The page makes the framework's "no base object types" stance explicit because users from typed-schema backgrounds (databases, Pydantic, GraphQL, Protobuf) arrive expecting a schema-definition step. The deep-research-agent user-test finding ("object types should be nouns describing their role in the pipeline, not just data bags") is the surfacing path for the ontology design guidance.

### Changed

- **`README.md`**: new `## The type system at a glance` section between `Concepts at a glance` and `A small example`. Three short beats — event types are fixed (full enumerated list inline), object and relation types are yours (no central schema, no registration), patch states are fixed — plus a pointer to the new concepts page. The bridge between the primitive index and the example: a reader who has skimmed the twelve primitives now knows the vocabulary the example uses (`object.created`, the custom `task.completed`, the `task` and `depends_on` strings) before they hit the code. The section calls out the `task.completed` custom-event-type usage in the example explicitly so the framework-vs-application event distinction lands before the code rather than after it.
- **`mkdocs.yml`**: nav adds `Type system: concepts/type-system.md` to the Concepts section between Graph and Events; the `mkdocs-llmstxt` plugin's `sections.Concepts` list adds the matching entry with its one-sentence description. The `tests/test_llms_txt.py` gate from v1.0.5 #1 picks up the new page automatically at build time (the generated `/llms.txt` and `/llms-full.txt` regenerate from `mkdocs.yml` + the `docs/` source on every build, so the new page appears in both).
- **`pyproject.toml`**: `version` bumps `1.0.5.post1` → `1.0.5.post2`.
- **`activegraph/__init__.py`**: `__version__` tracks the bump (the `test_version_sync` gate asserts it matches `pyproject.toml` byte-identically).
- **`tests/snapshots/errors/{internal_bug__pattern_unknown_op, internal_bug__graph_view_unknown_op,schema_version_mismatch}.txt`**: rebaselined the embedded version string from `1.0.5.post1` to `1.0.5.post2` (same three snapshots v1.0.5.post1 rebaselined for the same reason — they render `activegraph.__version__` inline per the internal-bug context format and the schema-version-mismatch context format).

### CONTRACT amendments

- **`v1.0.5.post2` milestone added** with one numbered finding (`v1.0.5.post2 #1`) and a "deliberately does NOT touch" section. Single-finding milestone — same scope discipline as v1.0.1 / v1.0.2 / v1.0.2.post1 / v1.0.5 / v1.0.5.post1. Appended as a new section after v1.0.5.post1 per Standing Rule §1.

### v1.1 backlog (filed in `v1.1-plan.md`)

- **`object.patched` event-name drift in `docs/concepts/events.md`.** The page's "Object mutations" family lists `object.patched`, but the code emits `patch.applied` for the direct `graph.patch_object(...)` shortcut and never `object.patched`. Either correct the doc (drop the name or rename to `patch.applied`) or add the event in code; the fix shape depends on whether direct mutations were intended to be distinguishable from patch applies. Surfaced during the v1.0.5.post2 diagnosis; out of scope for a "no concepts-page reshape" docs-only release.
- **Reason-code taxonomy as a dedicated concepts page.** The `behavior.failed` / `tool.responded` `reason=` field carries a closed taxonomy (`llm.network_error`, `llm.parse_error`, `tool.unknown_tool`, `tool.max_turns_exhausted`, …) documented only across the per-error pages. A dedicated reference (or `failure-model.md` expansion) could enumerate the codes the way v1.0.5.post2 #1 enumerates the event types.

### Migration from v1.0.5.post1

Forward-compatible. No code changes required. The runtime API, public surface, and CI gates are unchanged. The doc site grows by one page; existing pages stay byte-identical. `/llms.txt` and `/llms-full.txt` regenerate automatically on the next `mkdocs build` (the v1.0.5 #1 structural-drift guarantee).

```bash
pip install --upgrade activegraph==1.0.5.post2
```

- Every v1.0.5.post1 surface (LICENSE, NOTICE, CONTRIBUTING.md, issue templates, license metadata) stays byte-identical.
- Every v1.0.5 surface (`/llms.txt`, `/llms-full.txt`, `mkdocs-llmstxt` plugin, `tests/test_llms_txt.py`) stays byte-identical apart from the new entry the plugin picks up from `mkdocs.yml`.
- Every v1.0.4 / v1.0.3 / v1.0.2.post1 / v1.0.2 / v1.0.1 / v1.0 surface stays byte-identical.

## [v1.0.5.post1] — 2026-05-19

Pre-launch foundation pass before the repository is flipped public. Three coupled deliverables: the framework's license switches from MIT to Apache 2.0; a `CONTRIBUTING.md` lands with an issues-first contribution policy; the `.github/ISSUE_TEMPLATE/` surface lands with three structured templates plus a `config.yml` that disables blank issues. Coupled because shipping any one without the others would leave a public repository in a half-stated posture.

No framework code changes. No new public API. No reshape of any locked decision below v1.0.5. The release surface is repo-root metadata (`LICENSE`, `NOTICE`, `pyproject.toml`'s license field, `README.md`'s license section), contributor-facing prose (`CONTRIBUTING.md`), and the GitHub issue-template surface (`.github/ISSUE_TEMPLATE/`). The framing — post-release patch between a numbered milestone and the next — matches the v1.0.2.post1 precedent (CONTRACT v1.0.4 #6's appended archeology section is the canonical example of how a `.postN` release lands in CONTRACT under Standing Rule §1).

### The finding (v1.0.5.post1 #1)

v1.0.5.post1 #1 — Active Graph is licensed under Apache 2.0 from v1.0.5.post1 forward. Three reasons named in the CONTRACT amendment: explicit patent grant (§3 of the license, which MIT does not provide; load-bearing for a framework whose primitives — event-sourced reactive graph, relation behaviors, binding-moment validation, pack format — are themselves the contribution surface); institutional standard for foundation-shaped projects (ASF, CNCF, LF AI; matches enterprise legal-review calibration); legal precision on trademark / contribution / NOTICE boundaries (§§6, 5, 4(d); MIT leaves these implicit). The previous declared license was MIT, recorded in `pyproject.toml` and `README.md` through twelve milestones but never accompanied by a `LICENSE` file at the repo root — this release is the first to ship the canonical license text.

### Added

- **`LICENSE`** (v1.0.5.post1 #1). Canonical Apache 2.0 text from `https://www.apache.org/licenses/LICENSE-2.0.txt` prefixed with a single-line `Copyright 2026 Yohei Nakajima` header. The body below the header is byte-identical to the Apache Foundation's canonical plain-text version.
- **`NOTICE`** (v1.0.5.post1 #1). The Apache 2.0 §4(d) attribution pair. Two lines: project name (`Active Graph`) and copyright line (`Copyright 2026 Yohei Nakajima`). Downstream redistributors preserve NOTICE per §4(d).
- **`CONTRIBUTING.md`** (v1.0.5.post1 #1). Issues-first contribution policy for the framework's early public phase. Issues are open; code PRs are maintainer-only with an issue-first discussion gate; documentation PRs may be opened directly. Names the policy as a pre-launch posture (not a permanent stance) with the relaxation criteria stated. Includes the explicit Apache 2.0 §5 inbound-equals-outbound statement. Names three out-of-scope items mirroring the CONTRACT amendment's "deliberately does NOT touch" section: CLA / DCO decision, `CODE_OF_CONDUCT.md` paired with a contact channel, broader contributor surface.
- **`.github/ISSUE_TEMPLATE/bug_report.md`**, **`feature_request.md`**, **`question.md`** (v1.0.5.post1 #1). Three structured templates prompting for the information that makes a triage pass deterministic — minimal reproduction (bugs), problem statement and current workaround (feature requests), what-tried and what-expected (questions). Each template heads with a one-line pointer to `docs.activegraph.ai` and `CONTRIBUTING.md`. Pre-labels each issue (`bug`, `enhancement`, `question`) so issue-list filters work without manual triage.
- **`.github/ISSUE_TEMPLATE/config.yml`** (v1.0.5.post1 #1). Disables `blank_issues_enabled` and adds two `contact_links` pointing at the docs site and `CONTRIBUTING.md`. Forces every issue through one of the three templates.
- **`tests/test_license.py`** (v1.0.5.post1 #1). Standing Rule §2 gate anchored on the contract boundary ("Active Graph is licensed under Apache 2.0 from v1.0.5.post1 forward"). Six assertions covering the five surfaces the claim binds: LICENSE carries the Apache canonical heading plus the §3 patent-grant section header; NOTICE carries the project name and copyright line; pyproject.toml's license field reads SPDX `Apache-2.0`; no `License ::` classifier remains in pyproject; README's license section names Apache 2.0 and points at LICENSE; sanity check on tomllib availability.

### Changed

- **`pyproject.toml`**: `[project].license` switches from `{ text = "MIT" }` to the SPDX string `"Apache-2.0"` per PEP
- Adds `[project].license-files = ["LICENSE", "NOTICE"]` declaring the carried metadata. Drops the `License :: OSI Approved :: MIT License` classifier — PEP 639 forbids `License ::` classifiers when the SPDX form is used. Bumps `build-system.requires` from `setuptools>=68` to `setuptools>=77.0.3`, the minimum that supports PEP 639's SPDX license metadata.
- **`README.md`**: the `## License` section now reads "Active Graph is licensed under the Apache License 2.0" with pointers to LICENSE and NOTICE. The `## Contributing` section now points at `CONTRIBUTING.md` and names the issues-first policy as the pre-launch posture.
- **`activegraph/__init__.py`**: `__version__` tracks the bump to `"1.0.5.post1"` (the `test_version_sync` gate asserts it matches `pyproject.toml` byte-identically).
- **`tests/snapshots/errors/{internal_bug__pattern_unknown_op, internal_bug__graph_view_unknown_op,schema_version_mismatch}.txt`**: rebaselined the embedded version string from `1.0.5` to `1.0.5.post1` (these three snapshots render `activegraph.__version__` inline per the internal-bug context format and the schema-version-mismatch context format).

### CONTRACT amendments

- **`v1.0.5.post1` milestone added** with one numbered finding (`v1.0.5.post1 #1`) and a "deliberately does NOT touch" section. Single-finding milestone — same scope discipline as v1.0.1 / v1.0.2 / v1.0.2.post1 / v1.0.5. Appended as a new section between v1.0.5 and v1.1 per Standing Rule §1 (the v1.0.2.post1-via-v1.0.4-#6 retroactive archeology section is the precedent for how a `.postN` release lands).

### v1.1 backlog (filed in `v1.1-plan.md`)

- **CLA / DCO decision.** Apache 2.0 §5's implicit grant covers the contract today; if contribution volume grows past maintainer-review bandwidth or if enterprise legal desks request the ceremony, decide between a CLA (with a signing workflow) and a DCO (the `Signed-off-by:` discipline). v1.1 work, not v1.0.5.post1.
- **`CODE_OF_CONDUCT.md` paired with a contact channel.** Contributor Covenant v2.1 is the standard text; the missing piece is the contact channel for reports. v1.1 picks both up together — shipping the document without the inbox would publish a hollow reporting commitment.
- **Relax the issues-first contribution policy.** Today's maintainer-only-code-PRs posture is explicitly pre-launch; v1.1 owns the decision to broaden it based on observed contribution patterns during v1.0.x's public window.

### Migration from v1.0.5

Forward-compatible. No code changes required. The runtime API, public surface, doc-site content, and CI gates are unchanged.

```bash
pip install --upgrade activegraph==1.0.5.post1
```

PyPI metadata changes for v1.0.5.post1 onward: `License-Expression: Apache-2.0` (PEP 639 SPDX form); `License-File: LICENSE`, `License-File: NOTICE` (the carried metadata files); the `License :: OSI Approved :: MIT License` classifier is removed. Redistributors should update their license-tracking metadata accordingly.

- Every v1.0.5 surface (`/llms.txt`, `/llms-full.txt`, `mkdocs-llmstxt` plugin, `tests/test_llms_txt.py`) stays byte-identical.
- Every v1.0.4 / v1.0.3 / v1.0.2.post1 / v1.0.2 / v1.0.1 / v1.0 surface stays byte-identical.

## [v1.0.5] — 2026-05-19

AI-readable docs via `llms.txt` support. The v1.0.4 external user-test surfaced that most evaluators of Active Graph in 2026 reach the doc site through AI coding assistants (Claude Code, Cursor, Replit) rather than browsers — and that mkdocs-rendered HTML wraps content in navigation chrome that those agents spend tokens unwrapping. The dominant convention for machine-readable docs is [llms.txt](https://llmstxt.org/) (Howard, 2024), adopted by Stripe, Vercel, Anthropic's docs, Nuxt, and many others.

v1.0.5 ships both files at the doc-site root, generated at build time from the existing `docs/` markdown source. No abstraction changes, no new runtime capability, no source-markdown changes. The release is docs + build-infrastructure only.

### The single finding

v1.0.5 #1 — /llms.txt (structured markdown index, ~96 lines) and /llms-full.txt (concatenated full content, ~110K tokens) at the docs.activegraph.ai site root, generated by the `mkdocs-llmstxt` plugin inside `mkdocs build`. Drift prevention is structural — no hand-maintained `llms.txt` lives in the repository, so both files cannot drift from the source markdown they are generated from.

### Added

- **`https://docs.activegraph.ai/llms.txt`** (v1.0.5 #1). Structured markdown index with `# Active Graph` H1, blockquote summary, and H2 sections (Quickstart, Concepts, Guides, Cookbook, Reference, Optional) listing every doc page with a one-sentence description per curated entry. Sized for AI tools that support llms.txt-aware fetching.
- **`https://docs.activegraph.ai/llms-full.txt`** (v1.0.5 #1). Concatenated markdown of every doc page in reading order, sized for large-context-window AI ingestion (~110K tokens, under the 200K target). The "everything in one file" reference for AI tools that prefer comprehensive corpora.
- **`mkdocs-llmstxt>=0.2`** added to `pyproject.toml`'s `[docs]` extra. Same maintainer as `mkdocstrings`; the workflow auto-syncs because `.github/workflows/docs.yml` installs `.[docs]`.
- **`tests/test_llms_txt.py`** — Standing Rule §2 gate anchored on the v1.0.5 #1 contract claim. Six assertions: both files exist; `llms.txt` has the H1 + blockquote + at least 4 H2 sections; `llms.txt` references the three nav-anchor pages named in the amendment (concepts/graph, quickstart, at least one cookbook page); `llms-full.txt` carries the H1 plus a distinctive marker phrase from the quickstart body. Marked `@pytest.mark.slow`; runs in the `docs.yml` workflow after `mkdocs build`.

### Changed

- **`mkdocs.yml`**: adds the `llmstxt` plugin block with `markdown_description`, `full_output: llms-full.txt`, and a `sections:` map mirroring the existing `nav:` 1:1.
- **`.github/workflows/docs.yml`**: adds the `pytest -m slow tests/test_llms_txt.py` verification step after `mkdocs build`. Build cost: ~5 seconds on top of the existing ~5-second mkdocs build.
- **`README.md`**: short note under `## Documentation` pointing AI agents at `/llms.txt` and `/llms-full.txt`. Frames the audience so human readers understand why both URLs exist alongside the rendered site rather than replacing it.

### CONTRACT amendments

- **v1.0.5 milestone added** with one numbered finding (`v1.0.5 #1`) and a "deliberately does NOT touch" section. Single-finding milestone — same scope discipline as v1.0.1 / v1.0.2 / v1.0.2.post1 (each release small, independently reviewable, and free of unrelated cleanup).

### v1.1 backlog (filed in `v1.1-plan.md`)

- **Content negotiation on the docs host.** A worker / edge function returning `text/markdown` for `Accept: text/markdown` requests, complementing the static `/llms.txt` and `/llms-full.txt` files. The static-files approach handles the index-and-bulk case; content negotiation handles the per-page case. Requires docs-host infrastructure GitHub Pages does not support natively.
- **Editorial doc-readability pass.** Front-load page summaries (so the first paragraph stands alone as the page abstract), tighten cross-references, normalize terminology across the per-error catalog. Open-ended editorial work, orthogonal to v1.0.5's mechanical file-generation scope.

### Migration from v1.0.4

Forward-compatible. No code changes required. The new files appear automatically at the doc-site root on the next deploy.

```bash
pip install --upgrade activegraph==1.0.5
```

- Every v1.0.4 surface (`Graph.relations`, `Graph.get_relations` alias, the failure-model footers on the 10 per-error pages, CONTRACT review-overlay markers) stays byte-identical.
- Every v1.0.3 surface (Graph.objects, Runtime.errors, BehaviorFailure, LLMMessage.tool_calls, WARNING log) stays byte-identical.
- Every v1.0.2 / v1.0.2.post1 surface (LLMProvider.default_model, recognizes_model, both-binding-moments validation) stays byte-identical.

### Provider non-promises in v1.0.5

Inheriting v1.0.4 / v1.0.3 / v1.0.2 / v1.0.1 #5 (c). Specifically unchanged in this release:

- `LLMProvider` Protocol stays at v1.0.2 #1's widened shape.
- The closed CONTRACT v0.6 #11 reason taxonomy is unchanged.
- The `behavior.failed` event payload, the WARNING log format, and the `BehaviorFailure` shape stay byte-identical to v1.0.3 through v1.0.4.

## [v1.0.4] — 2026-05-19

Pre-launch foundation cleanup absorbing six small findings from the post-v1.0.3 contract review (see `CONTRACT-review-findings.md` §5). No abstraction changes, no new runtime capability, no new CI gate. Three documentation corrections, one additive API method, one test addition, one CONTRACT archeology restoration.

The release also operationalizes the two Standing Rules adopted by the contract review banner: §1 (amendments append, never modify) shaped every v1.0.4 commit; §2 (tests anchor on the contract boundary, not the implementation's path) shaped the new tests for

# 1 and #4.

### The six findings

v1.0.4 #1 — graph.relations(source=, target=, type=) canonical filter API (mirrors v1.0.3 #1's graph.objects fix) v1.0.4 #2 — per-error-page footer pointing at failure-model.md's "Observing failures in caller code" (10 pages) v1.0.4 #3 — WARNING-log vs BehaviorFailure field-name divergence documented in failure-model.md v1.0.4 #4 — boundary-anchored test for \_requeue_unfired zero-subscriber carve-out (Standing Rule §2 shape) v1.0.4 #5 — review-overlay markers at v0 #11, v0 #16, v0.8 #19 for stale forward-pointer prose v1.0.4 #6 — appended ### v1.0.2.post1 section to CONTRACT under v1.0.2 #1, restoring archeology that v1.0.2.post1's in-place revision destroyed

### Added

- **`Graph.relations(source=None, target=None, type=None) -> list[Relation]` (v1.0.4 #1).** Canonical filter API on `Graph`. Three kwargs compose by AND; no-kwargs returns every relation; the source/target decomposition replaces the asymmetric `direction="outgoing"|"incoming"|"both"` axis on the alias. Eight filter combinations (each row of the table in CONTRACT v1.0.4 #1) are the contract claim; each is covered by a dedicated test in `tests/test_graph.py`.

Implementation note: the method is a direct projection over `self._relations.values()`, not a wrapper over `get_relations`. The underlying loop is six lines, and routing through the alias would obscure the per-row contract claim that the tests anchor on. The duplication trade is small; the readability win is the point.

- **`tests/test_requeue_unfired.py::test_zero_subscriber_event_ids_are_absent_from_requeue_set_on_load` (v1.0.4 #4).** Boundary-anchored sibling to the existing `queue_depth == 0` test. Asserts directly on the requeue set (`rt._queue._q`) rather than the implementation's symptom. Locks the v0.5 #8 carve-out at the contract boundary the amendment names.

### Changed

- **`Graph.get_relations(...)` (v1.0.4 #1).** Kept as a backward-compatible alias for `Graph.relations`. No deprecation warning in v1.0.4; v1.1's Theme A (Graph/View harmonization) owns the deprecation decision.

### Documentation

- **`docs/concepts/graph.md`** (v1.0.4 #1). The long-broken line-43 reference `graph.relations(source=claim_id)` now resolves to the new method. Three canonical-form examples (`source=`, `target=`, `type=`) plus one line on the `get_relations` alias.
- **`docs/concepts/failure-model.md`** (v1.0.4 #3). Adds one paragraph in the "Observing failures in caller code" section naming the intentional field-name divergence between the WARNING log (`error_type` / `error_message` — v0.8 #6 schema) and `BehaviorFailure` (`exception_type` / `message` — Python convention). Values are identical; only the names differ.
- **`docs/reference/errors/*.md`** (v1.0.4 #2). Adds the fixed one-line footer pointing at `failure-model.md#observing-failures-in-caller-code` to the 10 per-error pages whose error classes route through `behavior.failed` (identified by tracing each error class's raise sites through the runtime emission path, not by guessing from page titles). The 21 pages whose errors are raised at decoration / registration / setup / lookup time do NOT get the footer.

### CONTRACT amendments

- **v1.0.4 milestone added** with six amendments and a "deliberately does NOT touch" section.
- **`### v1.0.2.post1` subsection appended** as §(e)-equivalent under v1.0.2 #1 (v1.0.4 #6). Documents the validation-boundary correction (lazy-at-first-run → both binding moments), the `_live.py` `weakref.WeakSet` mechanism, and cites `tests/test_llm_default_model.py` Section (g) as the canonical Standing Rule §2 model.
- **Top-of-v1.0.2-#1 breadcrumb** updated in place to point at the now-existing post1 section (the single in-place edit Standing Rule §1 permits in v1.0.4, explicitly authorized by the contract review).
- **Three review-overlay markers added in place** at v0 #11, v0 #16, v0.8 #19 (v1.0.4 #5). Original prose preserved verbatim; each overlay is bracketed `[review overlay 2026-05-19: …]` so the layer boundary is explicit and greppable by date.

### v1.1 backlog (filed in `v1.1-plan.md`)

- **C-3 — Lock failure-routing for eval-time pattern failures.** Surfaced during v1.0.4 #2's audit. Most dispatch-time errors route through `behavior.failed`; `ReplayDivergenceError` and `UnsupportedPatternError` eval-time raises deliberately escape. v1.1 should either keep the asymmetry and document the carve-out criterion in CONTRACT, or route every dispatch-time error uniformly. The current state is neither documented nor locked.
- **I-4 — Cross-link `replay-divergence-error.md` to replay/fixture documentation.** The standard v1.0.4 #2 footer doesn't apply to this page because the error deliberately escapes rather than emitting `behavior.failed`. A different cross-link is needed; phrasing depends on how C-3 resolves.

### Migration from v1.0.3

Forward-compatible. No code changes required.

```bash
pip install --upgrade activegraph==1.0.4
```

- `graph.get_relations(object_id=, type=, direction=)` keeps working byte-identically; new code uses `graph.relations(source=, target=, type=)`.
- All v1.0.3 surfaces (Graph.objects, Runtime.errors, BehaviorFailure, LLMMessage.tool_calls, WARNING log) are unchanged.
- All v1.0.2 / v1.0.2.post1 surfaces (LLMProvider.default_model, recognizes_model, both-binding-moments validation) are unchanged.

### Provider non-promises in v1.0.4

Inheriting v1.0.3 / v1.0.2 / v1.0.1 #5 (c). Specifically unchanged in this release:

- `LLMProvider` Protocol stays at v1.0.2 #1's widened shape; no further additions.
- The closed CONTRACT v0.6 #11 reason taxonomy is unchanged.
- The `behavior.failed` event payload, the WARNING log format, and the `BehaviorFailure` shape stay byte-identical to v1.0.3.

## [v1.0.3] — 2026-05-19

Comprehensive response to two user-test reports. Four findings span the framework's user-facing API surface, the largest single release since v1.0.1. The framing — patch release on the adoption-surface milestone, one commit per finding for independent review — matches v1.0.1 / v1.0.2. The two prior user-test findings carried forward into this release (`output_schema=` UX and silent `behavior.failed` UX) are addressed in #2 and #3.

### The four findings

v1.0.3 #1 — graph.objects(type=...) as canonical query API v1.0.3 #2 — @llm_behavior(output_schema=) strict-validates at decoration time (dict-form filed as v1.1 candidate) v1.0.3 #3 — WARNING log + Runtime.errors property for behavior.failed v1.0.3 #4 — multi-turn tool-use messages carry full content blocks

### Added

- **`Graph.objects(type=..., where=...)` (v1.0.3 #1).** Canonical query API on `Graph`, mirroring `View.objects(type=...)` so call sites read the same inside and outside behaviors. External users who reached for the natural form previously hit `AttributeError`; the docs even showed the call. `Graph.query(object_type=...)` stays as a backward-compatible alias — no deprecation in v1.0.3.
- **`Runtime.errors -> list[BehaviorFailure]` (v1.0.3 #3).** A read-only property projecting `behavior.failed` events from the graph's event log into structured named-tuples. Five fields per failure (`behavior`, `event_id`, `reason`, `exception_type`, `message`) plus `failed_event_id` for callers that want the full payload. The events stay the source of truth; the property is a view.
- **`BehaviorFailure` (v1.0.3 #3).** New `NamedTuple` exported from the top-level `activegraph` namespace. Distinct from Python's builtin `RuntimeError` — the name was chosen so it doesn't shadow.
- **`LLMMessage.tool_calls` (v1.0.3 #4).** Additive field (`Optional[tuple[ToolCall, ...]]`, default `None`) carrying the originating `ToolCall` objects on assistant messages that triggered tool_use. The provider adapter reconstructs the wire-format content blocks from this field.
- **`doc_url` in the structured log schema (v1.0.3 #3).** The JSON log formatter now emits `doc_url` (when present). The `behavior.failed` WARNING log carries the URL pointing at the reason's class-level documentation page so operators tailing logs can click through.

### Changed

- **`@llm_behavior(output_schema=)` strict-validates at decoration time (v1.0.3 #2).** Passing anything that isn't `None` or a Pydantic `BaseModel` subclass raises `TypeError` from the decorator with a structured message that names the actual type passed and inlines a copy-pasteable code example of the correct form. Previously, a JSON-schema dict raised a `TypeError` internally and the runtime caught it as a generic exception, producing a `behavior.failed` event with reason `llm.schema_violation` and no diagnostic naming the cause. Dict-form `output_schema=` support is a v1.1 candidate.
- **`Runtime._emit_behavior_failed` emits a WARNING log line (v1.0.3 #3).** Every `behavior.failed` emission produces exactly one log line at `WARNING` level on the `activegraph.runtime` logger. The line carries `behavior`, `event_id`, `reason`, `error_type`, `error_message`, and `doc_url`. The function- and relation-behavior exception handlers now route through this centralized emitter rather than calling `_emit_lifecycle` directly, removing a duplicate `ERROR` log on the function path. Users opt out via standard Python logging configuration.
- **Multi-turn tool-use message construction (v1.0.3 #4).** When the LLM returns `tool_use` blocks, the runtime appends an `LLMMessage(role="assistant", content=raw_text or "", tool_calls=tuple(response_tool_calls))` to the message history, not just the raw text. The Anthropic provider adapter's `_message_to_anthropic` reconstructs the wire-format content blocks (text + tool_use) on the way out. Single-turn flows and zero-tool assistant messages keep their byte-identical wire serialization. Hashing-stability invariant: `LLMMessage.to_dict()` only emits the `tool_calls` key when non-None, so existing single-turn fixture prompt hashes are unchanged.

### Fixed

- **First user-test report, finding A** — `graph.objects(type="x")` `AttributeError`'d when called on a `Graph` instance. Users who'd been writing `ctx.view.objects(type="x")` inside behaviors hit the gap immediately when trying the equivalent outside a behavior. Fixed by adding `Graph.objects` as the canonical form. See "Added: `Graph.objects(...)`".
- **First user-test report, finding B** — `@llm_behavior(output_schema={"type": "object", ...})` silently produced zero results: the dict raised `TypeError` internally, the runtime emitted `behavior.failed` with `reason="llm.schema_violation"`, and the diagnostic carried no hint that the user had passed a dict instead of a Pydantic class. Fixed by failing at the `@llm_behavior(...)` line with a structured message + code example. See "Changed: `@llm_behavior(output_schema=)` strict-validates".
- **First user-test report, finding C** — `runtime.run_goal()` returned cleanly with zero results when behaviors failed, leaving users no signal short of inspecting `graph._events`. Fixed by emitting a WARNING log line and exposing `Runtime.errors`. See "Added: `Runtime.errors`" and "Changed: `_emit_behavior_failed` emits a WARNING log line".
- **Second user-test report, finding D** — multi-turn tool-use exchanges through the Vertex AI proxy returned HTTP 400 because the runtime appended only `raw_text` to the message history after a tool_use turn, dropping the tool_use blocks Anthropic's spec requires for matching subsequent tool_result blocks. Fixed by carrying `ToolCall` objects on the assistant message and reconstructing the content blocks in `_message_to_anthropic`. See "Added: `LLMMessage.tool_calls`" and "Changed: multi-turn tool-use message construction".

### Examples

No example changes in this release. The bundled Diligence pack and the BabyAGI example continue to run unchanged against the new surfaces; their tool-using behaviors exercise v1.0.3 #4's fix through the existing fixture path.

### Documentation

- **`docs/concepts/graph.md`** already showed `graph.objects(type="claim")` as the canonical form; v1.0.3 #1 makes that documentation match the implementation. No prose change required.
- **`docs/concepts/failure-model.md`** gains an "Observing failures in caller code" section documenting the WARNING log line and the `Runtime.errors` property as the two user-facing surfaces (v1.0.3 #3).
- **`docs/reference/`** picks up the new `BehaviorFailure` NamedTuple shape and the `Runtime.errors` property.

### Migration from v1.0.2.post1

Forward-compatible. No code changes required.

```bash
pip install --upgrade activegraph==1.0.3
```

- `graph.query(object_type=...)` keeps working byte-identically; new code uses `graph.objects(type=...)`.
- `@llm_behavior(output_schema=SomeBaseModel)` keeps working byte-identically. Callers passing a JSON-schema dict will see a `TypeError` at the decorator line instead of a silent `behavior.failed` at first LLM call — the error names what they passed and shows the correct form.
- `runtime.run_goal()` keeps working byte-identically; the new WARNING log line is opt-out via stdlib logging configuration. Code that inspected `graph._events` for `behavior.failed` continues to work; the new `Runtime.errors` property is the ergonomic alternative.
- Multi-turn tool-use exchanges send extra content blocks on the wire now. Direct Anthropic API access keeps working; Vertex-AI-proxy users stop hitting HTTP 400.

### Provider non-promises in v1.0.3

Inheriting v1.0.2 / v1.0.1 #5 (c). Specifically unchanged:

- `LLMProvider.complete()` / `estimate_cost()` / `count_tokens()` signatures stay locked at v0.6 #3 / v0.7. v1.0.2's additive members (`default_model`, `recognizes_model`) are unchanged.
- The closed CONTRACT v0.6 #11 reason taxonomy is unchanged.
- OpenAI tool-use translation stays a v1.1 candidate per v1.0.1 #5 (c) clause 2.
- The pack format, the exception hierarchy, and the failure model are unchanged.

## [v1.0.2.post1] — 2026-05-19

Post-release fix to v1.0.2. The CONTRACT amendment landed in v1.0.2 promised registration-time validation; an external spot-check discovered the implementation was firing the validation lazily, at first `run_goal()` / `run_until_idle()` / `run_until()` via `_ensure_registry()` rather than at registration time. The validation logic and error message were correct — the boundary was wrong.

This post-release moves the validation to **both binding moments** so cross-provider model mismatches fail fast at setup time:

1. **`Runtime(graph, llm_provider=...)` construction** — runs the bulk validation pass against whatever is already registered (global registry, explicit `behaviors=[...]`, or pack-loaded).
1. **`register()` / `@llm_behavior` decoration** — when one or more Runtimes are alive, the freshly-registered behavior is checked against each live Runtime's provider via a `weakref.WeakSet`. The WeakSet auto-cleans on GC; no `Runtime.close()` is added.

The lazy path inside `_ensure_registry()` stays in place as a defensive double-check for code paths that bypass both binding moments (currently: pack behaviors registered after Runtime construction via `load_pack`). Pack-load-time validation is filed as a v1.1 candidate if friction surfaces.

The CONTRACT v1.0.2 #1 (b) wording is clarified to match — see that section for the locked decision.

### Changed

- **Validation boundary corrected.** No public-API change; the error message is byte-identical to v1.0.2. The difference is *when* it fires: at `Runtime(...)` construction or at `@llm_behavior` / `register()` time, instead of at first `run_goal()`.
- **`Runtime.__init__` order updated.** Bulk validation runs before the Runtime self-registers in the live-set, so a Runtime whose construction fails validation stays out of the WeakSet. This prevents pytest exception-traceback strong-refs on failed-construction `self` from polluting subsequent `@llm_behavior` validation passes during a test session. In production it's a no-op (failed-construction Runtimes go out of scope and are GC'd promptly anyway), but the invariant — "only successfully-constructed Runtimes participate in validation" — is worth keeping clean.
- **`tests/conftest.py`** clears the live-Runtime WeakSet in the autouse `_isolate_registry` fixture for the same pytest-pollution reason. The WeakSet auto-cleans on GC in production; only test isolation needs the explicit clear.

### Added

- **`activegraph/runtime/_live.py`** — new module owning the live- Runtime WeakSet, the `track_runtime()` hook, and the single-behavior cross-provider validator that both binding moments invoke. The validator is factored out of v1.0.2's `_resolve_and_validate_llm_models` so the check itself lives in one place; the bulk function and the new decorator-path call site both delegate.

### Fixed

- **External spot-check finding** — `Runtime(graph, llm_provider=...)` no longer returns successfully when the registry contains a cross-provider model mismatch. The `@llm_behavior` decorator no longer adds a conflicting behavior to the registry when a Runtime with an incompatible provider is already alive. Same diagnostic message; earlier boundary.

### Migration from v1.0.2

No code changes required. The error fires at an earlier point in the program's execution — what used to surface at `rt.run_goal(...)` now surfaces at `Runtime(...)` or at `@llm_behavior` decoration. Code that was already catching `InvalidRuntimeConfiguration` around `run_goal` should move the `try`/`except` to the relevant binding moment, or wrap the whole setup block.

## [v1.0.2] — 2026-05-19

Patch release addressing the most urgent of three findings from the second-round external user-test. The framing — patch release based on user-test findings — matches v1.0.1's; the scope is narrower (one finding, not four, plus no provider-expansion work). The other two findings need design consideration and are tracked for v1.0.3 / v1.1, not folded into this release.

### The finding (v1.0.1 #5 credibility hit)

v1.0.1 #5 shipped `OpenAIProvider` and locked the provider- commitment contract: same Protocol surface, swap one for the other without reshaping any `@llm_behavior`. The second-round user-test exercised exactly that swap and surfaced a silent default-model mismatch: `@llm_behavior(...)` without an explicit `model=` inherited the decorator's hardcoded default `"claude-sonnet-4-5"` — an Anthropic-family name. With `Runtime(graph, llm_provider=OpenAIProvider())`, that name went verbatim to OpenAI's `chat.completions.create` and produced an HTTP 404 with no hint that the cross-provider mismatch was the cause. The `behavior.failed` event carried the provider's verbatim 404 prose; diagnosis required inspecting the decorator, tracing the default, and recognizing the model-family conflict.

This directly undermined v1.0.1's provider-agnostic claim. v1.0.2 makes the default provider-aware and validates explicit model names at registration time.

### Added

- **`LLMProvider.default_model` attribute (additive Protocol widening, CONTRACT v1.0.2 #1 (a)).** Each shipped provider declares a default model:

| Provider            | `default_model`       |
| ------------------- | --------------------- |
| `AnthropicProvider` | `"claude-sonnet-4-5"` |
| `OpenAIProvider`    | `"gpt-4o-mini"`       |

`@llm_behavior(...)` with no `model=` argument now resolves to the configured provider's `default_model` at registration time, via `Runtime(graph, llm_provider=...)._ensure_registry()`. The resolved name is stamped onto the `LLMBehavior` instance so `behavior.build_prompt(...)` sees the concrete model in its hash inputs.

- **`LLMProvider.recognizes_model(name) -> bool` method (additive, CONTRACT v1.0.2 #1 (b)).** Returns True when `name` belongs to a model family the provider serves. Shipped providers: `AnthropicProvider` recognizes `claude-*`; `OpenAIProvider` recognizes `gpt-*`, `o1-*`, `o3-*`, `o4-*`.

### Changed

- **`@llm_behavior(model=...)` default changed from `"claude-sonnet-4-5"` to `None`.** Existing call sites passing `model="..."` explicitly stay byte-identical. Call sites that omitted `model=` previously inherited `"claude-sonnet-4-5"` via the decorator default; they now inherit the same string when the configured provider is `AnthropicProvider` (whose `default_model` is `"claude-sonnet-4-5"`), and the *provider- appropriate* default (`"gpt-4o-mini"`) when the configured provider is `OpenAIProvider`. Custom providers that don't declare a `default_model` retain the v1.0.1 hardcoded fallback for backward compat. CONTRACT v1.0.2 #1 (c).
- **`LLMBehavior.model` field type changed from `str` to `Optional[str]`.** v1.0.1 instances pickle/load cleanly (string values still load); freshly-decorated v1.0.2 behaviors carry `None` until a Runtime resolves a provider default. CONTRACT v1.0.2 #1 (c).
- **Registration-time cross-provider validation.** When a behavior pins `model=` explicitly and the configured provider doesn't recognize the name, the runtime checks each shipped provider's `recognizes_model()` and raises `InvalidRuntimeConfiguration` if a *different* shipped provider claims the name. The error is structured per the v1.0 format — `what_failed` / `why` / `how_to_fix` — and names both providers plus the way out (swap the provider, or use the configured provider's default). Permissive by default: unknown names (custom deployments, fine-tunes like `ft:gpt-4o-mini:org::id`) pass through silently. CONTRACT v1.0.2 #1 (b).
- **`docs/reference/llm-providers.md`** gains a "Default model resolution" section, a "Cross-provider model-name validation" section, and updated rows for `default_model` and recognized prefixes in the side-by-side table. The "Writing a custom provider" example now shows the optional `default_model` + `recognizes_model` members and notes they are additive.

### Fixed

- **Second external user-test, finding 1** — `@llm_behavior` with no `model=` argument silently used an Anthropic-family default, producing HTTP 404 at first LLM call when the configured provider was `OpenAIProvider`. The diagnostic message on `behavior.failed` did not name the cause. See "Added: `LLMProvider.default_model`" and "Changed: registration-time cross-provider validation" above.

### Examples

- **`examples/babyagi.py`** simplified to drop its per-provider model table. The `@llm_behavior` definitions omit `model=` and let `Runtime`'s provider resolution pick the right default. Switching `--provider` between `anthropic` and `openai` is a one-line change in the example; nothing else needs to change.

### Provider non-promises in v1.0.2

Inheriting the v1.0.1 #5 (c) clauses. Specifically *unchanged* in v1.0.2:

- `LLMProvider.complete()` / `estimate_cost()` / `count_tokens()` signatures stay locked at v0.6 #3 / v0.7. v1.0.2 widens the Protocol additively with two members; it does not reshape the three core methods.
- The closed CONTRACT v0.6 #11 reason taxonomy is unchanged. The cross-provider mismatch raises a `ConfigurationError` subclass at registration time, not a new behavior-failure reason code.
- The v1.0.1 #2 prompt-assembly shape is unchanged. Same schema
- example instance + "instance not schema" language.

### Migration from v1.0.1

Additive. Forward-compatible:

```bash
pip install --upgrade activegraph==1.0.2
```

- `@llm_behavior(model="...")` call sites that pinned an explicit string keep working byte-identically.
- `@llm_behavior(...)` call sites that omitted `model=` now inherit the configured provider's `default_model`. For `AnthropicProvider`, that's the same `"claude-sonnet-4-5"` the v1.0.1 decorator default produced. For `OpenAIProvider`, the default changes from the v1.0.1 silent-Anthropic-name to `"gpt-4o-mini"`, fixing the v1.0.2 finding.
- Custom providers that don't declare `default_model` continue to use the v1.0.1 hardcoded fallback. Custom providers that want the v1.0.2 default-resolution behavior add a `default_model: str = "..."` class attribute (and optionally a `recognizes_model()` method to participate in cross-provider validation).
- `LLMProvider` Protocol gains two additive members. Existing custom-provider classes that don't implement them still pass `isinstance(p, LLMProvider)` checks at the three core methods.

## [v1.0.1] — 2026-05-19

The first-external-user-test patch plus the OpenAI provider expansion. v1.0 final shipped on 2026-05-18; the first developer outside the maintainer's loop ran the install / quickstart / tutorial path on the day-of, and three small UX findings surfaced before v1.0.1 publish. All three fit the "X is confusing" shape on HANDOFF.md's user-test heuristic (none "X doesn't compose with Y the way I expected" — architectural shape held).

v1.0.1 also closes an implicit adoption-surface gap the user-test didn't surface but readers feel: the framework shipped a single concrete `LLMProvider` (`AnthropicProvider`), making the provider-agnostic claim read as theoretical. v1.0.1 #5 ships `OpenAIProvider` with surface parity and locks in the provider-commitment contract.

No CONTRACT amendments to v1.0's own decisions, no public-API renames, no new runtime capability. CONTRACT v1.0.1 records the four user-test fixes plus the provider-expansion decision; this entry is the shipping changelog.

### Added

- **`activegraph.register(behavior_obj)`** — public function for appending an already-constructed behavior to the global registry. Pairs with `clear_registry()` for multi-run scripts that capture the registry once and re-register per run, replacing the v1.0 pattern of reaching into the private `activegraph.behaviors.decorators._REGISTRY` list. Validates the argument is a `Behavior` / `RelationBehavior` / `LLMBehavior` instance and raises `TypeError` otherwise. CONTRACT v1.0.1 #1.
- **`docs/cookbook/multi-run-scripts.md`** — new cookbook recipe covering the capture-once-re-register-per-run pattern, when to use it (hypothesis sweeps, A/B comparisons inside one process, batch jobs that want per-input graph isolation without per-input process startup), and when not (single-runtime scripts don't need any of this). Wired into the mkdocs nav under the Cookbook section. CONTRACT v1.0.1 #1.
- **`activegraph.llm.prompt.example_instance_from_schema`** — new helper that walks a JSON Schema and produces a deterministic placeholder instance. Used by `build_system_prompt` to render an example alongside the schema in the LLM system prompt; exported for tests and for prompt-debugging tools. CONTRACT v1.0.1 #2.

### Changed

- **`@llm_behavior(output_schema=...)` system prompt now embeds an example instance and explicit "instance, not schema" language.** The first external user-test surfaced a failure mode v1.0's prompt-assembly didn't anticipate: some models echo the JSON Schema definition back as their response instead of an instance that conforms to it. The framework refused with `llm.schema_violation`, the user had to reverse-engineer the cause from the raw response. v1.0.1 changes the system-prompt schema block to three parts — the schema (unchanged), a synthesized example instance, and explicit "Return an INSTANCE that conforms to this schema, NOT the schema itself" language. `build_instruction` (the user-message task sentence) also gains "NOT the schema definition itself" so the framing appears in two places. The example generator handles `type`, `properties`, `items`, `enum`, `const`, `anyOf`/`oneOf` (picks the non-null variant), and `$ref` to `$defs`/`definitions`; unrecognized shapes fall back to `null`. The synthesized example is deterministic across runs so the prompt-hash cache key stays stable. CONTRACT v1.0.1 #2. See the expanded [`llm-behavior-error`](https://docs.activegraph.ai/errors/llm-behavior-error/) reference page for the failure mode and the `prompt_template=` override pattern when the auto-derived example isn't useful.
- **`SQLiteEventStore()` constructor error points at the higher- level `Runtime(graph, persist_to=...)` API.** v1.0 raised a bare Python `TypeError: missing 1 required positional argument: 'run_id'`; the user-test reader had to first look up "what is a run_id" before they could decide how to recover. v1.0.1 hand-raises a TypeError with a structured hint:

```text
SQLiteEventStore requires a run_id. For most cases, use
Runtime(graph, persist_to='path/to/trace.sqlite') instead,
which handles run_id automatically. If you need a per-run
handle (migration, conformance test, trace inspection), pass
both explicitly: SQLiteEventStore('path/to/trace.sqlite',
run_id='run_...').
```

The signature change (`Optional[str] = None` for both args) is internal — every existing caller passes both args positionally or by keyword. CONTRACT v1.0.1 #3.

- **`clear_registry()` returns the cleared list.** v1.0 returned `None`; v1.0.1 returns `list[Behavior | RelationBehavior]` in registration order. Callers that ignored the return value still work unchanged. The shape pairs with the new `register()` for the multi-run pattern. CONTRACT v1.0.1 #1.
- **`@llm_behavior` decorator docstring names what each `prompt_template=` placeholder contains.** v1.0 documented the four placeholders by name (`{system}`, `{view}`, `{event}`, `{instruction}`) but didn't say what each one rendered to. The v1.0.1 doc-site entry for the schema-echo failure mode points readers at `prompt_template=` as a fallback for schemas the auto-example can't render usefully, so the decorator docstring grows a four-bullet list naming the content of each placeholder. Concrete enough to compose a custom template without first opening `activegraph/llm/prompt.py`. CONTRACT v1.0.1 #4.

### Fixed

- **First external user-test, finding 1** — multi-run scripts had no public-API path to re-populate the registry after `clear_registry()`. See "Added: `activegraph.register`" and "Changed: `clear_registry()` returns the cleared list" above.
- **First external user-test, finding 2** — models occasionally returned the JSON Schema definition as their response instead of an instance, triggering `llm.schema_violation`. See "Changed: `@llm_behavior(output_schema=...)` system prompt now embeds an example instance" above.
- **First external user-test, finding 3** — `SQLiteEventStore()` with missing args produced a bare Python `TypeError` instead of hinting at the higher-level `persist_to=` API. See "Changed: `SQLiteEventStore()` constructor error" above.

### Provider expansion

- **`activegraph.llm.OpenAIProvider`** — second concrete `LLMProvider` with surface parity to `AnthropicProvider`. Same three Protocol methods (`complete`, `estimate_cost`, `count_tokens`), same lazy-SDK + env-var loading shape, same family-prefix pricing table, same structured-output path through the framework's instruction-based prompt assembly. A runtime swapping `AnthropicProvider()` for `OpenAIProvider()` doesn't reshape any `@llm_behavior` definition. CONTRACT v1.0.1 #5.
- **`activegraph.llm.parsing.parse_structured_response`** — JSON-extraction-then-Pydantic-validate helper extracted from `AnthropicProvider`. Both shipped providers (and any future provider that uses the framework's instruction-based path) import it directly, producing byte-identical `llm.parse_error` and `llm.schema_violation` reason codes for byte-identical responses. The extraction preserved Anthropic's behavior exactly; all 9 existing `test_llm_anthropic.py` tests pass unchanged. CONTRACT v1.0.1 #5.
- **`pyproject.toml` extras follow a three-pattern shape.** `[llm]` pulls every shipped provider's SDK (`anthropic>=0.40`, `openai>=1.0`, `tiktoken>=0.7`). `[anthropic]` and `[openai]` aliases install one provider at a time for cost-conscious production deployments. `[all]` rolls up everything from `[llm]` plus persistence and metrics extras. CONTRACT v1.0.1 #5 (b).
- **`docs/reference/llm-providers.md`** — new reference page documenting both providers side-by-side: install commands, API key env vars, the symmetric Protocol surface, the asymmetric details (`count_tokens` server-side vs client-side, tool-use support gap, native structured-output mode deferral), and a "writing a custom provider" section pointing at `parse_structured_response` for error-semantics parity. Wired into the mkdocs nav under Reference. CONTRACT v1.0.1 #5.

### Provider non-promises in v1.0.1 (per CONTRACT v1.0.1 #5 (c))

Documented as contract clauses rather than discovered as user friction later — same discipline as v1.0's honesty section.

- **Token counting is provider-dependent.** Anthropic uses `messages.count_tokens` server-side; OpenAI uses `tiktoken` client-side when available and a `chars / 4` heuristic when not, with a one-time debug log on first heuristic call. Operators gating on `budget.max_cost_usd` should install `tiktoken` (via `[openai]` or `[llm]`) for accurate accounting.
- **Tool use is Anthropic-only in v1.0.1.** `OpenAIProvider` accepts the `tools=` kwarg for Protocol compatibility but raises `LLMBehaviorError(reason="llm.network_error")` with a v1.1 pointer when the list is non-empty. Tool-shape translation in `Tool.to_definition()` is filed under v1.1 #7-and-beyond.
- **Native structured-output modes are v1.1 candidates.** Both providers use the instruction-based path that v1.0.1 #2's example-instance work feeds. OpenAI's `response_format={"type":"json_schema",...}` and analogous future modes stay v1.1 — they diverge providers' latency profiles, cache-key semantics, and error paths in ways that warrant their own decision.
- **No new reason codes.** The closed CONTRACT v0.6 #11 taxonomy is unchanged. OpenAI auth failures land in `llm.network_error` with the exception message preserved verbatim, same as Anthropic for the same failure mode.

### Examples

- **`examples/babyagi.py`** with companion `examples/babyagi/README.md` — BabyAGI's autonomous agent loop (Nakajima 2023) rebuilt as three reactive behaviors over a shared graph. The minimal-loop counterpart to the Diligence pack's domain-rich example: same conceptual lineage as the framework's launch essays, runnable end-to-end against either provider, traces to `traces/babyagi-<timestamp>.sqlite`. The v1.0.1 public `register()` API replaces v1.0's `_REGISTRY` workaround. A `--provider {anthropic,openai}` CLI flag exercises the new symmetric surface — same example, same loop, swap the provider with one argument.

### Migration from v1.0

Additive. The changes are forward-compatible:

```bash
pip install --upgrade activegraph==1.0.1
```

- `clear_registry()` now returns a list; v1.0 callers that ignored the return continue to work unchanged.
- `register()` is new; nothing existing calls it.
- `SQLiteEventStore("/path", run_id="r")` (the v1.0 supported shape) keeps working; the new error fires only on missing-arg call sites, which are by construction unmigrated.
- LLM prompt-hash values change because the system prompt got longer. No on-disk LLM fixtures exist in the framework's tests, so no replay-divergence risk for in-tree code; user code that saved fixtures from a v1.0 live run will see a fresh `llm.fixture_missing` against v1.0.1 prompts and needs a re-record pass. Same shape as any v0.6+ prompt-assembly change; see [`llm-behavior-error`](https://docs.activegraph.ai/errors/llm-behavior-error/)'s re-record recipe.
- `OpenAIProvider` is new; install via `pip install "activegraph[openai]"` or `pip install "activegraph[llm]"`. Existing `[llm]` extra users pick up `openai` and `tiktoken` automatically on upgrade alongside the existing `anthropic` SDK.

## [v1.0] — 2026-05-18

v1.0 final. The lighter-weight verification pass against v1.0-rc3 ran the same seven-check shape as the rc2 lighter pass and produced six clean passes plus one partial finding on Check 6 (the tutorial's step 7 fork-and-diff snippet undersold its own output). The B2 fix's core promise — fork-and-diff runs without an API key against bundled fixtures — held intact. Scope = v1.0-rc3 + the Check 6 tutorial fix + a README "Concepts at a glance" section bridging the README and the doc site for evaluators.

No runtime capability changes; no public-API renames; no CONTRACT amendment.

### Changed

- **Tutorial step 7 fork-and-diff snippet** emits its own next-step guidance instead of a bare `forked: <run-id>` line. The shipped rc3 snippet ran cleanly end-to-end but its terminal output was one anticlimactic line, leaving a first-time reader to scroll past it and notice the "Then diff:" CLI block on their own. The snippet now prints the exact `activegraph diff` command for the fork it just created (parameterized from the same constants defined at the top of the snippet), and the transition prose before the CLI block names it explicitly. The diff itself produces 61 divergent objects and 49 divergent relations — a substantive output that the rc3 snippet was hiding behind a prose-only handoff. The voice test: a first-time reader running `python fork_and_diff.py` cold now sees both the fork creation and the exact next command, with no ambiguity about whether they need to do something else.
- **README adds a "Concepts at a glance" section** between "What you get" and "A small example." Twelve primitives — graph, events, behaviors, relations, patches, views, frames, policies, patterns, replay, forking, failure model — each with a one-line "what + why" and a link to the concept page. The section is evaluator-facing: it lets a reader scan the framework's conceptual primitives from the GitHub repo page without first clicking through to the doc site. Mirrors the `docs/concepts/*.md` navigation 1:1; complements but does not duplicate "What you get" (which is feature-oriented; the new section is primitive-oriented).
- **Deploy-verification workflow gains a `pull_request` trigger.** Discovered at v1.0 final merge time: the gate (CONTRACT v1.1 #9) ran on push-to-main and cron but not on PR events, so the required-status-check rule on branch protection had nothing to match against on the PR. A `workflow_dispatch` run reported under a different context name and didn't satisfy the rule. Adding `pull_request:` to the workflow's `on:` triggers lets the check report alongside the other CI gates on every PR; the existing push and cron triggers continue to run unchanged. Config-only change; the gate's logic is untouched.

### Fixed

- **Check 6 user-test finding** (rc3 lighter pass). See the Tutorial step 7 entry above. The runtime artifact didn't change; the tutorial prose and the snippet's terminal output changed.

### Migration from v1.0-rc3

Additive. No code changes required. Existing v1.0-rc3 installs should:

```bash
pip install --upgrade activegraph==1.0.0
```

## [v1.0-rc3 amendment — docs-build fix] — 2026-05-18

Post-rc3-merge, pre-rc3-tag follow-on. The v1.1 #9 deploy- verification gate's first run on `main` after the rc3 merge caught a silent failure that had been dwelling since the doc-site phase: `mkdocs.yml` declared the `mkdocstrings` plugin for API-reference auto-generation (added in commit `b533dd4` during the doc-site phase), but `.github/workflows/docs.yml`'s hardcoded install step only pulled `mkdocs` + `mkdocs-material`. Every docs build through every doc-site PR failed with `Config value 'plugins': The "mkdocstrings" plugin is not installed`, the deploy job never had an artifact to upload, and the `has_pages: false` finding from the rc3 #2 investigation was an effect of this — not just the externally-owned Pages-enable step. No version bump; this is a follow-on to v1.0-rc3, not a new rc.

### The gate did its job

CONTRACT v1.1 #9 (deploy-verification) was designed to catch the class "internal CI ships green, the external artifact is broken, nobody notices for months." On its first real run, it caught the build failure that v1.0 had been carrying silently since the doc- site phase. The gate's red signal on `main` post-rc3-merge was correctly the discipline call to action, not noise.

### Fixed

- **`mkdocstrings[python]` now installed by the docs workflow.** Root cause: hardcoded `pip install mkdocs mkdocs-material` in `.github/workflows/docs.yml` drifted from `mkdocs.yml`'s `plugins:` block. Audit-then-fix discipline (same shape as rc3's wheel-completeness audit): enumerated every plugin and every markdown extension in `mkdocs.yml`, cross-checked against the workflow install. Single gap: `mkdocstrings` (+ its `[python]` handler). The `pymdownx.*` extensions are covered transitively by `mkdocs-material` (verified via fresh-venv install).

### Changed

- **`pyproject.toml` gains a `docs` optional-dependency extra.** Lists `mkdocs`, `mkdocs-material`, `mkdocstrings[python]`. The docs workflow now installs from this extra (`pip install -e ".[docs]"`) instead of hardcoding the dep list. Adding a new mkdocs plugin in the future updates one place — the same pattern every other workflow already follows (`types.yml`, `docstrings.yml`, `wheel-completeness.yml`, and `deploy-verification.yml` all install from pyproject).

### Audit-decision: no CONTRACT v1.1 #10

Considered: a generalized "declared-but-not-installed" audit gate in the same shape as v1.1 #8 and v1.1 #9. Cross-checked all six workflows' install steps. `docs.yml` was the **only** workflow whose install step hardcoded deps that could drift from a separate config file (`mkdocs.yml`). Every other workflow installs from pyproject extras (auto-synced) or installs only its own tool, so the failure mode "config declares X, install doesn't include X" is structurally impossible for them. Joining `docs.yml` to that pattern institutes the prevention; an audit gate would be mechanism without a target. Filed as a one-off, not a class.

### Externally owned (unchanged from rc3)

The two operational steps named in the rc3 entry above still gate the v1.1 #9 deploy-verification check from passing: enable GitHub Pages on the repo, configure DNS for `docs.activegraph.ai`. With this build-fix landed, the docs workflow can now produce a deployable artifact for the first time since the doc-site phase shipped — the artifact is the precondition that the externally- owned Pages-enable step then publishes.

## [v1.0-rc3] — 2026-05-18

The lighter-user-test-findings milestone. Two findings from the [CONTRACT v1.0 #C4](https://github.com/yoheinakajima/activegraph/blob/main/CONTRACT.md) lighter pass against rc2 addressed; both surfaced discipline gaps that ship with new v1.1 CI gates institutionalizing the verification layer that caught them. No runtime capability changes; no public-API renames.

### Added

- **Wheel-completeness CI gate** (`.github/workflows/wheel-completeness.yml` + `tests/test_wheel_completeness.py`). CONTRACT v1.1 #8. Builds the wheel via `python -m build`, installs it into a fresh venv (NOT editable), runs `activegraph quickstart` against the installed wheel, fails if any runtime data file is missing. Catches the class of bug that's structurally invisible to source-tree tests and editable installs. Marked `slow`; CI invokes via `pytest -m slow tests/test_wheel_completeness.py`. To be configured as a required status check on main per CONTRACT v1.1 #8 implementation scope.
- **Deploy-verification CI gate** (`.github/workflows/deploy-verification.yml` + `tests/test_doc_site_reachable.py`). CONTRACT v1.1 #9. Fetches `DOCS_BASE_URL` + 4 known-good page paths, asserts HTTP 200 and that the response body contains `Active Graph` (the mkdocs `site_name`). Failure-mode design distinguishes DNS failure, HTTP 404, and content mismatch — each fails with a message that names the operational step to fix it. The HTTP-reachability complement to `tests/test_doc_links.py`, which is source-tree- scoped only. Runs on push to main + daily cron (catches drift if the site goes down without a code change). Required for merge once GitHub Pages is enabled.

### Changed

- **B3 fix: `prompts/*.md` ship in the wheel.** The v1.0-rc2 user- test gate surfaced that `pip install activegraph==1.0.0rc2` followed by `activegraph quickstart` crashed with `PackPromptLoadError: prompts directory does not exist`. Root cause: `pyproject.toml` declared no `[tool.setuptools.package-data]` block, so setuptools' default behavior (ship only `.py` files) omitted the 4 `.md` prompt files. Audit confirmed exactly 4 non-`.py` files in `activegraph/`, all in `packs/diligence/prompts/`. Fix is a single key/glob; rc3 #1 commit. The wheel-completeness gate above is the enforcement layer that prevents this class from recurring.
- **Domain cutover: `docs.activegraph.dev` → `docs.activegraph.ai`.** CONTRACT v1.0 #C6 amended (rc3 amendment block in CONTRACT.md). Primary domain switched to the already-owned `docs.activegraph.ai`. `.dev` becomes a redirect-source the maintainer registers and configures separately (externally owned). The codebase holds exactly one primary; the constant `DOCS_BASE_URL` is the single swap point. Affected files: `activegraph/errors.py` (constant), `docs/CNAME`, `mkdocs.yml` (`site_url`), `README.md` (5 link refs), `CHANGELOG.md` (11 link refs), `HANDOFF.md`, `docs/about/publishing.md`, `examples/quickstart_session.txt`, `.github/workflows/docs.yml` (comments). Error snapshots rebaselined via `UPDATE_SNAPSHOTS=1` — 58 files. The cutover pattern worked as designed: one constant change propagates to every URL through `f"{DOCS_BASE_URL}/..."` interpolation. `tests/test_doc_links.py` continues to recognize the .dev URL form so historical CHANGELOG entries linking to .dev still pass the source-presence check.

### Externally owned (B4 findings; the gate is shipped, the

operational steps are yours):

- **GitHub Pages must be enabled on the repo.** Per rc3 step-4 investigation: `has_pages: false` is the smoking-gun finding; the doc site at any URL 404s because there's nothing to serve. Fix: Settings → Pages → Source: GitHub Actions. Precondition: the repo must be public (free plan) or on GitHub Pro/Team /Enterprise (paid). Until this lands, the v1.1 #9 deploy- verification gate is red on every CI run; that's the correct signal.
- **DNS for `docs.activegraph.ai` must be configured.** CNAME record pointing at `yoheinakajima.github.io` (the GitHub Pages default subdomain). The v1.1 #9 gate's failure message names this when DNS is the missing piece.
- **`docs.activegraph.dev` redirect.** Register the .dev domain and configure it to 301/302-redirect to `docs.activegraph.ai`. Not strictly required for the v1.1 #9 gate to pass (gate only checks .ai), but needed for historical CHANGELOG entries' .dev links to keep resolving for users.

### Boundary shift (CONTRACT v1.1 framing):

The two new gates each move the user-test boundary outward by one layer.

- **v1.1 #8 (wheel-completeness):** after this gate lands, the lighter user-test verifies the PyPI artifact (CDN, upload, distribution) — not the wheel itself.
- **v1.1 #9 (deploy-verification):** after this gate lands, the lighter user-test verifies the published-domain experience (does the README link land on a page that reads well? does navigation feel right?) — not the basic reachability question of "does this URL return 200."

Same rc1-vs-rc2 discipline pattern noted in the CONTRACT entries: each rc surfaces a finding that's structurally invisible to the prior layer's CI; each rc institutionalizes the verification layer that caught it.

## [v1.0-rc2] — 2026-05-18

The user-test-findings milestone. Five findings from the [CONTRACT v1.0 #C4](https://github.com/yoheinakajima/activegraph/blob/main/CONTRACT.md) gate addressed; one was a latent runtime state-machine bug since v0.5. No new runtime capability; no public-API renames.

### Added

- **PyPI publish workflow** (`.github/workflows/publish.yml`). Tag-push trigger matching `v*` triggers `python -m build` then upload via PyPI trusted publishing (OIDC-based). Documented in [Publishing a release](https://docs.activegraph.ai/about/publishing/). Externally owned per [CONTRACT v1.0 #C8](https://github.com/yoheinakajima/activegraph/blob/main/CONTRACT.md) — the agent ships the workflow, the maintainer runs the publish.
- **Tutorial-snippet CI test** (`tests/test_tutorial_snippets.py`). Subprocess-runs the tutorial's step 7 fork snippet end-to-end against the bundled fixtures; asserts exit 0 and idempotency on re-run. Tactical down-payment on [CONTRACT v1.1 #2 expansion](https://github.com/yoheinakajima/activegraph/blob/main/CONTRACT.md) (spec-vs-impl drift gate for Python doc snippets).
- **`_requeue_unfired` regression test** (`tests/test_requeue_unfired.py`). Locks the C3 regression vector: `Runtime.load` on a cleanly-drained saved run produces `queue_depth == 0`.

### Changed

- **`_requeue_unfired` uses `runtime.idle` as the high-water mark.** Latent bug since CONTRACT v0.5 #8: the function relied on the false reverse-implication "no `behavior.started` references this event id ⟹ event was still in the queue." Events with **zero** subscribed behaviors are popped-and-discarded with no `behavior.started` emitted, so they were falsely requeued on every `Runtime.load`. The fix uses the last `runtime.idle` event as the high-water mark (the runtime emits `runtime.idle` only after the queue empties); only events after the last idle are candidates for requeue. `runtime.budget_exhausted` is explicitly NOT a drain marker — using it would break budget-bounded pause-and-resume.
- **Tutorial step 3 and quickstart prose** distinguish the provider layer (where the fixture provider produces responses) from the runtime's replay cache layer (where `cache_hit=true` legitimately appears under strict-replay loads or `Runtime.fork()` in-process). Pre-rc2 prose conflated the two. The conflation was originally in the v1.0 spec at `examples/quickstart_session.txt`; the spec is updated with a header drift-note documenting the two-layer reality.
- **Tutorial step 7 fork snippet** uses `RecordedDiligenceProvider(companies=THREE_COMPANIES)` as the fork's `llm_provider=`. Matches the parent run's provider; preserves the "no API key required" tutorial pitch. The snippet also includes a tutorial-only cleanup-on-collision branch so it's re-runnable without manual DB surgery.
- **`_prepare_interactive_subdir` collision prompt** re-prompts on unrecognized input. Pre-rc2 behavior fell through to the suffix branch on any input that wasn't `o` or `q`, which swallowed typeahead from the next prompt. Mirrors the existing iteration-loop pattern at `run_interactive_mode`.

### Deprecated

Nothing. Backward compatibility holds — all v0–v1.0-rc1 tests pass.

### Removed

Nothing user-facing.

### Fixed

- **`Runtime.load(...).status().queue_depth` reads 0 on a freshly loaded cleanly-drained run.** Was a non-zero false count of events that had been popped-and-discarded during the original run. See the Changed entry for `_requeue_unfired` above.
- **`activegraph --version` reports the correct version.** Was stuck at `0.9.1` through v1.0-rc1's release (the version-sync gate validated internal consistency but not correspondence with the git tag). The v1.1 #6 version-tag-correspondence gate closes this gap in v1.1.
- **Tutorial step 7 fork snippet runs end-to-end against bundled fixtures** with no API key, matching the rest of the quickstart's "no API key required" pitch.
- **`cache_hit=true` claim** in the tutorial and quickstart "what just happened" prose: the claim was wrong for initial fixture-mode runs (the runtime's replay cache only fires on strict-replay or in-process fork). Prose corrected; two-layer vocabulary lands cleanly for first-time readers.

### Migration from v1.0-rc1

Additive. No code changes required. Existing v1.0-rc1 installs should:

```bash
pip install --upgrade activegraph==1.0.0rc2
```

Existing saved runs (`*.db` files) load with `queue_depth == 0` correctly post-upgrade — the C3 fix is in the load path, not the storage format.

### Known follow-ons (v1.1 scope)

In addition to the v1.0-rc1 v1.1 backlog (carried forward):

- **CONTRACT v1.1 #2 expansion** — spec-vs-impl drift gate covers executable Python snippets in docs, not just CLI flags. v1.0-rc2 ships the tactical step 7 test; v1.1 generalizes.
- **CONTRACT v1.1 #5** — `Runtime.load` auto-provider ergonomics. The rc2 fix for B2 passes `RecordedDiligenceProvider` explicitly; the v1.1 design question is whether `Runtime.load` should infer a provider from the run's recorded events or from a pack-manifest declaration. New runtime capability, banned in v1.0.
- **CONTRACT v1.1 #6** — version-tag-correspondence CI gate. Existing version-sync gate validates `__version__` matches `pyproject.toml`; the v1.1 gate adds correspondence with the current annotated tag in tagged-release CI runs.
- **CONTRACT v1.1 #7 backlog item: fork cache pre-population symmetry.** `Runtime.fork(at_event=...)` in-process pre-populates the LLM cache from the parent's events; the persistent shape (`SQLiteEventStore.fork_run` then `Runtime.load`) does not. The two paths should be symmetric. New runtime behavior, banned in v1.0.

## [v1.0-rc1] — 2026-05-18

The adoption-surface milestone. No new runtime capability; the contract is "a new user can install, run, understand, debug, and extend the framework without reading source code."

### Added

- **Error hierarchy rewrite.** Every exception now inherits from `ActiveGraphError` and carries structured `what_failed`, `how_to_fix`, and `context` fields. Seven category bases (`ConfigurationError`, `RegistrationError`, `ExecutionError`, `ReplayError`, `StorageError`, `PatternError`, `PackError`) with 33 leaves. Built-in lineage preserved via multi-inheritance — `except ValueError`/`except KeyError` clauses still work.
- **Per-error reference catalog.** Every error message ends with a `More:` link to a dedicated page documenting when it fires, why, how to diagnose, and how to fix. Catalog at [docs.activegraph.ai/reference/errors](https://docs.activegraph.ai/reference/errors/replay-divergence-error/).
- **Documentation site at [docs.activegraph.ai](https://docs.activegraph.ai/)**: concepts pages for every primitive (graph, events, behaviors, relations, patches, views, frames, policies, patterns, replay, forking, failure model); guides; cookbook (common patterns, debugging, migration); CLI reference; API reference via mkdocstrings.
- **`activegraph quickstart` CLI command.** Bundled Diligence demo in fixture mode (byte-deterministic, no API key, ~20 seconds); `--interactive` mode walks the user through writing their first behavior.
- **10-minute tutorial at [docs.activegraph.ai/quickstart](https://docs.activegraph.ai/quickstart/).** Install → run → write a behavior → save and inspect → fork and diff. Seven steps; every example runs.
- **CI gates on the public surface.** Version-sync gate (`pyproject.toml` ↔ `activegraph.__version__`), broken-link gate for the doc site, mypy `--strict` gate on the [`__all__` allowlist](https://github.com/yoheinakajima/activegraph/blob/main/docs/reference/api/TYPE_REPORT.md) (22/38 modules clean at baseline), docstring coverage gate ([Ring 0 92/100 not-missing, Ring 1 at 84.7%](https://github.com/yoheinakajima/activegraph/blob/main/docs/reference/api/COVERAGE_REPORT.md); exemption list in [`docstring_gaps.toml`](https://github.com/yoheinakajima/activegraph/blob/main/docstring_gaps.toml)).
- **CLI follow-on flags** (referenced from error messages' recovery prose): `inspect --event <id>`, `inspect --behaviors`, `inspect --pack-version`, `migrate --skip-corrupted`, `fork --record`.

### Changed

- **README trimmed** from 1275 lines to ~190. The doc site is now the canonical reference; the README is the conversion funnel (30-second pitch → install → `activegraph quickstart` → tutorial).
- **Error messages structured.** Every framework-raised exception exposes `what_failed` (one line), `how_to_fix` (actionable prose), and `context` (structured detail) on the exception instance. Plain `str(exc)` renders all three.
- **Trace printer formats `pack.loaded`** (was previously falling through to the generic event renderer).

### Deprecated

Nothing. Backward compatibility holds — all v0–v0.9 tests pass.

### Removed

Nothing user-facing. Internal: a handful of dead code paths surfaced during the error-rewrite audits were removed.

### Fixed

- `pack.loaded` trace formatting (was missing despite being spec'd in CONTRACT v0.9 #25).
- Several inconsistent error categories — see CONTRACT v1.0 PR-F audit findings for the cross-category reclassifications.

### Migration from v0.9.1

Additive. See [Migration from v0.7 § 5–6](https://docs.activegraph.ai/cookbook/migration-from-v0-7/#5-adopt-the-v10-error-hierarchy-v09--v10):

```python
# v1.0 — broader catches with structured context:
try:
    rt = Runtime.load(url, run_id=rid)
except activegraph.StorageError as e:
    log(e.what_failed, e.how_to_fix, e.context)
except activegraph.ActiveGraphError as e:
    log(e.what_failed, e.how_to_fix)
```

Existing `except ValueError`/`except KeyError`/`except TypeError` clauses keep working — multi-inheritance preserves builtin lineage.

### Known follow-ons (v1.1 scope)

- `fork --set <pack>.<key>=<value>` for cheap fork-with-override experiments (CONTRACT v1.1 #1; canonical Python-API recipe at [Cookbook § Fork with a pack-setting override](https://docs.activegraph.ai/cookbook/common-patterns/#fork-with-a-pack-setting-override-v10-python-api)).
- `inspect --memo` and `inspect --search` (CONTRACT v1.1 #1).
- Type-completeness burndown — close the 16 dirty allowlist modules (CONTRACT v1.1 #3, [`TYPE_REPORT.md`](https://github.com/yoheinakajima/activegraph/blob/main/docs/reference/api/TYPE_REPORT.md)).
- Docstring-completeness burndown — close the 8 missing Ring 0 exemptions and upgrade one-liners to full (CONTRACT v1.1 #4, [`COVERAGE_REPORT.md`](https://github.com/yoheinakajima/activegraph/blob/main/docs/reference/api/COVERAGE_REPORT.md)).
- Spec-vs-impl drift gate for CLI flags (CONTRACT v1.1 #2).

## [v0.9.1] — 2026-05-17

Operator-visible quality-of-life fixes between v0.9 and v1.0.

### Added

- `[trace.flags]` rollup header at the top of every trace block with `prompt_normalized=true|false` so operators can see at a glance whether a run used the v0.7+ normalized-prompt format.

### Changed

- Approval-demo output is now granular (per-object) rather than batched, so operators can see which approval the runtime is waiting on.

### Migration from v0.9

None — additive trace and demo improvements; no API changes.

## [v0.9] — 2026-05-16

The **pack format** milestone. A pack bundles object types, behaviors, tools, prompts, and policies for a specific domain.

### Added

- `Pack` dataclass: frozen, equality by `(name, version)`.
- Pack-aware decorators (`activegraph.packs.behavior`, `llm_behavior`, `relation_behavior`, `tool`) with no global registry side effects.
- `runtime.load_pack(pack, settings=...)` — idempotent; conflicts (object type, relation type, behavior name, tool name, policy name) raise `PackConflictError` before any state mutation; version mismatch raises `PackVersionConflictError`.
- Object type schemas enforced via Pydantic at `graph.add_object`; relation type validation at `graph.add_relation`.
- Namespace prefixing: canonical strict (`diligence.claim_extractor`); short-name lookups lenient.
- Three settings access forms: typed parameter injection (primary), `ctx.settings`, `ctx.pack_settings(name)`.
- Prompt loader: TOML frontmatter; content-hashed via SHA-256 truncated to 16 hex chars; hash (not version) is the replay contract.
- Discovery via Python entry points (`activegraph.packs`); `discover()`, `load_by_name()`, `clear_discovery_cache()`.
- `activegraph pack new <name>` scaffolding command.
- `activegraph pack list` to enumerate installed packs.
- `activegraph.packs.diligence` — production-quality reference pack: 8 object types, 6 relation types, 7 behaviors, 3 tools, 2 policies, 4 prompts, recorded fixtures for 3 companies, end-to-end demo at [`examples/diligence_real_run.py`](https://github.com/yoheinakajima/activegraph/blob/main/examples/diligence_real_run.py).
- Pack authoring guide at [Authoring packs](https://docs.activegraph.ai/guides/authoring-packs/).

### Changed

- **Python floor raised to 3.11** (uses stdlib `tomllib`).
- **`pydantic>=2` is now a hard dependency** (was opt-in via `[llm]`). The pack format's object-type schemas and settings models require it.
- `click>=8,<9` becomes a hard dependency (CLI is always available).

### Migration from v0.8

Additive. See [Migration from v0.7 § 4](https://docs.activegraph.ai/cookbook/migration-from-v0-7/#4-adopt-the-pack-format-v08--v09). Global decorators (`@behavior`, `@tool`) keep working alongside loaded packs; the pack format is opt-in for new code.

Python 3.10 users must upgrade to 3.11+ before installing v0.9.

## [v0.8] — 2026-05-16

The **operator surface** milestone. Hardens the boundary between the framework and the world it runs in.

### Added

- `PostgresEventStore` behind the same `EventStore` protocol as SQLite (Postgres 16+; `pip install activegraph[postgres]`).
- Connection-URL addressing everywhere (`sqlite:///relative`, `sqlite:////absolute`, `postgres://...`).
- `activegraph migrate --from <url> --to <url>` — transaction-per-run, idempotent, one-directional.
- Structured JSON logging via `configure_logging(json_output=True)` with a documented schema (the operator contract).
- `Metrics` protocol (three methods: counter, histogram, gauge) with `NoOpMetrics` default and reference `PrometheusMetrics` backend.
- `runtime.status(recent=N)` — frozen `RuntimeStatus` dataclass for introspection.
- `activegraph` CLI: `inspect`, `replay`, `fork`, `diff`, `export-trace`, `migrate`. CLI exit codes documented as contract.
- Operator guide at [Operating in production](https://docs.activegraph.ai/guides/operating-in-production/).

### Migration from v0.7

Additive. See [Migration from v0.7 § 3, 7, 8](https://docs.activegraph.ai/cookbook/migration-from-v0-7/#3-adopt-connection-urls-v07--v08). Old SQLite path arguments (`persist_to="/path/to.db"`) keep working; URLs are required for CLI and cross-store operations.

## [v0.7] — 2026-05-16

The **tools and advanced matching** milestone.

### Added

- `@tool` decorator: tools as first-class primitives with input schema, output schema, determinism flag, cost, timeout.
- LLM ↔ tool turn loop owned by the runtime; multi-turn until the model returns a non-tool response or `max_tool_turns` hits.
- `tool.requested` / `tool.responded` event pair; replay cache separate from the LLM cache.
- `RecordedToolProvider` + `RecordingToolProvider` for tests.
- Two reference tools: `web_fetch`, `graph_query` (factory-based for graph read access).
- Cypher-subset pattern subscriptions via `pattern=` on `@behavior` / `@llm_behavior`. Compile-time strict; the unsupported tokens raise `UnsupportedPatternError` naming the offending token.
- Negation via `NOT EXISTS { ... }`.
- Temporal predicates: `activate_after=N` events (event-count, not wall-clock — keeps replay deterministic).
- Tool budgets (`max_tool_calls`) + cost-sharing with LLM (`max_cost_usd` covers both).
- Causal-chain walk crosses tool boundaries via `tool_request_event_id` provenance.

### Changed

- Prompt assembly normalized — every prompt is content-hashed via the canonical form; the `prompt_normalized=true` flag appears in the v0.9.1 trace rollup for runs using this format.

### Migration from v0.6

Additive. v0.6 LLM behaviors continue to work without `tools=` declarations.

## [v0.6] — 2026-05-16

The **LLM integration** milestone.

### Added

- `@llm_behavior` decorator with structured output parsing (Pydantic schema).
- Frame-aware prompt construction: system prompt assembled from frame goal + constraints + behavior description + output-schema reminder, in a fixed order.
- `llm.requested` / `llm.responded` event pair with model, full prompt+params, prompt hash, estimated cost, deterministic flag, cache-hit flag.
- `AnthropicProvider` reference implementation (reads `ANTHROPIC_API_KEY`; never from code).
- `RecordedLLMProvider` + `RecordingLLMProvider` for tests (fixtures keyed by SHA-256 of prompt+params canonical form).
- Cost accounting: Decimal-precise `max_cost_usd` budget; pre-call estimate via `count_tokens`; post-call actual cost from provider's `usage`.
- Structured failure reasons (`llm.network_error`, `llm.rate_limited`, `llm.parse_error`, `llm.schema_violation`, `llm.fixture_missing`, `budget.cost_exhausted`).

### Migration from v0.5

Additive. LLM behaviors are opt-in; non-LLM runs unaffected. New optional dependency `activegraph[llm]` (anthropic SDK).

## [v0.5] — 2026-05-16

The **resumability** milestone. The event log becomes the source of truth.

### Added

- Full event log persistence via the `EventStore` protocol; SQLite reference backend with schema version pinned from day one in a `meta` table.
- `Runtime.load(url, run_id=...)` — open, pick a run, replay, return runtime ready to continue.
- Strict-replay mode (`replay_strict=True`) — re-executes behaviors and fires `ReplayDivergenceError` on mismatch.
- Fork (`runtime.fork(at_event=...)`) — new run, copies parent's event log up to the cutoff (inclusive), independent log thereafter.
- Structural diff (`parent.diff(other)`) — shared / parent-only / fork-only event partitions; divergent objects and relations.
- Multiple runs per file; ULID `run_id`s; provenance carries `run_id`.
- Unfired-event re-queue on load (events emitted but never popped return to the queue on resume).

### Migration from v0

Additive. v0 in-memory runs continue to work without `persist_to=`. New optional dependency `activegraph[sqlite]` (stdlib — no extra packages needed).

## [v0] — 2026-05-16

The **core runtime**.

### Added

- In-memory `Graph` with typed objects, typed relations, and an append-only event log.
- Function-based (`@behavior`) and class-based (subclass `Behavior`) behaviors.
- Relation behaviors (`@relation_behavior`) — coordination logic on edges.
- Event-type subscriptions with predicate filters (`where=`).
- Patch system with optimistic concurrency (version-keyed apply; rejected patches surface as `patch.rejected` events).
- Views with type/depth/recent-events scoping.
- Frames (mission context per run) and policies (per-behavior capability declarations).
- Trace printer (`runtime.print_trace()`); causal-chain query (`runtime.trace.causal_chain(object_id=...)`).
- Budgets (`max_events`, `max_behavior_calls`, `max_seconds`, `max_depth`, etc.) — runtime stops cleanly when hit; resumable.

### Migration from before v0

There is no before-v0.

______________________________________________________________________

The graph is the world. Behaviors are physics. The trace is the proof.

# Publishing a release

This page documents the release-publish workflow. It exists so the maintainer can run a release with confidence and so the next session (human or agent) picking up release work has a reproducible procedure.

The framework's tag-and-publish flow is **`.github/workflows/publish.yml`**. It triggers on a git tag push matching `v*` and uploads the built sdist + wheel to PyPI via trusted publishing. Per [CONTRACT v1.0 #C8](https://github.com/yoheinakajima/activegraph/blob/main/CONTRACT.md), PyPI publishing is externally owned — the workflow is the artifact the framework ships; the maintainer runs the actual publish by pushing the tag.

## Before the first publish

The workflow uses PyPI **trusted publishing** (OIDC-based, no long-lived API tokens). Trusted publishing requires one-time setup on the PyPI side; do this once before tagging the first release.

1. Create the `activegraph` project on PyPI by uploading any test release manually (the canonical first-time-publish bootstrap). A pre-release tag like `v0.0.0a1` is safe — it doesn't conflict with the real version sequence and reserves the project name. This step needs a one-time PyPI API token; revoke it after.
1. Configure trusted publishing on the project page. Visit <https://pypi.org/manage/account/publishing/> and add a publisher with:
1. **Owner:** `yoheinakajima`
1. **Repository:** `activegraph`
1. **Workflow:** `publish.yml`
1. **Environment:** `pypi`
1. Confirm the workflow's `publish` job references the same environment (`environment: name: pypi`). The repository's GitHub Environments → `pypi` settings don't need additional protection rules; the OIDC token PyPI accepts is scoped to the environment + workflow combination.

After this setup, no PyPI credentials live in repository secrets. Future maintainers tag-and-push; PyPI accepts the OIDC token issued by GitHub at workflow runtime.

## Triggering a release

1. Update the version constants. Both must match:

1. `activegraph/__version__` in `activegraph/__init__.py`

1. `version` in `pyproject.toml` The version-sync test (`tests/test_version_sync.py`) verifies both match before any release work merges to main. The forthcoming version-tag-correspondence gate ([CONTRACT v1.1 #6](https://github.com/yoheinakajima/activegraph/blob/main/CONTRACT.md)) will also verify they match the current tag in tagged-release CI.

1. Update the CHANGELOG. Add a section for the new version with "Added / Changed / Deprecated / Removed / Fixed / Migration" subsections. Follow the existing v1.0-rc2 entry's shape.

1. Merge the version-bump + CHANGELOG PR to `main`.

1. Tag and push from `main`. The tag string uses the human-facing form (`v1.0-rc2`, `v1.0`, `v1.0.1`); the `version` strings in `__init__.py` and `pyproject.toml` use the PEP 440 normalized form (`1.0.0rc2`, `1.0.0`, `1.0.1`).

   ```bash
   git checkout main
   git pull
   git tag -a v1.0-rc2 -m "v1.0-rc2"
   git push origin v1.0-rc2
   ```

1. Watch the publish workflow on GitHub Actions. It builds the sdist + wheel, uploads them to PyPI via trusted publishing, and exits 0 on success. Total runtime ~2 minutes.

## After the publish

Verify the package is installable from a fresh environment. The test of record:

```bash
python -m venv /tmp/verify-publish
source /tmp/verify-publish/bin/activate
pip install activegraph==1.0.0rc2
activegraph --version    # should print 1.0.0rc2
activegraph quickstart   # should run end-to-end against fixtures
deactivate
rm -rf /tmp/verify-publish
```

If `pip install` fails, the most likely causes are:

- **PyPI hasn't indexed the package yet** (transient; usually \<1 minute, occasionally up to 5). Retry after a brief wait.
- **The wheel build failed to include a required file.** Check `MANIFEST.in` and the `[tool.setuptools.package-data]` section of `pyproject.toml` if the import surface is missing data files.
- **Version mismatch.** The PyPI version string is the `pyproject.toml` form, not the tag form — `1.0.0rc2`, not `v1.0-rc2`. The tag triggers the workflow; the package version is what `pip install` asks for.

After verification, update any external references (release notes on GitHub Releases, social posts, the documentation site's homepage if relevant). The doc site rebuilds automatically on push to `main`; no separate publish step is needed there.

## Rolling back

PyPI does not allow re-uploading the same version once published — this is intentional ([PEP 600](https://peps.python.org/pep-0600/)). If a release ships broken:

1. Yank the broken version on PyPI (visible to existing installs; blocks new ones from resolving to that version unless explicitly pinned).
1. Bump the version (`1.0.0rc2` → `1.0.0rc3` or `1.0.0` → `1.0.1`) and re-tag.

The yank-and-bump path is documented because the framework's [invariant-protection voice](https://docs.activegraph.ai/concepts/failure-model/) extends to release ops: the next maintainer needs to know what to do, not guess.

## Why trusted publishing

The trade-off vs. API-token publishing:

- **Trusted publishing** uses GitHub's OIDC issuer; PyPI verifies the token at workflow runtime against the pre-configured publisher (owner + repo + workflow + environment). No long-lived secrets; the OIDC token is short-lived and scoped to the workflow run.
- **API-token publishing** stores a long-lived PyPI API token as a GitHub Actions secret. Simpler initial setup (one secret, no PyPI-side trusted-publisher config), but the token has full upload permission for the project as long as it exists, and rotation is a manual chore.

For an externally-owned-publish project where the agent loop ships the workflow but the maintainer runs the tag, trusted publishing is the right default. The setup cost is once per project; the rotation cost is zero.