Self-hosting

How chit develops chit, which mode to use, and the discipline that keeps autonomous runs honest

chit builds chit with its own loop. This is the operating guide: which mode to use, what the orchestrator still owns, and the discipline that keeps autonomous runs honest. The loop itself lives above chit (manifests are static DAGs and cannot loop); see supervised convergence for why.

The two modes

Both modes share one shape: an implementer writes a slice, a reviewer checks it, and a human checkpoints. They differ in who implements.

Supervised. The Claude Code chat implements with its full tools and context; a per-scope Codex advisor reviews each round and inspects the git diff. The chat owns the loop and the checkpoint. Reach for this on nuanced, cross-package, or exploratory slices, where the chat's reasoning and live tools earn their keep.
Autonomous (converge). The chat sets a task; chit runs both agents, looping to convergence (examples/converge.json). Drive it MCP-native from the chat with chit_start then chit_next per iteration, or from a terminal with chit converge. The chat does not implement and does not babysit each handoff. Reach for this on well-scoped, self-contained slices, and to keep building the self-hosting habit.

The mode is a per-slice choice, not a default. When in doubt on a gnarly change, supervised produces cleaner results faster; on a tight, well-specified change, autonomous is the better demonstration and offloads the writing.

Roles are assigned in the chit, not fixed to a vendor

The autonomous loop is two roles, an implementer and a reviewer, declared as participants in the manifest. The bundled examples/converge.json pairs a write-capable Claude implementer with a read-only Codex reviewer, and that default is a good one (an independent model reviewing catches what a same-model check misses). But the pairing is not baked into the runtime: each participant names its own agent (claude or codex) and its own permissions.filesystem, and the implement / review steps call participants by name. Swap them and a write-capable Codex implements while a read-only Claude reviews. The permission you grant a participant, not its vendor, decides whether it can write; how that boundary is enforced depends on the vendor: a read_only Codex runs under an OS sandbox (--sandbox read-only), while a read_only Claude runs under --permission-mode plan, a permission boundary inside Claude rather than an OS sandbox. (The bundled manifest's instructions and output text read in terms of "the implementer" and "the reviewer," so a swapped manifest produces correctly labeled output; update the prose only if you want vendor-specific wording.) The loop driver only requires that the steps are named implement and review, and that the reviewer emits the structured verdict block the manifest asks for.

When several chits share the same reviewer or implementer, lift that profile into a named role in ~/.config/chit/config.json instead of repeating it. A role carries the instructions, session, and permissions, and optionally a default agent:

{
  "roles": {
    "reviewer": {
      "agent": "codex",
      "instructions": "Review the diff skeptically.",
      "session": "per_scope",
      "permissions": { "filesystem": "read_only" }
    }
  }
}

The manifest participant becomes { "role": "reviewer" } and can still override any field inline (a different agent, write permission). The vendor-neutral point above holds either way: the role now carries the persona, but the permission you grant the participant, not the role's vendor, still decides whether it can write.

The orchestrator's job (both modes)

The chat is always the orchestrator. It owns three things the loop does not:

Sequencing and the human checkpoint. One slice at a time; stop and hand back on a block, an ambiguous product decision, or anything outward-facing.
The final gates. The reviewer runs read-only and usually cannot even run the tests (its sandbox blocks the temp dirs the suite needs). So the orchestrator runs them itself, every slice, before merge: the full test suite, a live smoke of the real behavior, typecheck, the linter, the browser-safety check when core changed, and a scan for banned characters. A reviewer proceed is necessary, not sufficient.
The push. Never push without explicit human permission, and only after the gates pass.

Treat the reviewer as an independent second opinion, not ground truth: verify each finding against the code before acting on it. Codex reviews (not a second Claude) precisely because an independent model catches what a same-model check misses.

Running an autonomous slice

Branch into a worktree so a wedged or abandoned run stays isolated.
Drive the loop. MCP-native from the chat (the primary path): chit_start with the task, scope, and the worktree as cwd, then chit_next once per iteration, checkpointing between. Or from a terminal:
```
chit converge --task "<the slice>" --scope <stable-id> --cwd <worktree>
```
Either way it loops implement and check to the reviewer's verdict (proceed converges only when its verification passed -- the reviewer reports structured checks; a proceed with no passing checks stops as needs-decision for you to judge. block stops, anything else revises and retries up to the budget, default 3; an unparseable verdict is treated as block, never an implicit proceed). It records the loop under chit's state dir (keyed by repo, not in the working tree) and audits each iteration by default.

To make that verification ground truth instead of the reviewer's word, declare policy.requiredChecks in the manifest (or pass required_checks to chit_start over MCP): chit then runs those commands itself after a proceed review, as argv with no shell, and only a clean pass converges. A failure sends the loop back to revise; a check that cannot run stops as needs-decision for you. See Execution policy.
Inspect what happened: chit_trace (or read the loop log) lists each iteration's auditRef. Open a transcript with chit_audit_show over MCP (pass that audit_ref), or with chit audit show <run-id> from the terminal (the CLI takes the audit run id that chit audit list prints). The transcript carries the prompts, outputs, live adapter events, usage, and the recorded per-participant config.
Run the final gates yourself (see above). The reviewer could not.
Checkpoint with the human. Push only on explicit approval.

chit_start and chit_next drive the loop; chit_status and chit_trace inspect it, and chit_cancel stops it. The same two tools run a non-loop manifest as a one-shot DAG pass when you want a plain run rather than the implement/review loop.

Running unattended: jobs and batches

Foreground (one chit_next per iteration) keeps you in the loop on every round. When a slice is well-specified enough to leave alone, the same loop runs unattended:

Background. chit_start with mode: "background" launches the loop in a detached worker against a git worktree and returns immediately. The worker survives an MCP reconnect. Check on it with chit_status, stop it with chit_cancel (intent-first: it records the cancel, then signals the worker), and read each iteration's receipt with the audit tools. One task, no babysitting.
Batch. chit_batch_start runs several loop tasks in parallel, one git worktree per task, as background runs. (For a single unattended task, use chit_start with mode: "background"; don't launch several of those in one repo, since they share the working tree and collide. Batch exists precisely to isolate each task.) You hand it a reviewed task graph; each task declares the files it will touch (claimedPaths, so claim-overlapping tasks serialize rather than race) and optional dependencies. Dependencies are a launch gate, not integration: a task launches only once its dependencies reach review_ready, but its worktree branches from the batch base and does not contain their changes, so a task never sees another task's diff (merging is yours). To change the agent pairing, select a vetted recipe on the task or batch, or point manifestPath at your own converge manifest with the pairing swapped (participants inline, or referencing reusable roles defined in config). There is no daemon: progress happens only when you call chit_batch_advance, and chit_batch_status is read-only (follow its nextAction, not the per-task status, to drive the batch). The deliverable is a set of reviewable worktree branches; chit never auto-merges. If you lose a batch id, chit_batch_list recovers it. Retire the worktrees with chit_batch_cleanup (dry-run by default; it never deletes receipts).

The orchestrator's job does not change in either mode: you still run the final gates yourself and push only on explicit approval. Unattended means chit does the writing and reviewing without you watching each round, not that the work merges itself.

Discipline that bites

MCP server staleness. The chit MCP server is a persistent process; it runs the adapter and runtime code from when it started. After changing an adapter or the runtime, reconnect the server before any MCP-driven run reflects the change. The reviewer reads files from disk, so reviews stay current; runs through the adapter do not.
Config: agents, roles, and recipes. Config is layered: built-in codex and claude, then ~/.config/chit/config.json (global), then the repo's chit.config.json. Both files hold agents (model, reasoning effort, and timeouts on named agents; built-in ids cannot be redefined), roles (reusable participant profiles a manifest references by id), and recipes (vetted manifest references with safe runtime defaults). A later layer replaces an agent, role, or recipe by id, whole. The repo file cannot set env or strictMcp; those belong in the global config. A repo recipe's manifestPath must stay inside the repo. A manifest references agents or roles; plans and batches may select recipes. chit show and chit audit show report the effective config, and chit doctor reports which layer defined each agent and recipe.
Worktrees. Run converge in a worktree; clean it up when the slice lands. Both the audit transcript and the loop log live in the local state dir (keyed by repo), so they survive even after the worktree is removed.

Pointers

Supervised convergence: the supervised pattern.
examples/converge.json: the autonomous loop manifest.
Audit log: reading transcripts with chit audit.
MCP surface: the run tools and the one invariant.