Concepts

The three loops and the artifact graph that drive every contractor change.

contractor's model has two moving parts: an artifact graph that structures each change, and three nested loops that actually move work through it.

The artifact graph

Every change — a blueprint — progresses through artifacts in dependency order. The shipped contractor-base schema defines four:

proposal ──┬── requirements ──┬── tasks
            └── design ────────┘

Proposal — Why the change exists and what it does, in prose.
Requirements — Behavioral specs, organized by capability. Uses RFC 2119 keywords (SHALL / MUST / MAY) and Given-When-Then scenarios so the acceptance criteria are explicit.
Design — Technical decisions and architecture: what you'll build, what you won't, and the trade-offs.
Tasks — Implementation checklist, grouped into numbered sections with trackable - [ ] checkboxes.

Artifacts are blocked until their dependencies complete, ready when they can be written, and done once their files exist on disk. requirements and design are siblings — both depend on the proposal, neither depends on the other — so they can be authored in parallel. tasks waits for both.

This graph lives in a schema. Each blueprint records its schema in its own .contractor.yaml:

# contractor/blueprints/<name>/.contractor.yaml
schema: contractor-base

The shipped default is contractor-base. A lighter-weight alternative, contractor-lite, ships with two artifacts (brief → tasks) for small changes that don't need the full ceremony. See Schemas for the full picture.

The three loops

Three loops run at different time scales, each with a distinct purpose.

Inner loop — inside a task-group session

This is where development actually happens. Inside a single task-group, the implement agent writes code, picks an appropriate test command from the repo's testing partial, runs it, observes output, fixes failures, and iterates until the tests are green — before it marks checkboxes or commits.

write → run verify → observe → fix → repeat → mark [x]

Two things matter about this loop:

Failures come from the test runner or the compiler, not from LLM self-review. The agent iterates against real execution signal.
Repo-specific conventions live in contractor/partials/testing.md, authored by humans and inlined directly into the implement agent's prompt on every run. That's where you document the default test command, monorepo subset variants, flaky suites, and test layout. See Partials.

When the group's tests pass and every checkbox is marked, the agent commits and exits. The pipeline enforces one commit per group: exiting without a new commit is treated as failure.

Outer loop — blueprint lifecycle across sessions

This is the human-governed arc of a whole change: spec authoring, parallel review by fresh agents, and archival. Each phase is a slash command or a pipeline step.

propose → implement → review → close
   │          │          │        │
   │          │          │        └─ merge branch, archive blueprint,
   │          │          │           merge requirements into source of truth
   │          │          │
   │          │          └─ four parallel agents (reuse, quality,
   │          │             efficiency, blueprint compliance), fix issues
   │          │
   │          └─ task-group agents run the inner loop per group,
   │             commit per group when verify is green
   │
   └─ create artifacts: proposal → requirements + design → tasks

Gates between phases pause for user confirmation. You approve them from the CLI or the dashboard.

Middle loop — cross-session memory

Agents notice things they can't (or shouldn't) act on mid-task: pre-existing code issues outside scope, undocumented conventions, task descriptions that didn't match reality, workflow friction. These are observations, written to contractor/.observations/<change>/<phase>.json.

agent phases ─────────────────► retro
   │                               │
   │  RETRO_TAIL appended to       │  reads .observations/
   │  strategy templates;          │  presents items to user,
   │  agents write observations    │  applies code fixes and
   │  to .observations/            │  rule updates, cleans up
   │                               │
   └───────── next session ────────┘

Between sessions, /contractor:retro presents the accumulated observations and the user decides which to apply, convert into rules, or dismiss. Retro is deliberately user-driven — the agent does not self-modify the repo based on its own observations.

How pipelines advance work

A pipeline chains phases into an automated sequence. The default pipeline is implement → review → gate → close. You run one with contractor run or from the TUI dashboard. See Pipelines for step kinds, custom pipelines, and resume semantics.

Inside the implement phase, the runner inspects tasks.md and expands it into one sub-step per incomplete task-group. Each sub-step is a fresh agent invocation scoped to that single group — the task-group-implement strategy. The pipeline enforces commit-per-group so each group is a reviewable unit in git history.

Review runs four review agents in parallel — reuse, quality, efficiency, and blueprint compliance — against the full branch diff. Their findings surface as a single consolidated report for you to act on.

Close merges the branch back to main, archives the blueprint into contractor/archive/, and folds its requirements/<capability>/spec.md deltas into the repo-wide source of truth at contractor/requirements/.

Each blueprint runs in its own git worktree so multiple changes can be in flight without stepping on each other.

Directory layout

After contractor repo install:

contractor/
├── config.yaml              # Project configuration (pipelines, scopes)
├── partials/                # User-authored agent prompt fragments
│   ├── testing.md           # Repo-specific test guidance
│   └── prepare.md           # Repo-specific worktree-prepare guidance
├── requirements/            # Accumulated requirements (source of truth)
│   └── <capability>/spec.md
├── blueprints/              # Active blueprints
│   └── <name>/
│       ├── .contractor.yaml # Blueprint metadata (schema, scope, base, …)
│       ├── proposal.md
│       ├── requirements/<capability>/spec.md
│       ├── design.md
│       └── tasks.md
├── .worktrees/              # Git worktrees (gitignored)
├── .observations/           # Agent observations (gitignored)
└── archive/                 # Completed blueprints

Run state (runs, events, gates, known repos) lives in a single global SQLite database at ~/.contractor/contractor.db, shared across every repo on the machine. Runtime logs and pipeline step state live under ~/.contractor/projects/<encoded-repo>/{logs,runs}/, keyed by the main repo root so every worktree collapses onto the same directory.

Where to go next

Schemas — the artifacts/phases/pipeline shape, plus contractor-base vs contractor-lite.
Pipelines — step kinds, custom pipelines, resume.
Strategies — the five named strategies and which phases use them.
Partials — the contractor/partials/ mechanism.
Worktrees — the worktree-per-blueprint model and the synthetic prepare step.
Gates — gate steps, preapproval, and dynamically injected gates.

On This Page

The artifact graph The three loops Inner loop — inside a task-group session Outer loop — blueprint lifecycle across sessions Middle loop — cross-session memory How pipelines advance work Directory layout Where to go next