Thesis

Problem

Most "agents" are UI over record retrieval. They fetch context then hand work back to humans. Value stalls at the point of execution.

Architecture beats features. Durable advantage comes from systems design that combines memory, tools, policies, and verification.
Vertical depth beats horizontal breadth. Industry language, incentives, and compliance create moats that general copilots cannot cross.
Execution closes the loop. Agents must initiate actions, not just summarize. The loop is perceive → decide → act → verify → learn.
Shared infrastructure compounds. Common rails for security, observability, commerce, and data contracts reduce time to market across studios.
Operator networks accelerate truth. Domain operators plus model engineers produce higher signal and lower CAC than isolated R&D.

Goal: move work forward safely.

Tools: CRUD in SaaS, RPA, retrieval, generation, payments
Transactions: idempotency keys, rollback, human-in-the-loop gates
Verification: invariants, unit checks, reconciliation against source-of-truth

1. Task specificity increases success probability. Narrow verbs outperform vague intents.

2. Tool determinism enables trust. The agent should know what success looks like before it acts.

3. State awareness prevents loops. Plans must check current system state before execution.

4. Tight feedback compounds learning. Every action produces a labeled outcome that updates policy and memory.

5. Human cadence creates leverage. Insert approvals only where risk or ambiguity is high.

Language: each industry has stable jargon, forms, and flows that improve retrieval and planning.
Incentives: agents tuned to how value is realized in the vertical select better actions.
Integrations: a few deep systems beat a thousand shallow OAuths.
Regulatory tailwinds: compliance scaffolding is reusable and defensible.

Fieldwork first: shadow operators. Map the canonical workflow. Identify three verbs that produce money or time.
Thin wedge: ship a single agent that owns one verb end-to-end.
Close the loop: instrument verification and reconciliation on day one.
Scale by adjacency: add neighbors in the workflow graph once the core loop is reliable.