The AI invents requirements
You asked for a simple endpoint. The agent decided you also need PostgreSQL, Redis, Kubernetes, and microservices. Nobody asked.
Atomic Spec turns unpredictable AI into a governed assembly line — eight Prime Directives, four HITL checkpoints, atomic task files, and 24 specialized subagents.
Atomic Spec exists because one AI-instructions file isn't enough. Soft guidance drifts. Gates don't.
You asked for a simple endpoint. The agent decided you also need PostgreSQL, Redis, Kubernetes, and microservices. Nobody asked.
The agent reads your whole repo, drowns in it, and ships a 500-line function that does everything and nothing well.
"Where did this code come from? Which requirement does it satisfy?" Silence. Good luck debugging or extending it.
Every component built. Nothing connected. Missing routes. Broken nav. "Feature complete" — and nothing works end-to-end.
The Solution
A governance framework that turns the AI from an unpredictable collaborator into a system that ships consistent, traceable code — feature after feature.
Every AI decision is bound by a constitution. Article IX hardcodes the eight Prime Directives — non-negotiable rules every command enforces.
The AI proposes. Humans approve. At four critical decision points, the system STOPS until you sign off.
Instead of one monolithic tasks.md, every feature explodes into isolated T-XXX files. The agent reads only the current task — by design.
18 Knowledge Stations encode domain expertise as gate criteria. The AI stops guessing — it follows your engineering standard, station by station.
Station 03Standard
ICP, JTBD, wedge, competitor pain mining — raw requirement capture.
Why it matters
Skip Station 03 and you ship technical debt on day one. Without this context, the AI produces code that runs and can't be maintained.
Generalist AI pretends to know every domain. Our specialists actually do. Subagents are discovered by scanning YAML frontmatter and matched to your feature's keywords — not hard-coded.
Backend
REST conventions, tenant isolation, API schemas.
Database
Schema normalization, indexing strategies.
Frontend
Component composition, state management.
Security
OWASP controls, vulnerability analysis.
DevOps
CI/CD pipelines, containerization.
Frontend
User flows, accessibility (WCAG).
Billing
Stripe and LemonSqueezy integration patterns.
AI/ML
RAG systems, prompt engineering.
Plus 13 more waiting in .specify/subagents/ —
and you can drop in your own.
Most AI coding fails because the model makes silent decisions. HITL checkpoints flip that. The agent proposes, then HALTS. You review. You approve. Only then does it build.
Conventional wisdom says AI needs more context. We disagree. Too much context causes drift. During implementation, the agent is architecturally prevented from reading anything outside the current task file — by design.
# Task: Create the User model
# AI reads ONLY T-010-create-user-model.md
# Result: clean, focused implementation
class User(Base):
id = Column(UUID, primary_key=True)
email = Column(String, unique=True)
password_hash = Column(String)
# Roles added in T-015
# Analytics added in T-020
# Billing added in T-025"Why forbid plan.md? Because if the AI sees 'future admin panel', it will try to build it today. Pinning forces the agent to build only what this task needs — and nothing else."
Discovery, requirements, gates 03–05
Architecture, research, registry sync
Atomic decomposition, embedded context
Context-Pinned execution, verification commands
Note: Reject at any HITL checkpoint and the workflow loops back to the previous phase automatically. No corrupted state. No half-built features.