Adversarial multi-agent system · ML / AI engineering

A.R.C.A

The adversarial agentic system that actually ships code.

59 specialized agents. 3 pipelines. 111 ADRs. Zero AI slop tolerated.

View on GitHub Read the ADRs

Specialized Agents

51 Opus · 4 Sonnet · 4 Haiku

148

Skills Catalog

Loaded surgically per task

111

Architecture Decisions

Nygard ADRs

Pipelines

ML (14 cycles) · HTB (6 phases) · ART (9 phases)

Bash Hooks

Wired across 14 lifecycle events

30+

MCP Servers

Engram, GitHub, HTB, Exa, Obsidian, +25

What A.R.C.A does

Seven pillars that turn a single AI coding assistant into a regulated multi-agent engineering team. Pure configuration — no runtime, no server, no container.

59 Specialized Agents

Each with its own model assignment (Opus for reasoning, Sonnet for implementation), specific role, blocking gate and audit responsibility. From @data-validator to @ai-redteam-orchestrator.

Adversarial Gate Chain

Producer → @math-critic → @debt-detector → @code-critic → @chief-architect. No code reaches main without explicit sign-off. 65 bash hooks enforce the chain at PreToolUse / PostToolUse level.

3 Pipelines (ML · HTB · ART)

ML: C1→C14 (14 cycles, 65 phases). HTB: F0→F7 (CVE-first CTF with MCP integration). ART: R0→R8 (AI Red Teaming, MITRE ATLAS + OWASP LLM Top 10:2025, ADR-081).

111 Documented Decisions

Every architectural choice ships as a Nygard ADR: context, alternatives weighed, consequences. Covers architecture, security, governance, pipelines and the meta-system itself.

148-Skill Catalog

OWASP security, ML engineering, DevOps, RAG patterns, agent orchestration, HTB CTF methodology, AI red teaming, prompt engineering. @skill-router selects ≤3 per task.

Honest Disclosure

Inverted-style canary tests assert known bugs still occur, so a future change cannot silently mask a structural limit. 19 AI-slop signals detected pre-merge. Zero hidden state.

Dynamic Orchestration

A 4th mode for the long tail no fixed pipeline covers. @architect-ai proposes a bespoke agent DAG — which subagents, in what order, with which adversarial critics per node — for Adrian's approval before any agent executes. Two-layer gate enforcement: a schema-floor validator rejects under-gated proposals at declaration, a runtime hook hard-blocks at invocation. /orchestrate · ADR-089.

Quick look

A.R.C.A lives inside Claude Code’s native extension points: markdown agent definitions, bash hooks, JSON settings. Clone the repo, run install, work.

Cloneinstall.shVerify

arca.install

# 1. clone
$ git clone https://github.com/infantesromeroadrian/arca-claude-code
$ cd arca-claude-code

# 2. install — symlinks settings + hooks + skills into ~/.claude
$ ./install.sh --skills --hooks --mcp

# 3. verify the gate chain
$ claude --debug --agent code-critic 'review this'
  ↳ @math-critic
    ↳ @debt-detector
      ↳ @code-critic
        ↳ @chief-architect

The gate chain

Four adversarial agents stand between producer code and main. Each one blocks. Bypass leaves an audit trail.

Stage 01

@math-critic

Mathematical correctness

Validates loss functions, gradients, numerical stability, attention scaling, sampling strategies. Blocking on producer code from @ml-engineer / @dl-engineer / @ai-engineer in C3, C5, C6, C8.

Stage 02

@debt-detector

Inline mechanic

Catches unused imports / functions, TODOs without tickets, logical duplication (>3 sites), cyclomatic complexity over 10. Blocking on producer output in C6 and C8.

Stage 03

@code-critic

Terminal gate

19 isolated AI-slop signals. Audits architecture, strict security adherence, runtime performance, deep maintainability limits. No code is final without explicit approval.

Stage 04

@chief-architect

Pre-deploy seal

C10 deploy gate. Cross-checks that @code-critic, @math-critic, @model-evaluator, @tester and @ai-red-teamer signed off. Sin firma, no hay deploy.

Built on

The stack underneath. Each one earned its place with a documented ADR.

Claude Opus 4.8Anthropic SDKMCPLangGraphLangChainEngramOllamaQwen 2.5 7BObsidianExcalidrawGitHubVercelNext.js 15Tailwind v4PostgreSQLRTX 2000 Ada

The 14-cycle pipeline

From Discovery to Sunset across 65 phases. Each cycle has a blocking gate at its exit — owned by a specific agent.

Discovery

@project-planner

Data

@data-validator

Feature

@math-critic

Design

@architect-ai

POC

@math-critic

Build

@code-critic

MLOps

@mlops-engineer

Quality

@model-evaluator

Pre-Prod

@deployment

C10

Deploy

@chief-architect

C11

Post-Deploy

@deployment

C12

Monitoring

@monitoring

C13

Governance

@mlops-engineer

C14

Sunset

@mlops-engineer

4th mode · /orchestrate

ML, HTB and ART are the three fixed pipelines. For the long tail no fixed pipeline covers, @architect-ai proposes a bespoke agent DAG — chosen subagents, their order, and the adversarial critics gating each node — for Adrian’s approval before any agent executes. A schema-floor validator rejects under-gated proposals at declaration; a runtime hook hard-blocks at invocation. ADR-089.

A.R.C.A — navigating the architecture map

Architecture decisions

111 numbered ADRs. Every architectural choice carries context, alternatives weighed and consequences. Below: 19 milestone records.

The adversarial agentic system that actually ships code.

⟩What A.R.C.A does

59 Specialized Agents

Adversarial Gate Chain

3 Pipelines (ML · HTB · ART)

111 Documented Decisions

148-Skill Catalog

Honest Disclosure

Dynamic Orchestration

⟩Quick look

⟩The gate chain

@math-critic

@debt-detector

@code-critic

@chief-architect

⟩Built on

⟩The 14-cycle pipeline

⟩Architecture decisions

⟩The 9 mortal sins

⟩Stack

⟩Take it for a spin

What A.R.C.A does

Quick look

The gate chain

Built on

The 14-cycle pipeline

Architecture decisions

The 9 mortal sins

Stack

Take it for a spin