Home
All servicesWeb DesignSEOAI ConsultingAgent OS
Law FirmsAboutBlogContactBook a Call →
← Back to AI Consulting
A Brand75 Framework

Agent OS — the operating system for autonomous AI agent teams.

Service-based businesses don’t need more software. They need a workforce. Agent OS is the framework we use to design, build, and deploy AI agents that run real operations — sales, marketing, ops — without supervision.

11
Agents in production
5
Lifecycle phases
6
Core operating rules
24/7
Autonomous operation
What is Agent OS

A framework for building AI teams that actually run your business.

Agent OS — the Agent OS — is how Brand75 designs autonomous AI workers that go beyond chatbots. Each agent has identity, capabilities, rules, knowledge, and a heartbeat. Together they form an organization with reporting lines, accountability, and verifiable output.

It’s not a product. It’s a methodology, proven across our own agency operations and now packaged for service businesses ready to scale without hiring.

Anatomy of every agent

IdentitySOUL.md
CapabilitiesTOOLS.md
Operating RulesAGENTS.md
Knowledge/knowledge/
HeartbeatHEARTBEAT.md
Reporting5-field format
The Five Components

Anatomy of an AI agent.

Every agent in an Agent OS team is built from the same five primitives.

I
SOUL.md

Identity

Who the agent is, what scope they own, and where their authority ends.

  • Name, role, reporting line
  • Personality and tone of voice
  • Domain of authority — what they decide vs. escalate
  • Hard boundaries — what they never touch
  • Self-reported model identity
C
TOOLS.md

Capabilities

The exact tools, paths, and commands the agent can use to act on the world.

  • Real file paths, no abstractions
  • API endpoints with required scopes
  • CLI commands and arguments
  • MCP tools with use cases
  • Capped under 10K characters to avoid context overflow
R
AGENTS.md

Operating Rules

The constraints, reporting format, and verification protocol every agent must follow.

  • Verify-before-reporting protocol
  • 5-field structured report format
  • Status taxonomy (STARTED, COMPLETED, BLOCKED, etc.)
  • Escalation paths and thresholds
  • Anti-narration rule (no filler updates)
K
/knowledge/

Knowledge Base

Domain-specific reference material the agent loads on demand.

  • Brand voice, ICP, positioning docs
  • Service-specific playbooks
  • Tool guides and integration notes
  • Past decisions and rationale
  • Indexed for fast retrieval
H
HEARTBEAT.md

Heartbeat

The recurring tasks that keep the agent moving without being asked.

  • Cron-driven check-ins
  • Per-platform monitoring cadence
  • Weekly review rituals
  • Auto-baselines after activation
  • Escalation triggers when thresholds breach
Lifecycle

Five phases from idea to autonomous worker.

No agent goes live without clearing every phase.

1

Design

Define role & scope

2

Build

Author the files

3

Validate

Pre-flight & spawn

4

Activate

Announce & baseline

5

Monitor

48h supervised watch

Phase 1 — Strategy Layer

Design

Strategy defines the agent before a single file is written. We answer: what business outcome does this agent own, what's the smallest scope that delivers it, and what would success look like in 30 days?

  • Draft role description and reporting line
  • Identify upstream and downstream agents
  • Write SOUL.md, TOOLS.md skeleton, knowledge index
  • Approve scope with the human owner before building
Phase 2 — Execution Layer

Build

Execution turns the design into real files in real directories. Templates make the bulk fast; the unique parts get hand-tuned.

  • Create agent directory structure
  • Write SOUL.md, TOOLS.md, AGENTS.md, HEARTBEAT.md
  • Sync runtime + workspace copies
  • Register the agent in core config
  • Wire up cron jobs and webhooks
Phase 3 — Pre-Flight

Validate

No agent goes live without passing every check. We spawn it in isolation, run a test task, and verify outputs match the report.

  • Schema validation on every .md file
  • Spawn test in sandbox
  • Run a single representative task
  • Verify report matches actual output
  • Accountability pass — PASS required to proceed
Phase 4 — Go Live

Activate

Activation is announcement plus baseline. The team is told the new agent exists, what they own, and how to reach them. Baselines get captured for future drift detection.

  • Announce in team channel
  • Establish performance baselines
  • Open the heartbeat schedule
  • Hand off first real production task
Phase 5 — Supervised Operation

Monitor

The first 48 hours are watched closely. The orchestrator reviews every report. After two clean days, the agent moves to autonomous status.

  • 48-hour heightened review window
  • Real production tasks only — no synthetic
  • Daily audit on all completions
  • Promote to autonomous after 48h clean run
Real Team in Production

An Agent OS team running today.

This is the actual agent org chart powering Brand75 operations. Pick a route below to see how work moves through the team — or click any agent for their role.

Inbound lead → routed → outreach drafted → approved
Orchestrator
Atlas
Routes work, owns context
Strategy / CEO
Ryan
Verification & truth
CFO / Finance
Alex
Money & spend
CMO / Marketing
Koa
Marketing org owner
COO / Operations
Aura
Systems & pipelines
Sales
Nova
Pipeline & outreach
VibeCoder
Kai
Web & product builds
Copywriter
Muse
Brand voice & copy
Scout
Scout
Research & recon
Social Media
Nalu
Multi-platform monitor
Citations
Iris
Brand intelligence
Builder
Leo
Infra & automation
Scraper
Echo
Data ingestion
Critic (peer to Atlas)
Sage
Quality gate on every public output
Orchestrator

Atlas

The router. Receives every inbound task, identifies which agent owns it, delegates with full context, and reports back to the human owner. Atlas never does the work — Atlas makes sure the work goes to the right hands.

Scope · Owns: routing, context, escalation. Delegates: everything else.
Strategy / CEO

Ryan

The accountability layer. Verifies that what an agent reported actually happened — by reading files, hitting APIs, checking artifacts. Nothing reaches the owner as "done" until Ryan returns PASS.

Scope · Owns: verification, weekly task board sweep, strategic review.
CFO / Finance

Alex

Money and spend awareness. Alex tracks what the agent team costs to run, what each loop is worth in revenue or saved hours, and where the unit economics need to change.

Scope · Owns: token spend, model cost, ROI per loop, budget guardrails.
CMO / Marketing

Koa

Marketing organization owner. Manages the creative team beneath, owns the editorial calendar, and reports marketing performance up the chain.

Scope · Owns: brand voice, content strategy, social presence, the marketing roster.
COO / Operations

Aura

Systems and pipelines. Aura makes sure the infrastructure that other agents depend on stays up — crons, webhooks, integrations, deployments.

Scope · Owns: ops infra, automation pipelines, the ops roster.
Sales

Nova

Pipeline and outreach. Nova qualifies inbound leads, drafts personalized outreach, and keeps the CRM honest.

Scope · Owns: pipeline hygiene, outreach drafts, qualification.
VibeCoder

Kai

Builds product surfaces — landing pages, web tools, lightweight apps. The digital craftsperson on the team.

Scope · Owns: web builds, page implementations, prototype tooling.
Copywriter

Muse

Brand voice keeper. Every piece of customer-facing copy goes through Muse before it ships.

Scope · Owns: copy, brand voice consistency, headline craft.
Scout

Scout

Research and reconnaissance. Scout investigates prospects, competitors, and topics on demand so the creative team works from facts instead of guesses.

Scope · Owns: prospect research, competitive scans, topical recon.
Social Media

Nalu

Multi-platform monitoring and posting. Watches YouTube, TikTok, X, and Instagram for mentions and trends, surfacing the relevant signal to Koa.

Scope · Owns: social listening, post scheduling, engagement summaries.
Citations / Brand Intelligence

Iris

Tracks where the brand and clients get cited across the web and inside AI engines. The eyes-on-search-results agent.

Scope · Owns: citation monitoring, brand visibility tracking, AI engine appearance.
Builder

Leo

Infrastructure and automation. Leo builds the plumbing — workflow flows, scripts, cron jobs — that the other agents run on top of.

Scope · Owns: workflow automation, script authoring, infra hardening.
Scraper / Data

Echo

Data ingestion. Pulls structured data from sources the other agents need — APIs, sheets, scrapes — and lands it where the team expects.

Scope · Owns: data pipelines, scraping, ingestion validation.
Critic (peer to Atlas)

Sage

The quality gate. Sage runs on a different model than primary and scores output against an LLM-as-judge rubric before anything reaches a public surface. Procedural memory only writes when Sage returns 0.7+.

Scope · Owns: pre-publish review, eval scoring, memory gating.
Embedded in every agent

Six rules that make autonomy safe.

These rules ship with every Agent OS agent. They’re the difference between an LLM that hallucinates and an agent you can trust to act unsupervised.

RULE 01

Verify Before Reporting

An agent never claims a task is done. It re-reads the file, runs the check, queries the API, then reports. Trust comes from evidence.

RULE 02

Anti-Narration

No "I’m going to start by…" filler. Agents lead with action and report results. The diff is the proof, not the prose.

RULE 03

Experience Logging

Every non-trivial run writes a structured log entry. Future runs read it. The team learns instead of repeating mistakes.

RULE 04

5-Field Report Format

task_id · status · result · blockers · next_step. Five fields, every time. Parsing is easy and escalation is automatic.

RULE 05

Status Taxonomy

STARTED · COMPLETED · BLOCKED · ESCALATED · DELEGATED. Five states, no ambiguity. The orchestrator routes based on these alone.

RULE 06

Operating Model Lock

Each agent declares its own model identity. Routing changes propagate to the agent's self-report. No drift, no surprises.

When things go wrong

Six failure modes — and how Agent OS catches them.

Autonomous systems fail. The framework’s job is to make every failure visible, fast.

Failure ModeHow It Shows UpThe Fix
Hallucinated completionAgent reports COMPLETED but artifact is missingVerify-before-report + Ryan audit pass
Context overflowTOOLS.md or AGENTS.md exceeds 10K charsAuto-cap with overflow into PROTOCOLS.md
Silent rate-limitModel 429s mid-task, agent hangsHealth monitor failover to fallback chain
Scope creepAgent acts outside SOUL.md authorityHard boundaries in identity + escalation
Stale knowledgeMemory says X exists, X was renamedVerify-before-recommend on every memory hit
Cascading failureOne agent breaks, queue backs upHeartbeat health checks + isolated retries
Built and Running

The systems behind the team.

Agent OS isn’t a slide deck — the agents above run on real infrastructure that’s already in production.

Approval Manager

live · gates production actions

Any irreversible action — production deploys, payments, outbound posts, data destruction — is held in a pending queue with a Discord ping. Nothing ships until the human owner approves. A separate daily reconciliation job audits the upstream system independently, so bypassed approvals get caught after the fact.

Observability

live · Langfuse self-hosted

Every agent session is shipped as a trace with tool calls, fallbacks, and per-axis quality scores. Engineering view for debugging, executive view for cost, quality view for the critic. Sidecar pattern — no runtime patches — so it survives every upgrade.

Three-Layer Memory

live · episodic · semantic · procedural

What happened (sessions), what is known (durable facts), and how to do things (recipes). Every entry is tagged by source so agent inferences never get retrieved as ground truth. A pre-compaction watcher rescues durable facts before they fall off the context window.

Critic Gate (Sage)

live · different model from primary

Sage scores every user-facing output against an LLM-as-judge rubric — factual accuracy, citation accuracy, completeness, source quality, tool efficiency. New recipes only enter procedural memory if Sage clears them at 0.7 or higher.

Eval Suite

live · grows from real failures

Each agent has a seed task set scored per rubric axis. Production traces that fail the critic become new eval cases. The suite gets stricter the longer the team runs — drift gets caught, not absorbed.

Version Bundle

live · daily 3 AM PT snapshot

Code, prompts, durable memory, and auth profiles snapshot together every night with 14-day retention. Rollback is holistic — a bad prompt change doesn’t outlive itself. Recovery from any single day is one command.

Why this works

Principles, not magic.

Agent OS works because it borrows from how real organizations are built — clear roles, written rules, accountability loops — and translates them into structures an LLM can actually follow.

Roles, not prompts

Each agent owns a domain end-to-end. Prompts vanish on cold start; roles persist across sessions, models, and weeks.

Files, not memory

The team's truth lives in version-controlled files. Memory drifts. Files don't.

Verifiable autonomy

Every agent action produces a check the next agent (or human) can audit. Trust scales because evidence scales.

Composable structure

Add a new agent in days, not weeks. The framework is the same. The role description changes.

Human-on-the-loop

The owner approves scope, reviews escalations, and shapes strategy. Agents handle the work, not the judgment.

Built for service businesses

Agencies, law firms, consultancies. Anywhere repeatable knowledge work eats founder hours, Agent OS gives that time back.

Ready to build your team?

Brand75 designs and deploys Agent OS agent teams for service businesses. We start with a single agent, prove it pays for itself, then scale.

Start a conversation →