Autonomous AI Operations · Beta

Autonomous AI That
Runs Your Operations

Self-hosted autonomous AI operations platform. A primary agent — you name it; it works like an autonomous chief of staff — monitors your channels, drives a real authenticated browser, delegates to a swarm of specialist peers, and runs the back office 24/7 across Telegram, WhatsApp, Discord, Slack, Email and its own SwarmAI Messenger app — all under hardware-key auth + MFA.

PS>powershell -c "irm https://northpeak.app/install.ps1 | iex"

Installs Node 22+, pnpm 10+, git, then clones the repo and links the global swarmai command. Idempotent · honours SWARMAI_HOME / SWARMAI_REPO env vars · full install guide

28
Dashboard Modules
200+
Tools
15
LLM Providers
6
Channels
1,200+
Tests
24/7
Standby
Capabilities

Everything Your Operations Agent Needs

A 28-module desktop dashboard, grouped into thematic areas. Every feature ships with tests, audit logs, and master-gated approvals where mutation is involved.

Peer-bus · live graph

Live Peer-Bus Swarm

Watch the whole organisation think in real time. The orchestrator agent sits at the centre of a constellation of department peers — security, finance, strategy, legal, tech, ops — with ask / reply / assign edges animating as work flows between them. Ring, Shells, Sectors and Bubbles views.

peer meshaskreplyassignlive edges
Desktop app · team chat

SwarmAI Messenger

A dedicated desktop messenger where your team and the AI share one contact list. Human↔human chats cost zero tokens; the AI contact is clearly badged “uses tokens”. Markdown, files & images, reactions, reply / edit / delete, voice notes, link unfurl and full-text search — and the agent can register & invite teammates itself.

electronvoice notesreactionsFTSinvite-gated
Learning loop · bandit routing

ML Observatory

The swarm learns from its own outcomes. A Thompson-sampling bandit ranks models on a 7-signal reward, a model registry promotes shadow→active, and built-in heads cover sentiment, triage, tool-failure prediction and anomaly detection — with a fine-tune studio for custom scorers and LoRA datasets.

banditreward modelsentimentanomaliesfine-tune
Autonomy · self-improvement sweep

Playtime & Self-Evolving Core

In idle time the agent runs supervised “playtime” sweeps — proposing new triggers, flows and skills from what it observed — while a sealed, sandboxed self-modification engine can draft, test and (on approval) promote brand-new tools. Inert by default; you decide what ships.

playtime sweepself-evolvingsandboxedmaster-gated
Token & dollar telemetry

Cost & Budget Control

Every token and dollar is captured per category, agent, provider, model and tier — charted over time with drill-down and burn-down, and capped by budgets with no-cap or hard-limit modes. Autonomy you can actually afford and audit.

per-categorybudgetsburn-downCSV / JSON
Office docs · interactive .ipg

Document & Artifact Studio

The agent produces real deliverables — Word, PowerPoint, Excel and PDF with letterheads, charts and brand templates — rendered inline as artifact cards. It also emits .ipg Interactive Portable Graphics: live, self-contained HTML dashboards you can open in the IPG Viewer.

docxpptxxlsxpdf.ipg
Vertical packs · Hub marketplace

Domain Verticals & Hub

Specialise the swarm for a vertical — agriculture, security ops, sales — with installable domain packs, then extend it from the Hub marketplace of tools, channels, skills and MCP servers. Author your own tools and MCP servers right from chat.

domain packshubmcp authoringtools authoring
Phase 11 · ~17 tools

Conversational Configuration

Configure the platform by chatting. Schedule cron jobs, enroll channels, edit personas, define triggers, manage masters — no YAML hand-editing required.

cron.addchannel.enrollpersona.edittrigger.create
Phase 7

Background Tasks

Fire-and-forget peer dispatch via peer_ask_background. SQLite-backed registry, multi-channel notification fan-out (Telegram/WhatsApp/Discord/Slack DM on completion), and main-session wake-up when results land.

queuedrunningcompletedcancelled
Phase 13

Real Browser Control

Native Chrome / Edge extension drives your already-authenticated browser. No Playwright, no headless puppeteer — just DOM access plus opt-in screenshots from the tab you're actually logged into.

dom.readclicktypescreenshot (opt-in)
6 channels

Multi-Channel Messaging

Talk to your agent from anywhere. Telegram, WhatsApp Cloud API, WhatsApp Personal (free QR scan via Baileys), Discord, Slack, plus the dashboard. Every channel uses DM-pairing — the agent only responds to authorised contacts.

telegramwhatsappdiscordslackdashboard
Tiered Model Tree · health-ranked routing

Fifteen LLM Providers

OpenRouter, Anthropic, OpenAI, Gemini, Bedrock, GitHub Copilot, Ollama (local), MiniMax, Qwen, Kimi, the Claude Code / Codex / Gemini CLI agents, and any OpenAI-compatible endpoint. The Model Tree routes per task class (heavy / average / simple) with per-provider quotas, circuit breakers and configurable failover.

openrouteranthropicopenaigeminibedrockcopilotollamacli-agents
Phase 6

Session Replay & Time Travel

Scrub the timeline of any past conversation, branch from any turn into a new session, and export to JSON or NDJSON. Diff between branches to see exactly what changed.

replaybranchdiffexport
Hardware key · MFA · Scopes

Enterprise-Grade Auth

ed25519 hardware key (CLI sign or WebAuthn / Touch ID / Windows Hello / YubiKey). TOTP MFA with 10 single-use recovery codes. Granular scopes. Bearer tokens with silent rotation at 50% TTL.

ed25519webauthntotpscopes
Phase 12

4-Level Emergency Stop

Pause work fast: soft (drain), cancel-all (running tasks), freeze (no new work), kill (immediate halt). Triggered by slash commands or natural language — "stop everything now" works.

/soft/cancel-all/freeze/kill
4 baseline + 8 optional layers

Self-Healing

LLM-call retries (rate limit, context overflow, malformed JSON), process supervisor (heartbeat, circuit breaker), workspace file integrity (LEDGER seal chain), capability-gap detection — plus eight Owner-selectable advanced strategies.

healingsupervisorintegrityautonomy
Phases 1A + 3

Tauri Desktop App

Native Windows / macOS / Linux shell built on Tauri 2. System tray, auto-updater scaffold, three themes (Midnight Forge / Twilight Command / Daylight Brief). Falls back to a PWA in the browser.

tauri 2trayupdaterthemes
Phase 8

Always-on Daemon

One command (swarmai setup --install-daemon) installs a native service unit. Windows uses NSSM, macOS launchd (user agent), Linux systemd (user unit — no root required).

nssmlaunchdsystemd
HMAC-sealed

Tamper-Evident Audit Ledger

Every operation appended to LEDGER.md with a chained HMAC seal. Workspace integrity check verifies the chain on boot — any tampering is detected immediately.

append-onlyhmac chainverifiable
Memory default · Redis optional

Distributed Rate Limiter

Per-IP, per-token, per-endpoint limits. In-memory by default (zero-config), Redis-backed when REDIS_URL is set — for multi-instance deploys behind a load balancer.

memoryredisper-IPper-token
Phase 14

Pretty Terminal Logger

swarmai logs tails server output like docker logs -f. Colour-coded levels, JSON-aware structure, filterable by category, follows rotation.

swarmai logstail -fcolourised
Architecture

How It All Connects

A primary orchestrator agent at the centre, peer agents spawned on demand, channels feeding inbound traffic, the browser extension on the side, vault and master-auth keeping it all locked down.

Inbound — Channels & Triggers
Telegram
WhatsApp
Discord
Slack
Messenger
Dashboard
RSS / IMAP / Webhook
Orchestrator Agent — The Reasoning Loop
Main Session · Orchestrator
Peer Mesh — Spawned at Runtime
security
tech-lead
finance
strategy
legal
sales / ops
custom…
Capability Layer
Tool Registry (200+)
Peer-Bus + Meetings
Browser + Devices
Memory + RAG (Qdrant)
ML Observatory
Cost & Budget
Replay / Branch
Security & State (mode 0600 on disk)
Vault (AES-256-GCM)
Master Auth + MFA
Hardware Keys
HMAC-Sealed LEDGER
Approval Queue
Quick Start

From Zero to Running in Three Steps

The installer takes care of Node, pnpm, git, and the workspace. You only do three things.

1

Install

One command on your platform. Idempotent — re-run anytime to upgrade.

# Windows (PowerShell) powershell -c "irm https://northpeak.app/install.ps1 | iex" # macOS / Linux curl -fsSL https://northpeak.app/install.sh | bash
2

Setup

Interactive walkthrough — pick LLM provider, paste API key, set master passphrase, optionally enroll channels.

swarmai setup
3

Start

Boots the agent server (port 7910) and dashboard (port 18789), and opens your browser.

swarmai start
+

Always-on (optional)

Install as a native daemon — starts at boot, no Docker required.

swarmai setup --install-daemon swarmai daemon status
Device Agent

Run on Every Device You Own

A small daemon turns any machine — phone, laptop, desktop, server — into an endpoint your primary agent can drive. Outbound WebSocket only, so devices behind NAT (carrier networks, corporate firewalls, hotel WiFi) work without port-forwarding, static IPs, or SSH tunnels.

🖥️

Desktop Control

27+ native tools: clipboard read/write, screenshot, window list/focus/close/move, app launch, volume, brightness, caffeinate, system info, battery, network interfaces, process management.

  • macOS · Linux · Windows · Android
  • Master-gated on mutating ops
⚙️

CLI Tools, Auto-Wrapped

50 bundled manifests cover git, gh, docker, kubectl, helm, npm, pnpm, cargo, aws, gcloud, az, psql, redis-cli, terraform, ffmpeg, jq, curl, rg, fd, and 30+ more.

  • Read-only subs auto-pair-gated
  • Mutating subs master-gated
📱

Android via Termux

Full mobile device control with Termux:API + Shizuku — battery, wifi toggle, bluetooth, brightness, volume, screen lock, screenshot, app launch, calls, SMS compose, WhatsApp compose, GPS location, vibration.

  • No root required
  • One-tap pair from gateway
🔌

Editor Protocol (ACP)

JSON-RPC server for VS Code, Zed, JetBrains, Cursor. Agent reads open buffers, fetches diagnostics, proposes inline edits (you accept/reject), drives the integrated terminal.

  • Inline diff preview
  • Editor commands via verbs
🔐

Secure by Default

Single-use 6-char pair codes (5-min TTL, ambiguous chars excluded), sha256-hashed token storage, per-device tool-policy floor, master-revocable from the gateway. Token lives at ~/.swarmai/device-agent.yaml chmod 0600.

  • Bearer-subprotocol WS auth
  • Per-call invocation audit log
📡

Outbound Connection

Device opens a WebSocket OUT to your gateway URL — works behind any NAT/firewall a normal browser works behind. Exponential reconnect backoff, in-flight invocation draining on shutdown.

  • Auto-reconnect 1s → 60s
  • Verbose mode for live MainAgent ↔ Device traffic

Install the daemon

# Windows (PowerShell) powershell -c "irm https://northpeak.app/device-agent.ps1 | iex" # macOS / Linux / Termux (Android) curl -fsSL https://northpeak.app/device-agent.sh | bash

On Android: install Termux + Termux:API from F-Droid first — the installer walks you through Shizuku pairing. Already have SwarmAI installed locally and just want the desktop tools added in-process? Use agent.sh / agent.ps1 instead.

Tech Stack

Built on Boring Tech, Where it Counts

TypeScript strict, ESM-only, plugin-boundary lint enforcement — the codebase is built to be readable and modifiable.

Language & Runtime

TypeScript strict Node 22+ pnpm 10+ ESM-only Turbo

Frontend

React 18 Vite 5 Tailwind 3.4 shadcn/ui Framer Motion Zustand xterm.js

Desktop

Tauri 2 System tray Auto-updater

Storage

SQLite (better-sqlite3) FTS5 Qdrant (vectors) Redis (optional)

Auth & Crypto

@noble/ed25519 otplib (TOTP) @simplewebauthn AES-256-GCM

Channels & Messenger

Baileys (WhatsApp) Telegram bot API Discord interactions Slack events IMAP / SMTP email Messenger (Electron)

Observability

Prometheus /metrics HMAC audit ledger SSE log tail

Quality Gates

Vitest ESLint flat config --max-warnings 0 zod (everywhere) CI lint+typecheck+test
Status

Honest, Up-Front, Pre-1.0

No marketing fluff — here's the real state of the project.

Self-Hosted

Your masters file, vault, audit ledger, and session DB all live on your machine. No data leaves the box except for the LLM provider you wire in.

Free & Open Source

The platform itself is free. You only pay for the LLM calls — with your own provider key. Bring OpenRouter, Anthropic, OpenAI, Gemini, or run Ollama locally for zero LLM cost.

Beta · Pre-1.0

A 28-module dashboard, 200+ tools and 1,200+ tests passing, lint-clean across a ~95-package monorepo — but APIs may still evolve before 1.0. Run it, file issues, contribute fixes.