LOCAL-FIRST AI OPERATING SYSTEM

Your AI works
while you sleep.

DevOS runs 100% on your machine — your data never leaves.
31 agents. 18 tools. 10 LLM providers. 6 channels. Computer control.
Free with Ollama. Auto-detects your hardware. Zero telemetry. Zero compromise.

$ npx devos-ai
Requires Node 18+ and Ollama · Windows, macOS, Linux · Auto-configures for your GPU
0
expert agents
0
channels
0+
built-in tools
0
LLM providers
0
security layers
CORE ENGINE

Everything wired together.

The foundation for autonomous planning, verified execution, and persistent learning. Built from scratch — no wrappers.

TruthCheck + FaultEngine

Every action verified

TruthCheck verifies postconditions after every action. FaultEngine classifies errors into 8 categories and applies targeted repairs. Zero silent failures.

newpostconditions8 error typesauto-repair
GoalEngine v2

Goal → Project → Task

Full hierarchy execution. Parallel tasks per project. Pause, resume, replan mid-run. Smart retry with a different strategy on each failure.

DAG executorparallel taskssmart retry
MissionCanvas

Shared agent state

All 31 agents read and write to a shared canvas. CEO always has full mission context. No direct agent messages — canvas only.

newshared statefull context
MemoryLayers + MemoryStrategy

Memory that learns

HOT (RAM) → WARM (SQLite) → COLD (compressed). MemoryStrategy tracks successful action sequences per goal — replays them instantly next time.

newsql.jslearns from successcross-session
ModelRouter + SetupWizard

Auto-configures for your GPU

First boot scans your hardware, checks installed models, and recommends the best setup for your specific machine. Never installs what you already have.

newhardware detectionsmart model pick
SkillVault + LiveView

Docker sandboxed execution

Community skills run in isolated Docker containers (512MB, 0.5 CPU, no network). Browser tasks stream via noVNC — watch DevOS browse live.

newdocker sandboxnoVNC live
Executor Unified Layer

Retry, fallback, verify

Every action runs through a unified Executor — timeout protection, exponential backoff retries, automatic fallback actions, TruthCheck on every success.

newretry + backofffallback chain
PrecisionEdit

Surgical file editing

LLM plans specific line ranges to change. Never rewrites whole files. TypeScript verification after every edit. Lines changed vs preserved logged.

newline-level editstsc verify
BYOK · 10 Providers

Any model, any price

Ollama free by default. ModelRouter auto-picks the best model for each task type based on your VRAM. DataGuard routes sensitive data to local only.

ollamagroqclaudegemini+6 more
WHAT DEVOS CAN DO

18 built-in capabilities.

Everything DevOS can do autonomously — no plugins, no setup, works out of the box.

Run Commands
Executes terminal commands on your machine — installs packages, runs scripts, anything you'd type yourself.
Create & Edit Files
Creates, reads, updates, and deletes files and folders anywhere on your computer.
Browse the Web
Opens websites, clicks links, fills forms — operates a real browser just like you would.
Search the Web
Searches Google and fetches page content to research topics, find documentation, or gather data.
Run Python Code
Writes and runs Python scripts — data analysis, automation, machine learning, anything Python can do.
Run JavaScript
Executes Node.js code — builds APIs, processes data, runs tests, interacts with npm packages.
Windows Automation
Runs PowerShell scripts natively — manages Windows settings, processes, services, and system tasks.
Send Notifications
Sends desktop alerts when tasks complete, fail, or need your attention — across all 6 channels.
System Info
Reads your GPU, CPU, RAM, disk usage — DevOS knows your hardware and optimises accordingly.
Deploy to Vercel
Pushes your project live to Vercel in one step — no manual dashboard or CLI needed.
Git & GitHub
Commits code, pushes to GitHub, manages branches — full version control built in.
Fetch Any URL
Downloads content from any web address — APIs, JSON data, HTML pages, raw files.
Plugin Support
Connects to any MCP plugin — extend DevOS with community tools, no coding required.
Docker Sandbox
Runs untrusted or community skills in an isolated container — safe, contained, no risk to your system.
Live Browser View
Streams the browser live to your dashboard via noVNC — watch DevOS work in real time.
Screen Capture
Takes screenshots of your screen — used by VisionLoop to see what's happening and decide next actions.
Mouse & Keyboard
Moves the mouse, clicks buttons, types text — real computer control, not simulation.
API-First Execution
Checks for a service API before resorting to UI automation. Gmail API beats clicking — 10x more reliable.
devos — live execution local · no data sent
COMPUTER CONTROL

DevOS can control your computer.

VisionLoop takes a screenshot, asks the vision model what to do next, executes the action, verifies it worked, and repeats. All locally. All safely.

VisionLoop
Screenshot → llava:13b (local) or Claude vision → parse action → execute → verify → repeat. Up to 20 iterations per goal. Confidence gate at 65% — anything lower goes to CommandGate for approval.
APIRegistry
API-first execution. DevOS checks if a service has an API before touching the UI. Gmail, Sheets, GitHub, Notion — use their APIs. UI automation is the fallback, not the default. 10x more reliable.
CommandGate
Approval before any dangerous operation. Low-confidence actions, file deletions, external sends — all require your yes before execution. CLI prompt or Telegram inline keyboard. 60 second timeout.

How computer control works

Give DevOS a goal like "reply to my unread emails" and watch it work. It checks for a Gmail API first, uses it if available, falls back to browser automation if not. Every action TruthCheck verified.

API-first vision model local llava claude vision BYOK memory learns
1devos automate "reply to unread emails"
2CommandGate → approval requested
3APIRegistry → Gmail API found ✓
4VisionLoop → screenshot taken
5llava:13b → { action: "click", confidence: 0.92 }
6TruthCheck → action verified ✓
7MemoryStrategy → sequence saved
8Done in 3 iterations · learned for next time
FIRST BOOT

Scans your machine.
Configures itself.

DevOS detects your GPU, checks what models you already have, and asks before downloading anything. Works on any hardware — RTX 4090, M3 Pro, GTX 1060, integrated Intel.

devos — first boot setup
┌─────────────────────────────────────────┐
│ DevOS — First Boot │
└─────────────────────────────────────────┘

Hey! I just scanned your machine.

  GPU    →  NVIDIA RTX 3080 · 10GB VRAM · CUDA ✓
  RAM    →  32GB
  OS     →  Windows 11

Good news — I found models already installed:

  chat      →  mistral:7b                ✓ installed, good fit
  code      →  qwen2.5-coder:7b        ✓ installed, perfect
  vision    →  moondream                ↑ upgrade: llava:7b
  reasoning → mistral:7b                ✓ installed
  embedding → nomic-embed-text        ✓ installed

  Apply these settings and download llava:7b? (yes / no / skip)
CHAT INTERFACE

Say it. DevOS does it.
You watch it happen.

Chat like a human. Every agent action streams live on the right. Full transparency — no black box. Fully local.

DevOS Dashboard v2 local · offline capable
Chat ollama · qwen2.5-coder:7b
|
LivePulse live
💬
Natural language
Talk to DevOS like a person. No syntax required — just describe what you want done.
Real verified actions
Creates files, runs code, deploys apps, controls your computer — TruthCheck verifies every single action.
🔍
Full transparency
LivePulse streams every agent thought, tool call, and decision in real time. No black box.
CHANNELS

Control DevOS from anywhere.

Send goals from your phone. Get results on any device. 6 channels working out of the box.

Web Dashboard
● Live
Next.js 3-panel layout. Chat + LivePulse agent feed + navigation. MissionCard inline goal tracking. xterm.js terminal built in.
Telegram
● Live
Streaming edits. /mission /dawn /echo commands. CommandGate phone approvals for dangerous ops before execution.
WhatsApp
● Live
whatsapp-web.js. Send goals from your number. Get notified on completion. No third-party API — runs locally.
Discord
● Live
discord.js bot. Goal notifications to any channel or DM. Auto-starts when token configured.
Slack
● Live
Socket mode bolt app. Start goals via DM. Goal completion and failure notifications to any channel.
Email
● Live
IMAP polling every 60s. Email a goal, get a reply when done. SMTP notifications with full execution summary.
AGENT TEAM

31 expert agents.

Each with 400–600 word specialist system prompts. CEO orchestrates all via MissionCanvas. AgentDen gives each private workspace.

CEO Agent
Strategic Orchestrator
Goal decomposition, task assignment, MissionLog tracking.
Engineer Agent
TypeScript Specialist
Writes and runs code. PrecisionEdit for surgical changes.
QA Agent
Testing + Validation
Two-stage review. Blocks Engineer on failure.
Research Agent
Intelligence Specialist
Web research with CRAAP fact-checking.
Designer Agent
UI/UX Specialist
Runs design system before any UI work.
Deployment Agent
DevOps Specialist
Owns all deploys. Auto-rollback on failure.
Marketing Agent
Growth Specialist
Copy, SEO, launch assets. Never invents facts.
Security Auditor
Vulnerability Specialist
OWASP Top 10, CVSS scores with specific fixes.
Debugger Agent
Root Cause Specialist
3 hypotheses, eliminates systematically.
Data Analyst
Analytics Specialist
SQL, Python, data viz. Confidence intervals always stated.
Database Designer
Storage Specialist
3NF by default. Soft deletes always included.
API Architect
REST Specialist
Consistent REST, OpenAPI specs, versioning.
Performance Optimizer
Speed Specialist
Measures first. Before/after metrics always.
Content Writer
Technical Writing
Active voice, code examples over descriptions.
Git Manager
Version Control
Conventional commits. Never force pushes main.
Product Manager
Strategy Specialist
Problem before solution. RICE scoring.
Web Scraper
Data Extraction
Checks robots.txt first. Never transmits PII.
SEO Specialist
Search Optimization
Core Web Vitals. Schema markup always.
Automation Engineer
Workflow Specialist
Every automation has a manual fallback.
Cloud Architect
Infrastructure Specialist
Zero-downtime deploys. Health checks post-deploy.
Mobile Developer
React Native Specialist
44pt touch targets. Tests on real device.
Documentation Writer
Docs Specialist
All examples are copy-paste runnable.
Reddit Strategist
Community Specialist
80/20 give-take rule. Lurks before posting.
Email Marketer
Lifecycle Specialist
5 subject lines, picks best. One CTA per email.
Competitor Analyst
Intel Specialist
HIGH/MEDIUM/LOW threat levels with response plan.
Financial Analyst
Revenue Specialist
LTV:CAC > 3. Payback period always stated.
UX Researcher
User Research
Never asks "would you use this?" Why × 5.
AI Model Selector
LLM Routing
Routes by task + VRAM. Logs cost per goal.
ForesightEngine
Prediction Specialist
Predictions with confidence scores + timeframes.
PolicyShield
Security Enforcement
Out-of-process. Cannot be bypassed. Tamper-evident logs.
Desktop Automator
Computer Control
VisionLoop + APIRegistry. Controls your screen. API-first, UI as fallback.
PILOTS

4 always-on scheduled agents.

Activate once and they run forever. You check your dashboard — they do the work.

Startup Scout
Daily · 08:00
GitHub trending, ProductHunt, IndieHackers, HN Show. Curated digest filtered for your stack and interests. Delivered every morning.
Market Monitor
Weekly · Monday
OpenClaw, OpenFang, Cursor, Manus — pricing changes, feature drops, traction. Threat level with recommended response.
AI Researcher
Weekly · Wednesday
New model releases, research papers, benchmarks. Flags models that fit your GPU. Recommends when to switch providers.
Competitor Tracker
Daily · 11:00
Watches competitor GitHub repos for commits, PRs, issues. Alerts on significant feature drops with response recommendation.
COMPARE

How DevOS stacks up.

Honest benchmarks across 6 categories. Scores reflect real capability — where others win, we say so.

DevOS v1.0
OpenFang
OpenClaw
Paperclip
EARLY ACCESS · BETA LAUNCHING SOON
312 developers already waiting

Get early access to DevOS

Public beta launching soon. Join the waitlist and be among the first developers to run a fully autonomous AI OS — with computer control — on your own machine.

✓ You're on the list. We'll email you when beta drops.
or connect via
ROADMAP

What should we build next?

Submit a feature request. Upvote what others are asking for. We read every single one.

Request a feature

Pick a category and describe your idea briefly.

Select a category and describe your idea.
0 requests
No requests yet — be the first to suggest something!