Features Overview โ
Flightdeck is a multi-agent orchestration platform. This page covers everything it can do, organized by what matters to you as a user.
Crew Orchestration โ
At its core, Flightdeck turns a single task into a coordinated team effort.
Project Lead โ
Every session starts with a Project Lead agent. You describe what you want built, and the lead:
- Analyzes your task and breaks it into subtasks
- Creates a task DAG (directed acyclic graph) with dependencies
- Assembles a crew of specialist agents
- Delegates work, coordinates progress, and synthesizes results
14 Specialized Roles โ
Each agent has a role with a purpose-built system prompt and recommended model:
| Role | Focus |
|---|---|
| Project Lead ๐ | Orchestration, delegation, crew coordination |
| Developer ๐ป | Code implementation, tests, bug fixes |
| Architect ๐๏ธ | System design, architecture decisions. Can also delegate. |
| Code Reviewer ๐ | Readability, maintainability, code patterns |
| Critical Reviewer ๐ก๏ธ | Security, performance, edge cases |
| Readability Reviewer ๐ | Naming, organization, documentation quality |
| Product Manager ๐ฏ | User needs, product quality, UX review |
| Technical Writer ๐ | Documentation, API design, developer experience |
| Designer ๐จ | UI/UX, interaction design, accessibility |
| QA Tester ๐งช | Test strategy, quality assurance, coverage |
| Generalist ๐ง | Cross-disciplinary problem solving |
| Radical Thinker ๐ | Challenge assumptions, propose bold alternatives |
| Secretary ๐ | Plan tracking, status reports, session summaries |
| Agent โ๏ธ | Neutral general-purpose agent |
Custom roles can be created via Settings โ Roles with your own system prompts, colors, and icons.
Task DAG โ
The lead creates a dependency graph that controls execution order. Tasks flow through states: pending โ ready โ running โ done (or failed, paused, skipped). The EagerScheduler automatically assigns ready tasks to idle agents as soon as dependencies resolve.
Three views in the dashboard:
- Graph view โ Interactive dependency graph (ReactFlow) with status colors and critical path highlighting
- Kanban board โ Drag-and-drop columns by status with context menus
- Gantt chart โ Timeline view with dependencies
โ Auto-DAG Guide
Parallel Execution โ
Multiple agents work simultaneously โ a developer writes code while a reviewer checks it, an architect designs the system, and a secretary tracks progress. Up to 50 concurrent agents (configurable).
Multi-Provider Support โ
Flightdeck is provider-agnostic. All agents communicate through the Agent Client Protocol (ACP) โ a standardized message-passing interface that abstracts away provider differences โ so you can mix and match providers in the same crew.
Supported Providers โ
| Provider | Binary | Auth |
|---|---|---|
| GitHub Copilot ๐ | copilot | GitHub CLI (gh auth login) |
| Claude Code ๐ | claude-agent-acp | ANTHROPIC_API_KEY |
| Google Gemini CLI ๐ | gemini | GEMINI_API_KEY |
| Codex ๐ค | codex-acp | OPENAI_API_KEY |
| Cursor โ๏ธ | agent | CURSOR_API_KEY |
| OpenCode ๐ | opencode | (manages own keys) |
| Kimi CLI ๐ | kimi | Kimi auth |
| Qwen Code ๐ฎ | qwen | Qwen auth |
Model Resolution โ
Each role has a default model, but you can override per-role or per-agent. The ModelResolver handles cross-provider translation โ request claude-opus-4.6 from a Gemini agent and it maps to the closest equivalent. Standard/fast/premium tier aliases work across all providers.
Provider Ranking โ
Set a preference order for providers. If the default is unavailable, Flightdeck falls through to the next in the ranking. Configure via flightdeck.config.yaml or Settings โ Providers in the dashboard.
โ Provider Guide
Real-Time Dashboard โ
The web dashboard is your control center, updating in real-time over WebSocket.
Home Dashboard โ
At-a-glance view across all projects:
- Action Required โ Pending decisions and permission requests
- Active Work โ What agents are doing right now, grouped by project
- Decisions Made โ Recent decisions for awareness
- Recent Activity โ Latest events across all projects
- Progress โ Per-project DAG summaries
Lead Dashboard โ
The main working interface for an active session:
- Chat panel โ Send messages to the Project Lead (queue or interrupt mode)
- Decision panel โ Approve, reject, or dismiss pending decisions with optional comments
- Sidebar tabs โ Seven tabs (reorderable): Crew, Comms, Groups, DAG, Models, Costs, Timers
- Catch-up banner โ "While you were away" summary of what happened
Timeline โ
Swim-lane Gantt chart showing agent activity over time:
- One lane per agent with color-coded events
- Communication links between agents
- Zoom, pan, and keyboard navigation (โโ pan, +/- zoom)
- Session replay scrubber for reviewing past sessions
- Live auto-scroll mode during active sessions
โ Timeline Guide
Analytics โ
Session-level analytics and cost tracking:
- Token usage trends โ Input/output tokens over time
- Cost breakdowns โ Per-agent and per-task token attribution from provider data
- Session comparison โ Side-by-side comparison of two sessions
- Auto-generated insights โ Efficiency observations
Overview Dashboard โ
Per-project overview with:
- Quick status bar (running/stopped, agent count, task progress, duration)
- Attention items (failed agents, blocked tasks, pending decisions)
- Decision feed and activity feed
- Session history
Communication โ
Agents communicate through structured messaging channels.
Direct Messaging โ
Agents message each other by ID. Messages can be:
- Queued โ Delivered when the recipient is ready (non-blocking)
- Interrupt โ Breaks into the agent's current work immediately (priority)
Group Chat โ
Create groups by member ID or role. Groups are auto-created when 3+ agents work on the same feature and auto-archived when all members finish. The dashboard shows a dedicated Groups tab for following multi-agent conversations.
Broadcasts โ
Send a message to every active agent at once โ useful for announcing decisions or sharing context that affects the whole crew.
@Mentions โ
Type @ in the chat to autocomplete agent names. Mentioned agents receive the message.
โ Agent Communication ยท Chat Groups
Coordination & Safety โ
These features prevent agents from stepping on each other and keep you in control.
File Locking โ
Before editing a file, an agent must acquire a lock. Locks have:
- TTL โ Expired locks are automatically cleaned up (no deadlocks from crashed agents)
- Glob support โ Lock
src/auth/*to claim an entire directory - Conflict detection โ Overlapping lock requests are rejected with clear error messages
Scoped Commits โ
When an agent commits, git add only stages files the agent has locked. Post-commit verification confirms the right files landed. This prevents git add -A from leaking other agents' uncommitted work.
Trust Dial (Oversight System) โ
Three levels of human oversight:
| Level | Behavior |
|---|---|
| Supervised | Agents explain reasoning before acting. Significant actions require approval. |
| Balanced | Key decisions need approval; routine work proceeds automatically. |
| Autonomous | Agents work independently. Only critical failures require intervention. |
Set globally or per-project. The AttentionBar at the top of the dashboard shows the current level with an escalation indicator (green/yellow/red).
โ Oversight Guide
Governance Pipeline Future Feature โ
The governance infrastructure exists in the codebase but is currently disabled (
enabled: false). It will be activated in a future release.
The governance system is designed to route every agent command through ordered hooks before execution:
- Security โ Blocked patterns, path traversal checks
- Permission โ Role-based access control
- Validation โ Payload schema validation
- Rate Limiting โ Per-command throttling
- Policy โ Custom policy rules
- Approval โ Human approval gates (when configured)
Post-hooks will handle audit logging and metrics collection.
Decision Queue โ
When agents face architectural choices or need permission, they surface decisions to you:
- Review decisions in the Approval Queue (Shift+A shortcut)
- Approve with optional comment, reject with reason, or dismiss
- Auto-deny timer pauses while you're reviewing (no missed decisions)
- Decisions are categorized automatically (architecture, style, testing, etc.)
Security โ
- Prompt injection sanitization โ 4-layer defense at write boundary
- Secret redaction โ 12 regex pattern categories (AWS, GitHub, API keys, etc.) redacted from WS broadcasts, DB writes, logs
- CORS lockdown and rate limiting
- Path traversal validation for CWD and file operations
- Challenge-response auth for integrations
Knowledge & Persistence In Development โ
What agents learn carries across sessions. The core knowledge infrastructure exists and is functional โ the system design is being refined for the best developer experience.
Knowledge Base โ
Four-category knowledge system:
| Category | Purpose | Example |
|---|---|---|
| Core ๐ก๏ธ | Project rules, identity | "Use factory pattern for services" |
| Procedural ๐ง | Patterns, corrections, how-to | "Always run lint before commit" |
| Semantic ๐๏ธ | Architecture, facts, relationships | "Auth module uses JWT with bcrypt" |
| Episodic ๐ | Session summaries, recent events | "Session 5 refactored the API layer" |
Browse, search (fuzzy search via Fuse.js), and manage entries from the Knowledge page in the dashboard.
Knowledge Injection โ
On agent spawn, the KnowledgeInjector automatically injects relevant knowledge into the agent's context:
- Token-budgeted to avoid context overflow
- Priority: Core (always) > Procedural > Semantic > Episodic
- Sanitized against prompt injection
Skills System โ
Drop Markdown files in .github/skills/ and they're hot-reloaded into agent prompts:
.github/skills/
โโโ testing-conventions/
โ โโโ SKILL.md
โโโ api-patterns/
โ โโโ SKILL.md
โโโ error-handling/
โโโ SKILL.mdEach skill has YAML frontmatter with name and description to control when it's loaded.
Collective Memory โ
Cross-session knowledge that compounds over time. Agents can remember() facts and recall() them in future sessions. Knowledge is automatically extracted from completed sessions.
Session Management โ
- Session resume โ Resume from a previous session ID with full context recovery
- Session replay โ Scrub through past sessions with a timeline scrubber. Keyframes capture agent state, messages, and DAG changes. Adjustable playback speed. Shareable via tokenized links.
- Session history โ Browse past sessions with metadata (duration, tasks, token usage)
- Persistent projects โ Projects survive across sessions. Chat history, state, and knowledge auto-load on startup.
Monitoring & Notifications โ
AttentionBar โ
Persistent status bar at the top of every page with three escalation states:
- ๐ข Green โ All clear
- ๐ก Yellow โ Needs attention (pending decisions, context pressure)
- ๐ด Red โ Action required (failed agents, blocked tasks)
Updates via WebSocket push (<3s latency). Sensitivity adjusts with the Trust Dial level.
PulseStrip โ
Compact horizontal strip showing real-time crew health: active agents, token usage, context window pressure per agent, and pending decision count.
Notification Channels โ
Configure alerts for:
- Desktop notifications โ Browser notifications with sound/preview options
- Telegram โ Bot integration with batched delivery and challenge-response auth
- Slack โ Webhooks with optional thread-per-session
- Discord โ Webhooks with optional thread-per-session
Configurable quiet hours (timezone-aware) and per-event-type preferences.
Navigation & Productivity โ
Command Palette โ
Press โK (Mac) or Ctrl+K (Windows/Linux) to:
- Navigate to any page, agent, or setting
- Search across entities
- Execute natural language commands (27 NL commands across 4 categories โ no LLM required)
Keyboard Shortcuts โ
| Shortcut | Action |
|---|---|
| โK / Ctrl+K | Command palette |
| Shift+A | Approval queue |
| Alt+1โ5 | Switch project tabs |
| Escape | Close modals/panels |
Global Search โ
Search across messages, tasks, decisions, and activity from the search dialog (โ+Shift+K).
Historical Data โ
All pages load from the REST API when no live agents are present โ no empty states for existing projects. You can browse any past project's timeline, tasks, decisions, and analytics even when nothing is running.
Onboarding โ
Contextual Coach โ
Behavior-triggered tips appear as toasts when specific conditions are met โ for example, after your first approval, after repeated manual approvals (suggesting you adjust the Trust Dial), or on first agent crash. Tips appear once per trigger and are tracked in localStorage.
QuickStart Templates โ
Pre-configured project templates let new users launch their first crew in seconds. Five built-in templates cover common workflows:
- Code Review โ Lead + 2 developers + reviewer
- Bug Fix โ Architect + 2 developers + QA
- Quick Fix โ Lead + developer (recommended for first-time users)
- Docs Blitz โ Writer + developer
- Full Build โ Lead + architect + 3 developers + reviewer + QA
Spotlight Tour โ
A 6-step interactive guided tour that highlights key UI elements โ the pulse strip, agent cards, approval queue, chat panel, sidebar, and command palette. Each step spotlights one element with a tooltip explanation. Progress is tracked and the tour only appears once.
Progressive Route Disclosure โ
The sidebar starts minimal and reveals more navigation items as you gain experience:
| Tier | Visible Routes | Unlocked When |
|---|---|---|
| Starter | Lead, Overview, Crews, Settings | Always |
| Active | + Tasks, Timeline | 2+ agents spawned or any tasks created |
| Collaboration | + Mission Control | 3+ agents in a session |
| Power | + Analytics, Groups, Org Chart, Database | 3+ sessions or manual expansion |
All routes remain accessible via the command palette and direct URLs regardless of tier.
Data Management โ
Purge old session data from Settings โ Data Management. Preview shows exact record counts before deletion. Configurable retention period (7 days to 1 year).
โ Settings Guide ยท Data Management