Skip to content

Roles & Agents โ€‹

Built-in Roles โ€‹

Flightdeck ships with 13 specialist roles. The Project Lead automatically selects roles based on the task โ€” you don't need to configure them.

RoleIconFocusDefault Model
Project Lead๐ŸŽฏCoordination, delegation, synthesisClaude Opus 4.6
Developer๐Ÿ’ปCode implementation, testsClaude Opus 4.6
Architect๐Ÿ—๏ธSystem design, challenges problem framingClaude Opus 4.6
Code Reviewer๐Ÿ“–Readability, maintainability, patternsGemini 3 Pro
Critical Reviewer๐Ÿ›ก๏ธSecure-by-design review, performance, edge casesGemini 3 Pro
Product Manager๐ŸŽฏUser needs, product quality, UXGPT-5.3 Codex
Technical Writer๐Ÿ“Documentation, API design reviewGPT-5.2
Designer๐ŸŽจUI/UX, interaction design, accessibilityClaude Opus 4.6
Generalist๐Ÿ”งCross-disciplinary problem solvingClaude Opus 4.6
Radical Thinker๐Ÿš€Challenge assumptions, unconventional ideasGemini 3 Pro
Secretary๐Ÿ“‹Plan tracking, progress auditingGPT-4.1
QA Tester๐ŸงชTest strategy, edge cases, quality gatesClaude Sonnet 4.6
Agentโš™๏ธNeutral general-purpose agent, no role-specific instructionsCLI default

When to use each role โ€‹

  • Developer โ€” The workhorse. Use for any coding task: implementing features, writing tests, fixing bugs, refactoring.
  • Architect โ€” Use early in a project for system design, or when a Developer is stuck on a structural decision. Architects challenge problem framing and propose alternatives.
  • Code Reviewer โ€” Automatically assigned after code changes. Focuses on readability and patterns โ€” catches the "this works but is hard to maintain" issues.
  • Critical Reviewer โ€” The skeptic. Operates with a secure-by-design principle: security is a structural requirement, not an afterthought. Use for security-sensitive code, performance-critical paths, or when you need someone to find edge cases others missed.
  • Product Manager โ€” Use when the team needs to prioritize features or think about user experience. Good for reviewing API designs from a consumer's perspective.
  • Technical Writer โ€” Ensures documentation stays accurate. Also reviews API design โ€” if something is hard to document, it's probably too complex.
  • Designer โ€” Use for UI/UX work, accessibility reviews, or interaction design. Thinks about the human experience.
  • QA Tester โ€” Runs code end-to-end, verifies behavior, catches runtime failures that code review cannot detect.
  • Generalist โ€” The Swiss army knife. Use for cross-cutting tasks that don't fit a single specialty: research, build systems, DevOps.
  • Radical Thinker โ€” Deliberately challenges conventional approaches. Useful when the team is stuck or when you want fresh perspectives.
  • Secretary โ€” Tracks progress, maintains checklists, provides status reports. Created automatically for projects using the task DAG.
  • Agent โ€” A blank-slate role with no specialized instructions. Use when you want an agent that just follows the system prompt you provide.

AI-Aware Estimation: A crew of AI agents compresses timelines dramatically โ€” weeks of work become hours. Plan in sessions (30 min โ€“ 2 hours), not sprints. A 10-agent session resolved 8 sub-issues across 6 GitHub issues in ~15 minutes. The bottleneck is coordination (lock contention, review cycles), not implementation speed.

Model Diversity โ€‹

Roles deliberately use different AI models to bring diverse perspectives:

  • Claude (Opus 4.6): Lead, Developer, Architect, Designer, Generalist
  • GPT (5.3 Codex, 5.2, 4.1): Product Manager, Technical Writer, Secretary
  • Gemini (3 Pro): Code Reviewer, Critical Reviewer, Radical Thinker
  • Claude (Sonnet 4.6): QA Tester

The lead can override models per agent via CREATE_AGENT, and users can change models at runtime from the dashboard.

Custom Roles โ€‹

Register custom roles via the Settings page or the API:

bash
curl -X POST http://localhost:3001/api/roles \
  -H 'Content-Type: application/json' \
  -d '{
    "id": "data-engineer",
    "name": "Data Engineer",
    "icon": "๐Ÿ“Š",
    "color": "#4CAF50",
    "systemPrompt": "You are a data engineering specialist...",
    "defaultModel": "claude-opus-4.6"
  }'

Agent Lifecycle โ€‹

creating โ†’ running โ†’ idle โ†’ completed
                 โ†˜         โ†—
                  failed
  • creating: Agent process is starting up
  • running: Actively processing a task
  • idle: Waiting for new work
  • completed: Task finished successfully
  • failed: Process exited with error (may auto-restart)

Auto-Restart & Health โ€‹

  • Agents that crash are automatically restarted (configurable, up to a max restart count)
  • Auto-restart verifies the parent is still alive before restarting, and wraps in error handling
  • Crash counts are reset on successful agent exit
  • A heartbeat monitor detects stalled teams and nudges the lead (DAG-aware)
  • The ContextRefresher re-injects crew context after context window compaction
  • On agent termination, orphaned children are cascade-terminated and delegations cleaned up

Agent Identity โ€‹

Each agent gets:

  • A unique ID (short hash)
  • A .agent.md file in the working directory with role instructions
  • Access to the crew manifest (team roster, active delegations, coordination rules)

Agents reference each other by short ID in commands (e.g., "to": "a1b2c3").

Documentation generated by AI

Documentation generated by AI