AI & Machine Learning

How to Create a Fleet of AI Coding Agents for CI Automation

2026-05-03 13:31:15

Introduction

Imagine your CI pipeline not just running tests, but also investigating failures, triaging issues, writing release notes, and fixing bugs—automatically. That's exactly what the Coding Agent Sandboxes team at Docker achieved with their Fleet: seven AI agent roles that work together autonomously inside microVM sandboxes. In this step-by-step guide, you'll learn how to replicate this approach using Claude Code skills and a local-first development mindset. Whether you're a solo developer or part of a larger team, these principles will help you build a virtual team that ships faster and reduces manual toil.

How to Create a Fleet of AI Coding Agents for CI Automation
Source: www.docker.com

What You Need

Step-by-Step Guide

Step 1: Define Your Agent Roles

Start by listing the repetitive tasks you want to automate. The Docker team created roles like /cli-tester (exploratory testing), /build-engineer (handling builds and releases), /triage-agent (managing issue backlog), and /release-notes-writer. Each role should have a clear persona, set of responsibilities, and allowed tools. Don't think of these as scripts—think of them as autonomous decision-makers. For example, a build engineer's persona might be: "You are a meticulous build engineer who verifies cross-platform compatibility and detects resource leaks."

Step 2: Create Skill Files as Role Descriptions

In Claude Code, a skill is a markdown file that describes the agent's role. It’s not a list of commands to run, but a guide for judgment. For each role, write a skill file that includes:

Save these files in a skills/ directory in your repo. For instance, skills/cli-tester.md.

Step 3: Test Locally First

This is the most important principle from Docker's experience. Always iterate on your agent locally before wiring it into CI. Run the skill file directly in your terminal using Claude Code. Watch the agent think: does it build binaries? Does it exercise CLI commands? Does it find real issues? Adjust the skill file until the behavior matches your expectations. This local iteration cycle takes seconds, not minutes. You avoid the painful commit-push-wait-read-logs loop.

Step 4: Isolate Agents with Sandboxes

Agents need full autonomy without touching your host system. Docker's Coding Agent Sandboxes (sbx) provides secure microVM-based isolation—each agent gets its own Docker daemon, network, and filesystem. You can use any equivalent tool (e.g., Docker-in-Docker, Firecracker, or a managed sandbox service). Ensure the sandbox is configured to mount your workspace and grant necessary permissions (like running sbx commands or installing dependencies).

Step 5: Wire Agents into CI with a Thin Workflow

Once a skill is polished locally, create a CI workflow that calls the same skill file. The Docker team uses GitHub Actions with matrix builds for macOS, Linux, and Windows. The workflow is minimal: set up the environment, check out the code, and invoke the skill (e.g., claude-code skill skills/cli-tester.md). There is no separate "CI version" of the skill—just one file, two runtimes. This consistency ensures that what you test locally is exactly what runs in CI.

How to Create a Fleet of AI Coding Agents for CI Automation
Source: www.docker.com

Step 6: Orchestrate a Fleet of Agents

To create a virtual team, define multiple skill files and schedule them in your CI. For example:

Each agent runs in its own sandbox, independent and isolated. They can even communicate by posting results to a shared issue tracker or Slack channel.

Step 7: Monitor, Iterate, and Expand

Treat your agent fleet as living software. Review their decisions—did they miss an issue? Did they misunderstand a failing test? Update the skill files to improve judgment. Add new roles as your project grows. The Docker team emphasizes that a good skill file is like a good employee manual: it's clear, contextual, and gives the agent the freedom to act intelligently.

Tips for Success

By following these steps, you can build a virtual agent team that ships faster and reduces developer burnout. The key is turning your CI pipeline from a passive verification step into an autonomous problem-solving workforce—all starting with a simple skill file on your laptop.

Explore

Google's TCMalloc Breaks Linux Kernel API, Forces Exception to No-Regressions Rule Rethinking Online Security: Beyond the Bot vs. Human Binary Microsoft Expands Access to Agentic AI Platform That Transforms Scientific R&D Breaking: Aerobic Exercise Tops Landmark 217-Study Review for Knee Arthritis Pain Relief The Unsettling Rise of AI in Job Interviews: What Candidates Need to Know