s17

Autonomous Agents

Multi-Agent Platform

Check the Board, Claim the Task

648 LOC15 toolsAutonomous task claiming

Teammates become useful when they can discover and claim work themselves.

Autonomous Agent Cycle

Agents watch their own idle timer

Agent A

idle

idle timer: 10%

Agent B

idle

idle timer: 0%

Agent C

idle

idle timer: 0%

Shared task board

T1open

Fix auth bug

owner: -

T2open

Add rate limiter

owner: -

T3open

Write docs

owner: -

T4open

Clean tests

owner: -

Nobody assigns tasks directly; agents claim visible open cards when their timers wake them.

Quiet Agents

Autonomous agents start by waiting. The important mental model is a work board, not a central dispatcher.

1/8

中文 · 繁中 · English · 日本語

s01 → ... → s15 → s16 → s17 → s18 → s19 → s20

"Check the board, claim the task" — poll when idle, work when found.

Harness Layer: Autonomy — Self-organizing teammates, no leader assignment needed.

The Problem

s16's teammates can communicate and handshake shutdown. But each teammate waits for Lead to assign tasks — with 10 unclaimed tasks on the board, Lead has to manually assign 10 times. This doesn't scale. Teammates should check the task board themselves, claim unowned tasks, and look for the next one when done.

The Solution

Autonomous Agents Overview

Carries forward S16's teaching-version MessageBus and protocol tools. This chapter adds: idle_poll (poll every 5 seconds when idle), scan_unclaimed_tasks (scan the board for claimable tasks), auto-claim (claim on sight, no Lead needed).

Teammate lifecycle expands from two phases to three:

Phase	Behavior	Exit condition
WORK	inbox → LLM → tool loop	`stop_reason != tool_use`
IDLE	5s poll inbox + task board	60s timeout
SHUTDOWN	Send summary, exit	—

How It Works

idle_poll: Idle Polling

After completing a task, the teammate doesn't exit. It enters the IDLE phase — checking every 5 seconds for new work:

IDLE_POLL_INTERVAL = 5   # seconds
IDLE_TIMEOUT = 60         # seconds

def idle_poll(agent_name, messages, name, role) -> str:
    """Return 'work', 'shutdown', or 'timeout'."""
    for _ in range(IDLE_TIMEOUT // IDLE_POLL_INTERVAL):
        time.sleep(IDLE_POLL_INTERVAL)

        # ① Check inbox (priority)
        inbox = BUS.read_inbox(agent_name)
        if inbox:
            # shutdown_request handled immediately
            for msg in inbox:
                if msg.get("type") == "shutdown_request":
                    # ... reply shutdown_response
                    return "shutdown"
            # Regular messages: inject into context, return to WORK
            messages.append(...)
            return "work"

        # ② Scan task board
        unclaimed = scan_unclaimed_tasks()
        if unclaimed:
            task = unclaimed[0]
            result = claim_task(task["id"], agent_name)
            if "Claimed" in result:
                messages.append(...)
                return "work"
    return "timeout"

Inbox takes priority (may contain protocol messages like shutdown_request), task board second. A shutdown_request received during IDLE is dispatched immediately — no need to wait for the next WORK phase.

scan_unclaimed_tasks: Scan the Task Board

Find tasks that are pending, unowned, with all dependencies completed (can_start):

def scan_unclaimed_tasks() -> list[dict]:
    unclaimed = []
    for f in sorted(TASKS_DIR.glob("task_*.json")):
        task = json.loads(f.read_text())
        if (task.get("status") == "pending"
                and not task.get("owner")
                and can_start(task["id"])):
            unclaimed.append(task)
    return unclaimed

Three conditions: must be pending, no owner, all blockedBy dependencies completed. can_start checks dependency task status — having dependencies doesn't mean the task can't start, only unresolved dependencies block it. Teaching version picks the first by filename; CC uses file locks to prevent multiple teammates from claiming the same task.

claim_task: Owner Check

Auto-claim checks the claim result, not treating failure as success:

def claim_task(task_id: str, owner: str = "agent") -> str:
    task = load_task(task_id)
    if task.status != "pending":
        return f"Task {task_id} is {task.status}, cannot claim"
    if task.owner:
        return f"Task {task_id} already owned by {task.owner}"
    if not can_start(task_id):
        return f"Blocked by: {deps}"
    task.owner = owner
    task.status = "in_progress"
    save_task(task)
    return f"Claimed {task.id} ({task.subject})"

Teaching version has no file locks, so concurrent claims may still race. But the task.owner check avoids the most obvious "last writer wins" problem. CC uses proper-lockfile to protect task files, with claimTask doing read-modify-write inside a file lock (utils/tasks.ts:541-612).

Teammate Lifecycle: WORK → IDLE → SHUTDOWN

s16's teammates exit after finishing. s17 adds the IDLE phase — teammates cycle through WORK → IDLE in an outer loop:

# Outer loop: WORK → IDLE cycle
while True:
    # WORK phase: inner loop (max 10 LLM rounds)
    for _ in range(10):
        # Check inbox, dispatch protocol, call LLM, execute tools
        ...
        if response.stop_reason != "tool_use":
            break  # WORK phase ends

    # IDLE phase
    idle_result = idle_poll(name, messages, name, role)
    if idle_result == "shutdown":
        break
    if idle_result == "timeout":
        break  # 60s timeout → SHUTDOWN

# SHUTDOWN: send summary to Lead
BUS.send(name, "lead", summary, "result")

Key design:

Outer while True: WORK and IDLE alternate until timeout or shutdown request
Inner for 10: WORK phase caps at 10 LLM rounds (prevents infinite loops)
IDLE timeout 60s: 12 polls × 5s = 60s. Timeout sends summary and exits
shutdown_request works in both phases: WORK phase dispatches via handle_inbox_message; IDLE phase's idle_poll checks and replies directly

Identity Re-injection

After autoCompact (s08), a teammate's messages list may be compressed into a summary. On each new WORK phase entry, check:

if len(messages) <= 3:
    messages.insert(0, {"role": "user",
        "content": f"<identity>You are '{name}', role: {role}. "
                   f"Continue your work.</identity>"})

Short messages suggest compression happened — re-inject identity. In real CC, context compaction preserves the system prompt; the teaching version's simplified implementation needs manual handling.

consume_lead_inbox: Unified Inbox Consumer

Both the check_inbox tool and the main loop call the same consume_lead_inbox() function: route protocol responses to update state first, then inject all messages into Lead's conversation history. Teammates' summaries and results don't just print to terminal — Lead's LLM can see them and coordinate next steps.

Putting It Together

1. Lead: "Build the backend — too many tasks, let teammates self-claim"
2. Lead → create_task("Create database schema")
3. Lead → create_task("Write API routes")
4. Lead → create_task("Write unit tests")
5. Lead → spawn_teammate("alice", "backend", "You are a backend developer")
6. Lead → spawn_teammate("bob", "backend", "You are a backend developer")

7. alice thread starts → WORK: no initial inbox → spins → IDLE
8. bob thread starts → WORK: no initial inbox → spins → IDLE

9. alice IDLE poll 1 → scan_unclaimed → finds "Create database schema"
10. alice → claim_task → "Create database schema" → back to WORK
11. bob IDLE poll 1 → scan_unclaimed → finds "Write API routes"
12. bob → claim_task → "Write API routes" → back to WORK

13. alice WORK: write_file("schema.sql", ...) → complete_task → WORK ends
14. alice IDLE → scan → "Write unit tests" → claim → WORK
15. alice WORK: write_file("test_api.py", ...) → complete_task → WORK ends
16. alice IDLE → 60s no new tasks → SHUTDOWN

17. bob similar flow → done → SHUTDOWN
18. Lead consume_lead_inbox → sees alice and bob's summaries

Two teammates claim and work in parallel. Lead only creates tasks and spawns teammates — no manual assignment needed.

Changes from s16

Component	Before (s16)	After (s17)
Task assignment	Lead manually assigns	Teammates auto-claim (can_start checks deps)
Teammate state	WORK or exit	WORK → IDLE (60s poll) → SHUTDOWN
claim_task	No owner check	Rejects tasks that already have an owner
IDLE phase shutdown	Doesn't handle shutdown_request	Dispatches shutdown immediately and exits
Lead inbox	Prints only, not in context	consume_lead_inbox injects into history
New functions	—	idle_poll, scan_unclaimed_tasks, consume_lead_inbox
Identity persistence	System prompt only	Auto re-inject after compression
Lead tools	14 (s16)	14 (unchanged)
Teammate tools	5	8 (+ list_tasks, claim_task, complete_task)
Teammate exit	Exit after task done	Exit only after 60s idle timeout

Try It

cd learn-claude-code
python s17_autonomous_agents/code.py

Try this prompt:

Create 3 tasks on the board, then spawn alice and bob. Watch them auto-claim and work.

What to observe: Do teammates auto-claim unassigned tasks? Are tasks with blockedBy dependencies claimed only after their dependencies complete? Does idle timeout trigger shutdown? Does a shutdown_request in IDLE phase get an immediate response? How do task states change in .tasks/?

What's Next

Teammates self-organize now. But Alice and Bob both work in the same directory — Alice edits config.py, Bob also edits config.py, overwriting each other.

s18 Worktree Isolation → Each task gets its own working directory, no conflicts.

Deep Dive into CC Source

Teaching note: This chapter's idle_poll + auto-claim mechanism is a teaching design, using a unified polling function to demonstrate "find work when idle." CC's actual implementation combines multiple mechanisms, but shares the same goal — reducing Lead's manual assignment burden.

1. CC's Idle Mechanism: Combined Approach, Not Single Polling

Teaching version uses a single idle_poll() to handle both inbox checking and task claiming during idle. CC's actual implementation combines four mechanisms:

idle_notification: After completing a round of work, sendIdleNotification() (inProcessRunner.ts:569-589) sends an idle notification to Lead. Lead knows the teammate is available and can assign new tasks or request shutdown.

mailbox polling: waitForNextPromptOrShutdown() (inProcessRunner.ts:689-868) is a 500ms polling loop that continuously checks three sources: pending user messages, mailbox file messages, and task list. Shutdown requests are prioritized (inProcessRunner.ts:768-804), preventing starvation by regular messages.

task watcher: useTaskListWatcher (hooks/useTaskListWatcher.ts:34-189) uses fs.watch() to monitor the .claude/tasks/ directory with 1-second debounce, triggering checks when new tasks are created or dependencies unblock. The dependency check (L197-207) verifies "no incomplete tasks in blockedBy", not "blockedBy is empty".

active claiming: The polling loop also calls tryClaimNextTask() (inProcessRunner.ts:853-860) — actively claiming tasks from the task list while waiting. So "teammates don't actively poll for tasks" is inaccurate; CC has both passive notification and active claiming.

2. Task Claiming: File Locks + Atomic Operations

claimTask() (utils/tasks.ts:541-612) uses proper-lockfile task-level locks, performing read-check-modify-write within the lock. Checks: owner already exists (L575-576), already completed (L580-581), unresolved blockers in blockedBy (L585-594). claimTaskWithBusyCheck() (utils/tasks.ts:614-692) uses task-list level locks, making busy check and claim atomic to avoid TOCTOU.

findAvailableTask() (inProcessRunner.ts:595-604) checks "all blockedBy completed" using task.blockedBy.every(id => !unresolvedTaskIds.has(id)). tryClaimNextTask() (inProcessRunner.ts:624-657) updates status to in_progress after claiming, so the UI immediately reflects the change.

3. Teaching Version vs CC Comparison

Dimension	Teaching (s17)	CC
Idle mechanism	idle_poll unified polling (5s)	idle_notification + 500ms mailbox polling + task watcher
Task discovery	scan_unclaimed_tasks (polling)	useTaskListWatcher (file watching) + tryClaimNextTask (active polling)
Dependency check	can_start (all blockedBy completed)	findAvailableTask (same semantics)
Concurrency safety	Owner check (no file lock)	proper-lockfile task lock + task-list lock
Shutdown handling	IDLE dispatches directly, WORK via handle_inbox_message	500ms polling loop prioritizes shutdown_request
Timeout exit	60s with no new tasks	No fixed timeout, Lead manual shutdown
Identity persistence	Messages length detection	Context compaction preserves system prompt
Claim failure handling	Check return value, skip on failure	File locks guarantee atomicity

Teaching version's idle_poll() merges CC's four mechanisms into one polling function — a reasonable simplification since the core semantics (find work when idle, claim after deps resolve, prioritize shutdown) are consistent.