feat: ensure local agent completes todos before ending turn (#2601)

## Summary - When a local agent ends its turn with incomplete todos (pending or in_progress), a reminder message is now injected telling it to continue and complete the remaining tasks - This only happens once per turn to avoid infinite loops - Added `hasIncompleteTodos()` and `buildTodoReminderMessage()` helpers to prepare_step_utils.ts - Added `TodoReminderState` to track whether a reminder has already been sent this turn Fixes #2600 ## Test plan - Unit tests added for: - `hasIncompleteTodos()` - correctly detects pending/in_progress todos - `buildTodoReminderMessage()` - builds proper reminder message listing incomplete todos - `prepareStepMessages()` with todoContext: - Injects reminder when agent finishes with incomplete todos - Does not inject reminder when already reminded this turn - Does not inject reminder when all todos are completed - Does not inject reminder when agent has pending tool calls - Combines reminder with existing injected messages 🤖 Generated with [Claude Code](https://claude.com/claude-code)  --- <a href="https://app.devin.ai/review/dyad-sh/dyad/pull/2601" target="_blank"> <picture> <source media="(prefers-color-scheme: dark)" srcset="https://static.devin.ai/assets/gh-open-in-devin-review-dark.svg?v=1"> <img src="https://static.devin.ai/assets/gh-open-in-devin-review-light.svg?v=1" alt="Open with Devin"> </picture> </a>   --- ## Summary by cubic Ensures the local agent completes remaining todos before ending a turn by running a one-time outer-loop follow-up pass that adds a reminder. The reminder is not persisted. Meets #2600. - **New Features** - Outer-loop detection via shouldRunTodoFollowUpPass: runs one follow-up pass when the final step has no tool calls and incomplete todos remain; skips in read-only and plan modes. - Helpers hasIncompleteTodos() and buildTodoReminderMessage(); multi-pass E2E fixture and test; fake LLM server scans all user messages and counts todo reminders to drive passes. - **Refactors** - Removed inner-loop reminder injection from prepareStepMessages; tests cleaned up. - Restructured local_agent_handler into a controlled pass loop with createdAt guards, baseMessageHistoryCount and compaction state reset each pass, AI messages persisted across passes, and synthetic todo reminders excluded from aiMessagesJson; updated compaction test to include toolCalls in mock steps. <sup>Written for commit 70b9c5a6595c5b024d25665e785d93dd77a3076f. Summary will update on new commits.</sup>  --------- Co-authored-by: Will Chen <willchen90@gmail.com> Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>

feat: ensure local agent completes todos before ending turn (#2601)
54444ddf · wwwillchen-bot · GitHub · 30ed10aa · 54444ddf · 54444ddf
--- a/e2e-tests/fixtures/engine/local-agent/todo-followup-loop.ts
+++ b/e2e-tests/fixtures/engine/local-agent/todo-followup-loop.ts
+import type { LocalAgentFixture } from "../../../../testing/fake-llm-server/localAgentTypes";
+
+/**
+ * Fixture that tests the outer loop todo follow-up behavior:
+ *
+ * Pass 1: Agent creates 3 todos, completes only 1 of them, then emits chat text.
+ *         The outer loop detects incomplete todos and sends a reminder.
+ *
+ * Pass 2: After receiving the todo reminder, agent completes the remaining 2 todos.
+ *
+ * This tests that the outer loop correctly:
+ * 1. Detects incomplete todos after a pass
+ * 2. Injects a reminder message
+ * 3. Runs another pass to allow the agent to complete remaining work
+ */
+export const fixture: LocalAgentFixture = {
+  description: "Test outer loop todo follow-up when todos are partially complete",
+  passes: [
+    {
+      // First pass: Create todos and partially complete them
+      turns: [
+        {
+          text: "I'll create a todo list to track these tasks.",
+          toolCalls: [
+            {
+              name: "update_todos",
+              args: {
+                merge: false,
+                todos: [
+                  {
+                    id: "todo-1",
+                    content: "Create utility function",
+                    status: "in_progress",
+                  },
+                  {
+                    id: "todo-2",
+                    content: "Write unit tests",
+                    status: "pending",
+                  },
+                  {
+                    id: "todo-3",
+                    content: "Update documentation",
+                    status: "pending",
+                  },
+                ],
+              },
+            },
+          ],
+        },
+        {
+          text: "Let me create the utility function first.",
+          toolCalls: [
+            {
+              name: "write_file",
+              args: {
+                path: "src/utils/helper.ts",
+                content:
+                  "export function helper(x: number): number {\n  return x * 2;\n}\n",
+                description: "Create helper utility function",
+              },
+            },
+          ],
+        },
+        {
+          text: "Now marking the first task as done.",
+          toolCalls: [
+            {
+              name: "update_todos",
+              args: {
+                merge: true,
+                todos: [
+                  {
+                    id: "todo-1",
+                    status: "completed",
+                  },
+                ],
+              },
+            },
+          ],
+        },
+        {
+          // This text-only response triggers the outer loop check.
+          // Since there are still incomplete todos, it will inject a reminder.
+          text: "I've completed the utility function. Let me continue with the remaining tasks.",
+        },
+      ],
+    },
+    {
+      // Second pass: After receiving todo reminder, complete remaining tasks
+      turns: [
+        {
+          text: "I see there are still incomplete todos. Let me write the unit tests.",
+          toolCalls: [
+            {
+              name: "write_file",
+              args: {
+                path: "src/utils/helper.test.ts",
+                content:
+                  'import { helper } from "./helper";\n\ntest("helper doubles input", () => {\n  expect(helper(5)).toBe(10);\n});\n',
+                description: "Create unit tests for helper",
+              },
+            },
+          ],
+        },
+        {
+          text: "Marking tests as done.",
+          toolCalls: [
+            {
+              name: "update_todos",
+              args: {
+                merge: true,
+                todos: [
+                  {
+                    id: "todo-2",
+                    status: "completed",
+                  },
+                ],
+              },
+            },
+          ],
+        },
+        {
+          text: "Now updating the documentation.",
+          toolCalls: [
+            {
+              name: "write_file",
+              args: {
+                path: "src/utils/README.md",
+                content:
+                  "# Utils\n\n## helper(x)\n\nDoubles the input number.\n",
+                description: "Update documentation",
+              },
+            },
+          ],
+        },
+        {
+          text: "Marking documentation as done.",
+          toolCalls: [
+            {
+              name: "update_todos",
+              args: {
+                merge: true,
+                todos: [
+                  {
+                    id: "todo-3",
+                    status: "completed",
+                  },
+                ],
+              },
+            },
+          ],
+        },
+        {
+          // All todos complete - no more follow-up passes
+          text: "All tasks are now complete! I've created the utility function, written unit tests, and updated the documentation.",
+        },
+      ],
+    },
+  ],
+};
--- a/e2e-tests/local_agent_todo_followup.spec.ts
+++ b/e2e-tests/local_agent_todo_followup.spec.ts
+import { testSkipIfWindows } from "./helpers/test_helper";
+
+/**
+ * E2E test for the outer loop todo follow-up behavior.
+ *
+ * This tests that when an agent creates a todo list but only partially
+ * completes it in the first pass, the outer loop will:
+ * 1. Detect incomplete todos
+ * 2. Inject a reminder message
+ * 3. Run another pass to complete the remaining todos
+ *
+ * Related to issue #2601
+ */
+testSkipIfWindows("local-agent - todo follow-up loop", async ({ po }) => {
+  await po.setUpDyadPro({ localAgent: true });
+  await po.importApp("minimal");
+  await po.chatActions.selectLocalAgentMode();
+
+  // Send prompt that triggers the todo follow-up loop fixture
+  await po.sendPrompt("tc=local-agent/todo-followup-loop");
+
+  // Snapshot the final messages to verify:
+  // 1. All todos were created and completed across two passes
+  // 2. The todo reminder was injected between passes
+  // 3. Files were created in both passes
+  await po.snapshotMessages();
+
+  // Verify files were created in both passes
+  await po.snapshotAppFiles({
+    name: "after-todo-followup",
+    files: [
+      "src/utils/helper.ts", // Created in pass 1
+      "src/utils/helper.test.ts", // Created in pass 2
+      "src/utils/README.md", // Created in pass 2
+    ],
+  });
+});
--- a/e2e-tests/snapshots/local_agent_todo_followup.spec.ts_after-todo-followup.txt
+++ b/e2e-tests/snapshots/local_agent_todo_followup.spec.ts_after-todo-followup.txt
+=== src/utils/helper.test.ts ===
+import { helper } from "./helper";
+
+test("helper doubles input", () => {
+  expect(helper(5)).toBe(10);
+});
+
+
+=== src/utils/helper.ts ===
+export function helper(x: number): number {
+  return x * 2;
+}
+
+
+=== src/utils/README.md ===
+# Utils
+
+## helper(x)
+
+Doubles the input number.
--- a/e2e-tests/snapshots/local_agent_todo_followup.spec.ts_local-agent---todo-follow-up-loop-1.aria.yml
+++ b/e2e-tests/snapshots/local_agent_todo_followup.spec.ts_local-agent---todo-follow-up-loop-1.aria.yml
+- paragraph: /Generate an AI_RULES\.md file for this app\. Describe the tech stack in 5-\d+ bullet points and describe clear rules about what libraries to use for what\./
+- button "file1.txt file1.txt Edit":
+  - img
+  - text: ""
+  - button "Edit":
+    - img
+    - text: ""
+  - img
+- paragraph: More EOM
+- button "Copy":
+  - img
+- img
+- text: Approved
+- img
+- text: claude-opus-4-5
+- img
+- text: less than a minute ago
+- button "Copy Request ID":
+  - img
+  - text: ""
+- paragraph: tc=local-agent/todo-followup-loop
+- paragraph: I'll create a todo list to track these tasks.Let me create the utility function first.
+- 'button "helper.ts src/utils/helper.ts Edit Summary: Create helper utility function"':
+  - img
+  - text: ""
+  - button "Edit":
+    - img
+    - text: ""
+  - img
+  - text: ""
+- paragraph: Now marking the first task as done.I've completed the utility function. Let me continue with the remaining tasks.I see there are still incomplete todos. Let me write the unit tests.
+- 'button "helper.test.ts src/utils/helper.test.ts Edit Summary: Create unit tests for helper"':
+  - img
+  - text: ""
+  - button "Edit":
+    - img
+    - text: ""
+  - img
+  - text: ""
+- paragraph: Marking tests as done.Now updating the documentation.
+- 'button "README.md src/utils/README.md Edit Summary: Update documentation"':
+  - img
+  - text: ""
+  - button "Edit":
+    - img
+    - text: ""
+  - img
+  - text: ""
+- paragraph: Marking documentation as done.All tasks are now complete! I've created the utility function, written unit tests, and updated the documentation.
+- button "Copy":
+  - img
+- img
+- text: claude-opus-4-5
+- img
+- text: less than a minute ago
+- button "Copy Request ID":
+  - img
+  - text: ""
+- button "Undo":
+  - img
+  - text: ""
+- button "Retry":
+  - img
+  - text: ""
\ No newline at end of file
--- a/rules/git-workflow.md
+++ b/rules/git-workflow.md
@@ -83,6 +83,7 @@ The stashed changes will be automatically merged back after the rebase completes
 - When rebasing documentation/table conflicts (e.g., workflow README tables), prefer keeping **both** additions from HEAD and upstream - merge new rows/content from both branches rather than choosing one side
 - **Complementary additions**: When both sides added new sections at the end of a file (e.g., both added different documentation tips), keep both sections rather than choosing one — they're not truly conflicting, just different additions
 - **React component wrapper conflicts**: When rebasing UI changes that conflict on wrapper div classes (e.g., `flex items-start space-x-2` vs `flex items-end gap-1`), keep the newer styling from the incoming commit but preserve any functional components (like dialogs or modals) that exist in HEAD but not in the incoming change
+- **Refactoring conflicts**: When incoming commits refactor code (e.g., extracting inline logic into helper functions), and HEAD has new features in the same area, integrate HEAD's features into the new structure. Example: if incoming code moves streaming logic to `runSingleStreamPass()` and HEAD adds mid-turn compaction to the inline code, add compaction support to the new function rather than keeping the old inline version

 ## Rebasing with uncommitted changes


--- a/src/__tests__/local_agent_handler.test.ts
+++ b/src/__tests__/local_agent_handler.test.ts
@@ -760,6 +760,7 @@ describe("handleLocalAgentStream", () => {
              response: {
                messages: [...preCompactionGenerated],
              },
+              toolCalls: [{}], // First step has tool calls
            },
            {
              response: {
@@ -768,6 +769,7 @@ describe("handleLocalAgentStream", () => {
                  ...postCompactionGenerated,
                ],
              },
+              toolCalls: [], // Last step has no tool calls (ended with text)
            },
          ]),
        };

--- a/src/__tests__/prepare_step_utils.test.ts
+++ b/src/__tests__/prepare_step_utils.test.ts
@@ -4,9 +4,14 @@ import {
  processPendingMessages,
  injectMessagesAtPositions,
  prepareStepMessages,
+  hasIncompleteTodos,
+  buildTodoReminderMessage,
  type InjectedMessage,
 } from "@/pro/main/ipc/handlers/local_agent/prepare_step_utils";
-import type { UserMessageContentPart } from "@/pro/main/ipc/handlers/local_agent/tools/types";
+import type {
+  UserMessageContentPart,
+  Todo,
+} from "@/pro/main/ipc/handlers/local_agent/tools/types";
 import { ImagePart, ModelMessage } from "ai";

 describe("prepare_step_utils", () => {
@@ -798,4 +803,99 @@ describe("prepare_step_utils", () => {
      ).toBe("encrypted-data");
    });
  });
+
+  describe("hasIncompleteTodos", () => {
+    it("returns true when there are pending todos", () => {
+      const todos: Todo[] = [
+        { id: "1", content: "Task 1", status: "pending" },
+        { id: "2", content: "Task 2", status: "completed" },
+      ];
+      expect(hasIncompleteTodos(todos)).toBe(true);
+    });
+
+    it("returns true when there are in_progress todos", () => {
+      const todos: Todo[] = [
+        { id: "1", content: "Task 1", status: "in_progress" },
+        { id: "2", content: "Task 2", status: "completed" },
+      ];
+      expect(hasIncompleteTodos(todos)).toBe(true);
+    });
+
+    it("returns false when all todos are completed", () => {
+      const todos: Todo[] = [
+        { id: "1", content: "Task 1", status: "completed" },
+        { id: "2", content: "Task 2", status: "completed" },
+      ];
+      expect(hasIncompleteTodos(todos)).toBe(false);
+    });
+
+    it("returns false when there are no todos", () => {
+      const todos: Todo[] = [];
+      expect(hasIncompleteTodos(todos)).toBe(false);
+    });
+  });
+
+  describe("buildTodoReminderMessage", () => {
+    it("builds a message listing incomplete todos", () => {
+      const todos: Todo[] = [
+        { id: "1", content: "Implement feature A", status: "in_progress" },
+        { id: "2", content: "Write tests", status: "pending" },
+        { id: "3", content: "Setup project", status: "completed" },
+      ];
+
+      const message = buildTodoReminderMessage(todos);
+
+      expect(message).toContain("2 incomplete todo(s)");
+      expect(message).toContain("[in_progress] Implement feature A");
+      expect(message).toContain("[pending] Write tests");
+      expect(message).not.toContain("Setup project");
+    });
+
+    it("handles a single incomplete todo", () => {
+      const todos: Todo[] = [
+        { id: "1", content: "Last task", status: "pending" },
+      ];
+
+      const message = buildTodoReminderMessage(todos);
+
+      expect(message).toContain("1 incomplete todo(s)");
+      expect(message).toContain("[pending] Last task");
+    });
+  });
+
+  describe("prepareStepMessages with injected messages", () => {
+    it("works with existing injected messages", () => {
+      const pendingUserMessages: UserMessageContentPart[][] = [];
+      const allInjectedMessages: InjectedMessage[] = [
+        {
+          insertAtIndex: 1,
+          sequence: 0,
+          message: {
+            role: "user",
+            content: [{ type: "text", text: "Screenshot from crawl" }],
+          },
+        },
+      ];
+
+      const messages: ModelMessage[] = [
+        { role: "user", content: "Build an app" },
+        { role: "assistant", content: "I analyzed the screenshot." },
+      ];
+
+      const result = prepareStepMessages(
+        { messages },
+        pendingUserMessages,
+        allInjectedMessages,
+      );
+
+      expect(result).toBeDefined();
+      // Should have: user message, injected screenshot, assistant message
+      expect(result!.messages).toHaveLength(3);
+      expect(result!.messages[0].role).toBe("user");
+      expect((result!.messages[1].content as { text: string }[])[0].text).toBe(
+        "Screenshot from crawl",
+      );
+      expect(result!.messages[2].role).toBe("assistant");
+    });
+  });
 });
--- a/src/pro/main/ipc/handlers/local_agent/local_agent_handler.ts
+++ b/src/pro/main/ipc/handlers/local_agent/local_agent_handler.ts
--- a/src/pro/main/ipc/handlers/local_agent/prepare_step_utils.ts
+++ b/src/pro/main/ipc/handlers/local_agent/prepare_step_utils.ts
@@ -6,9 +6,38 @@
 */

 import { ImagePart, ModelMessage, TextPart, UserModelMessage } from "ai";
-import type { UserMessageContentPart } from "./tools/types";
+import type { UserMessageContentPart, Todo } from "./tools/types";
 import { cleanMessageForOpenAI } from "@/ipc/utils/ai_messages_utils";

+/**
+ * Check if a single todo is incomplete (pending or in_progress).
+ */
+const isIncompleteTodo = (todo: Todo): boolean =>
+  todo.status === "pending" || todo.status === "in_progress";
+
+/**
+ * Check if there are incomplete todos (pending or in_progress).
+ */
+export function hasIncompleteTodos(todos: Todo[]): boolean {
+  return todos.some(isIncompleteTodo);
+}
+
+/**
+ * Build a reminder message for incomplete todos.
+ */
+export function buildTodoReminderMessage(todos: Todo[]): string {
+  const incompleteTodos = todos.filter(isIncompleteTodo);
+
+  const todoList = incompleteTodos
+    .map((t) => `- [${t.status}] ${t.content}`)
+    .join("\n");
+
+  // Note: The "incomplete todo(s)" substring is used as a detection marker by test
+  // infrastructure in testing/fake-llm-server/ (chatCompletionHandler.ts and
+  // localAgentHandler.ts). Update those files if this text changes.
+  return `You have ${incompleteTodos.length} incomplete todo(s). Please continue and complete them:\n\n${todoList}`;
+}
+
 /**
 * A message that has been processed and is ready to inject.
 */

--- a/testing/fake-llm-server/chatCompletionHandler.ts
+++ b/testing/fake-llm-server/chatCompletionHandler.ts
@@ -28,38 +28,56 @@ export const createChatCompletionHandler =
    }

    // Check for local-agent fixture requests (tc=local-agent/*)
-    // This needs to be checked on the first user message, not the last (which might be tool results)
-    const lastUserMessage = messages
-      .slice()
-      .reverse()
-      .find((m: any) => m.role === "user");
-
-    // Extract text content from last user message (handles both string and array content)
-    let userTextContent = "";
-    if (lastUserMessage) {
-      if (typeof lastUserMessage.content === "string") {
-        userTextContent = lastUserMessage.content;
-      } else if (Array.isArray(lastUserMessage.content)) {
-        const textPart = lastUserMessage.content.find(
-          (p: any) => p.type === "text",
-        );
-        if (textPart) {
-          userTextContent = textPart.text;
+    // We need to check ALL user messages, not just the last one, because
+    // outer loop follow-up requests inject a todo reminder as the last user message.
+    // The fixture trigger (tc=local-agent/...) will be in an earlier user message.
+    const userMessages = messages.filter((m: any) => m.role === "user");
+
+    // Helper to extract text content from a message (handles both string and array content)
+    const getTextContent = (msg: any): string => {
+      if (typeof msg.content === "string") {
+        return msg.content;
+      } else if (Array.isArray(msg.content)) {
+        const textPart = msg.content.find((p: any) => p.type === "text");
+        return textPart ? textPart.text : "";
+      }
+      return "";
+    };
+
+    // Get the last user message's text content for other checks
+    const lastUserMessage = userMessages[userMessages.length - 1];
+    const userTextContent = lastUserMessage
+      ? getTextContent(lastUserMessage)
+      : "";
+
+    // First, check if the LAST user message is a fixture trigger
+    let localAgentFixture = extractLocalAgentFixture(userTextContent);
+
+    // If last message isn't a fixture but contains a todo reminder, search earlier messages
+    // This handles the outer loop case where a reminder is injected after the original fixture trigger
+    // Note: This magic string must match the reminder text in prepare_step_utils.ts
+    // buildTodoReminderMessage(). Update both if the text changes.
+    if (!localAgentFixture && userTextContent.includes("incomplete todo(s)")) {
+      for (const msg of userMessages) {
+        const textContent = getTextContent(msg);
+        const fixture = extractLocalAgentFixture(textContent);
+        if (fixture) {
+          localAgentFixture = fixture;
+          break; // Use the first (original) fixture trigger found
        }
      }
+    }

-      const localAgentFixture = extractLocalAgentFixture(userTextContent);
-      console.error(
-        `[local-agent] Checking message: "${userTextContent.slice(0, 50)}", fixture: ${localAgentFixture}`,
-      );
-      if (localAgentFixture) {
-        return handleLocalAgentFixture(req, res, localAgentFixture);
-      }
+    console.error(
+      `[local-agent] Checking message: "${userTextContent.slice(0, 50)}", fixture: ${localAgentFixture}`,
+    );
+    if (localAgentFixture) {
+      return handleLocalAgentFixture(req, res, localAgentFixture);
+    }

-      // Route plan acceptance message to exit-plan fixture
-      if (userTextContent.includes("I accept this plan")) {
-        return handleLocalAgentFixture(req, res, "exit-plan");
-      }
+    // Route plan acceptance message to exit-plan fixture
+    if (userTextContent.includes("I accept this plan")) {
+      return handleLocalAgentFixture(req, res, "exit-plan");
    }

    let messageContent = CANNED_MESSAGE;

--- a/testing/fake-llm-server/localAgentHandler.ts
+++ b/testing/fake-llm-server/localAgentHandler.ts
@@ -37,6 +37,30 @@ function getSessionId(messages: any[]): string {
    .digest("hex");
 }

+/**
+ * Check if a message content contains a todo reminder pattern.
+ * The todo reminder is injected by the outer loop when there are incomplete todos.
+ */
+function isTodoReminderMessage(msg: any): boolean {
+  if (msg?.role !== "user") return false;
+  const content = Array.isArray(msg.content)
+    ? msg.content.find((p: any) => p.type === "text")?.text
+    : typeof msg.content === "string"
+      ? msg.content
+      : null;
+  // Note: This magic string must match the reminder text in prepare_step_utils.ts
+  // buildTodoReminderMessage(). Update both if the text changes.
+  return content?.includes("incomplete todo(s)") ?? false;
+}
+
+/**
+ * Count the number of todo reminder messages in the conversation.
+ * This determines which outer loop pass we're on.
+ */
+function countTodoReminderMessages(messages: any[]): number {
+  return messages.filter(isTodoReminderMessage).length;
+}
+
 /**
 * Count the number of tool result messages AFTER the last user message
 * to determine which turn we're on for the current fixture.
@@ -99,9 +123,9 @@ async function loadFixture(fixtureName: string): Promise<LocalAgentFixture> {
    const module = require(fixturePath);
    const fixture = module.fixture as LocalAgentFixture;

-    if (!fixture || !fixture.turns) {
+    if (!fixture || (!fixture.turns && !fixture.passes)) {
      throw new Error(
-        `Invalid fixture: missing 'fixture' export or 'turns' array`,
+        `Invalid fixture: missing 'fixture' export or 'turns'/'passes' array`,
      );
    }

@@ -113,6 +137,30 @@ async function loadFixture(fixtureName: string): Promise<LocalAgentFixture> {
  }
 }

+/**
+ * Get the turns for the current pass from a fixture.
+ * Supports both simple fixtures (with `turns`) and multi-pass fixtures (with `passes`).
+ */
+function getTurnsForPass(
+  fixture: LocalAgentFixture,
+  passIndex: number,
+): Turn[] {
+  // If fixture uses passes, get the appropriate pass
+  if (fixture.passes && fixture.passes.length > 0) {
+    if (passIndex >= fixture.passes.length) {
+      // All passes exhausted
+      return [];
+    }
+    return fixture.passes[passIndex].turns;
+  }
+
+  // Simple fixture with turns - only valid for pass 0
+  if (passIndex > 0) {
+    return [];
+  }
+  return fixture.turns || [];
+}
+
 /**
 * Create a streaming chunk in OpenAI format
 */
@@ -292,26 +340,37 @@ export async function handleLocalAgentFixture(
    const fixture = await loadFixture(fixtureName);
    const sessionId = getSessionId(messages);

-    // Determine which turn we're on based on tool result rounds
+    // Determine which outer loop pass we're on based on todo reminder messages
+    const passIndex = countTodoReminderMessages(messages);
+
+    // Determine which turn we're on within the current pass
    const toolResultRounds = countToolResultRounds(messages);
    const turnIndex = toolResultRounds;

+    // Get the turns for the current pass
+    const turns = getTurnsForPass(fixture, passIndex);
+
    console.error(
-      `[local-agent] Loaded fixture: ${fixtureName}, Session: ${sessionId}, Turn: ${turnIndex}, Tool rounds: ${toolResultRounds}`,
+      `[local-agent] Loaded fixture: ${fixtureName}, Session: ${sessionId}, Pass: ${passIndex}, Turn: ${turnIndex}, Tool rounds: ${toolResultRounds}`,
    );

-    if (turnIndex >= fixture.turns.length) {
-      // All turns exhausted, send a simple completion message
-      console.log(`[local-agent] All turns exhausted, sending completion`);
+    if (turnIndex >= turns.length) {
+      // All turns exhausted for this pass, send a simple completion message
+      console.log(
+        `[local-agent] All turns exhausted for pass ${passIndex}, sending completion`,
+      );
      await streamTextResponse(res, "Task completed.");
      return;
    }

-    const turn = fixture.turns[turnIndex];
-    console.log(`[local-agent] Executing turn ${turnIndex}:`, {
-      hasText: !!turn.text,
-      toolCallCount: turn.toolCalls?.length ?? 0,
-    });
+    const turn = turns[turnIndex];
+    console.log(
+      `[local-agent] Executing pass ${passIndex}, turn ${turnIndex}:`,
+      {
+        hasText: !!turn.text,
+        toolCallCount: turn.toolCalls?.length ?? 0,
+      },
+    );

    // If this turn has tool calls, stream them
    if (turn.toolCalls && turn.toolCalls.length > 0) {

--- a/testing/fake-llm-server/localAgentTypes.ts
+++ b/testing/fake-llm-server/localAgentTypes.ts
@@ -24,9 +24,27 @@ export type Turn = {
  };
 };

+/**
+ * Represents a single outer loop pass.
+ * The outer loop runs when todos are incomplete after a chat response.
+ */
+export type Pass = {
+  /** Ordered turns within this pass */
+  turns: Turn[];
+};
+
 export type LocalAgentFixture = {
  /** Description for debugging */
  description?: string;
-  /** Ordered turns in the conversation */
-  turns: Turn[];
+  /**
+   * Ordered turns in the conversation.
+   * For simple fixtures without outer loop testing.
+   */
+  turns?: Turn[];
+  /**
+   * Ordered passes for testing outer loop behavior.
+   * Each pass contains turns that execute within that outer loop iteration.
+   * Use this when testing todo follow-up loop behavior.
+   */
+  passes?: Pass[];
 };