commandsSource-backedReview first Safety · Privacy ·

TDD Workflow for Claude Code

Implement test-driven development workflows with Claude Code using red-green-refactor cycles, automatic test generation, and AI-guided iteration until all tests pass

by JSONbored·added 2025-10-25·

Claude Code

HarnessClaude Code

Invocation:/tdd-workflow [feature] [options]

Install

Source

/tdd-workflow [feature] [options]

Readiness

TrustReview first
Sourcesource-backed
Safety notesMissing
ReviewedYes

Documentation Source repository Registry JSON · LLM text

Review first — review before installing

Open the source and read safety notes before installing.

Schema details

Install type: cli
Reading time: 11 min
Difficulty score: 100
Troubleshooting: Yes
Breaking changes: No

Runtime and command metadata

Command syntax: /tdd-workflow [feature] [options]

Full copyable content

/tdd-workflow [feature] [options]

About this resource

The /tdd-workflow command implements test-driven development (TDD) in Claude Code with AI-assisted red-green-refactor cycles, preventing hallucination through test-anchored iteration.

Features

Red-Green-Refactor Automation: Full TDD cycle with failing tests → passing code → optimization
Test-First Enforcement: Claude writes tests BEFORE implementing features
Iteration Until Pass: AI refines code until all tests pass (no manual fixes)
Anti-Hallucination: Tests serve as ground truth preventing scope drift
Coverage Tracking: Automatic code coverage measurement (80%+ target)
Test Templates: Pre-built patterns for unit, integration, and E2E tests
Framework Support: Vitest, Jest, Playwright, Cypress
Continuous Verification: Run tests after each code change

Usage

/tdd-workflow [feature] [options]

Workflow Modes

--unit - Unit test TDD workflow (default)
--integration - Integration test workflow
--e2e - End-to-end test workflow
--full - Complete test pyramid (unit + integration + e2e)

Test Frameworks

--vitest - Vitest (default for unit/integration)
--jest - Jest testing framework
--playwright - Playwright for E2E
--cypress - Cypress for E2E

Coverage Options

--coverage - Generate coverage report (default: true)
--min-coverage=<percent> - Minimum coverage threshold (default: 80)
--strict - Fail if coverage below threshold

Behavior Modifiers

--watch - Watch mode, re-run tests on file changes
--debug - Show detailed test execution logs
--commit-on-green - Auto-commit when all tests pass

Examples

Basic TDD Workflow - User Authentication

Command:

/tdd-workflow "user authentication service with email/password" --unit

TDD Cycle:

Phase 1: RED (Write Failing Tests)

// Claude generates: tests/unit/auth.service.test.ts
import { describe, it, expect, beforeEach } from "vitest";
import { AuthService } from "@/services/auth.service";

describe("AuthService", () => {
  let authService: AuthService;

  beforeEach(() => {
    authService = new AuthService();
  });

  describe("login", () => {
    it("should return user token for valid credentials", async () => {
      const result = await authService.login({
        email: "user@example.com",
        password: "ValidPass123",
      });

      expect(result).toHaveProperty("token");
      expect(result).toHaveProperty("user");
      expect(result.user.email).toBe("user@example.com");
    });

    it("should throw error for invalid email", async () => {
      await expect(
        authService.login({ email: "invalid", password: "pass" }),
      ).rejects.toThrow("Invalid email format");
    });

    it("should throw error for incorrect password", async () => {
      await expect(
        authService.login({ email: "user@example.com", password: "wrong" }),
      ).rejects.toThrow("Invalid credentials");
    });

    it("should hash password before comparison", async () => {
      const loginSpy = vi.spyOn(authService, "comparePassword");
      await authService.login({ email: "user@example.com", password: "pass" });

      expect(loginSpy).toHaveBeenCalledWith("pass", expect.any(String));
    });
  });

  describe("register", () => {
    it("should create new user and return token", async () => {
      const result = await authService.register({
        email: "new@example.com",
        password: "ValidPass123",
        name: "New User",
      });

      expect(result).toHaveProperty("token");
      expect(result.user.email).toBe("new@example.com");
    });

    it("should reject weak passwords", async () => {
      await expect(
        authService.register({ email: "user@example.com", password: "123" }),
      ).rejects.toThrow("Password must be at least 8 characters");
    });

    it("should reject duplicate emails", async () => {
      await authService.register({
        email: "dup@example.com",
        password: "ValidPass123",
      });

      await expect(
        authService.register({
          email: "dup@example.com",
          password: "ValidPass123",
        }),
      ).rejects.toThrow("Email already exists");
    });
  });
});

Run Tests (Expect Failures):

pnpm test auth.service.test.ts

❌ FAIL  tests/unit/auth.service.test.ts
  AuthService
    login
      ✗ should return user token for valid credentials
        → Cannot find module '@/services/auth.service'
      ✗ should throw error for invalid email
      ✗ should throw error for incorrect password
      ✗ should hash password before comparison

Claude: "Tests written and failing as expected. Proceeding to GREEN phase."

Phase 2: GREEN (Implement Minimum Code to Pass)

// Claude generates: src/services/auth.service.ts
import { hash, compare } from "bcrypt";
import { sign } from "jsonwebtoken";
import { z } from "zod";

const loginSchema = z.object({
  email: z.string().email("Invalid email format"),
  password: z.string().min(1),
});

const registerSchema = z.object({
  email: z.string().email("Invalid email format"),
  password: z.string().min(8, "Password must be at least 8 characters"),
  name: z.string().min(1),
});

interface User {
  id: string;
  email: string;
  name: string;
  passwordHash: string;
}

export class AuthService {
  private users: Map<string, User> = new Map();

  async login(credentials: { email: string; password: string }) {
    const validated = loginSchema.parse(credentials);

    const user = Array.from(this.users.values()).find(
      (u) => u.email === validated.email,
    );

    if (!user) {
      throw new Error("Invalid credentials");
    }

    const passwordMatch = await this.comparePassword(
      validated.password,
      user.passwordHash,
    );

    if (!passwordMatch) {
      throw new Error("Invalid credentials");
    }

    const token = this.generateToken(user);

    return {
      token,
      user: {
        id: user.id,
        email: user.email,
        name: user.name,
      },
    };
  }

  async register(data: { email: string; password: string; name: string }) {
    const validated = registerSchema.parse(data);

    if (
      Array.from(this.users.values()).some((u) => u.email === validated.email)
    ) {
      throw new Error("Email already exists");
    }

    const passwordHash = await hash(validated.password, 10);
    const user: User = {
      id: crypto.randomUUID(),
      email: validated.email,
      name: validated.name,
      passwordHash,
    };

    this.users.set(user.id, user);

    const token = this.generateToken(user);

    return {
      token,
      user: {
        id: user.id,
        email: user.email,
        name: user.name,
      },
    };
  }

  async comparePassword(password: string, hash: string): Promise<boolean> {
    return compare(password, hash);
  }

  private generateToken(user: User): string {
    return sign({ userId: user.id }, process.env.JWT_SECRET || "dev-secret", {
      expiresIn: "7d",
    });
  }
}

Run Tests Again:

pnpm test auth.service.test.ts

✓ PASS  tests/unit/auth.service.test.ts
  AuthService
    login
      ✓ should return user token for valid credentials (45ms)
      ✓ should throw error for invalid email (12ms)
      ✓ should throw error for incorrect password (38ms)
      ✓ should hash password before comparison (42ms)
    register
      ✓ should create new user and return token (48ms)
      ✓ should reject weak passwords (8ms)
      ✓ should reject duplicate emails (52ms)

Test Coverage:
  File: auth.service.ts
  Statements: 95.2%
  Branches: 100%
  Functions: 100%
  Lines: 94.7%

Claude: "All tests passing! Coverage: 95.2%. Proceeding to REFACTOR phase."

Phase 3: REFACTOR (Optimize While Keeping Tests Green)

// Claude refactors for better organization
export class AuthService {
  private users: Map<string, User> = new Map();
  private readonly saltRounds = 10;
  private readonly tokenExpiry = "7d";

  async login(credentials: LoginCredentials): Promise<AuthResult> {
    const validated = this.validateLoginCredentials(credentials);
    const user = await this.findUserByEmail(validated.email);
    await this.verifyPassword(validated.password, user.passwordHash);

    return this.createAuthResult(user);
  }

  async register(data: RegisterData): Promise<AuthResult> {
    const validated = this.validateRegisterData(data);
    await this.ensureEmailNotExists(validated.email);

    const user = await this.createUser(validated);
    return this.createAuthResult(user);
  }

  private validateLoginCredentials(credentials: unknown): LoginCredentials {
    return loginSchema.parse(credentials);
  }

  private async findUserByEmail(email: string): Promise<User> {
    const user = Array.from(this.users.values()).find((u) => u.email === email);
    if (!user) {
      throw new Error("Invalid credentials");
    }
    return user;
  }

  private async verifyPassword(password: string, hash: string): Promise<void> {
    const match = await compare(password, hash);
    if (!match) {
      throw new Error("Invalid credentials");
    }
  }

  private createAuthResult(user: User): AuthResult {
    return {
      token: this.generateToken(user),
      user: this.sanitizeUser(user),
    };
  }

  // ... more refactored methods
}

Run Tests After Refactor:

pnpm test auth.service.test.ts

✓ All tests still passing
✓ Coverage maintained: 95.2%

Claude: "Refactoring complete. All tests green. Ready for commit."

Integration Testing Workflow

Command:

/tdd-workflow "user registration API endpoint" --integration

Generated Integration Tests:

// tests/integration/auth.api.test.ts
import { describe, it, expect, beforeAll, afterAll } from "vitest";
import request from "supertest";
import { app } from "@/app";
import { db } from "@/lib/db";

describe("POST /api/auth/register", () => {
  beforeAll(async () => {
    await db.migrate.latest();
  });

  afterAll(async () => {
    await db.migrate.rollback();
    await db.destroy();
  });

  it("should register new user and return 201", async () => {
    const response = await request(app).post("/api/auth/register").send({
      email: "newuser@example.com",
      password: "ValidPass123",
      name: "New User",
    });

    expect(response.status).toBe(201);
    expect(response.body).toHaveProperty("token");
    expect(response.body.user.email).toBe("newuser@example.com");
  });

  it("should return 400 for invalid email format", async () => {
    const response = await request(app)
      .post("/api/auth/register")
      .send({ email: "invalid", password: "pass", name: "User" });

    expect(response.status).toBe(400);
    expect(response.body.error).toContain("Invalid email");
  });

  it("should return 409 for duplicate email", async () => {
    await request(app).post("/api/auth/register").send({
      email: "dup@example.com",
      password: "ValidPass123",
      name: "User",
    });

    const response = await request(app).post("/api/auth/register").send({
      email: "dup@example.com",
      password: "ValidPass123",
      name: "User2",
    });

    expect(response.status).toBe(409);
    expect(response.body.error).toContain("already exists");
  });

  it("should store hashed password in database", async () => {
    const password = "PlainPassword123";

    await request(app)
      .post("/api/auth/register")
      .send({ email: "hash@example.com", password, name: "User" });

    const user = await db("users").where({ email: "hash@example.com" }).first();

    expect(user.password).not.toBe(password);
    expect(user.password).toMatch(/^\$2[aby]\$/);
  });
});

E2E Testing Workflow with Playwright

Command:

/tdd-workflow "complete user registration flow" --e2e --playwright

Generated E2E Tests:

// tests/e2e/auth.spec.ts
import { test, expect } from "@playwright/test";

test.describe("User Registration Flow", () => {
  test("should complete full registration process", async ({ page }) => {
    // Navigate to registration page
    await page.goto("/register");

    // Fill registration form
    await page.fill('[name="email"]', "e2e@example.com");
    await page.fill('[name="password"]', "ValidPass123");
    await page.fill('[name="confirmPassword"]', "ValidPass123");
    await page.fill('[name="name"]', "E2E User");

    // Submit form
    await page.click('[type="submit"]');

    // Verify redirect to dashboard
    await expect(page).toHaveURL("/dashboard");

    // Verify user name displayed
    await expect(page.locator('[data-testid="user-name"]')).toHaveText(
      "E2E User",
    );
  });

  test("should show validation errors for invalid inputs", async ({ page }) => {
    await page.goto("/register");

    // Submit empty form
    await page.click('[type="submit"]');

    // Verify error messages
    await expect(page.locator('[data-testid="email-error"]')).toBeVisible();
    await expect(page.locator('[data-testid="password-error"]')).toBeVisible();
  });

  test("should persist session after registration", async ({
    page,
    context,
  }) => {
    await page.goto("/register");

    // Register user
    await page.fill('[name="email"]', "session@example.com");
    await page.fill('[name="password"]', "ValidPass123");
    await page.fill('[name="confirmPassword"]', "ValidPass123");
    await page.fill('[name="name"]', "Session User");
    await page.click('[type="submit"]');

    // Verify cookies set
    const cookies = await context.cookies();
    expect(cookies.find((c) => c.name === "auth_token")).toBeDefined();

    // Reload page - should stay authenticated
    await page.reload();
    await expect(page).toHaveURL("/dashboard");
  });
});

Full Test Pyramid Workflow

Command:

/tdd-workflow "payment processing feature" --full --min-coverage=90

Claude Executes:

Unit Tests (70% of test suite)
- Service layer logic
- Utility functions
- Data validation
- Business rules
Integration Tests (20% of test suite)
- API endpoints
- Database operations
- External service mocks
E2E Tests (10% of test suite)
- Critical user flows
- Payment completion
- Error handling UX

Coverage Report:

Test Results:
  Unit: 45 tests passing
  Integration: 12 tests passing
  E2E: 3 tests passing

Coverage:
  Overall: 92.3% ✓
  Statements: 93.1%
  Branches: 89.7%
  Functions: 95.2%
  Lines: 92.8%

Target: 90% - PASSED ✓

TDD Best Practices

1. Write Tests First (Always)

# Claude will REFUSE to implement before tests exist
User: "Implement user authentication"

Claude: "I'll follow TDD workflow:
1. First, I'll write comprehensive tests
2. Run tests (expect failures)
3. Implement minimal code to pass
4. Refactor while keeping tests green

Starting with test creation..."

2. Test Behavior, Not Implementation

// ❌ Bad: Testing implementation details
it("should call bcrypt.hash with saltRounds=10", () => {
  expect(bcrypt.hash).toHaveBeenCalledWith(password, 10);
});

// ✅ Good: Testing behavior
it("should store hashed password", async () => {
  await authService.register({ email, password });
  const user = await db.users.findOne({ email });

  expect(user.password).not.toBe(password);
  expect(await bcrypt.compare(password, user.password)).toBe(true);
});

3. Descriptive Test Names

// ✅ Clear test names following Given-When-Then
describe("AuthService", () => {
  describe("when user provides valid credentials", () => {
    it("should return auth token and user data", async () => {
      // test
    });
  });

  describe("when password is incorrect", () => {
    it("should throw InvalidCredentialsError", async () => {
      // test
    });
  });
});

4. Arrange-Act-Assert Pattern

it("should update user email", async () => {
  // Arrange: Set up test data
  const user = await createTestUser({ email: "old@example.com" });

  // Act: Perform action
  await userService.updateEmail(user.id, "new@example.com");

  // Assert: Verify outcome
  const updated = await userService.findById(user.id);
  expect(updated.email).toBe("new@example.com");
});

Anti-Hallucination Benefits

Problem: LLM Scope Drift

User: "Add user authentication"

Claude (without TDD):
*Implements auth + session management + password reset + 2FA + OAuth*
# Hallucinated features not requested

Solution: Test-Anchored Iteration

Claude (with TDD):
1. Writes tests ONLY for requested features
2. Tests define exact scope and behavior
3. Implementation guided by test requirements
4. Iteration stops when tests pass
5. No feature creep - tests are ground truth

Watch Mode & Continuous Testing

/tdd-workflow "shopping cart service" --watch

Behavior:

Monitors file changes
Re-runs affected tests automatically
Shows real-time coverage updates
Alerts when tests fail

Terminal Output:

Watching: src/**/*.ts, tests/**/*.test.ts

✓ 24 tests passing
✓ Coverage: 87.3%

Waiting for changes... (press 'q' to quit)

[File changed: src/services/cart.service.ts]
Re-running tests...

✗ 1 test failing
  CartService > should remove item from cart
    Expected: 1 item, Received: 2 items

Coverage: 85.1% (↓ 2.2%)

Configuration

Custom Test Runner

// .claude/tdd.config.json
{
  "framework": "vitest",
  "coverage": {
    "enabled": true,
    "threshold": 80,
    "reportDir": "coverage"
  },
  "testMatch": ["**/*.test.ts", "**/*.spec.ts"],
  "autoCommit": true,
  "commitMessage": "test: {{feature}} - all tests passing ✓"
}

Best Practices Summary

Always Write Tests First: No implementation before failing tests
Iterate Until Green: Let AI refine until all tests pass
Minimum Code: Implement only what's needed to pass tests
Refactor Fearlessly: Tests protect against regressions
Descriptive Names: Test names document expected behavior
High Coverage: Target 80%+ for production code
Fast Feedback: Use watch mode for continuous verification
Commit on Green: Auto-commit when test suite passes

Content outline

Features
Usage
Workflow Modes
Test Frameworks
Coverage Options
Behavior Modifiers
Examples
Basic TDD Workflow - User Authentication
Integration Testing Workflow
E2E Testing Workflow with Playwright
Full Test Pyramid Workflow
TDD Best Practices
1. Write Tests First (Always)
2. Test Behavior, Not Implementation
3. Descriptive Test Names
4. Arrange-Act-Assert Pattern

#tdd#testing#red-green-refactor#test-automation#quality#ai-testing

Source citations

Signals

Loading live community signals…

Schema details

About this resource

Features

Usage

Workflow Modes

Test Frameworks

Coverage Options

Behavior Modifiers

Examples

Basic TDD Workflow - User Authentication

Integration Testing Workflow

E2E Testing Workflow with Playwright

Full Test Pyramid Workflow

TDD Best Practices

1. Write Tests First (Always)

2. Test Behavior, Not Implementation

3. Descriptive Test Names

4. Arrange-Act-Assert Pattern

Anti-Hallucination Benefits

Problem: LLM Scope Drift

Solution: Test-Anchored Iteration

Watch Mode & Continuous Testing

Configuration

Custom Test Runner

Best Practices Summary

Source citations

Related resources

Generate Tests for Claude

Test-Driven Development Enforcer - CLAUDE.md Rules for Claude Code

/review - Code Review Command for Claude Code

/test-advanced - Test Suite Command for Claude Code

Signals