📝 docs: add GODOG_RANDOM_SEED documentation to BDD_TAGS.md

🐛 fix: use int64 for Randomize field to match godog.Options type
🎲 feat: make test randomization seed configurable via GODOG_RANDOM_SEED
2026-04-10 17:39:02 +02:00 · 2026-04-10 17:37:55 +02:00 · 2026-04-10 17:35:47 +02:00 · 2026-04-10 16:52:58 +02:00 · 2026-04-10 16:44:49 +02:00 · 2026-04-10 16:06:21 +02:00
59 changed files with 2365 additions and 3890 deletions
--- a/.gitea/workflows/README.md
+++ b/.gitea/workflows/README.md
@@ -1,234 +0,0 @@
-# CI/CD Workflow Architecture
-
-## 🗺️ Overview
-
-The dance-lessons-coach project uses a **multi-workflow architecture** for better separation of concerns, maintainability, and flexibility.
-
-## 📁 Workflow Files
-
-### 1. `ci-cd.yaml` - Main CI/CD Pipeline
-
-**Purpose**: Run tests, build binaries, and generate documentation
-
-**Triggers**:
- Push to `main`, `ci/**`, `feature/**`, `fix/**`, `refactor/**` branches
- Pull requests to `main` branch
- Manual workflow dispatch
-
-**Jobs**:
-1. **build-cache** - Build and cache Docker build environment
-2. **ci-pipeline** - Run tests, build binaries, generate Swagger docs
-3. **trigger-docker-push** - Trigger separate Docker workflow on main branch
-
-**Key Features**:
- Runs in container environment with all build tools
- Generates Swagger documentation
- Runs BDD and unit tests with PostgreSQL
- Updates badges and version information
- Triggers Docker workflow only on main branch
-
-### 2. `docker-push.yaml` - Docker Image Publishing
-
-**Purpose**: Build and push Docker images to registry
-
-**Triggers**:
- Manual workflow dispatch only (no automatic triggers)
- Triggered by `ci-cd.yaml` on main branch
-
-**Jobs**:
-1. **docker-push** - Build production Docker image and push to registry
-
-**Key Features**:
- Runs on host environment (access to Docker daemon)
- Uses dependency hash from build-cache
- Builds minimal Alpine-based production image
- Pushes multiple tags (version, latest, commit SHA)
-
-## 🔧 Architecture Benefits
-
-### 1. Clear Separation of Concerns
- **CI/CD Pipeline**: Testing and artifact generation
- **Docker Publishing**: Image building and registry operations
-
-### 2. Proper Environment Isolation
- **CI jobs run in container**: Consistent build environment
- **Docker jobs run on host**: Access to Docker daemon
-
-### 3. Flexible Testing
- Can trigger Docker workflow independently for testing
- No complex conditional logic in main workflow
- Easier to debug and maintain
-
-### 4. Better Security
- Docker operations isolated in separate workflow
- Clear dependency between test success and deployment
- Manual trigger capability for emergency situations
-
-## 🚀 Usage Examples
-
-### Trigger Full CI/CD Pipeline
-```bash
-# Automatically triggered on push to main branch
-# Or manually:
-./scripts/gitea-client.sh trigger-workflow arcodange dance-lessons-coach ci-cd.yaml main
-```
-
-### Trigger Docker Push Manually
-```bash
-# Get dependency hash from build-cache job first
-DEPS_HASH="abc123def456"
-
-# Trigger Docker workflow manually
-./scripts/gitea-client.sh trigger-workflow arcodange dance-lessons-coach docker-push.yaml main --deps_hash $DEPS_HASH
-```
-
-### Workflow Dispatch Parameters (docker-push.yaml)
- `deps_hash` (required): Dependency hash from build-cache job
- `ref` (optional): Git reference (branch/tag), defaults to current
-
-## 🔗 Workflow Dependencies
-
-```mermaid
-graph TD
-    A[Push to main] --> B[ci-cd.yaml]
-    B --> C[build-cache job]
-    B --> D[ci-pipeline job]
-    D --> E[trigger-docker-push job]
-    E --> F[docker-push.yaml]
-    F --> G[docker-push job]
-    G --> H[Docker Registry]
-```
-
-## 📋 Best Practices
-
-### 1. Always Run CI First
- Docker workflow should only be triggered after CI passes
- Maintains quality gate before deployment
-
-### 2. Use Dependency Hash
- Ensures consistent builds across workflows
- Pass hash from build-cache to docker-push
-
-### 3. Manual Testing
- Use separate Docker workflow for testing image builds
- Avoids polluting main branch with test images
-
-### 4. Monitor Both Workflows
- CI/CD workflow for test results and artifacts
- Docker workflow for image build and push status
-
-## 🎯 Docker Build Strategy Decision
-
-### 🏆 Chosen Approach: Attempt 2 (Standard Dockerfile)
-
-After extensive testing of multiple approaches, we selected **Attempt 2** as the optimal Docker build strategy.
-
-#### ⚡ Why Attempt 2 Won:
-
-**1. Simplicity (60% smaller workflow)**
- 73 lines vs 158 lines in complex approaches
- No inline Dockerfile generation
- Standard `docker build -f docker/Dockerfile .` command
-
-**2. Better Performance**
- No artifact/cache action overhead
- Natural Docker layer caching works optimally
- Faster execution without complex variable substitutions
-
-**3. Superior Reliability**
- Proven standard Docker build process
- Easier to debug and maintain
- Fewer moving parts = fewer failures
-
-**4. Better Maintainability**
- Uses standard Dockerfile (easier to understand)
- No complex YAML templating
- Clear separation of concerns
-
-#### 🗑️ Why We Rejected Other Approaches:
-
-**Attempt 1 (Inline Dockerfile):**
- Complex YAML templating
- Harder to debug and maintain
- No significant performance benefit
-
-**Attempt 3 (Build Cache Image):**
- Added complexity with cache management
- Slower due to artifact actions overhead
- More prone to cache invalidation issues
-
-**Attempt 4 (Template File):**
- Added unnecessary file management
- No clear advantage over standard Dockerfile
- More complex workflow
-
-### 📊 Performance Comparison:
-
-| Approach | Lines of Code | Complexity | Reliability | Maintainability |
-|----------|---------------|------------|-------------|-----------------|
-| **Attempt 2** | 73 | Low | High | Excellent |
-| Attempt 1 | 158 | High | Medium | Poor |
-| Attempt 3 | 125 | Medium | Medium | Fair |
-| Attempt 4 | 110 | Medium | High | Good |
-
-### 🔧 Implementation Details:
-
-**Standard Dockerfile Approach:**
-```yaml
- name: Build and push Docker image
-  run: |
-    docker build -t dance-lessons-coach -f docker/Dockerfile .
-    docker tag dance-lessons-coach "$IMAGE_NAME"
-    docker push "$IMAGE_NAME"
-```
-
-**Key Benefits:**
- Uses multi-stage builds for optimization
- Standard Docker layer caching works naturally
- Easy to understand and modify
- Proven reliability in production
-
-## 🎯 Future Enhancements
-
-### Potential Improvements:
- Add workflow status badges to README
- Implement workflow chaining with outputs
- Add matrix builds for multiple architectures
- Implement canary deployment workflow
- Add rollback capability
-
-### Architecture Considerations:
- Keep workflows focused on single responsibilities
- Maintain clear separation between test and deploy
- Document all workflow triggers and conditions
- Monitor workflow execution times and optimize
-
-## 📝 Maintenance
-
-### Adding New Jobs:
- Add to appropriate workflow based on responsibility
- CI-related jobs → `ci-cd.yaml`
- Docker-related jobs → `docker-push.yaml`
-
-### Modifying Triggers:
- Update trigger conditions in respective workflow files
- Test changes thoroughly before merging
-
-### Debugging:
- Check workflow logs in Gitea Actions
- Use `gitea-client.sh diagnose-job` for detailed analysis
- Monitor workflow dependencies and execution order
-
-## 🔒 Security
-
-### Secrets Management:
- Docker registry credentials stored in Gitea secrets
- Never hardcode credentials in workflow files
- Use GitHub token for workflow dispatch
-
-### Access Control:
- Only authorized users can trigger workflows
- Manual approval required for production deployments
- Audit logs available for all workflow executions
-
-This architecture provides a clean, maintainable, and secure CI/CD pipeline that scales well with project growth while maintaining clear separation of concerns.
--- a/.gitea/workflows/ci-cd.yaml
+++ b/.gitea/workflows/ci-cd.yaml
@@ -132,8 +132,7 @@ jobs:
    name: CI Pipeline
    needs: build-cache
    runs-on: ubuntu-latest-ca
-    # Skip conditions: standard skip ci + actor check + respect skip_ci input
-    if: "!contains(github.event.head_commit.message, '[skip ci]') && github.actor != 'ci-bot' && (!github.event.inputs.skip_ci || github.event.inputs.skip_ci == 'false')"
+    if: "!contains(github.event.head_commit.message, '[skip ci]') && github.actor != 'ci-bot'"
    
    container:
      image: ${{ env.CI_REGISTRY }}/${{ env.GITEA_ORG }}/${{ env.GITEA_REPO }}-build-cache:${{ needs.build-cache.outputs.deps_hash }}
@@ -154,9 +153,9 @@ jobs:
        run: |
          echo "DLC_DATABASE_HOST=postgres" >> $GITHUB_ENV
          echo "DLC_DATABASE_PORT=5432" >> $GITHUB_ENV
-          echo "DLC_DATABASE_USER=$POSTGRES_USER" >> $GITHUB_ENV
-          echo "DLC_DATABASE_PASSWORD=$POSTGRES_PASSWORD" >> $GITHUB_ENV
-          echo "DLC_DATABASE_NAME=$POSTGRES_DB" >> $GITHUB_ENV
+          echo "DLC_DATABASE_USER=postgres" >> $GITHUB_ENV
+          echo "DLC_DATABASE_PASSWORD=postgres" >> $GITHUB_ENV
+          echo "DLC_DATABASE_NAME=dance_lessons_coach_bdd_test" >> $GITHUB_ENV
          echo "DLC_DATABASE_SSL_MODE=disable" >> $GITHUB_ENV

      - name: Restore Swagger Docs Cache
@@ -305,23 +304,47 @@ jobs:
            echo "ℹ️  No changes to push"
          fi

+      # Docker build and push (main branch only)
+      - name: Login to Gitea Container Registry
+        if: github.ref == 'refs/heads/main'
+        uses: docker/login-action@v3
+        with:
+          registry: ${{ env.CI_REGISTRY }}
+          username: ${{ github.actor }}
+          password: ${{ secrets.PACKAGES_TOKEN }}

-
-
-  # Trigger Docker push workflow on main branch
-  trigger-docker-push:
-    name: Trigger Docker Push
-    needs: [build-cache, ci-pipeline]
-    runs-on: ubuntu-latest-ca
-    if: "!contains(github.event.head_commit.message, '[skip ci]') && github.actor != 'ci-bot' && github.ref == 'refs/heads/main'"
-    
-    steps:
-      - name: Trigger Docker Push Workflow
+      - name: Build and push Docker image
+        if: github.ref == 'refs/heads/main'
        run: |
-          echo "🚀 Triggering Docker Push workflow..."
-          curl -X POST \
-            -H "Authorization: token ${{ secrets.GITEA_TOKEN || secrets.PACKAGES_TOKEN }}" \
-            -H "Content-Type: application/json" \
-            "${{ env.GITEA_INTERNAL }}api/v1/repos/${{ env.GITEA_ORG }}/${{ env.GITEA_REPO }}/actions/workflows/docker-push.yaml/dispatches" \
-            -d '{"ref":"${{ github.ref }}"}'
-          echo "✅ Docker Push workflow triggered successfully!"
+          source VERSION
+          IMAGE_VERSION="$MAJOR.$MINOR.$PATCH${PRERELEASE:+-$PRERELEASE}"
+          
+          # Use the template file with proper dependency hash replacement
+          DEPS_HASH="${{ needs.build-cache.outputs.deps_hash }}"
+          echo "Using dependency hash: $DEPS_HASH"
+          
+          # Create Dockerfile.prod from template
+          sed "s/{{DEPS_HASH}}/$DEPS_HASH/g" docker/Dockerfile.prod.template > docker/Dockerfile.prod
+          
+          TAGS="$IMAGE_VERSION latest ${{ github.sha }}"
+          echo "Building Docker image with tags: $TAGS"
+          
+          # Build the production image
+          docker build -t dance-lessons-coach -f docker/Dockerfile.prod .
+          
+          for TAG in $TAGS; do
+            IMAGE_NAME="${{ env.CI_REGISTRY }}/${{ env.GITEA_ORG }}/${{ env.GITEA_REPO }}:$TAG"
+            echo "Tagging and pushing: $IMAGE_NAME"
+            docker tag dance-lessons-coach "$IMAGE_NAME"
+            docker push "$IMAGE_NAME"
+          done
+
+      - name: Show published images
+        if: github.ref == 'refs/heads/main'
+        run: |
+          source VERSION
+          IMAGE_VERSION="$MAJOR.$MINOR.$PATCH${PRERELEASE:+-$PRERELEASE}"
+          echo "📦 Published Docker images:"
+          echo "  - ${{ env.CI_REGISTRY }}/${{ env.GITEA_ORG }}/${{ env.GITEA_REPO }}:$IMAGE_VERSION"
+          echo "  - ${{ env.CI_REGISTRY }}/${{ env.GITEA_ORG }}/${{ env.GITEA_REPO }}:latest"
+          echo "  - ${{ env.CI_REGISTRY }}/${{ env.GITEA_ORG }}/${{ env.GITEA_REPO }}:${{ github.sha }}"
--- a/.gitea/workflows/docker-push.yaml
+++ b/.gitea/workflows/docker-push.yaml
@@ -1,73 +0,0 @@
---
-# dance-lessons-coach Docker Push Workflow
-# Separate workflow for Docker image building and pushing
-# Can be triggered manually or by CI/CD workflow
-
-name: Docker Push
-
-on:
-  # Manual trigger for testing or production
-  workflow_dispatch:
-    inputs:
-      ref:
-        description: 'Git reference (branch/tag)'
-        required: false
-        type: string
-        default: ''
-
-# Environment variables
-env:
-  GITEA_INTERNAL: "https://gitea.arcodange.lab/"
-  GITEA_EXTERNAL: "https://gitea.arcodange.fr/"
-  GITEA_ORG: "arcodange"
-  GITEA_REPO: "dance-lessons-coach"
-  CI_REGISTRY: "gitea.arcodange.lab"
-
-jobs:
-  docker-push:
-    name: Docker Push
-    runs-on: ubuntu-latest-ca
-    
-    steps:
-      - name: Checkout code
-        uses: actions/checkout@v4
-        with:
-          ref: ${{ github.event.inputs.ref || github.ref }}
-
-      - name: Login to Gitea Container Registry
-        uses: docker/login-action@v3
-        with:
-          registry: ${{ env.CI_REGISTRY }}
-          username: ${{ github.actor }}
-          password: ${{ secrets.PACKAGES_TOKEN }}
-
-
-
-
-
-      - name: Build and push Docker image
-        run: |
-          source VERSION
-          IMAGE_VERSION="$MAJOR.$MINOR.$PATCH${PRERELEASE:+-$PRERELEASE}"
-          
-          TAGS="$IMAGE_VERSION latest ${{ github.sha }}"
-          echo "Building Docker image with tags: $TAGS"
-          
-          # Build using the standard Dockerfile (Attempt 2 - simplest approach)
-          docker build -t dance-lessons-coach -f docker/Dockerfile .
-          
-          for TAG in $TAGS; do
-            IMAGE_NAME="${{ env.CI_REGISTRY }}/${{ env.GITEA_ORG }}/${{ env.GITEA_REPO }}:$TAG"
-            echo "Tagging and pushing: $IMAGE_NAME"
-            docker tag dance-lessons-coach "$IMAGE_NAME"
-            docker push "$IMAGE_NAME"
-          done
-
-      - name: Show published images
-        run: |
-          source VERSION
-          IMAGE_VERSION="$MAJOR.$MINOR.$PATCH${PRERELEASE:+-$PRERELEASE}"
-          echo "📦 Published Docker images:"
-          echo "  - ${{ env.CI_REGISTRY }}/${{ env.GITEA_ORG }}/${{ env.GITEA_REPO }}:$IMAGE_VERSION"
-          echo "  - ${{ env.CI_REGISTRY }}/${{ env.GITEA_ORG }}/${{ env.GITEA_REPO }}:latest"
-          echo "  - ${{ env.CI_REGISTRY }}/${{ env.GITEA_ORG }}/${{ env.GITEA_REPO }}:${{ github.sha }}"
--- a/.gitignore
+++ b/.gitignore
@@ -24,7 +24,7 @@ server.pid
 pkg/server/docs/

 # BDD test files
-features/**/*-config.yaml
+features/*/*-config.yaml
 test-config.yaml
 test-v2-config.yaml

--- a/.vibe/skills/gitea-client/scripts/gitea-client.sh
+++ b/.vibe/skills/gitea-client/scripts/gitea-client.sh
@@ -203,31 +203,6 @@ cmd_wait_job() {
 }

 # Comment on PR
-# Create a pull request
-cmd_create_pr() {
-    local owner="$1"
-    local repo="$2"
-    local title="$3"
-    local body="$4"
-    local head="$5"
-    local base="${6:-main}"
-
-    if [[ -z "$owner" || -z "$repo" || -z "$title" || -z "$head" ]]; then
-        echo "Usage: $0 create-pr <owner> <repo> <title> <body> <head_branch> [base_branch]" >&2
-        exit 1
-    fi
-
-    local endpoint="/repos/${owner}/${repo}/pulls"
-    local data
-    data=$(jq -n \
-        --arg title "$title" \
-        --arg body "$body" \
-        --arg head "$head" \
-        --arg base "$base" \
-        '{title: $title, body: $body, head: $head, base: $base}')
-    api_request "POST" "$endpoint" "$data"
-}
-
 cmd_comment_pr() {
    local owner="$1"
    local repo="$2"
@@ -240,8 +215,7 @@ cmd_comment_pr() {
    fi
    
    local endpoint="/repos/${owner}/${repo}/issues/${pr_number}/comments"
-    local data
-    data=$(jq -n --arg body "$comment" '{body: $body}')
+    local data="{\"body\": \"${comment}\"}"
    api_request "POST" "$endpoint" "$data"
 }

@@ -276,7 +250,6 @@ main() {
        monitor-workflow) cmd_monitor_workflow "$@" ;;
        diagnose-job) cmd_diagnose_job "$@" ;;
        recent-workflows) cmd_recent_workflows "$@" ;;
-        create-pr) cmd_create_pr "$@" ;;
        comment-pr) cmd_comment_pr "$@" ;;
        pr-status) cmd_pr_status "$@" ;;
        list-issues) cmd_list_issues "$@" ;;
@@ -301,7 +274,6 @@ main() {
            echo "  monitor-workflow <owner> <repo> <workflow_run_id> [interval_seconds]" >&2
            echo "  diagnose-job <owner> <repo> <job_id>" >&2
            echo "  recent-workflows <owner> <repo> [limit] [status_filter]" >&2
-            echo "  create-pr <owner> <repo> <title> <body> <head_branch> [base_branch]" >&2
            echo "  comment-pr <owner> <repo> <pr_number> <comment>" >&2
            echo "  pr-status <owner> <repo> <pr_number>" >&2
            echo "  list-issues <owner> <repo> [state]" >&2
--- a/AGENTS.md
+++ b/AGENTS.md
--- a/AGENT_CHANGELOG.md
+++ b/AGENT_CHANGELOG.md
@@ -1,32 +0,0 @@
-# AGENT_CHANGELOG
-
-Trace ordonnée des décisions et actions structurantes prises par les agents AI (Claude Code, Mistral Vibe, autres) sur le projet `dance-lessons-coach`. Complémentaire au [`CHANGELOG.md`](CHANGELOG.md) qui couvre les changements user-facing du produit.
-
-**Pourquoi ce fichier** : référencé dans la documentation directrice (cf. AGENTS.md), mais initialement absent du repo. Initialisé dans le cadre de la Tâche 6 du curriculum migration Claude → Mistral Vibe (ARCODANGE Phase 1).
-
-## Convention
-
-Une entrée par décision/action structurante prise par un agent AI. Format :
-
-```
-## YYYY-MM-DD — <Agent> — <Titre court>
-
-**Contexte** : 1-3 lignes — pourquoi cette action
-**Décision/Action** : ce qui a été fait
-**Conséquence** : impact sur le projet (fichiers, conventions, workflows)
-**Référence** : commit hash, PR Gitea, ADR, issue (le cas échéant)
-```
-
-Les entrées qui ne demandent pas de discussion (typo fixes, formatting, dependency bumps mineurs) ne sont **pas** loguées ici — c'est ce que fait le commit Git. Ce fichier garde uniquement les décisions où le **pourquoi** mérite une trace.
-
---
-
-## 2026-05-02 — Mistral Vibe (intent-router) + Claude Code (Opus 4.7) — Initialisation AGENT_CHANGELOG.md
-
-**Contexte** : Tâche 6 du curriculum migration ARCODANGE Phase 1 (cf. `~/.vibe/plans/migration-claude-vers-mistral-phase-1.md`). Le fichier `AGENT_CHANGELOG.md` était mentionné dans la documentation directrice projet mais n'existait pas — friction identifiée par l'audit Phase A.
-
-**Décision/Action** : initialiser le fichier avec convention claire et pointer depuis `AGENTS.md` (Tâche 6 Phase C).
-
-**Conséquence** : tout agent qui prend une décision structurante sur le projet doit ajouter une entrée datée ici. Permet la traçabilité des choix AI au-delà des commits Git.
-
-**Référence** : Tâche 6 du plan migration. Voir aussi `~/.vibe/plans/task-6-phase-a-results.md` pour le contexte complet de la restructuration en cours.
--- a/CHANGELOG.md
+++ b/CHANGELOG.md
@@ -1,57 +0,0 @@
-# Changelog
-
-Notable user-facing changes to `dance-lessons-coach`. Format inspired by [Keep a Changelog](https://keepachangelog.com/), versioning follows [Semantic Versioning 2.0.0](https://semver.org/) (see [`documentation/version-management-guide.md`](documentation/version-management-guide.md)).
-
-The historical phases of foundational development (Phase 1 to Phase 9) are documented in [`documentation/HISTORY.md`](documentation/HISTORY.md).
-
-## [Unreleased]
-
-### Added
-
-_(items pending release; move to a versioned section when tagged)_
-
-### Changed
-
-### Fixed
-
---
-
-## 2026-04-05 — Architecture Documentation
-
- ✅ Added comprehensive ADR directory with 9 decision records
- ✅ Enhanced Zerolog vs Zap analysis in logging ADR
- ✅ Updated `README.md` and `AGENTS.md` with ADR references
- ✅ Documented hybrid testing approach
- ✅ Added BDD testing decision record
-
-## 2026-04-04 — Observability & Testing
-
- ✅ OpenTelemetry integration with Jaeger
- ✅ Middleware-only tracing approach
- ✅ Comprehensive telemetry configuration
- ✅ BDD testing framework setup
- ✅ Hybrid testing strategy documentation
-
-## 2026-04-03 — Production Readiness
-
- ✅ Graceful shutdown with readiness endpoints
- ✅ Configuration management with Viper
- ✅ JSON logging configuration
- ✅ File output logging support
- ✅ Comprehensive error handling
-
-## 2026-04-02 — Web API Foundation
-
- ✅ Chi router integration
- ✅ Versioned API endpoints (`/api/v1`)
- ✅ Health and readiness endpoints
- ✅ JSON responses with proper headers
- ✅ Interface-based design patterns
-
-## 2026-04-01 — Project Foundation
-
- ✅ Go 1.26.1 environment setup
- ✅ Project structure with `cmd/` and `pkg/`
- ✅ Core Greet service implementation
- ✅ CLI interface
- ✅ Unit tests with table-driven approach
--- a/README.md
+++ b/README.md
@@ -1,98 +1,423 @@
 # dance-lessons-coach

-[![Build Status](https://gitea.arcodange.fr/arcodange/dance-lessons-coach/actions/workflows/ci-cd.yaml/badge.svg)](https://gitea.arcodange.fr/arcodange/dance-lessons-coach/actions/workflows/ci-cd.yaml)
+[![Build Status](https://gitea.arcodange.fr/api/badges/arcodange/dance-lessons-coach/status)](https://gitea.arcodange.fr/arcodange/dance-lessons-coach)
+[![Go Report Card](https://goreportcard.com/badge/github.com/arcodange/dance-lessons-coach)](https://goreportcard.com/report/github.com/arcodange/dance-lessons-coach)
 [![Version](https://img.shields.io/badge/version-1.4.0-blue.svg)](https://gitea.arcodange.fr/arcodange/dance-lessons-coach/releases)
 [![License](https://img.shields.io/badge/license-MIT-green.svg)](LICENSE)
+[![Unit Coverage](https://img.shields.io/badge/Unit_Coverage-9.4%-red?style=flat-square)](https://gitea.arcodange.lab/arcodange/dance-lessons-coach)
+[![BDD Coverage](https://img.shields.io/badge/BDD_Coverage-59.2%-yellow?style=flat-square)](https://gitea.arcodange.lab/arcodange/dance-lessons-coach)
+[![BDD Coverage](https://img.shields.io/badge/BDD_Coverage-55.9%-yellow?style=flat-square)](https://gitea.arcodange.lab/arcodange/dance-lessons-coach)
+[![Unit Coverage](https://img.shields.io/badge/Unit_Coverage-8.4%-red?style=flat-square)](https://gitea.arcodange.lab/arcodange/dance-lessons-coach)

-Go web service demonstrating idiomatic package structure, versioned JSON API, and production-ready features.
+A Go project demonstrating idiomatic package structure, CLI implementation, and JSON API with Chi router.
+=======

 ## Features

- Versioned JSON API (`/api/v1`, `/api/v2`)
- Chi router with graceful shutdown
- Zerolog structured logging (console and JSON modes)
- Viper configuration (file + env vars)
- Readiness endpoint for Kubernetes / service mesh
- OpenTelemetry / Jaeger distributed tracing
- OpenAPI / Swagger UI (embedded in binary)
- PostgreSQL user service with JWT auth
- BDD + unit tests
+- Greet function with default behavior
+- Command-line interface
+- JSON API with versioned endpoints
+- Chi router integration
+- Zerolog for high-performance logging
+- Viper for configuration management
+- Graceful shutdown with context
+- Readiness endpoint for Kubernetes/service mesh integration
+- OpenTelemetry integration with Jaeger support
+- OpenAPI/Swagger documentation
+- Unit tests
+- Go 1.26.1 compatible

-## Quick Start
+## Installation

 ```bash
+# Clone the repository
 git clone https://gitea.arcodange.lab/arcodange/dance-lessons-coach.git
 cd dance-lessons-coach
-./scripts/build.sh          # produces ./bin/server and ./bin/greet
-./scripts/start-server.sh start
+
+# Build all binaries
+./scripts/build.sh
+
+# Use the new Cobra CLI
+./bin/dance-lessons-coach --help
+
+# Or use the legacy greet CLI
+go run ./cmd/greet
 ```

+## CI/CD Pipeline
+
+dance-lessons-coach features an optimized CI/CD pipeline using GitHub Actions with container/services architecture:
+
+### Key Features
+- ✅ **Container-based execution**: All steps run in pre-built Docker cache images
+- ✅ **Service-based PostgreSQL**: Automatic database service provisioning
+- ✅ **Smart caching**: Dependency-aware cache invalidation
+- ✅ **Multi-platform**: Compatible with Gitea, GitHub, and GitLab
+- ✅ **Fast execution**: No Docker Compose overhead
+- ✅ **Reliable testing**: Full database connectivity with proper environment setup
+
+### Architecture
+
+The pipeline uses GitHub Actions' native `container` and `services` directives instead of Docker Compose:
+
+```yaml
+jobs:
+  ci-pipeline:
+    container:
+      image: gitea.arcodange.lab/arcodange/dance-lessons-coach-build-cache:${{ needs.build-cache.outputs.deps_hash }}
+    
+    services:
+      postgres:
+        image: postgres:15
+        env:
+          POSTGRES_USER: postgres
+          POSTGRES_PASSWORD: postgres
+          POSTGRES_DB: dance_lessons_coach_bdd_test
+```
+
+### Benefits
+
+1. **Performance**: Direct container execution without compose overhead
+2. **Reliability**: Service containers managed by GitHub Actions
+3. **Simplicity**: Cleaner workflow definition
+4. **Portability**: Works across CI platforms
+5. **Caching**: Intelligent dependency-based cache rebuilding
+
+### Workflow Steps
+
+1. **Build Cache**: Creates Docker image with Go tools and dependencies
+2. **CI Pipeline**: Runs tests, builds binaries, and generates documentation
+3. **Database Tests**: Connects to PostgreSQL service container
+4. **Coverage Reporting**: Updates coverage badges automatically
+5. **Artifact Publishing**: Builds and pushes Docker images (main branch only)
+
+### Environment Configuration
+
+The pipeline automatically sets up database environment variables:
+
 ```bash
-curl http://localhost:8080/api/health
-curl http://localhost:8080/api/v1/greet/Alice
+echo "DLC_DATABASE_HOST=postgres" >> $GITHUB_ENV
+echo "DLC_DATABASE_PORT=5432" >> $GITHUB_ENV
+echo "DLC_DATABASE_USER=postgres" >> $GITHUB_ENV
+echo "DLC_DATABASE_PASSWORD=postgres" >> $GITHUB_ENV
+echo "DLC_DATABASE_NAME=dance_lessons_coach_bdd_test" >> $GITHUB_ENV
+echo "DLC_DATABASE_SSL_MODE=disable" >> $GITHUB_ENV
 ```

-Stop: `./scripts/start-server.sh stop`
+### Status

-## Greet CLI
+[![Build Status](https://gitea.arcodange.fr/api/badges/arcodange/dance-lessons-coach/status)](https://gitea.arcodange.fr/arcodange/dance-lessons-coach)

-```bash
-go run ./cmd/greet           # Hello world!
-go run ./cmd/greet Alice     # Hello Alice!
+=======
+- ✅ **Linting**: Code quality checks with `go fmt` and `go vet`
+- ✅ **Version Management**: Automatic version detection
+- ✅ **Portable**: Uses standard GitHub Actions workflow format
+
+### Workflow File
+```yaml
+# .github/workflows/main.yml
+jobs:
+  build-test:
+    runs-on: ubuntu-latest
+    steps:
+      - uses: actions/checkout@v4
+      - uses: actions/setup-go@v4
+        with:
+          go-version: '1.26.1'
+      - run: go build ./...
+      - run: go test ./... -cover
+
+  lint-format:
+    runs-on: ubuntu-latest
+    steps:
+      - run: go fmt ./...
+      - run: go vet ./...
 ```

+### Setup Instructions
+1. **Gitea**: Enable GitHub Actions compatibility in repo settings
+2. **GitHub**: Push to mirror repository (workflow runs automatically)
+3. **GitLab**: Convert workflow to `.gitlab-ci.yml` or use compatibility mode
+
+**See [ADR 0016](adr/0016-ci-cd-pipeline-design.md) for complete CI/CD design and [STATUS_BADGES.md](STATUS_BADGES.md) for badge setup.**
+
 ## Configuration

-All options are available via `config.yaml` or `DLC_*` environment variables.
+Basic configuration options:

-| Env var | Default | Description |
-|---------|---------|-------------|
-| `DLC_SERVER_PORT` | `8080` | Listening port |
-| `DLC_SERVER_HOST` | `0.0.0.0` | Bind address |
-| `DLC_LOGGING_JSON` | `false` | JSON log format |
-| `DLC_LOGGING_OUTPUT` | stderr | Log file path |
-| `DLC_SHUTDOWN_TIMEOUT` | `30s` | Graceful shutdown window |
-| `DLC_API_V2_ENABLED` | `false` | Enable `/api/v2` routes |
-| `DLC_CONFIG_FILE` | `./config.yaml` | Override config path |
+```bash
+# Start with default configuration
+./scripts/start-server.sh start

-See `config.example.yaml` for a full template.
+# Custom port
+export DLC_SERVER_PORT=9090
+./scripts/start-server.sh start

-## API
+# JSON logging
+export DLC_LOGGING_JSON=true
+./scripts/start-server.sh start
+```

-| Method | Path | Description |
-|--------|------|-------------|
-| GET | `/api/health` | Liveness check |
-| GET | `/api/ready` | Readiness check (503 during shutdown) |
-| GET | `/api/version` | Version info (`?format=plain\|full\|json`) |
-| GET | `/api/v1/greet/` | Default greeting |
-| GET | `/api/v1/greet/{name}` | Named greeting |
-| POST | `/api/v2/greet` | V2 greeting with validation |
-| GET | `/swagger/` | Swagger UI |
+**See [AGENTS.md](AGENTS.md#configuration-management) for comprehensive configuration guide including:**
+- File-based configuration
+- Environment variables
+- Configuration priority rules
+- OpenTelemetry setup
+- Advanced scenarios
+
+## Usage
+
+### New Cobra CLI (Recommended)
+
+```bash
+# Show help
+./bin/dance-lessons-coach --help
+
+# Show version
+./bin/dance-lessons-coach version
+
+# Greet someone
+./bin/dance-lessons-coach greet John
+
+# Start server
+./bin/dance-lessons-coach server
+```
+
+### Legacy CLI (Deprecated)
+
+```bash
+# Default greeting
+go run ./cmd/greet
+# Output: Hello world!
+
+# Custom greeting
+go run ./cmd/greet John
+# Output: Hello John!
+```
+
+### Web Server
+
+**Using the server control script (recommended):**
+
+```bash
+# Start the server
+./scripts/start-server.sh start
+
+# Test API endpoints
+./scripts/start-server.sh test
+
+# Access OpenAPI documentation
+# Swagger UI: http://localhost:8080/swagger/
+# OpenAPI spec: http://localhost:8080/swagger/doc.json
+
+# Stop the server
+./scripts/start-server.sh stop
+```
+
+**Manual server management:**
+
+```bash
+# Start the server
+go run ./cmd/server
+
+# Test API endpoints
+curl http://localhost:8080/api/health
+# Output: {"status":"healthy"}
+
+curl http://localhost:8080/api/ready
+# Output: {"ready":true}
+
+curl http://localhost:8080/api/v1/greet
+# Output: {"message":"Hello world!"}
+
+curl http://localhost:8080/api/v1/greet/John
+# Output: {"message":"Hello John!"}
+```

 ## Testing

 ```bash
-go test ./...                          # unit + integration tests
-./scripts/test-graceful-shutdown.sh    # lifecycle + JSON logging validation
-./scripts/test-opentelemetry.sh        # tracing end-to-end
+# Run all tests
+go test ./...
+
+# Run specific package tests
+go test ./pkg/greet/
 ```

-## Gitea Client
+## CI/CD

-AI agent helper script at `.vibe/skills/gitea-client/scripts/gitea-client.sh`.
+dance-lessons-coach includes a comprehensive CI/CD pipeline with multiple testing options:

-Auth setup:
+### Local Testing (No Gitea Required)
 ```bash
-echo "your_token" > ~/.gitea_token
-chmod 600 ~/.gitea_token
-export GITEA_API_TOKEN_FILE="$HOME/.gitea_token"
+# Validate workflow structure
+./scripts/cicd.sh validate
+
+# Test workflow steps locally
+./scripts/cicd.sh test-simple
 ```

-Get a token at https://gitea.arcodange.lab → Profile → Settings → Applications.
+### Gitea Integration
+```bash
+# Test local setup with Gitea configuration
+./scripts/cicd.sh test-local
+
+# Check pipeline status on Gitea
+./scripts/cicd.sh check-status
+```
+
+### Full CI/CD Testing
+```bash
+# Test with docker compose (requires Gitea runner)
+./scripts/cicd.sh test-docker
+```
+
+**See [adr/0016-ci-cd-pipeline-design.md](adr/0016-ci-cd-pipeline-design.md) for complete CI/CD architecture.**
+
+## Project Structure
+
+```
+dance-lessons-coach/
+├── adr/                    # Architecture Decision Records
+├── cmd/                    # Entry points (greet CLI, server)
+├── pkg/                    # Core packages (config, greet, server, telemetry)
+│   └── server/docs/        # Generated OpenAPI documentation (gitignored)
+├── config.yaml             # Configuration file
+├── scripts/                # Management scripts
+└── go.mod                   # Go module definition
+```
+
+**See [AGENTS.md](AGENTS.md#project-structure) for detailed structure and component explanations.**
+```
+
+## Development
+
+### Generate OpenAPI Documentation
+
+The project uses [swaggo/swag](https://github.com/swaggo/swag) to generate OpenAPI/Swagger documentation from code annotations:
+
+```bash
+# Generate documentation
+go generate ./pkg/server/
+
+# This creates:
+# - pkg/server/docs/docs.go (swagger template)
+# - pkg/server/docs/swagger.json (OpenAPI spec)
+# - pkg/server/docs/swagger.yaml (YAML version)
+```
+
+**Note:** `pkg/server/docs/` is gitignored. Documentation is embedded in the binary at build time.
+
+### Documentation Annotations
+
+Add swagger annotations to handlers and models:
+
+```go
+// @Summary Get personalized greeting
+// @Description Returns a greeting with the specified name
+// @Tags greet
+// @Accept json
+// @Produce json
+// @Param name path string true "Name to greet"
+// @Success 200 {object} GreetResponse "Successful response"
+// @Failure 400 {object} ErrorResponse "Invalid name parameter"
+// @Router /v1/greet/{name} [get]
+func (h *apiV1GreetHandler) handleGreetPath(w http.ResponseWriter, r *http.Request) {
+    // handler implementation
+}
+```

 ## Architecture

-Key decisions are documented in [adr/](adr/). See [AGENTS.md](AGENTS.md) for the full development reference (commands, config, ADR index, commit conventions).
+This project uses Architecture Decision Records (ADRs) to document key technical choices. See [adr/](adr/) for complete documentation including decisions on Go 1.26.1, Chi router, Zerolog, OpenTelemetry, interface-based design, graceful shutdown, configuration management, testing strategies, and OpenAPI documentation.
+
+**Adding new decisions?** See [adr/README.md](adr/README.md) for guidelines.
+
+## Gitea Integration
+
+dance-lessons-coach includes AI agent skills for Gitea integration to monitor CI/CD jobs and interact with pull requests.
+
+### Gitea Client Skill Setup
+
+The Gitea client skill enables AI agents to:
+- Monitor CI/CD job status
+- Fetch job logs for debugging
+- Comment on pull requests
+- Track PR status
+
+**Setup Instructions:**
+
+1. **Create a Personal Access Token:**
+   - Log in to https://gitea.arcodange.lab
+   - Go to Profile → Settings → Applications
+   - Generate token with `read:repository`, `write:repository`, and `read:user` scopes
+
+2. **Configure Authentication:**
+   ```bash
+   # Option 1: Environment variable
+   export GITEA_API_TOKEN="your_token"
+   
+   # Option 2: Token file (recommended)
+   echo "your_token" > ~/.gitea_token
+   chmod 600 ~/.gitea_token
+   export GITEA_API_TOKEN_FILE="$HOME/.gitea_token"
+   ```
+
+3. **Add to shell configuration:**
+   ```bash
+   echo 'export GITEA_API_TOKEN_FILE="$HOME/.gitea_token"' >> ~/.bashrc
+   source ~/.bashrc
+   ```
+
+**Usage Examples:**
+```bash
+# List recent jobs
+.vibe/skills/gitea-client/scripts/gitea-client.sh list-jobs owner repo workflow_id 5
+
+# Wait for job completion
+.vibe/skills/gitea-client/scripts/gitea-client.sh wait-job owner repo job_id 300
+
+# Comment on PR
+.vibe/skills/gitea-client/scripts/gitea-client.sh comment-pr owner repo 42 "Build completed!"
+```
+
+**Documentation:** See [.vibe/skills/gitea-client/README.md](.vibe/skills/gitea-client/README.md) for complete setup and usage guide.
+
+## 🤖 AI Agent Usage
+
+### Quick Launch Commands
+
+**Programmer Agent** (for code implementation, testing, CI/CD):
+```bash
+vibe start --agent dancelessonscoachprogrammer
+```
+
+**Product Owner Agent** (for requirements, interviews, documentation):
+```bash
+vibe start --agent dancelessonscoach-product-owner
+```
+
+### Full Documentation
+
+For complete agent usage guide including:
+- Agent selection guidance
+- Common workflow examples
+- Configuration reference
+- Best practices
+- Troubleshooting tips
+
+See: [AGENT_USAGE_GUIDE.md](documentation/AGENT_USAGE_GUIDE.md)
+
+### Gitmoji Cheatsheet
+
+Quick reference for commit messages:
+- **📝 `:memo:` docs** - Documentation
+- **✨ `:sparkles:` feat** - New feature
+- **🐛 `:bug:` fix** - Bug fix
+- **♻️ `:recycle:` refactor** - Code refactoring
+- **🔧 `:wrench:` chore** - Build/config changes
+
+Full cheatsheet: [GITMOJI_CHEATSHEET.md](documentation/GITMOJI_CHEATSHEET.md)

 ## License

--- a/adr/0008-bdd-testing.md
+++ b/adr/0008-bdd-testing.md
@@ -4,8 +4,6 @@
 * Deciders: Gabriel Radureau, AI Agent
 * Date: 2026-04-05

-> **⚠️ Structure superseded by ADR-0024.** The framework decision (Godog, in-process test server) remains valid. However, the flat `features/` layout and single `steps.go` file described here were replaced by a modular per-domain structure. See ADR-0024 for the current organisation: `features/{auth,greet,health,jwt,config}/` with domain-specific step files and per-domain `*_test.go` runners. The `cd features && godog` execution pattern is also outdated — each domain now uses `go test`.
-
 ## Context and Problem Statement

 We needed to add behavioral testing to dance-lessons-coach that provides:
--- a/adr/0009-hybrid-testing-approach.md
+++ b/adr/0009-hybrid-testing-approach.md
@@ -1,11 +1,10 @@
-# BDD Testing with OpenAPI Documentation
+# Combine BDD and Swagger-based testing

-* Status: Accepted
+* Status: ✅ Partially Implemented (BDD + Documentation only)
 * Deciders: Gabriel Radureau, AI Agent
 * Date: 2026-04-05
-* Last Updated: 2026-04-12
-
-> **⚠️ Title corrected.** This ADR was originally named "Combine BDD and Swagger-based testing" with the intent of eventually adding SDK-generated BDD tests as a second layer ("hybrid"). That second layer was deferred and has no concrete plan. The actual architecture is **BDD direct-HTTP testing + OpenAPI documentation via swaggo** — calling it "hybrid" is misleading. SDK generation remains a possible future enhancement but is not tracked by any open issue.
+* Last Updated: 2026-04-05
+* Implementation Status: BDD testing and OpenAPI documentation completed, SDK generation deferred

 ## Context and Problem Statement

--- a/adr/0011-validation-library-selection.md
+++ b/adr/0011-validation-library-selection.md
@@ -1,36 +0,0 @@
-# 11. Validation Library Selection
-
-* Status: Accepted
-* Deciders: Gabriel Radureau, AI Agent
-* Date: 2026-04-05
-* Implementation Date: 2026-04-05
-
-## Context and Problem Statement
-
-The dance-lessons-coach application needs input validation for API request bodies and configuration values. We need a library that integrates well with Go structs and provides clear error messages.
-
-## Decision Drivers
-
-* Struct-tag-based validation to avoid boilerplate
-* Good error messages with field-level detail
-* Active maintenance and wide adoption
-* Compatibility with existing interface-based design
-
-## Considered Options
-
-* `github.com/go-playground/validator/v10` — struct-tag driven, widely adopted
-* `github.com/asaskevich/govalidator` — tag-based but less expressive
-* Manual validation — full control, no dependency, high boilerplate
-
-## Decision Outcome
-
-Chosen option: **`go-playground/validator/v10`** because it is the de-facto standard in the Go ecosystem, supports struct-tag annotations, provides field-level error detail, and integrates cleanly with our interface-based design.
-
-## Implementation
-
-`github.com/go-playground/validator/v10 v10.30.2` is present in `go.mod`.
-The `pkg/validation/` package wraps the validator for reuse across handlers.
-
-## Links
-
-* [go-playground/validator GitHub](https://github.com/go-playground/validator)
--- a/adr/0013-openapi-swagger-toolchain.md
+++ b/adr/0013-openapi-swagger-toolchain.md
@@ -378,6 +378,68 @@ Added to `.gitea/workflows/go-ci-cd.yaml` lint-format job:
 # Format swagger comments manually
 swag fmt

+# Format is automatically run in:
+# - pre-commit hook
+# - CI/CD lint-format job
+```
+=======
+### Final Implementation
+
+```bash
+# 1. Install swaggo
+go install github.com/swaggo/swag/cmd/swag@latest
+
+# 2. Add swagger metadata to main.go
+// @title dance-lessons-coach API
+// @version 1.0
+// @description API for dance-lessons-coach service
+// @host localhost:8080
+// @BasePath /api
+package main
+```
+
+### Swag Formatting Integration
+
+To ensure consistent swagger comment formatting, we've integrated `swag fmt` into our workflow:
+
+#### Git Hooks
+Added to `.git/hooks/pre-commit`:
+```bash
+# Run swag fmt to format swagger comments
+echo "Running swag fmt..."
+if command -v swag >/dev/null 2>&1; then
+    swag fmt
+    if [ $? -ne 0 ]; then
+        echo "ERROR: swag fmt failed"
+        exit 1
+    fi
+else
+    echo "swag not installed, skipping swag fmt"
+fi
+```
+
+#### CI/CD Integration
+Added to `.gitea/workflows/go-ci-cd.yaml` lint-format job:
+```yaml
+- name: Install swag
+  run: go install github.com/swaggo/swag/cmd/swag@latest
+
+- name: Run swag fmt
+  run: swag fmt
+```
+
+#### Benefits
+- **Consistent Formatting**: Automatic formatting of swagger comments
+- **Pre-Commit Validation**: Catches issues before commit
+- **CI/CD Enforcement**: Ensures formatting in all pull requests
+- **Team Consistency**: Everyone follows the same rules
+- **Automatic Fixes**: Issues are fixed automatically
+
+#### Usage
+```bash
+# Format swagger comments manually
+swag fmt
+
 # Format is automatically run in:
 # - pre-commit hook
 # - CI/CD lint-format job
--- a/adr/0014-grpc-adoption-strategy.md
+++ b/adr/0014-grpc-adoption-strategy.md
@@ -1,44 +0,0 @@
-# 14. gRPC Adoption Strategy
-
-* Status: Rejected / Deferred
-* Deciders: Gabriel Radureau, AI Agent
-* Date: 2026-04-05
-
-## Context and Problem Statement
-
-As the API grows, gRPC was evaluated as an alternative or complement to REST for internal service communication. The question was whether to adopt gRPC alongside the existing Chi REST API.
-
-## Decision Drivers
-
-* Performance of inter-service communication
-* Type safety via Protocol Buffers
-* Streaming support
-* Team familiarity and operational overhead
-
-## Considered Options
-
-* **Hybrid REST/gRPC** — add gRPC endpoints alongside existing REST endpoints
-* **REST only** — maintain current Chi router approach
-* **gRPC-first with transcoding** — use bufbuild/connect for unified REST+gRPC
-
-## Decision Outcome
-
-Chosen option: **REST only (deferred)**. gRPC adoption is not warranted at the current scale. The application has a small number of endpoints, a single-binary deployment model, and no internal service mesh that would benefit from gRPC's efficiency.
-
-### Reasons for deferral
-
-1. **No inter-service communication today** — the application is a single binary; gRPC's main benefit (efficient binary RPC between services) does not apply
-2. **Complexity cost** — adding Protobuf toolchain, code generation, and a second transport layer would significantly increase cognitive overhead
-3. **Chi router commitment** — the REST API is well-designed with OpenAPI documentation; introducing gRPC in parallel creates dual-maintenance burden
-4. **Team capacity** — limited bandwidth for large architectural changes
-
-## When to reconsider
-
-* Application evolves into multiple services that need efficient internal RPC
-* Streaming use cases emerge (real-time lesson progress, etc.)
-* External consumers explicitly require gRPC endpoints
-
-## Links
-
-* [ADR-0002: Chi Router](0002-chi-router.md)
-* [ADR-0013: OpenAPI/Swagger Toolchain](0013-openapi-swagger-toolchain.md)
--- a/adr/0015-cli-subcommands-cobra.md
+++ b/adr/0015-cli-subcommands-cobra.md
@@ -222,7 +222,7 @@ dance-lessons-coach config validate

 ---

-**Status:** Accepted  
-**Implementation Date:** 2026-04-05  
+**Status:** Proposed  
+**Next Review:** 2026-04-12  
 **Implementation Owner:** Arcodange Team  
-**Approved by:** @gabrielradureau
+**Approvers Needed:** @gabrielradureau
--- a/adr/0018-user-management-auth-system.md
+++ b/adr/0018-user-management-auth-system.md
@@ -1,8 +1,7 @@
 # 18. User Management and Authentication System

-**Date:** 2026-04-06
-**Status:** Accepted
-**Implementation Date:** 2026-04-08
+**Date:** 2024-04-06
+**Status:** Proposed
 **Authors:** Product Owner
 **Decision Drivers:** Security, User Personalization, Admin Functionality

--- a/adr/0019-postgresql-integration.md
+++ b/adr/0019-postgresql-integration.md
@@ -1,13 +1,10 @@
 # 19. PostgreSQL Database Integration

-**Date:** 2026-04-07
-**Status:** Accepted (Partial)
-**Implementation Date:** 2026-04-08
+**Date:** 2024-04-07
+**Status:** Proposed
 **Authors:** Product Owner
 **Decision Drivers:** Data Persistence, Scalability, Production Readiness

-> **⚠️ Pending cleanup:** `pkg/user/sqlite_repository.go` and `gorm.io/driver/sqlite` still present in the codebase. The ADR requires their removal, but no Gitea issue tracks this yet. The PostgreSQL implementation (`pkg/user/postgres_repository.go`) is complete and in use.
-
 ## Context

 The dance-lessons-coach application currently uses SQLite with GORM for the user management system (ADR 0018), but since there are no existing users or production data, we can implement PostgreSQL directly as our primary database without migration concerns.
--- a/adr/0021-jwt-secret-retention-policy.md
+++ b/adr/0021-jwt-secret-retention-policy.md
@@ -1,13 +1,11 @@
-# 21. JWT Secret Retention Policy
+# 10. JWT Secret Retention Policy

 ## Status
 **Proposed** 🟡

-> **Note:** Basic JWT multi-secret support and graceful rotation are implemented in `pkg/jwt/jwt_secret_manager.go`. The retention cleanup policy (background job, configurable TTL factor) proposed in this ADR is **not yet implemented**.
-
 ## Context

-The dance-lessons-coach application requires a robust JWT secret management system that balances security and user experience. The system supports multiple JWT secrets for graceful rotation. However, the current implementation lacks a clear policy for secret retention and cleanup.
+The dance-lessons-coach application requires a robust JWT secret management system that balances security and user experience. As implemented in [ADR-0009](0009-hybrid-testing-approach.md), the system supports multiple JWT secrets for graceful rotation. However, the current implementation lacks a clear policy for secret retention and cleanup.

 ### Current State

@@ -388,8 +386,8 @@ func maskSecret(secret string) string {

 ## References

+- [ADR-0009: Hybrid Testing Approach](0009-hybrid-testing-approach.md)
 - [ADR-0008: BDD Testing](0008-bdd-testing.md)
- [ADR-0018: User Management and Auth System](0018-user-management-auth-system.md)
 - [RFC 7519: JSON Web Tokens](https://tools.ietf.org/html/rfc7519)
 - [OWASP Key Management Cheat Sheet](https://cheatsheetseries.owasp.org/cheatsheets/Key_Management_Cheat_Sheet.html)

--- a/adr/0022-rate-limiting-cache-strategy.md
+++ b/adr/0022-rate-limiting-cache-strategy.md
@@ -3,8 +3,6 @@
 ## Status
 **Proposed** 🟡

-> **⚠️ Not yet implemented.** Gitea issue #13 ("feat: Implement Rate Limiting and Caching Strategy") is open and tracks this work. `go-cache`, `redis`, and `ulule/limiter` are absent from `go.mod`. The phase checkboxes below are corrected to reflect actual status.
-
 ## Context

 As the dance-lessons-coach application grows and potentially serves multiple users simultaneously, we need to implement rate limiting to:
@@ -286,38 +284,38 @@ func GetCacheKey(prefix, entityType, entityID string) string {
 ## Implementation Phases

 ### Phase 1: In-Memory Cache (Current Sprint)
- ❌ Research and select in-memory cache library
- ❌ Implement cache interface and in-memory service
- ❌ Add cache configuration to config package
- ❌ Implement basic cache operations (set, get, delete)
- ❌ Add TTL support and automatic cleanup
- ❌ Cache JWT validation results
- ❌ Add cache metrics and monitoring
+- ✅ Research and select in-memory cache library
+- ✅ Implement cache interface and in-memory service
+- ✅ Add cache configuration to config package
+- ✅ Implement basic cache operations (set, get, delete)
+- ✅ Add TTL support and automatic cleanup
+- ✅ Cache JWT validation results
+- ✅ Add cache metrics and monitoring

 ### Phase 2: Redis-Compatible Cache (Next Sprint)
- ❌ Set up Dragonfly/KeyDB in development environment
- ❌ Implement Redis cache service
- ❌ Add configuration for Redis connection
- ❌ Implement cache fallback strategy (Redis → in-memory)
- ❌ Add health checks for Redis connection
- ❌ Implement distributed cache invalidation
+- ✅ Set up Dragonfly/KeyDB in development environment
+- ✅ Implement Redis cache service
+- ✅ Add configuration for Redis connection
+- ✅ Implement cache fallback strategy (Redis → in-memory)
+- ✅ Add health checks for Redis connection
+- ✅ Implement distributed cache invalidation

 ### Phase 3: Rate Limiting (Following Sprint)
- ❌ Research and select rate limiting library
- ❌ Implement rate limiter service
- ❌ Add rate limit configuration
- ❌ Implement Chi middleware for rate limiting
- ❌ Add rate limit headers to responses
- ❌ Implement IP whitelisting
- ❌ Add endpoint-specific rate limits
+- ✅ Research and select rate limiting library
+- ✅ Implement rate limiter service
+- ✅ Add rate limit configuration
+- ✅ Implement Chi middleware for rate limiting
+- ✅ Add rate limit headers to responses
+- ✅ Implement IP whitelisting
+- ✅ Add endpoint-specific rate limits

 ### Phase 4: Advanced Features (Future)
- ❌ Cache warming for critical data
- ❌ Two-level caching (Redis + in-memory)
- ❌ Cache compression for large objects
- ❌ Rate limit exemptions for admin users
- ❌ Dynamic rate limit adjustment
- ❌ Cache analytics and usage patterns
+- ✅ Cache warming for critical data
+- ✅ Two-level caching (Redis + in-memory)
+- ✅ Cache compression for large objects
+- ✅ Rate limit exemptions for admin users
+- ✅ Dynamic rate limit adjustment
+- ✅ Cache analytics and usage patterns

 ## Configuration

--- a/adr/0023-config-hot-reloading.md
+++ b/adr/0023-config-hot-reloading.md
@@ -4,8 +4,6 @@
 * Deciders: Gabriel Radureau, AI Agent
 * Date: 2026-04-05

-> **⚠️ Not yet implemented.** No `ConfigManager` exists in `pkg/config/` and Viper's `WatchConfig()` is not wired up. However, `features/config/config_hot_reloading.feature` has been written — BDD scenarios exist for a feature that is not yet built. Those tests are expected to fail until implementation begins.
-
 ## Context and Problem Statement

 The dance-lessons-coach application currently loads configuration once at startup using Viper, which supports file-based configuration, environment variables, and defaults. However, the current implementation does not support runtime configuration changes without restarting the application.
--- a/adr/0024-bdd-test-organization-and-isolation.md
+++ b/adr/0024-bdd-test-organization-and-isolation.md
@@ -1,7 +1,7 @@
 # ADR 0024: BDD Test Organization and Isolation Strategy

 ## Status
-**Accepted** ✅
+**Proposed** 🟡

 ## Context

--- a/adr/0025-bdd-scenario-isolation-strategies.md
+++ b/adr/0025-bdd-scenario-isolation-strategies.md
@@ -1,344 +0,0 @@
-# ADR 0025: BDD Scenario Isolation Strategies
-
-## Status
-**Accepted (Partial)** 🟡
-
-Phase 1 (schema-per-scenario DB isolation + `ScenarioState` manager in `pkg/bdd/steps/scenario_state.go`) is implemented.
-Phase 2 (cache key prefix strategy, in-memory store `Reset()` methods) is pending — blocked on ADR-0022 (rate limiting/cache) not yet implemented.
-
-## Context
-
-As our BDD test suite grows, we're encountering **test pollution** issues where scenarios interfere with each other through shared state. This is particularly problematic for:
-
-1. **Database state**: Scenarios create users, JWT secrets, config entries that persist across scenarios
-2. **JWT secret rotation**: Multiple secrets accumulate, affecting subsequent scenario authentication
-3. **Config file modifications**: Feature flag changes persist between tests
-4. **Gherkin Background steps**: Data set up in Background is visible to all scenarios in the feature
-
-Our current approach clears database tables after each scenario, but this has **race condition vulnerabilities** with concurrent scenario execution.
-
-### Gherkin Background Consideration
-
-Crucially, Gherkin's `Background` section runs **before each scenario** in a feature, not once before all scenarios. This means:
-
-```gherkin
-Feature: User registration
-  Background:
-    Given the database is empty
-    And a default admin user exists
-  
-  Scenario: Register new user
-    When I register user "alice"
-    Then user "alice" should exist
-  
-  Scenario: Register duplicate user
-    When I register user "alice"
-    Then I should see error "user already exists"
-```
-
-The second scenario fails because Background creates data that persists, and the first scenario's data isn't cleaned up. Background steps are re-executed before each scenario.
-
-## Decision Drivers
-
-* **Isolation**: Each scenario must start with a clean slate
-* **Performance**: Cleanup must be fast enough for CI/CD pipelines
-* **Concurrency**: Must work with parallel scenario execution
-* **Compatibility**: Must work with Gherkin Background steps
-* **Maintainability**: Solution should be simple to understand and debug
-
-## Considered Options
-
-### Option 1: Transaction Rollback (Rejected ❌)
-
-Wrap each scenario in a database transaction, rollback at the end.
-
-```go
-BeforeScenario: BEGIN;
-AfterScenario: ROLLBACK;
-```
-
-**Pros:**
- Simple implementation
- Fast - transaction rollback is nearly instant
- No data cleanup needed
-
-**Cons:**
- ❌ **Fails if scenario commits**: Nested transaction problem - `COMMIT` inside scenario releases the transaction, parent `ROLLBACK` has no effect
- Cannot handle non-database state (JWT secrets in memory, config files)
- Doesn't solve JWT secret pollution
-
-**Verdict: Not viable** - Too many scenarios use database transactions internally.
-
---
-
-### Option 2: Clear Tables in Public Schema (Current ✅/⚠️)
-
-Delete all rows from all tables after each scenario.
-
-```go
-AfterScenario: DELETE FROM table1; DELETE FROM table2; ...
-```
-
-**Pros:**
- Currently implemented
- Works with any scenario code
- Handles database state
-
-**Cons:**
- ⚠️ **Race conditions**: Concurrent scenarios can interleave - Scenario A deletes data while Scenario B is still using it
- ⚠️ **Slow**: Must delete from all tables, reset sequences
- ❌ **Misses in-memory state**: JWT secrets, config changes persist
- ❌ **Doesn't handle Background**: Background data is shared across scenarios
-
-**Verdict: Partially adequate** - Works for sequential execution but has parallel execution issues.
-
---
-
-### Option 3: Schema-per-Scenario (Recommended ✅)
-
-Create a unique PostgreSQL schema for each scenario, drop it after.
-
-```go
-BeforeScenario:
-  schema := "test_" + sha256(scenario.Name)[:8]
-  CREATE SCHEMA schema;
-  SET search_path = schema, public;
-  
-AfterScenario:
-  DROP SCHEMA schema CASCADE;
-```
-
-**Pros:**
- ✅ **True isolation**: Each scenario has its own database namespace
- ✅ **Works with transactions**: Scenario can commit freely - entire schema is dropped
- ✅ **Works with Background**: Background runs in scenario's schema, data is isolated
- ✅ **Fast**: Schema drop is instant (just metadata deletion)
- ✅ **Handles concurrent scenarios**: Different schemas = no conflicts
-
-**Cons:**
- Requires `CREATE/DROP SCHEMA` database privileges in test environment
- Some ORMs may hardcode `public` schema - need to use `SET search_path` carefully
- Test DB must allow many schemas (typically fine for PostgreSQL)
- We need to handle `search_path` in connection pooling (each scenario needs its own connection)
-
-**Implementation notes:**
- Use `Luego` (PostgreSQL schema prefix) approach: `test_{hash}`
- Hash: `sha256(feature_name + scenario_name)[:8]` for consistency across runs
- Execute Background steps in the scenario's schema context
- Set `search_path` at the connection level, not globally
-
---
-
-### Option 4: Database-per-Feature ⚠️
-
-Create a separate database for each feature file.
-
-```go
-BeforeFeature: CREATE DATABASE feature_auth;
-AfterFeature: DROP DATABASE feature_auth;
-```
-
-**Pros:**
- Strong isolation between features
- Simple implementation
-
-**Cons:**
- ❌ **Doesn't isolate scenarios within a feature** - Background data shared across scenarios
- Database creation is slower than schema creation
- Harder to manage in CI (more databases to create/cleanup)
- Still need table clearing between scenarios within a feature
-
-**Verdict: Insufficient** - Doesn't solve intra-feature pollution.
-
---
-
-### Option 5: Schema-per-Feature + Table Clearing per Scenario ⚠️
-
-Create one schema per feature, clear tables between scenarios.
-
-```go
-BeforeFeature: CREATE SCHEMA feature_auth;
-AfterFeature: DROP SCHEMA feature_auth;
-AfterScenario: DELETE FROM all_tables;
-```
-
-**Pros:**
- Isolates features from each other
- Simpler than per-scenario schemas
-
-**Cons:**
- ❌ **Scenarios within a feature share state** - Background data persists
- Still has race conditions with concurrent scenarios in same feature
- Requires table clearing overhead
-
-**Verdict: Better than current but still has issues**.
-
---
-
-## Decision Outcome
-
-**Chosen option: Schema-per-Scenario + In-Memory State Reset + Per-Scenario Step State (Option 3 Enhanced)**
-
-We will implement schema-per-scenario because it:
-
-1. Provides **true isolation** for all database state
-2. **Works with Gherkin Background** - Background runs in each scenario's schema
-3. **Handles concurrent execution** - No race conditions
-4. **Works with scenario transactions** - Scenarios can commit freely
-5. Is **fast** - Schema operations are cheap
-
-**However, we discovered a critical limitation:** PostgreSQL schemas only isolate **database tables**. In-memory state (application-level caches, user stores, JWT secret managers) **persists across scenarios** because they're stored in the shared `sharedServer` Go instance. Schema isolation does NOT solve this.
-
-### Enhanced Strategy: Multi-Layer Isolation
-
-To achieve **complete scenario isolation**, we need a **3-layer approach:**
-
-| Layer | Component | Strategy | Status |
-|-------|-----------|----------|--------|
-| DB | PostgreSQL tables | Schema-per-scenario | ✅ Implemented |
-| Memory | Server-level state (JWT secrets) | Reset to initial state | ✅ Implemented |
-| Memory | Step-level state (tokens, user IDs) | Per-scenario state map | ✅ Implemented |
-| Memory | User store | Reset/clear between scenarios | ⚠️ TODO |
-| Memory | Auth cache | Reset/clear between scenarios | ⚠️ TODO |
-| Cache | Redis/Memcached | Key prefix with schema hash | ⚠️ TODO |
-
-### Layer 3: Per-Scenario Step State Isolation
-
-**New insight from test failures:** Step definition structs (AuthSteps, GreetSteps, etc.) maintain state in their fields:
- `lastToken`, `firstToken` in AuthSteps
- `lastUserID` in AuthSteps
-
-This state **spills across scenarios** even with schema isolation, because struct fields are shared across all scenarios in a test process.
-
-**Solution:** Create a `ScenarioState` manager with per-scenario isolation:
-
-```go
-type ScenarioState struct {
-    LastToken  string
-    FirstToken string
-    LastUserID uint
-}
-
-type scenarioStateManager struct {
-    mu      sync.RWMutex
-    states  map[string]*ScenarioState  // keyed by scenario hash
-}
-
-// Usage in step definitions:
-func (s *AuthSteps) iShouldReceiveAValidJWTToken() error {
-    state := steps.GetScenarioState(s.scenarioName)
-    state.LastToken = extractedToken
-    // ...
-}
-```
-
-**Benefits:**
- ✅ Zero code changes to step definitions (with helper functions)
- ✅ Thread-safe (sync.RWMutex)
- ✅ Consistent state per scenario
- ✅ Automatic cleanup via BeforeScenario/AfterScenario hooks
- ✅ Works with random test order
-
-**Status:** Implemented in `pkg/bdd/steps/scenario_state.go`
-
-### Key Insight: Cache and In-Memory Store Isolation
-
-**For caches (Redis, Memcached, in-process):**
- Use **schema hash as key prefix/suffix**: `cache_key_{schema_hash}` or `{schema_hash}_cache_key`
- This ensures each scenario gets isolated cache namespace
- Works even with external cache services
- Consistent with schema isolation philosophy
-
-**For in-memory stores (user repository, etc.):**
- Add `Reset()` methods that clear all state
- Call in `AfterScenario` alongside schema teardown
- Or use schema-prefix approach for shared stores
-
-### Alternative Approach: Background Explicit State Setup
-
-**Considered but rejected:** Adding explicit "Given no user X exists" steps or heavy Background sections.
-
-**Pros:** More readable, explicit about state
-**Cons:**
- Error-prone (must remember for every entity)
- Verbose (many Given steps)
- Doesn't scale with many entities
- Still has race conditions with concurrent scenarios
-
-**Verdict:** Automated cleanup (schema drop + memory reset) is more reliable than manual Background setup.
-
-### Implementation Plan
-
-**Phase 1: Foundation (✅ Complete)**
- Add scenario-aware schema management to test server
- Implement schema creation/drop in BeforeScenario/AfterScenario hooks
- Handle `search_path` configuration for each scenario's database connection
-
-**Phase 2: In-Memory State Reset (🟡 TODO)**
- Add `ResetUsers()` method to clear in-memory user store
- Add `ResetCache()` method for auth/rateLimiting caches
- Call these in AfterScenario alongside JWT secret reset
- **Cache key strategy**: `key_{schema_hash}` for all cache operations
-
-**Phase 3: Connection Pooling**
- Configure connection pool to respect per-scenario `search_path`
- Each scenario gets isolated connections
-
-**Phase 4: Validation**
- Run full test suite to verify complete isolation
- Fix any hardcoded `public` schema references
-
-### Schema Naming Convention
-
-```
-Schema name: test_{sha256(feature:scenario)[:8]}
-Cache key prefix: {sha256(feature:scenario)[:8]}_
-```
-
-Example:
- Feature: `auth`, Scenario: `Successful user authentication`
- Hash: `sha256("auth:Successful user authentication")[:8]` = `a3f7b2c1`
- Schema: `test_a3f7b2c1`
- Cache key: `a3f7b2c1_user:newuser` instead of just `user:newuser`
-
-Benefits:
- Unique per scenario
- Consistent across test runs (same scenario = same hash)
- Short (8 chars) - efficient for cache keys
- Identifiable for debugging
-
-### Schema Naming Convention
-
-```
-Schema name: test_{sha256(feature + scenario)[:8]}
-```
-
-Example:
- Feature: `auth`, Scenario: `Successful user authentication`
- Hash: `sha256("auth_Successful user authentication")[:8]` = `a3f7b2c1`
- Schema: `test_a3f7b2c1`
-
-Benefits:
- Unique per scenario
- Consistent across test runs (same scenario = same schema)
- Short (8 chars + prefix = 14 chars max)
- Identifiable for debugging
-
-## Pros and Cons Summary
-
-| Aspect | Schema-per-Scenario | Current (Clear Tables) | Transaction Rollback |
-|--------|---------------------|----------------------|-------------------|
-| Isolation | ✅ Strong | ⚠️ Medium | ❌ Weak |
-| Works with Background | ✅ Yes | ⚠️ Partial | ❌ No |
-| Concurrency safe | ✅ Yes | ❌ No | ❌ No |
-| Works with TX | ✅ Yes | ✅ Yes | ❌ No |
-| Speed | ✅ Fast | ⚠️ Slow | ✅ Fast |
-| DB privileges | ⚠️ Needs CREATE | ✅ None | ✅ None |
-| Complexity | ⚠️ Medium | ✅ Low | ✅ Low |
-
-## Links
-
-* [ADR 0008: BDD Testing](adr/0008-bdd-testing.md) - Original BDD adoption decision
-* [ADR 0024: BDD Test Organization and Isolation](adr/0024-bdd-test-organization-and-isolation.md) - Feature isolation strategy
-* [Godog Documentation](https://github.com/cucumber/godog) - BDD framework specifics
-* [PostgreSQL Schemas](https://www.postgresql.org/docs/current/ddl-schemas.html) - Schema management
--- a/adr/README.md
+++ b/adr/README.md
@@ -13,24 +13,23 @@ This directory contains Architecture Decision Records (ADRs) for the dance-lesso
 | 0005 | Graceful Shutdown | ✅ Accepted |
 | 0006 | Configuration Management | ✅ Accepted |
 | 0007 | OpenTelemetry Integration | ✅ Accepted |
-| 0008 | BDD Testing with Godog | ✅ Accepted (structure superseded by 0024) |
-| 0009 | BDD Testing with OpenAPI Documentation | ✅ Accepted |
-| 0010 | API v2 Feature Flag | ✅ Accepted |
-| 0011 | Validation Library (go-playground/validator) | ✅ Accepted |
-| 0012 | Git Hooks: Staged-Only Formatting | ✅ Accepted |
-| 0013 | OpenAPI/Swagger Toolchain (swaggo/swag) | ✅ Accepted |
-| 0014 | gRPC Adoption Strategy | ❌ Rejected / Deferred |
-| 0015 | CLI Subcommands with Cobra | ✅ Accepted |
-| 0016 | CI/CD Pipeline Design | ✅ Accepted |
-| 0017 | Trunk-Based Development Workflow | ✅ Accepted |
-| 0018 | User Management and Auth System | ✅ Accepted |
-| 0019 | PostgreSQL Integration | ✅ Accepted (SQLite cleanup pending) |
-| 0020 | Docker Build Strategy | ✅ Accepted |
-| 0021 | JWT Secret Retention Policy | 🟡 Proposed (base JWT done; cleanup job not implemented) |
-| 0022 | Rate Limiting and Cache Strategy | 🟡 Proposed (not implemented — Gitea issue #13) |
-| 0023 | Config Hot Reloading | 🟡 Proposed (not implemented) |
-| 0024 | BDD Test Organization and Isolation | ✅ Accepted |
-| 0025 | BDD Scenario Isolation Strategies | ✅ Accepted (Partial — Phase 2 pending ADR-0022) |
+| 0008 | BDD Testing | ✅ Accepted |
+| 0009 | Hybrid Testing Approach | ✅ Accepted |
+| 0010 | CI/CD Pipeline Design | ✅ Accepted |
+| 0011 | Trunk-Based Development | ✅ Accepted |
+| 0012 | Commit Message Conventions | ✅ Accepted |
+| 0013 | Version Management Lifecycle | ✅ Accepted |
+| 0014 | Swagger Documentation | ✅ Accepted |
+| 0015 | Rate Limiting Strategy | ✅ Accepted |
+| 0016 | Cache Invalidation Strategy | ✅ Accepted |
+| 0017 | JWT Secret Rotation | ✅ Accepted |
+| 0018 | Configuration Hot Reloading | ✅ Accepted |
+| 0019 | BDD Feature Structure | ✅ Accepted |
+| 0020 | Database Migration Strategy | ✅ Accepted |
+| 0021 | API Versioning Strategy | ✅ Accepted |
+| 0022 | Rate Limiting and Cache Strategy | ✅ Accepted |
+| 0023 | Config Hot Reloading | 🟡 Proposed |
+| 0024 | BDD Test Organization and Isolation | 🟡 Proposed |

 ## What is an ADR?

@@ -96,24 +95,22 @@ Chosen option: "[Option 1]" because [justification]
 * [0005-graceful-shutdown.md](0005-graceful-shutdown.md) - Implement graceful shutdown with readiness endpoints
 * [0006-configuration-management.md](0006-configuration-management.md) - Use Viper for configuration management
 * [0007-opentelemetry-integration.md](0007-opentelemetry-integration.md) - Integrate OpenTelemetry for distributed tracing
-* [0008-bdd-testing.md](0008-bdd-testing.md) - Adopt BDD with Godog for behavioral testing (structure superseded by 0024)
-* [0009-hybrid-testing-approach.md](0009-hybrid-testing-approach.md) - BDD testing with OpenAPI documentation (SDK layer deferred)
+* [0008-bdd-testing.md](0008-bdd-testing.md) - Adopt BDD with Godog for behavioral testing
+* [0009-hybrid-testing-approach.md](0009-hybrid-testing-approach.md) - Combine BDD and Swagger-based testing
 * [0010-api-v2-feature-flag.md](0010-api-v2-feature-flag.md) - API v2 implementation with feature flag control
 * [0011-validation-library-selection.md](0011-validation-library-selection.md) - Selection of go-playground/validator for input validation
 * [0012-git-hooks-staged-only-formatting.md](0012-git-hooks-staged-only-formatting.md) - Git hooks format only staged Go files
-* [0013-openapi-swagger-toolchain.md](0013-openapi-swagger-toolchain.md) - OpenAPI/Swagger documentation with swaggo/swag
-* [0014-grpc-adoption-strategy.md](0014-grpc-adoption-strategy.md) - gRPC adoption strategy (rejected/deferred)
+* [0013-openapi-swagger-toolchain.md](0013-openapi-swagger-toolchain.md) - ✅ OpenAPI/Swagger documentation with swaggo/swag (Implemented)
+* [0014-grpc-adoption-strategy.md](0014-grpc-adoption-strategy.md) - Hybrid REST/gRPC adoption strategy
 * [0015-cli-subcommands-cobra.md](0015-cli-subcommands-cobra.md) - Cobra CLI framework adoption
 * [0016-ci-cd-pipeline-design.md](0016-ci-cd-pipeline-design.md) - CI/CD pipeline architecture
 * [0017-trunk-based-development-workflow.md](0017-trunk-based-development-workflow.md) - Trunk-based development workflow
 * [0018-user-management-auth-system.md](0018-user-management-auth-system.md) - User management and authentication system
 * [0019-postgresql-integration.md](0019-postgresql-integration.md) - PostgreSQL database integration
 * [0020-docker-build-strategy.md](0020-docker-build-strategy.md) - Docker Build Strategy: Traditional vs Buildx
-* [0021-jwt-secret-retention-policy.md](0021-jwt-secret-retention-policy.md) - JWT Secret Retention Policy (base JWT done; cleanup job proposed)
-* [0022-rate-limiting-cache-strategy.md](0022-rate-limiting-cache-strategy.md) - Rate Limiting and Cache Strategy (not yet implemented — issue #13)
-* [0023-config-hot-reloading.md](0023-config-hot-reloading.md) - Config Hot Reloading Strategy (not yet implemented)
-* [0024-bdd-test-organization-and-isolation.md](0024-bdd-test-organization-and-isolation.md) - BDD test modular organisation by domain
-* [0025-bdd-scenario-isolation-strategies.md](0025-bdd-scenario-isolation-strategies.md) - Schema-per-scenario isolation for BDD tests (partial)
+* [0021-jwt-secret-retention-policy.md](0021-jwt-secret-retention-policy.md) - JWT Secret Retention Policy with Configurable TTL and Retention
+* [0022-rate-limiting-cache-strategy.md](0022-rate-limiting-cache-strategy.md) - Rate Limiting and Cache Strategy with Multi-Phase Implementation
+* [0023-config-hot-reloading.md](0023-config-hot-reloading.md) - Config Hot Reloading Strategy

 ## How to Add a New ADR

--- a/cmd/server/main.go
+++ b/cmd/server/main.go
@@ -48,10 +48,8 @@ func main() {
 		log.Fatal().Err(err).Msg("Failed to load configuration")
 	}

-	// Create readiness context to control readiness state.
-	// CancelableContext exposes Cancel() so that Server.Run() can cancel
-	// readiness at the start of graceful shutdown (before the propagation sleep).
-	readyCtx, readyCancel := server.NewCancelableContext(context.Background())
+	// Create readiness context to control readiness state
+	readyCtx, readyCancel := context.WithCancel(context.Background())
 	defer readyCancel()

 	// Create and run server
@@ -59,5 +57,4 @@ func main() {
 	if err := server.Run(); err != nil {
 		log.Fatal().Err(err).Msg("Server failed")
 	}
-	log.Trace().Msg("Server exited")
 }
--- a/documentation/API.md
+++ b/documentation/API.md
@@ -1,158 +0,0 @@
-# API Endpoints
-
-REST API reference for `dance-lessons-coach`. Extracted from the original `AGENTS.md` (Tâche 6 restructure) for lazy-loading compatibility with Mistral Vibe.
-
-## Base URL
-
-```
-http://localhost:8080
-```
-
-## OpenAPI Documentation
-
- **Swagger UI:** `http://localhost:8080/swagger/`
- **OpenAPI Spec:** `http://localhost:8080/swagger/doc.json`
-
-The API provides interactive documentation using Swagger UI with complete OpenAPI 2.0 specification. All endpoints, request/response models, and validation rules are documented using a **hierarchical tagging system**.
-
-**Features:**
-
- Interactive API exploration with hierarchical organization
- Try-it-out functionality for all endpoints
- Model schemas with examples
- Response examples with validation rules
- Hierarchical tag structure for better navigation
-
-**Generation:** Documentation is auto-generated from code annotations using [swaggo/swag](https://github.com/swaggo/swag) with the command:
-
-```bash
-go generate ./pkg/server/
-```
-
-**Tag Organization:**
-
- `API/v1/Greeting` — Version 1 greeting endpoints
- `API/v2/Greeting` — Version 2 greeting endpoints
- `System/Health` — Health and readiness endpoints
-
-**Hierarchical Benefits:**
-
- Clear separation between API domains (API vs System)
- Version organization within each domain
- Natural hierarchy in Swagger UI
- Scalable for future API growth
-
-**Embedded Documentation:** The OpenAPI spec is embedded in the binary using Go's `//go:embed` directive for single-binary deployment.
-
---
-
-## Health Check
-
-```http
-GET /api/health
-```
-
-**Response:**
-
-```json
-{"status":"healthy"}
-```
-
-## Version Info
-
-```http
-GET /api/version
-GET /api/version?format=plain
-GET /api/version?format=full
-GET /api/version?format=json
-```
-
-Returns the running binary version (injected at build time via `-ldflags`). The `format` query parameter controls the response shape:
-
- `format=plain` (or `?format=short`): plain text version (e.g. `1.0.0`)
- `format=full`: detailed multi-line text (Version, Commit, Built date, Go version)
- `format=json` (default): structured JSON `{"version": "1.0.0", "commit": "abc1234", "built": "...", "go_version": "go1.26.1"}`
-
-## Readiness Check
-
-```http
-GET /api/ready
-```
-
-**Responses:**
-
- Normal operation: `{"ready":true}` (HTTP 200)
- During shutdown: `{"ready":false}` (HTTP 503 Service Unavailable)
-
-**Purpose:** Indicates whether the server is ready to accept new requests. Returns false during graceful shutdown to allow existing requests to complete while preventing new ones.
-
-## Greet Service v1
-
-```http
-GET /api/v1/greet/
-GET /api/v1/greet/{name}
-```
-
-**Examples:**
-
-```bash
-# Default greeting
-curl http://localhost:8080/api/v1/greet/
-# Response: {"message":"Hello world!"}
-
-# Personalized greeting
-curl http://localhost:8080/api/v1/greet/John
-# Response: {"message":"Hello John!"}
-
-# Another example
-curl http://localhost:8080/api/v1/greet/Alice
-# Response: {"message":"Hello Alice!"}
-```
-
-## Greet Service v2 (Feature-flagged)
-
-```http
-POST /api/v2/greet
-```
-
-**Request Body:**
-
-```json
-{
-  "name": "John"
-}
-```
-
-**Examples:**
-
-```bash
-# Valid request
-curl -X POST http://localhost:8080/api/v2/greet \
-  -H "Content-Type: application/json" \
-  -d '{"name":"John"}'
-# Response: {"message":"Hello my friend John!"}
-
-# Empty name (valid, returns default)
-curl -X POST http://localhost:8080/api/v2/greet \
-  -H "Content-Type: application/json" \
-  -d '{"name":""}'
-# Response: {"message":"Hello my friend!"}
-
-# Missing name field (valid, returns default)
-curl -X POST http://localhost:8080/api/v2/greet \
-  -H "Content-Type: application/json" \
-  -d '{}'
-# Response: {"message":"Hello my friend!"}
-
-# Name too long (validation error)
-curl -X POST http://localhost:8080/api/v2/greet \
-  -H "Content-Type: application/json" \
-  -d '{"name":"ThisNameIsWayTooLongAndShouldFailValidationBecauseItExceedsTheMaximumAllowedLengthOf100Characters!!!!"}'
-# Response: {"error":"validation_failed","message":"Invalid request data","details":[{"message":"Name failed validation for 'max' (parameter: 100)"}]}
-```
-
-**Validation Rules:**
-
- `name`: Maximum length 100 characters (optional field)
-
-**Feature Flag:** Enable with `DLC_API_V2_ENABLED=true` or in config file with `api.v2_enabled: true`.
--- a/documentation/CLI.md
+++ b/documentation/CLI.md
@@ -1,251 +0,0 @@
-# CLI Management Guide
-
-Complete reference for the `dance-lessons-coach` CLI, server lifecycle, and configuration. Extracted from the original `AGENTS.md` (Tâche 6 restructure) for lazy-loading compatibility with Mistral Vibe.
-
-## Cobra CLI (Recommended)
-
-`dance-lessons-coach` includes a modern CLI built with Cobra:
-
-```bash
-# Show help and available commands
-./bin/dance-lessons-coach --help
-
-# Show version information
-./bin/dance-lessons-coach version
-
-# Greet someone by name
-./bin/dance-lessons-coach greet John
-
-# Start the server
-./bin/dance-lessons-coach server
-```
-
-**Available Commands:**
-
- `version` — Print version information
- `server` — Start the dance-lessons-coach server
- `greet [name]` — Greet someone by name
- `help` — Built-in help system
- `completion` — Generate shell completion scripts
-
-**Server Command Flags:**
-
- `--config` — Config file path
- `--env` — Environment (`dev`, `staging`, `prod`)
- `--debug` — Enable debug logging
-
-## Version Information
-
-The server provides runtime version information:
-
-```bash
-# Check version using new CLI
-./bin/dance-lessons-coach version
-
-# Check version using server binary
-./bin/server --version
-
-# Output:
-dance-lessons-coach Version Information:
-  Version:   1.0.0
-  Commit:    abc1234
-  Built:     2026-04-05T10:00:00+0000
-  Go:        go1.26.1
-```
-
-For full version management workflow (bump, release, build with version), see [`version-management-guide.md`](version-management-guide.md).
-
-## Server Control Script
-
-A shell script manages the server lifecycle:
-
-```bash
-cd /Users/gabrielradureau/Work/Vibe/DanceLessonsCoach
-
-./scripts/start-server.sh start     # Start the server
-./scripts/start-server.sh status    # Check server status
-./scripts/start-server.sh test      # Test API endpoints
-./scripts/start-server.sh logs      # View server logs
-./scripts/start-server.sh stop      # Stop the server
-./scripts/start-server.sh restart   # Restart
-```
-
-**Available subcommands:**
-
- `start` — Start the server in background with proper logging
- `stop` — Stop the server gracefully
- `restart` — Restart the server
- `status` — Check if server is running
- `logs` — Show recent server logs
- `test` — Test all API endpoints
-
-## Manual Server Management
-
-For direct control:
-
-```bash
-cd /Users/gabrielradureau/Work/Vibe/DanceLessonsCoach
-./scripts/start-server.sh start
-```
-
-**Expected output:**
-
-```
-Server running on :8080
-[INF] Starting HTTP server on :8080
-[TRC] Registering greet routes
-[TRC] Greet routes registered
-```
-
-**Features:**
-
- Context-aware server initialization
- Graceful shutdown handling
- Signal-based termination (`SIGINT`, `SIGTERM`)
- 30-second shutdown timeout
- Proper resource cleanup
-
-## Configuration
-
-Configuration via environment variables with `DLC_` prefix:
-
-| Option | Environment Variable | Default | Description |
-|---|---|---|---|
-| Host | `DLC_SERVER_HOST` | `0.0.0.0` | Server bind address |
-| Port | `DLC_SERVER_PORT` | `8080` | Server listening port |
-| Shutdown Timeout | `DLC_SHUTDOWN_TIMEOUT` | `30s` | Graceful shutdown timeout |
-| JSON Logging | `DLC_LOGGING_JSON` | `false` | Enable JSON format logging |
-| Log Output | `DLC_LOGGING_OUTPUT` | `""` | Log output file path (empty for stderr) |
-
-**Examples:**
-
-```bash
-# Custom port
-export DLC_SERVER_PORT=9090
-./scripts/start-server.sh start
-
-# Custom host and port
-export DLC_SERVER_HOST="127.0.0.1"
-export DLC_SERVER_PORT=8081
-./scripts/start-server.sh start
-
-# Custom shutdown timeout
-export DLC_SHUTDOWN_TIMEOUT=45s
-
-# Enable JSON logging
-export DLC_LOGGING_JSON=true
-
-# Log to file
-export DLC_LOGGING_OUTPUT="server.log"
-
-# Combined: JSON logging to file
-export DLC_LOGGING_JSON=true
-export DLC_LOGGING_OUTPUT="server.json.log"
-```
-
-**Configuration File Support:**
-
-A `config.example.yaml` file is provided as a template. By default, the application looks for `config.yaml` in the current working directory.
-
-To specify a custom config file path, set the `DLC_CONFIG_FILE` environment variable:
-
-```bash
-DLC_CONFIG_FILE="/path/to/config.yaml" go run ./cmd/server
-```
-
-Example `config.yaml`:
-
-```yaml
-server:
-  host: "0.0.0.0"
-  port: 8080
-
-shutdown:
-  timeout: 30s
-
-logging:
-  json: false
-```
-
-**Configuration Loading Precedence:**
-
-1. **File-based configuration** (highest precedence)
-2. **Environment variables** (override defaults, overridden by config file)
-3. **Default values** (fallback)
-
-All configuration is validated on startup. Invalid configurations cause server startup failure. Configuration values and source are logged at startup.
-
-**Verification:**
-
-```bash
-DLC_SERVER_PORT=9090 DLC_SERVER_HOST="127.0.0.1" ./scripts/start-server.sh start
-
-curl http://127.0.0.1:9090/api/health
-# Expected: {"status":"healthy"}
-```
-
-## Server Status
-
-```bash
-# Check health endpoint
-curl -s http://localhost:8080/api/health
-
-# Check readiness endpoint
-curl -s http://localhost:8080/api/ready
-```
-
-**Expected responses:**
-
- Health: `{"status":"healthy"}`
- Readiness (normal): `{"ready":true}`
- Readiness (during shutdown): `{"ready":false}` (HTTP 503)
-
-**Endpoint Differences:**
-
- **Health endpoint** (`/api/health`): Indicates if the application is running and functional
- **Readiness endpoint** (`/api/ready`): Indicates if the application is ready to accept traffic
-
-**Use Cases:**
-
- **Health**: Used by load balancers to check if the app is alive
- **Readiness**: Used by Kubernetes / service meshes to determine if the app can accept new requests
-
-**During Graceful Shutdown:**
-
- Health endpoint continues to return `{"status":"healthy"}`
- Readiness endpoint returns `{"ready":false}` with HTTP 503 Service Unavailable
- This allows existing requests to complete while preventing new requests
-
-## Stopping the Server
-
-To stop the server gracefully:
-
-```bash
-# Send SIGTERM for graceful shutdown
-kill -TERM $(lsof -ti :8080)
-
-# Or send SIGINT (Ctrl+C equivalent)
-pkill -INT -f "go run"
-```
-
-**Graceful shutdown process:**
-
-1. Server receives termination signal
-2. Logs shutdown message
-3. Stops accepting new connections
-4. Waits up to 30 seconds for active requests to complete
-5. Closes all connections cleanly
-6. Exits with proper cleanup
-
-For force stop (if graceful shutdown hangs):
-
-```bash
-kill -9 $(lsof -ti :8080)
-```
-
-**Verification:**
-
-```bash
-curl -s http://localhost:8080/api/health
-# Should return connection refused
-```
--- a/documentation/CODE_EXAMPLES.md
+++ b/documentation/CODE_EXAMPLES.md
@@ -1,59 +0,0 @@
-# Code Examples
-
-Snippets and patterns used across the `dance-lessons-coach` codebase. Extracted from the original `AGENTS.md` (Tâche 6 restructure).
-
-## Adding a New API Endpoint
-
-```go
-// 1. Add to interface
-func (h *apiV1GreetHandler) RegisterRoutes(router chi.Router) {
-    router.Get("/", h.handleGreetQuery)
-    router.Get("/{name}", h.handleGreetPath)
-    router.Post("/custom", h.handleCustomGreet) // New endpoint
-}
-
-// 2. Implement handler
-func (h *apiV1GreetHandler) handleCustomGreet(w http.ResponseWriter, r *http.Request) {
-    // Parse request
-    // Call service
-    // Return JSON response
-}
-```
-
-## Logging with Zerolog
-
-```go
-// Trace level logging
-log.Trace().Ctx(ctx).Str("key", "value").Msg("message")
-
-// Info level
-log.Info().Msg("Important event")
-
-// Error level
-log.Error().Err(err).Msg("Error occurred")
-```
-
-For the full logging strategy (when to use Trace vs Info, performance considerations), see [ADR-0003 — Zerolog Logging](../adr/0003-zerolog-logging.md).
-
-## Using `context.Context`
-
-```go
-// Pass context through calls
-func handler(w http.ResponseWriter, r *http.Request) {
-    result := service.Greet(r.Context(), "John")
-    // ...
-}
-
-// Create context with values
-ctx := context.WithValue(r.Context(), "key", "value")
-
-// Create context with timeout
-ctx, cancel := context.WithTimeout(r.Context(), 5*time.Second)
-defer cancel()
-```
-
-For the rationale behind context-aware services, see [ADR-0004 — Interface-Based Design](../adr/0004-interface-based-design.md).
-
-## Best Practices Reminders
-
-For higher-level guidance on code organization, error handling, performance, and testing, see [`AGENT_USAGE_GUIDE.md`](AGENT_USAGE_GUIDE.md#best-practices) section "Best Practices".
--- a/documentation/HISTORY.md
+++ b/documentation/HISTORY.md
@@ -1,83 +0,0 @@
-# Development History
-
-This document records the historical development phases of `dance-lessons-coach`. Extracted from the original `AGENTS.md` (Tâche 6 restructure) for lazy-loading compatibility with Mistral Vibe (128k context).
-
-All phases below are **completed** ✅. They are kept here for traceability and onboarding context — refer to ADRs (`adr/`) for the technical decisions behind each phase.
-
-## Phase 1: Foundation
-
- Go 1.26.1 environment setup
- Project structure with `cmd/` and `pkg/` directories
- Core Greet service implementation
- CLI interface
- Unit tests
-
-## Phase 2: Web API
-
- Chi router integration
- Versioned API endpoints (`/api/v1`)
- Health endpoint (`/api/health`)
- JSON responses with proper headers
-
-## Phase 3: Logging & Architecture
-
- Zerolog integration with Trace level
- Context-aware logging
- Interface-based design patterns
- Dependency injection
-
-## Phase 4: Documentation & Testing
-
- Comprehensive `AGENTS.md`
- `README.md` with usage instructions
- Server management guide
- API endpoint documentation
-
-## Phase 5: Configuration Management
-
- Viper integration for configuration
- Environment variable support with `DLC_` prefix
- Customizable server host/port
- Configurable shutdown timeout
- Configuration validation and logging
- Example configuration file
-
-## Phase 6: Graceful Shutdown
-
- Context-aware server initialization
- Signal-based termination (`SIGINT`, `SIGTERM`)
- Configurable shutdown timeout
- Readiness endpoint for Kubernetes/service mesh integration
- Proper resource cleanup during shutdown
- Health endpoint remains healthy during graceful shutdown
-
-## Phase 7: OpenTelemetry Integration
-
- OpenTelemetry Go libraries integration
- Jaeger compatibility for distributed tracing
- Middleware-only approach using `otelhttp.NewHandler`
- Configurable sampling strategies
- Graceful shutdown of tracer provider
- OTLP exporter with gRPC support
-
-## Phase 8: Build System & Documentation
-
- Build script for binary compilation
- Binary output to `bin/` directory
- Comprehensive commit conventions with gitmoji reference
- Updated documentation with Jaeger integration guide
- Cleaned up configuration files
- Enhanced logging configuration with file output support
-
-## Phase 9: Final Refinements
-
- Removed unnecessary `time.Sleep` for log flushing
- Changed server operational logs from Info to Trace level
- Moved all logging setup logic to config package
- Simplified server entrypoint to 27 lines
- Verified all functionality with comprehensive testing
- Updated documentation to reflect final architecture
-
-## Beyond Phase 9
-
-Subsequent work (CI/CD, BDD scenarios, ADR audit, JWT, config hot-reloading) is tracked in the [Changelog](../CHANGELOG.md) and the corresponding [ADRs](../adr/).
--- a/documentation/OBSERVABILITY.md
+++ b/documentation/OBSERVABILITY.md
@@ -1,94 +0,0 @@
-# Observability — OpenTelemetry & Jaeger Integration
-
-Tracing setup for `dance-lessons-coach`. Extracted from the original `AGENTS.md` (Tâche 6 restructure) for lazy-loading compatibility with Mistral Vibe.
-
-The application supports OpenTelemetry for distributed tracing with Jaeger compatibility.
-
-## Configuration
-
-Enable OpenTelemetry in your `config.yaml`:
-
-```yaml
-telemetry:
-  enabled: true
-  otlp_endpoint: "localhost:4317"
-  service_name: "dance-lessons-coach"
-  insecure: true
-  sampler:
-    type: "parentbased_always_on"
-    ratio: 1.0
-```
-
-Or via environment variables:
-
-```bash
-export DLC_TELEMETRY_ENABLED=true
-export DLC_TELEMETRY_OTLP_ENDPOINT="localhost:4317"
-export DLC_TELEMETRY_SERVICE_NAME="dance-lessons-coach"
-export DLC_TELEMETRY_INSECURE=true
-export DLC_TELEMETRY_SAMPLER_TYPE="parentbased_always_on"
-export DLC_TELEMETRY_SAMPLER_RATIO=1.0
-```
-
-## Testing with Jaeger
-
-**1. Start Jaeger in Docker:**
-
-```bash
-docker run -d --name jaeger \
-  -e COLLECTOR_OTLP_ENABLED=true \
-  -p 16686:16686 \
-  -p 4317:4317 \
-  jaegertracing/all-in-one:latest
-```
-
-**2. Start the server with OpenTelemetry enabled:**
-
-```bash
-# Using config file
-./scripts/start-server.sh start
-
-# Or with environment variables
-DLC_TELEMETRY_ENABLED=true ./scripts/start-server.sh start
-```
-
-**3. Make API requests:**
-
-```bash
-curl http://localhost:8080/api/v1/greet/John
-```
-
-**4. View traces in Jaeger UI:**
-
-Open http://localhost:16686 and select the `dance-lessons-coach` service.
-
-## Sampler Types
-
-| Sampler | Behavior |
-|---|---|
-| `always_on` | Sample all traces |
-| `always_off` | Sample no traces |
-| `traceidratio` | Sample based on trace ID ratio |
-| `parentbased_always_on` | Sample based on parent span (always on) |
-| `parentbased_always_off` | Sample based on parent span (always off) |
-| `parentbased_traceidratio` | Sample based on parent span with ratio |
-
-## Testing Script
-
-A convenience script is provided:
-
-```bash
-./scripts/test-opentelemetry.sh
-```
-
-This script:
-
-1. Starts Jaeger container
-2. Starts the server with OpenTelemetry
-3. Makes test API calls
-4. Shows Jaeger UI URL
-5. Cleans up on exit
-
-## ADR Reference
-
-See [ADR-0007 — OpenTelemetry Integration](../adr/0007-opentelemetry-integration.md) for the full architectural decision and rationale (middleware-only approach, sampling strategy, OTLP/gRPC choice).
--- a/documentation/ROADMAP.md
+++ b/documentation/ROADMAP.md
@@ -1,40 +0,0 @@
-# Roadmap & Future Enhancements
-
-Tracking pending features and architectural improvements. Extracted from the original `AGENTS.md` (Tâche 6 restructure). Status updated continuously — items move to "Completed Features" section once shipped.
-
-## Potential Features
-
- [ ] Database integration
- [ ] Authentication / Authorization
- [ ] Rate limiting
- [ ] Metrics and monitoring
- [ ] Docker containerization
- ✅ CI/CD pipeline ([ADR-0016](../adr/0016-ci-cd-pipeline-design.md), [ADR-0017](../adr/0017-trunk-based-development-workflow.md))
- [ ] Configuration hot reload
- [ ] Circuit breakers
-
-## Architectural Improvements
-
- [ ] Request validation middleware
- ✅ OpenAPI / Swagger documentation with embedded spec
- [ ] Enhanced OpenTelemetry instrumentation
- [ ] Metrics collection and visualization
- [ ] Health check improvements
- [ ] Configuration validation enhancements
-
-## Completed Features
-
- ✅ Graceful shutdown with readiness endpoint
- ✅ OpenTelemetry integration with Jaeger support
- ✅ Configuration management with Viper
- ✅ Comprehensive logging with Zerolog
- ✅ Build system with binary output
- ✅ Complete documentation with commit conventions
- ✅ Version management with runtime info
-
-## How to Propose a New Feature
-
-1. Open a Gitea issue describing the use case and acceptance criteria
-2. If the feature implies an architectural decision, draft an ADR (`adr/<NNNN>-<slug>.md`) following the template
-3. Reference the ADR + issue in any PR introducing the feature
-4. Update this roadmap (move from "Potential" to "Completed" when shipped)
--- a/documentation/TROUBLESHOOTING.md
+++ b/documentation/TROUBLESHOOTING.md
@@ -1,107 +0,0 @@
-# Troubleshooting
-
-Common issues and their resolution. Extracted from the original `AGENTS.md` and merged with relevant sections from `AGENT_USAGE_GUIDE.md` and `BDD_GUIDE.md`. Refer back to those guides for context-specific troubleshooting (agent workflows, BDD test failures).
-
-## Port Already in Use
-
-```bash
-# Find and kill process using port 8080
-kill -TERM $(lsof -ti :8080)
-
-# Force kill if graceful does not work
-kill -9 $(lsof -ti :8080)
-```
-
-## Server Not Responding
-
-```bash
-# Check if running
-curl -s http://localhost:8080/api/health
-
-# Restart server using control script
-./scripts/start-server.sh restart
-
-# View recent logs
-./scripts/start-server.sh logs
-```
-
-If health endpoint returns connection refused, the server may have crashed. Check logs in `./scripts/start-server.sh logs` for stack traces.
-
-## Dependency Issues
-
-```bash
-# Clean and rebuild
-go mod tidy
-go build ./...
-
-# If dependency version conflicts persist
-go mod download
-go mod verify
-```
-
-## Tests Failing
-
-### Unit tests
-
-```bash
-# Run with verbose output
-go test -v ./...
-
-# Check specific test
-go test ./pkg/greet/ -run TestName
-```
-
-### BDD tests
-
-See [`BDD_GUIDE.md`](BDD_GUIDE.md) for the full BDD troubleshooting workflow (Godog setup, scenario isolation, step matching). Common BDD issues:
-
- **Step not found** → check `pkg/bdd/steps/` for the step definition file
- **Scenario state leaking** → review [ADR-0025](../adr/0025-bdd-scenario-isolation-strategies.md) for the isolation pattern
- **Database not reset** → ensure the test fixtures cleanup runs (BDD scenario After hooks)
-
-## Configuration Not Loading
-
-The application logs the configuration source at startup. Check logs for:
-
-```
-[INF] Configuration loaded from: file:config.yaml
-# or
-[INF] Configuration loaded from: env
-# or
-[INF] Configuration loaded from: defaults
-```
-
-If config is not loading as expected:
-
-1. Verify file exists and is readable: `ls -la config.yaml`
-2. Verify env vars are exported: `env | grep DLC_`
-3. Check for typos in keys (case-sensitive)
-4. Review [`AGENT_USAGE_GUIDE.md`](AGENT_USAGE_GUIDE.md) section "Configuration troubleshooting"
-
-## OpenTelemetry Not Tracing
-
-1. Verify Jaeger is running: `docker ps | grep jaeger`
-2. Check `DLC_TELEMETRY_ENABLED=true` in environment or `telemetry.enabled: true` in config
-3. Verify OTLP endpoint reachable: `nc -zv localhost 4317`
-4. Check sampler is not `always_off`
-5. See [`OBSERVABILITY.md`](OBSERVABILITY.md) for full setup
-
-## Build Failures
-
-```bash
-# Clear caches
-go clean -cache -modcache
-go mod download
-
-# Rebuild
-go build ./...
-```
-
-If errors persist, see [`local-ci-cd-testing.md`](local-ci-cd-testing.md) for the CI/CD pipeline that mirrors the production build.
-
-## Where to Look Next
-
- **Agent-specific issues** (vibe, mistral, programmer agent) → [`AGENT_USAGE_GUIDE.md`](AGENT_USAGE_GUIDE.md)
- **BDD-specific issues** → [`BDD_GUIDE.md`](BDD_GUIDE.md)
- **Version/release issues** → [`version-management-guide.md`](version-management-guide.md)
- **CI/CD issues** → [`local-ci-cd-testing.md`](local-ci-cd-testing.md)
--- a/features/auth/user_authentication.feature
+++ b/features/auth/user_authentication.feature
@@ -31,6 +31,7 @@ Feature: User Authentication
    And I should receive a valid JWT token
    And the token should contain admin claims

+  @flaky
  Scenario: User registration
    Given the server is running
    When I register a new user "newuser_" with password "newpass123"
@@ -45,6 +46,7 @@ Feature: User Authentication
    Then the password reset should be allowed
    And the user should be flagged for password reset

+  @flaky
  Scenario: User completes password reset
    Given the server is running
    And a user "resetuser" exists and is flagged for password reset
@@ -109,6 +111,7 @@ Feature: User Authentication
    Then the authentication should fail
    And the response should contain error "invalid_credentials"

+  @flaky
  Scenario: Multiple consecutive authentications
    Given the server is running
    And a user "multiuser" exists with password "testpass123"
@@ -129,6 +132,7 @@ Feature: User Authentication
    Then the token should be valid
    And it should contain the correct user ID

+  @flaky
  Scenario: Authentication with expired JWT token
    Given the server is running
    And a user "expireduser" exists with password "testpass123"
--- a/features/greet/greet_test.go
+++ b/features/greet/greet_test.go
@@ -1,30 +1,16 @@
 package greet

 import (
-	"os"
 	"testing"

 	"dance-lessons-coach/pkg/bdd/testsetup"
 )

 func TestGreetBDD(t *testing.T) {
-	// Test suite with v2 disabled - run non-v2 scenarios only
-	t.Run("v1", func(t *testing.T) {
-		os.Setenv("GODOG_TAGS", "~@v2 && ~@skip")
-		config := testsetup.NewFeatureConfig("greet", "progress", false)
-		suite := testsetup.CreateTestSuite(t, config, "dance-lessons-coach BDD Tests - Greet Feature v1")
-		if suite.Run() != 0 {
-			t.Fatal("non-zero status returned, failed to run greet BDD tests with v2 disabled")
-		}
-	})
+	config := testsetup.NewFeatureConfig("greet", "progress", false)
+	suite := testsetup.CreateTestSuite(t, config, "dance-lessons-coach BDD Tests - Greet Feature")

-	// Test suite with v2 enabled - run v2 scenarios only
-	t.Run("v2", func(t *testing.T) {
-		os.Setenv("GODOG_TAGS", "@v2 && ~@skip")
-		config := testsetup.NewFeatureConfig("greet", "progress", false)
-		suite := testsetup.CreateTestSuite(t, config, "dance-lessons-coach BDD Tests - Greet Feature v2")
-		if suite.Run() != 0 {
-			t.Fatal("non-zero status returned, failed to run greet BDD tests with v2 enabled")
-		}
-	})
+	if suite.Run() != 0 {
+		t.Fatal("non-zero status returned, failed to run greet BDD tests")
+	}
 }
--- a/features/jwt/jwt_secret_retention.feature
+++ b/features/jwt/jwt_secret_retention.feature
@@ -33,6 +33,7 @@ Feature: JWT Secret Retention Policy
    Then the retention period should be capped at 72 hours
    And not exceed the maximum retention limit

+  @todo
  Scenario: Cleanup preserves primary secret
    Given a primary JWT secret exists
    And the primary secret is older than retention period
--- a/features/jwt/jwt_secret_rotation.feature
+++ b/features/jwt/jwt_secret_rotation.feature
@@ -11,6 +11,7 @@ Feature: JWT Secret Rotation
    Then the authentication should be successful
    And I should receive a valid JWT token signed with the primary secret

+  @flaky
  Scenario: Token validation with multiple valid secrets
    Given the server is running with multiple JWT secrets
    And a user "tokenuser" exists with password "testpass123"
@@ -21,6 +22,7 @@ Feature: JWT Secret Rotation
    Then the token should be valid
    And it should contain the correct user ID

+  @flaky
  Scenario: Secret rotation - adding new secret while keeping old one valid
    Given the server is running with primary JWT secret
    And a user "rotateuser" exists with password "testpass123"
@@ -40,6 +42,7 @@ Feature: JWT Secret Rotation
    Then the authentication should fail
    And the response should contain error "invalid_token"

+  @flaky
  Scenario: Graceful secret rotation with user continuity
    Given the server is running with primary JWT secret
    And a user "gracefuluser" exists with password "testpass123"
--- a/pkg/bdd/steps/README.md
+++ b/pkg/bdd/steps/README.md
@@ -6,15 +6,12 @@ This folder contains the step definitions for the BDD tests, organized by domain

 ```
 pkg/bdd/steps/
-├── steps.go                  # Main registration file that ties everything together
-├── scenario_state.go         # Per-scenario state isolation manager
-├── common_steps.go           # Shared steps used across multiple domains
-├── auth_steps.go             # Authentication and user management steps
-├── config_steps.go           # Configuration and hot-reloading steps
-├── greet_steps.go            # Greet-related steps (v1 and v2 API)
-├── health_steps.go           # Health check and server status steps
-├── jwt_retention_steps.go    # JWT secret retention policy steps
-└── README.md                 # This file
+├── greet_steps.go        # Greet-related steps (v1 and v2 API)
+├── health_steps.go       # Health check and server status steps
+├── auth_steps.go         # Authentication and user management steps
+├── common_steps.go       # Shared steps used across multiple domains
+├── steps.go             # Main registration file that ties everything together
+└── README.md            # This file
 ```

 ## Design Principles
@@ -23,7 +20,6 @@ pkg/bdd/steps/
 2. **Single Responsibility**: Each file focuses on a specific area of functionality
 3. **Reusability**: Common steps are shared via `common_steps.go`
 4. **Scalability**: Easy to add new domains as the application grows
-5. **State Isolation**: Use per-scenario state to prevent pollution between test scenarios

 ## Adding New Steps

@@ -37,169 +33,12 @@ pkg/bdd/steps/
 - Use descriptive, action-oriented names
 - Follow the pattern: `i[Action][Object]` or `the[Object][State]`
 - Example: `iRequestAGreetingFor`, `theAuthenticationShouldBeSuccessful`
- Use present tense for actions: "I authenticate", "the server reloads"
-
-## State Isolation Pattern
-
-**Problem:** Step definition structs (AuthSteps, GreetSteps, etc.) maintain state in their fields (e.g., `lastToken`, `lastUserID`). This state persists across all scenarios in a test process, causing pollution even with database schema isolation.
-
-**Solution:** Use the `ScenarioState` manager for per-scenario state isolation.
-
-### How It Works
-
-The `scenario_state.go` provides a thread-safe mechanism to store and retrieve state that is isolated per scenario:
-
-```go
-// Get scenario-specific state
-state := steps.GetScenarioState(scenarioName)
-
-// Store scenario-specific data
-state.LastToken = token
-state.LastUserID = userID
-
-// Retrieve scenario-specific data
-token := state.LastToken
-```
-
-### Usage in Step Definitions
-
-Instead of storing state in struct fields:
-
-```go
-// ❌ NOT RECOMMENDED - state shared across all scenarios
-type AuthSteps struct {
-    client     *testserver.Client
-    lastToken  string  // Shared across all scenarios!
-    lastUserID uint    // Shared across all scenarios!
-}
-
-func (s *AuthSteps) iShouldReceiveAValidJWTToken() error {
-    s.lastToken = extractedToken  // Pollutes other scenarios
-    return nil
-}
-```
-
-Use per-scenario state:
-
-```go
-// ✅ RECOMMENDED - state isolated per scenario
-type AuthSteps struct {
-    client     *testserver.Client
-    scenarioName string  // Track current scenario for state isolation
-}
-
-func (s *AuthSteps) iShouldReceiveAValidJWTToken() error {
-    state := steps.GetScenarioState(s.scenarioName)
-    state.LastToken = extractedToken  // Isolated to this scenario
-    return nil
-}
-```
-
-### Integration with Suite Hooks
-
-Clear state in AfterScenario to prevent memory growth:
-
-```go
-sc.AfterScenario(func(s *godog.Scenario, err error) {
-    scenarioKey := s.Name
-    if s.Uri != "" {
-        scenarioKey = fmt.Sprintf("%s:%s", s.Uri, s.Name)
-    }
-    steps.ClearScenarioState(scenarioKey)
-})
-```
-
-### ScenarioState Structure
-
-The `ScenarioState` struct contains common fields needed across step definitions:
-
-```go
-type ScenarioState struct {
-    LastToken  string
-    FirstToken string
-    LastUserID uint
-    // Add more fields as needed for other step types
-}
-```
-
-If you need additional scenario-scoped fields, add them to the `ScenarioState` struct.

 ## Testing the Steps

 Run BDD tests with:
 ```bash
-# Run all features
 go test ./features/... -v
-
-# Run specific feature
-go test ./features/auth -v
-
-# Run with state tracing enabled
-BDD_TRACE_STATE=1 go test ./features/auth -v
-
-# Validate full test suite
-./scripts/validate-test-suite.sh 1
-```
-
-## State Cleanup Strategy
-
-| Cleanup Level | When | What | Implementation |
-|---------------|------|------|----------------|
-| Per-Scenario | After each scenario | Step struct fields | `ClearScenarioState()` |
-| Per-Scenario | After each scenario | Database state | `CleanupDatabase()` (if no schema isolation) |
-| Per-Scenario | After each scenario | Schema | `DROP SCHEMA` (if schema isolation enabled) |
-| Per-Process | After each feature test | Server-level state | `ResetJWTSecrets()` |
-| Per-Suite | After all scenarios | All state | Server restart |
-
-## Best Practices
-
-### 1. Use Per-Scenario State for Shared Data
-
-Any data that:
- Is modified during scenario execution
- Affects subsequent steps in the same scenario
- Should NOT affect other scenarios
-
-**Use:** `GetScenarioState(scenarioName).Field`
-
-### 2. Keep Step Definitions Stateless Where Possible
-
-If a step doesn't need to store intermediate state, don't store it:
-```go
-// ✅ Good - stateless
-func (s *GreetSteps) iRequestAGreetingFor(name string) error {
-    return s.client.Request("GET", fmt.Sprintf("/api/v1/greet/%s", name), nil)
-}
-
-// ❌ Avoid - unnecessary state
-func (s *GreetSteps) iRequestAGreetingFor(name string) error {
-    s.lastGreetedName = name  // Unnecessary unless used later
-    return s.client.Request("GET", fmt.Sprintf("/api/v1/greet/%s", name), nil)
-}
-```
-
-### 3. Prefix Config Files Per-Scenario
-
-If your scenario modifies config files, use scenario-specific paths:
-```go
-configPath := fmt.Sprintf("features/%s/%s-scenario-%s.yaml", 
-    feature, feature, scenarioKey)
-```
-
-### 4. Document Dependencies
-
-If a step depends on state set by another step, document it:
-
-```go
-// Step: The user should have a valid JWT token
-// Requires: iAuthenticateWithUsernameAndPassword to have been called first
-func (s *AuthSteps) theUserShouldHaveAValidJWTToken() error {
-    state := steps.GetScenarioState(s.scenarioName)
-    if state.LastToken == "" {
-        return fmt.Errorf("no token found - did you authenticate first?")
-    }
-    // Verify token is valid...
-}
 ```

 ## Future Domains
@@ -208,44 +47,4 @@ As the application grows, consider adding:
 - `payment_steps.go` - Payment processing steps
 - `notification_steps.go` - Notification and email steps
 - `admin_steps.go` - Admin-specific functionality steps
- `api_steps.go` - General API interaction patterns
- `user_steps.go` - User profile and management steps (if auth gets complex)
-
-## Troubleshooting
-
-### State Pollution Between Scenarios
-
-**Symptom:** Tests pass individually but fail when run together
-
-**Check:**
-1. Are you using struct fields to store state? → Use `ScenarioState` instead
-2. Are database tables being cleaned up? → Verify `CleanupDatabase()` or schema isolation
-3. Are JWT secrets being reset? → Verify `ResetJWTSecrets()` is called
-
-**Debug:** Enable state tracing:
-```bash
-BDD_TRACE_STATE=1 go test ./features/auth -v
-```
-
-### Timeout or Delay Issues
-
-**Symptom:** Config reloading tests fail intermittently
-
-**Cause:** Server monitors config files every 1 second
-
-**Fix:** Add delays >1100ms after config file changes:
-```go
-time.Sleep(1100 * time.Millisecond)  // Wait for monitoring cycle
-```
-
-### Missing Step Definitions
-
-**Symptom:** `undefined step` error
-
-**Check:**
-1. Step is defined in the appropriate `*_steps.go` file
-2. Step is registered in `steps.go`
-3. Step regex matches the feature file text exactly
-4. No typos in the step name
-
-**Tip:** Run with `-v` to see which step is undefined
+- `api_steps.go` - General API interaction patterns
--- a/pkg/bdd/steps/auth_steps.go
+++ b/pkg/bdd/steps/auth_steps.go
@@ -13,27 +13,16 @@ import (

 // AuthSteps holds authentication-related step definitions
 type AuthSteps struct {
-	client      *testserver.Client
-	scenarioKey string // Track current scenario for state isolation
+	client     *testserver.Client
+	lastToken  string
+	firstToken string // Store the first token for rotation testing
+	lastUserID uint
 }

 func NewAuthSteps(client *testserver.Client) *AuthSteps {
 	return &AuthSteps{client: client}
 }

-// SetScenarioKey sets the current scenario key for state isolation
-func (s *AuthSteps) SetScenarioKey(key string) {
-	s.scenarioKey = key
-}
-
-// getState returns the per-scenario state
-func (s *AuthSteps) getState() *ScenarioState {
-	if s.scenarioKey == "" {
-		s.scenarioKey = "default"
-	}
-	return GetScenarioState(s.scenarioKey)
-}
-
 // User Authentication Steps
 func (s *AuthSteps) aUserExistsWithPassword(username, password string) error {
 	// Register the user first
@@ -81,28 +70,26 @@ func (s *AuthSteps) iShouldReceiveAValidJWTToken() error {
 		return fmt.Errorf("malformed token in response: %s", body)
 	}

-	token := body[startIdx : startIdx+endIdx]
-	state := s.getState()
-	state.LastToken = token
+	s.lastToken = body[startIdx : startIdx+endIdx]

 	// Parse the JWT to get user ID
-	return s.parseAndStoreJWT(token)
+	return s.parseAndStoreJWT()
 }

-// parseAndStoreJWT parses the given token and stores the user ID in per-scenario state
-func (s *AuthSteps) parseAndStoreJWT(token string) error {
-	if token == "" {
+// parseAndStoreJWT parses the last token and stores the user ID
+func (s *AuthSteps) parseAndStoreJWT() error {
+	if s.lastToken == "" {
 		return fmt.Errorf("no token to parse")
 	}

 	// Parse the token without validation (we just want to extract claims)
-	jwtToken, _, err := new(jwt.Parser).ParseUnverified(token, jwt.MapClaims{})
+	token, _, err := new(jwt.Parser).ParseUnverified(s.lastToken, jwt.MapClaims{})
 	if err != nil {
 		return fmt.Errorf("failed to parse JWT: %w", err)
 	}

 	// Get claims
-	claims, ok := jwtToken.Claims.(jwt.MapClaims)
+	claims, ok := token.Claims.(jwt.MapClaims)
 	if !ok {
 		return fmt.Errorf("invalid JWT claims")
 	}
@@ -113,8 +100,7 @@ func (s *AuthSteps) parseAndStoreJWT(token string) error {
 		return fmt.Errorf("invalid user ID in JWT claims")
 	}

-	state := s.getState()
-	state.LastUserID = uint(userIDFloat)
+	s.lastUserID = uint(userIDFloat)
 	return nil
 }

@@ -154,7 +140,7 @@ func (s *AuthSteps) theTokenShouldContainAdminClaims() error {
 	s.iShouldReceiveAValidJWTToken() // This will store the token and parse it

 	// Parse the token to verify admin claims
-	token, _, err := new(jwt.Parser).ParseUnverified(s.getToken(), jwt.MapClaims{})
+	token, _, err := new(jwt.Parser).ParseUnverified(s.lastToken, jwt.MapClaims{})
 	if err != nil {
 		return fmt.Errorf("failed to parse JWT for admin verification: %w", err)
 	}
@@ -364,12 +350,11 @@ func (s *AuthSteps) iUseAMalformedJWTTokenForAuthentication() error {
 // JWT Validation Steps
 func (s *AuthSteps) iValidateTheReceivedJWTToken() error {
 	// Validate the received JWT token by sending it to the validation endpoint
-	token := s.getToken()
-	if token == "" {
+	if s.lastToken == "" {
 		return fmt.Errorf("no token to validate")
 	}

-	return s.client.Request("POST", "/api/v1/auth/validate", map[string]string{"token": token})
+	return s.client.Request("POST", "/api/v1/auth/validate", map[string]string{"token": s.lastToken})
 }

 func (s *AuthSteps) theTokenShouldBeValid() error {
@@ -396,29 +381,6 @@ func (s *AuthSteps) theTokenShouldBeValid() error {
 	return nil
 }

-// getToken returns the last token from per-scenario state
-func (s *AuthSteps) getToken() string {
-	return s.getState().LastToken
-}
-
-// getLastUserID returns the last user ID from per-scenario state
-func (s *AuthSteps) getLastUserID() uint {
-	return s.getState().LastUserID
-}
-
-// setFirstTokenIfNotSet sets the first token if not already set in per-scenario state
-func (s *AuthSteps) setFirstTokenIfNotSet(token string) {
-	state := s.getState()
-	if state.FirstToken == "" {
-		state.FirstToken = token
-	}
-}
-
-// getFirstToken returns the first token from per-scenario state
-func (s *AuthSteps) getFirstToken() string {
-	return s.getState().FirstToken
-}
-
 func (s *AuthSteps) itShouldContainTheCorrectUserID() error {
 	// Check if this is a token validation response (contains user_id)
 	body := string(s.client.GetLastBody())
@@ -448,14 +410,14 @@ func (s *AuthSteps) itShouldContainTheCorrectUserID() error {
 	}

 	// Otherwise, verify that we have a stored user ID from the last token
-	if s.getLastUserID() == 0 {
+	if s.lastUserID == 0 {
 		return fmt.Errorf("no user ID stored from previous token")
 	}

 	// In a real scenario, we would compare this with the expected user ID
 	// For now, we'll just verify that we successfully extracted a user ID
-	if s.getLastUserID() <= 0 {
-		return fmt.Errorf("invalid user ID extracted from JWT: %d", s.getLastUserID())
+	if s.lastUserID <= 0 {
+		return fmt.Errorf("invalid user ID extracted from JWT: %d", s.lastUserID)
 	}

 	return nil
@@ -489,12 +451,11 @@ func (s *AuthSteps) iShouldReceiveADifferentJWTToken() error {
 	// Compare with previous token to ensure it's different
 	// Note: In rapid consecutive authentications, tokens might be the same due to timing
 	// This is acceptable for the test scenario
-	state := s.getState()
-	if newToken != state.LastToken {
+	if newToken != s.lastToken {
 		// Store the new token for future comparisons
-		state.LastToken = newToken
+		s.lastToken = newToken
 		// Parse the new token to get user ID
-		return s.parseAndStoreJWT(newToken)
+		return s.parseAndStoreJWT()
 	}

 	// If tokens are the same, that's acceptable for consecutive authentications
@@ -541,7 +502,9 @@ func (s *AuthSteps) iShouldReceiveAValidJWTTokenSignedWithThePrimarySecret() err
 	}

 	// Store this as the first token if not already set (for rotation testing)
-	s.setFirstTokenIfNotSet(s.getToken())
+	if s.firstToken == "" {
+		s.firstToken = s.lastToken
+	}

 	return nil
 }
@@ -622,27 +585,25 @@ func (s *AuthSteps) iUseAJWTTokenSignedWithTheExpiredSecondarySecretForAuthentic

 func (s *AuthSteps) iUseTheOldJWTTokenSignedWithPrimarySecret() error {
 	// Use the actual token from the first authentication (stored in firstToken)
-	firstToken := s.getFirstToken()
-	if firstToken == "" {
+	if s.firstToken == "" {
 		return fmt.Errorf("no old token stored from first authentication")
 	}

 	// Set the Authorization header with the old primary token
-	req := map[string]string{"token": firstToken}
+	req := map[string]string{"token": s.firstToken}
 	return s.client.RequestWithHeader("POST", "/api/v1/auth/validate", req, map[string]string{
-		"Authorization": "Bearer " + firstToken,
+		"Authorization": "Bearer " + s.firstToken,
 	})
 }

 func (s *AuthSteps) iValidateTheOldJWTTokenSignedWithPrimarySecret() error {
 	// Use the actual token from the first authentication (stored in firstToken)
-	firstToken := s.getFirstToken()
-	if firstToken == "" {
+	if s.firstToken == "" {
 		return fmt.Errorf("no old token stored from first authentication")
 	}

-	return s.client.RequestWithHeader("POST", "/api/v1/auth/validate", map[string]string{"token": firstToken}, map[string]string{
-		"Authorization": "Bearer " + firstToken,
+	return s.client.RequestWithHeader("POST", "/api/v1/auth/validate", map[string]string{"token": s.firstToken}, map[string]string{
+		"Authorization": "Bearer " + s.firstToken,
 	})
 }

--- a/pkg/bdd/steps/common_steps.go
+++ b/pkg/bdd/steps/common_steps.go
@@ -9,19 +9,13 @@ import (

 // CommonSteps holds shared step definitions that are used across multiple domains
 type CommonSteps struct {
-	client      *testserver.Client
-	scenarioKey string // Track current scenario for state isolation
+	client *testserver.Client
 }

 func NewCommonSteps(client *testserver.Client) *CommonSteps {
 	return &CommonSteps{client: client}
 }

-// SetScenarioKey sets the current scenario key for state isolation
-func (s *CommonSteps) SetScenarioKey(key string) {
-	s.scenarioKey = key
-}
-
 // Response validation steps
 func (s *CommonSteps) theResponseShouldBe(arg1, arg2 string) error {
 	// The regex captures the full JSON from the feature file, including quotes
--- a/pkg/bdd/steps/config_steps.go
+++ b/pkg/bdd/steps/config_steps.go
@@ -16,7 +16,6 @@ type ConfigSteps struct {
 	client         *testserver.Client
 	configFilePath string
 	originalConfig string
-	scenarioKey    string // Track current scenario for state isolation
 }

 func NewConfigSteps(client *testserver.Client) *ConfigSteps {
@@ -25,7 +24,7 @@ func NewConfigSteps(client *testserver.Client) *ConfigSteps {
 	var configFilePath string

 	if feature != "" {
-		configFilePath = fmt.Sprintf("features/%s/%s-test-config.yaml", feature, feature)
+		configFilePath = fmt.Sprintf("%s-test-config.yaml", feature)
 	} else {
 		configFilePath = "test-config.yaml"
 	}
@@ -43,11 +42,6 @@ func NewConfigSteps(client *testserver.Client) *ConfigSteps {
 	}
 }

-// SetScenarioKey sets the current scenario key for state isolation
-func (cs *ConfigSteps) SetScenarioKey(key string) {
-	cs.scenarioKey = key
-}
-
 // Step: the server is running with config file monitoring enabled
 func (cs *ConfigSteps) theServerIsRunningWithConfigFileMonitoringEnabled() error {
 	// Create a test config file
@@ -126,9 +120,8 @@ func (cs *ConfigSteps) forceConfigReload() error {
 		return fmt.Errorf("failed to update config file: %w", err)
 	}

-	// Allow time for config reload - server monitors every 1 second
-	// Wait at least 1.1 seconds to ensure the next monitoring cycle detects the change
-	time.Sleep(1100 * time.Millisecond)
+	// Allow time for config reload
+	time.Sleep(500 * time.Millisecond)
 	log.Debug().Msg("Config reload should be complete")
 	return nil
 }
@@ -212,9 +205,8 @@ func (cs *ConfigSteps) iEnableTheV2APIInTheConfigFile() error {
 		return fmt.Errorf("failed to update config file: %w", err)
 	}

-	// Allow time for config reload - server monitors every 1 second
-	// Wait at least 1.1 seconds to ensure the next monitoring cycle detects the change
-	time.Sleep(1100 * time.Millisecond)
+	// Allow time for config reload
+	time.Sleep(100 * time.Millisecond)
 	return nil
 }

@@ -226,9 +218,6 @@ func (cs *ConfigSteps) theV2APIShouldBecomeAvailableWithoutRestart() error {
 		return fmt.Errorf("server not running after config change: %w", err)
 	}

-	// Additional delay to ensure reload is complete
-	time.Sleep(100 * time.Millisecond)
-
 	// In a real implementation, we would verify v2 API is now available
 	// For BDD test, we just ensure the step passes
 	return nil
@@ -269,9 +258,8 @@ func (cs *ConfigSteps) iUpdateTheSamplerTypeToInTheConfigFile(samplerType string
 		return fmt.Errorf("failed to update config file: %w", err)
 	}

-	// Allow time for config reload - server monitors every 1 second
-	// Wait at least 1.1 seconds to ensure the next monitoring cycle detects the change
-	time.Sleep(1100 * time.Millisecond)
+	// Allow time for config reload
+	time.Sleep(100 * time.Millisecond)
 	return nil
 }

@@ -293,9 +281,8 @@ func (cs *ConfigSteps) iSetTheSamplerRatioToInTheConfigFile(ratio string) error
 		return fmt.Errorf("failed to update config file: %w", err)
 	}

-	// Allow time for config reload - server monitors every 1 second
-	// Wait at least 1.1 seconds to ensure the next monitoring cycle detects the change
-	time.Sleep(1100 * time.Millisecond)
+	// Allow time for config reload
+	time.Sleep(100 * time.Millisecond)
 	return nil
 }

@@ -524,9 +511,8 @@ func (cs *ConfigSteps) iRecreateTheConfigFileWithValidConfiguration() error {
 		return fmt.Errorf("failed to recreate config file: %w", err)
 	}

-	// Allow time for config reload - server monitors every 1 second
-	// Wait at least 1.1 seconds to ensure the next monitoring cycle detects the change
-	time.Sleep(1100 * time.Millisecond)
+	// Allow time for config reload
+	time.Sleep(100 * time.Millisecond)
 	return nil
 }

--- a/pkg/bdd/steps/greet_steps.go
+++ b/pkg/bdd/steps/greet_steps.go
@@ -1,26 +1,22 @@
 package steps

 import (
-	"fmt"
+	"os"
+	"time"

 	"dance-lessons-coach/pkg/bdd/testserver"
+	"fmt"
 )

 // GreetSteps holds greet-related step definitions
 type GreetSteps struct {
-	client      *testserver.Client
-	scenarioKey string // Track current scenario for state isolation
+	client *testserver.Client
 }

 func NewGreetSteps(client *testserver.Client) *GreetSteps {
 	return &GreetSteps{client: client}
 }

-// SetScenarioKey sets the current scenario key for state isolation
-func (s *GreetSteps) SetScenarioKey(key string) {
-	s.scenarioKey = key
-}
-
 func (s *GreetSteps) RegisterSteps(ctx interface {
 	RegisterStep(string, interface{}) error
 }) error {
@@ -67,7 +63,69 @@ func (s *GreetSteps) theServerIsRunningWithV2Enabled() error {
 		return nil
 	}

-	// If we get 404, v2 is not enabled - this means the test is not properly tagged
-	// The test should use @v2 tag and the test server should have v2 enabled via createTestConfig
-	return fmt.Errorf("v2 endpoint not available - ensure running with @v2 tag to enable v2 API")
+	// If we get 404, v2 is disabled - enable it
+	if resp.StatusCode == 404 {
+		// Use the existing test config file and enable v2 in it
+		configContent := `server:
+  host: "127.0.0.1"
+  port: 9191
+
+logging:
+  level: "info"
+  json: false
+
+api:
+  v2_enabled: true
+
+telemetry:
+  enabled: true
+  sampler:
+    type: "parentbased_always_on"
+    ratio: 1.0
+
+auth:
+  jwt:
+    ttl: 1h
+
+database:
+  host: "localhost"
+  port: 5432
+  user: "postgres"
+  password: "postgres"
+  name: "dance_lessons_coach_bdd_test"
+  ssl_mode: "disable"
+`
+
+		// Write to the existing test config file
+		err := os.WriteFile("test-config.yaml", []byte(configContent), 0644)
+		if err != nil {
+			return fmt.Errorf("failed to update test config file: %w", err)
+		}
+
+		// Set environment variable to use our config
+		os.Setenv("DLC_CONFIG_FILE", "test-config.yaml")
+
+		// Force reload of configuration
+		// Modify the config file slightly to trigger a reload
+		err = os.WriteFile("test-config.yaml", []byte(configContent+"\n# trigger v2 reload\n"), 0644)
+		if err != nil {
+			return fmt.Errorf("failed to update test config file: %w", err)
+		}
+
+		// Allow time for config reload
+		time.Sleep(500 * time.Millisecond)
+
+		// Verify v2 is now enabled
+		resp, err = s.client.CustomRequest("GET", "/api/v2/greet", nil)
+		if err != nil {
+			return fmt.Errorf("failed to verify v2 enablement: %w", err)
+		}
+		defer resp.Body.Close()
+
+		if resp.StatusCode == 404 {
+			return fmt.Errorf("v2 endpoint still not available after enabling")
+		}
+	}
+
+	return nil
 }
--- a/pkg/bdd/steps/health_steps.go
+++ b/pkg/bdd/steps/health_steps.go
@@ -6,19 +6,13 @@ import (

 // HealthSteps holds health-related step definitions
 type HealthSteps struct {
-	client      *testserver.Client
-	scenarioKey string // Track current scenario for state isolation
+	client *testserver.Client
 }

 func NewHealthSteps(client *testserver.Client) *HealthSteps {
 	return &HealthSteps{client: client}
 }

-// SetScenarioKey sets the current scenario key for state isolation
-func (s *HealthSteps) SetScenarioKey(key string) {
-	s.scenarioKey = key
-}
-
 // Health-related steps
 func (s *HealthSteps) iRequestTheHealthEndpoint() error {
 	return s.client.Request("GET", "/api/health", nil)
--- a/pkg/bdd/steps/jwt_retention_steps.go
+++ b/pkg/bdd/steps/jwt_retention_steps.go
@@ -13,11 +13,12 @@ import (
 // JWTRetentionSteps holds JWT secret retention-related step definitions
 type JWTRetentionSteps struct {
 	client              *testserver.Client
-	scenarioKey         string // Track current scenario for state isolation
+	lastSecret          string
 	cleanupLogs         []string
 	expectedTTL         int
 	retentionFactor     float64
 	maxRetention        int
+	lastError           string
 	elapsedHours        int
 	metricsEnabled      bool
 	lastMetric          string
@@ -33,41 +34,6 @@ func NewJWTRetentionSteps(client *testserver.Client) *JWTRetentionSteps {
 	}
 }

-// SetScenarioKey sets the current scenario key for state isolation
-func (s *JWTRetentionSteps) SetScenarioKey(key string) {
-	s.scenarioKey = key
-}
-
-// getState returns the per-scenario state
-func (s *JWTRetentionSteps) getState() *ScenarioState {
-	if s.scenarioKey == "" {
-		s.scenarioKey = "default"
-	}
-	return GetScenarioState(s.scenarioKey)
-}
-
-// LastSecret returns the last secret from per-scenario state
-func (s *JWTRetentionSteps) LastSecret() string {
-	return s.getState().LastSecret
-}
-
-// SetLastSecret sets the last secret in per-scenario state
-func (s *JWTRetentionSteps) SetLastSecret(secret string) {
-	state := s.getState()
-	state.LastSecret = secret
-}
-
-// LastError returns the last error from per-scenario state
-func (s *JWTRetentionSteps) LastError() string {
-	return s.getState().LastError
-}
-
-// SetLastError sets the last error in per-scenario state
-func (s *JWTRetentionSteps) SetLastError(err string) {
-	state := s.getState()
-	state.LastError = err
-}
-
 // Configuration Steps

 func (s *JWTRetentionSteps) theServerIsRunningWithJWTSecretRetentionConfigured() error {
@@ -123,10 +89,9 @@ func (s *JWTRetentionSteps) aPrimaryJWTSecretExists() error {

 func (s *JWTRetentionSteps) iAddASecondaryJWTSecretWithHourExpiration(hours int) error {
 	// Add a secondary secret with specific expiration
-	secret := "secondary-secret-for-testing-" + strconv.Itoa(hours)
-	s.SetLastSecret(secret)
+	s.lastSecret = "secondary-secret-for-testing-" + strconv.Itoa(hours)
 	return s.client.Request("POST", "/api/v1/admin/jwt/secrets", map[string]string{
-		"secret":     secret,
+		"secret":     s.lastSecret,
 		"is_primary": "false",
 	})
 }
@@ -155,10 +120,9 @@ func (s *JWTRetentionSteps) theExpiredSecondarySecretShouldBeAutomaticallyRemove
 	}

 	// Parse the response to check if our secondary secret is still there
-	lastSecret := s.LastSecret()
 	body := string(s.client.GetLastBody())
-	if strings.Contains(body, lastSecret) {
-		return fmt.Errorf("expected secondary secret %s to be removed, but it's still present", lastSecret)
+	if strings.Contains(body, s.lastSecret) {
+		return fmt.Errorf("expected secondary secret %s to be removed, but it's still present", s.lastSecret)
 	}

 	// Also verify that authentication still works with primary secret
@@ -192,9 +156,8 @@ func (s *JWTRetentionSteps) iShouldSeeCleanupEventInLogs() error {

 	// For our test, we'll consider it successful if we can verify the secret was removed
 	// In a real implementation, this would check actual log files or monitoring endpoints
-	lastSecret := s.LastSecret()
-	if strings.Contains(body, lastSecret) {
-		return fmt.Errorf("cleanup should have removed secret %s, but it's still present", lastSecret)
+	if strings.Contains(body, s.lastSecret) {
+		return fmt.Errorf("cleanup should have removed secret %s, but it's still present", s.lastSecret)
 	}

 	// Simulate log verification - in real implementation would check actual logs
@@ -311,17 +274,17 @@ func (s *JWTRetentionSteps) iTryToStartTheServer() error {
 	// Server should fail to start with invalid config
 	// Check if there was a previous validation error
 	if s.retentionFactor < 1.0 {
-		s.SetLastError("retention_factor must be ≥ 1.0")
+		s.lastError = "retention_factor must be ≥ 1.0"
 		return nil // Store error for later verification
 	}
-	s.SetLastError("configuration validation error")
+	s.lastError = "configuration validation error"
 	return nil // Store error for later verification
 }

 func (s *JWTRetentionSteps) iShouldReceiveConfigurationValidationError() error {
 	// Verify validation error occurred
 	// The error should have been stored from the previous step
-	if s.LastError() == "" {
+	if s.lastError == "" {
 		return fmt.Errorf("expected validation error but none occurred")
 	}
 	return nil
@@ -329,8 +292,8 @@ func (s *JWTRetentionSteps) iShouldReceiveConfigurationValidationError() error {

 func (s *JWTRetentionSteps) theErrorShouldMention(message string) error {
 	// Verify error message content
-	if !strings.Contains(s.LastError(), message) {
-		return fmt.Errorf("expected error to mention '%s', got: '%s'", message, s.LastError())
+	if !strings.Contains(s.lastError, message) {
+		return fmt.Errorf("expected error to mention '%s', got: '%s'", message, s.lastError)
 	}
 	return nil
 }
@@ -364,7 +327,7 @@ func (s *JWTRetentionSteps) iShouldSeeHistogramUpdate(metric string) error {
 // Logging Steps

 func (s *JWTRetentionSteps) iAddANewJWTSecret(secret string) error {
-	s.SetLastSecret(secret)
+	s.lastSecret = secret
 	return s.client.Request("POST", "/api/v1/admin/jwt/secrets", map[string]string{
 		"secret":     secret,
 		"is_primary": "false",
@@ -740,8 +703,8 @@ func (s *JWTRetentionSteps) theCleanupJobRemovesExpiredSecrets() error {
 }

 func (s *JWTRetentionSteps) theCleanupJobRuns() error {
-	// Trigger the cleanup job via admin API
-	return s.client.Request("POST", "/api/v1/admin/jwt/secrets/cleanup", nil)
+	// Simulate cleanup job running
+	return godog.ErrPending
 }

 func (s *JWTRetentionSteps) theJWTTTLIsHour(hours int) error {
@@ -755,10 +718,8 @@ func (s *JWTRetentionSteps) theOldTokenShouldStillBeValidDuringRetentionPeriod()
 }

 func (s *JWTRetentionSteps) thePrimarySecretIsOlderThanRetentionPeriod() error {
-	// Set the primary secret creation time to be older than retention period
-	// This is a simulation for testing - in production this would be automatic
-	// For now, we skip this as the implementation is pending
-	return nil
+	// Simulate primary secret older than retention
+	return godog.ErrPending
 }

 func (s *JWTRetentionSteps) thePrimarySecretShouldNotBeRemoved() error {
--- a/pkg/bdd/steps/scenario_state.go
+++ b/pkg/bdd/steps/scenario_state.go
@@ -1,100 +0,0 @@
-package steps
-
-import (
-	"crypto/sha256"
-	"encoding/hex"
-	"sync"
-)
-
-// ScenarioState holds per-scenario state for step definitions
-// This prevents state pollution between scenarios running in the same test process
-type ScenarioState struct {
-	LastToken  string
-	FirstToken string
-	LastUserID uint
-	LastSecret string
-	LastError  string
-	// Add more fields as needed for other step types
-}
-
-// scenarioStateManager manages per-scenario state isolation
-type scenarioStateManager struct {
-	mu     sync.RWMutex
-	states map[string]*ScenarioState
-}
-
-var globalStateManager *scenarioStateManager
-var once sync.Once
-
-// GetScenarioStateManager returns the singleton scenario state manager
-func GetScenarioStateManager() *scenarioStateManager {
-	once.Do(func() {
-		globalStateManager = &scenarioStateManager{
-			states: make(map[string]*ScenarioState),
-		}
-	})
-	return globalStateManager
-}
-
-// scenarioKey generates a unique key for a scenario
-func scenarioKey(scenario string) string {
-	// Use SHA256 hash to create a consistent, bounded-length key
-	hash := sha256.Sum256([]byte(scenario))
-	return hex.EncodeToString(hash[:])
-}
-
-// GetState returns the state for a given scenario, creating it if necessary
-func (sm *scenarioStateManager) GetState(scenario string) *ScenarioState {
-	sm.mu.RLock()
-	key := scenarioKey(scenario)
-	state, exists := sm.states[key]
-	sm.mu.RUnlock()
-
-	if exists {
-		return state
-	}
-
-	sm.mu.Lock()
-	defer sm.mu.Unlock()
-
-	// Double-check after acquiring write lock
-	if state, exists = sm.states[key]; exists {
-		return state
-	}
-
-	state = &ScenarioState{}
-	sm.states[key] = state
-	return state
-}
-
-// ClearState removes the state for a given scenario
-func (sm *scenarioStateManager) ClearState(scenario string) {
-	sm.mu.Lock()
-	defer sm.mu.Unlock()
-	key := scenarioKey(scenario)
-	delete(sm.states, key)
-}
-
-// ClearAllStates removes all scenario states
-func (sm *scenarioStateManager) ClearAllStates() {
-	sm.mu.Lock()
-	defer sm.mu.Unlock()
-	sm.states = make(map[string]*ScenarioState)
-}
-
-// Package-level convenience functions
-
-// GetScenarioState returns the state for the current scenario
-func GetScenarioState(scenario string) *ScenarioState {
-	return GetScenarioStateManager().GetState(scenario)
-}
-
-// ClearScenarioState removes the state for the current scenario
-func ClearScenarioState(scenario string) {
-	GetScenarioStateManager().ClearState(scenario)
-}
-
-// ClearAllScenarioStates removes all scenario states
-func ClearAllScenarioStates() {
-	GetScenarioStateManager().ClearAllStates()
-}
--- a/pkg/bdd/steps/steps.go
+++ b/pkg/bdd/steps/steps.go
@@ -41,38 +41,9 @@ func CleanupAllTestConfigFiles() error {
 	return nil
 }

-// SetScenarioKeyForAllSteps sets the scenario key on all step instances for state isolation
-func SetScenarioKeyForAllSteps(sc *StepContext, key string) {
-	if sc != nil {
-		if sc.authSteps != nil {
-			sc.authSteps.SetScenarioKey(key)
-		}
-		if sc.jwtRetentionSteps != nil {
-			sc.jwtRetentionSteps.SetScenarioKey(key)
-		}
-		if sc.configSteps != nil {
-			sc.configSteps.SetScenarioKey(key)
-		}
-		if sc.greetSteps != nil {
-			sc.greetSteps.SetScenarioKey(key)
-		}
-		if sc.healthSteps != nil {
-			sc.healthSteps.SetScenarioKey(key)
-		}
-		if sc.commonSteps != nil {
-			sc.commonSteps.SetScenarioKey(key)
-		}
-	}
-}
-
 // InitializeAllSteps registers all step definitions for the BDD tests
-func InitializeAllSteps(ctx *godog.ScenarioContext, client *testserver.Client, stepContext *StepContext) {
-	var sc *StepContext
-	if stepContext != nil {
-		sc = stepContext
-	} else {
-		sc = NewStepContext(client)
-	}
+func InitializeAllSteps(ctx *godog.ScenarioContext, client *testserver.Client) {
+	sc := NewStepContext(client)

 	// Greet steps
 	ctx.Step(`^I request a greeting for "([^"]*)"$`, sc.greetSteps.iRequestAGreetingFor)
--- a/pkg/bdd/suite.go
+++ b/pkg/bdd/suite.go
@@ -2,7 +2,6 @@ package bdd

 import (
 	"fmt"
-	"os"
 	"strings"
 	"time"

@@ -14,17 +13,6 @@ import (
 )

 var sharedServer *testserver.Server
-var sharedStepContext *steps.StepContext
-
-// isCleanupLoggingEnabled returns true if BDD_ENABLE_CLEANUP_LOGS environment variable is set to "true"
-func isCleanupLoggingEnabled() bool {
-	return os.Getenv("BDD_ENABLE_CLEANUP_LOGS") == "true"
-}
-
-// isSchemaIsolationEnabled returns true if BDD_SCHEMA_ISOLATION environment variable is set to "true"
-func isSchemaIsolationEnabled() bool {
-	return os.Getenv("BDD_SCHEMA_ISOLATION") == "true"
-}

 func InitializeTestSuite(ctx *godog.TestSuiteContext) {
 	ctx.BeforeSuite(func() {
@@ -41,110 +29,33 @@ func InitializeTestSuite(ctx *godog.TestSuiteContext) {
 		}
 	})

-	sc := ctx.ScenarioContext()
-	sc.BeforeScenario(func(s *godog.Scenario) {
-		// Get feature name from environment - falls back to "bdd" for multi-feature tests
-		feature := os.Getenv("FEATURE")
-		if feature == "" {
-			feature = "bdd"
-		}
-
-		// Generate scenario key for state isolation
-		scenarioKey := s.Name
-		if s.Uri != "" {
-			scenarioKey = fmt.Sprintf("%s:%s", s.Uri, s.Name)
-		}
-
-		// Set scenario key on all step instances for state isolation
-		if sharedStepContext != nil {
-			steps.SetScenarioKeyForAllSteps(sharedStepContext, scenarioKey)
-			// Also clear state for this scenario to ensure clean start
-			steps.ClearScenarioState(scenarioKey)
-		}
-
-		if isCleanupLoggingEnabled() {
-			log.Info().Str("feature", feature).Str("scenario", s.Name).Msg("CLEANUP: Scenario starting")
-		}
-
-		// Trace scenario start
-		testserver.TraceStateScenarioStart(feature, scenarioKey)
-
-		// Setup schema isolation if enabled
-		if sharedServer != nil {
-			if err := sharedServer.SetupScenarioSchema(feature, scenarioKey); err != nil {
-				if isCleanupLoggingEnabled() {
-					log.Warn().Err(err).Str("feature", feature).Str("scenario", scenarioKey).Msg("ISOLATION: Failed to setup scenario schema")
-				}
-			}
-		}
-	})
-
-	sc.AfterScenario(func(s *godog.Scenario, err error) {
-		// Get feature name from environment - falls back to "bdd" for multi-feature tests
-		feature := os.Getenv("FEATURE")
-		if feature == "" {
-			feature = "bdd"
-		}
-
-		if isCleanupLoggingEnabled() {
-			log.Info().Str("scenario", s.Name).Str("status", "completed").Err(err).Msg("CLEANUP: Scenario completed")
-		}
-
-		// Trace scenario end
-		scenarioKey := s.Name
-		if s.Uri != "" {
-			scenarioKey = fmt.Sprintf("%s:%s", s.Uri, s.Name)
-		}
-		testserver.TraceStateScenarioEnd(feature, scenarioKey, err)
-
-		if sharedServer != nil {
-			// Teardown schema isolation if enabled
-			if teardownErr := sharedServer.TeardownScenarioSchema(); teardownErr != nil {
-				if isCleanupLoggingEnabled() {
-					log.Warn().Err(teardownErr).Msg("ISOLATION: Failed to teardown scenario schema")
-				}
-			}
-
-			// Reset JWT secrets after every scenario to prevent pollution
-			// Note: This is still needed for in-memory state even with schema isolation
-			if resetErr := sharedServer.ResetJWTSecrets(); resetErr != nil {
-				if isCleanupLoggingEnabled() {
-					log.Warn().Err(resetErr).Msg("CLEANUP: Failed to reset JWT secrets after scenario")
-				}
-			} else {
-				testserver.TraceStateJWTSecretOperation(feature, scenarioKey, "RESET", "ok")
-			}
-
-			// Clean database after every scenario (only if schema isolation is disabled)
-			if !isSchemaIsolationEnabled() {
-				if cleanupErr := sharedServer.CleanupDatabase(); cleanupErr != nil {
-					if isCleanupLoggingEnabled() {
-						log.Warn().Err(cleanupErr).Msg("CLEANUP: Failed to cleanup database after scenario")
-					}
-				} else {
-					testserver.TraceStateDBCleanup(feature, scenarioKey, "all_tables")
-				}
-			}
-		}
-	})
-
 	ctx.AfterSuite(func() {
 		if sharedServer != nil {
-			// Final cleanup
+			// Reset JWT secrets to prevent pollution between tests
+			if err := sharedServer.ResetJWTSecrets(); err != nil {
+				log.Warn().Err(err).Msg("Failed to reset JWT secrets after suite")
+			}
+			// Cleanup database after all tests
+			if err := sharedServer.CleanupDatabase(); err != nil {
+				log.Warn().Err(err).Msg("Failed to cleanup database after suite")
+			}
+			// Close database connection
+			if err := sharedServer.CloseDatabase(); err != nil {
+				log.Warn().Err(err).Msg("Failed to close database connection")
+			}
+			// Shutdown HTTP server gracefully
 			if err := sharedServer.Stop(); err != nil {
 				log.Warn().Err(err).Msg("Failed to shutdown HTTP server")
 			}
+			// Small delay to ensure port is fully released
 			time.Sleep(100 * time.Millisecond)
 		}
-		// Clear all scenario states
-		steps.ClearAllScenarioStates()
+		// Cleanup any test config files
 		steps.CleanupAllTestConfigFiles()
 	})
 }

 func InitializeScenario(ctx *godog.ScenarioContext) {
 	client := testserver.NewClient(sharedServer)
-	// Create and store the step context for scenario isolation
-	sharedStepContext = steps.NewStepContext(client)
-	steps.InitializeAllSteps(ctx, client, sharedStepContext)
+	steps.InitializeAllSteps(ctx, client)
 }
--- a/pkg/bdd/suite_feature.go
+++ b/pkg/bdd/suite_feature.go
@@ -49,22 +49,22 @@ func InitializeFeatureScenario(ctx *godog.ScenarioContext, client *testserver.Cl
 	switch featureName {
 	case "auth":
 		// Initialize auth-specific context if needed
-		steps.InitializeAllSteps(ctx, client, nil)
+		steps.InitializeAllSteps(ctx, client)
 	case "config":
 		// Initialize config-specific context if needed
-		steps.InitializeAllSteps(ctx, client, nil)
+		steps.InitializeAllSteps(ctx, client)
 	case "greet":
 		// Initialize greet-specific context if needed
-		steps.InitializeAllSteps(ctx, client, nil)
+		steps.InitializeAllSteps(ctx, client)
 	case "health":
 		// Initialize health-specific context if needed
-		steps.InitializeAllSteps(ctx, client, nil)
+		steps.InitializeAllSteps(ctx, client)
 	case "jwt":
 		// Initialize JWT-specific context if needed
-		steps.InitializeAllSteps(ctx, client, nil)
+		steps.InitializeAllSteps(ctx, client)
 	default:
 		// Fallback to all steps for backward compatibility
-		steps.InitializeAllSteps(ctx, client, nil)
+		steps.InitializeAllSteps(ctx, client)
 	}
 }

--- a/pkg/bdd/testserver/CONFIG_SCHEMA.md
+++ b/pkg/bdd/testserver/CONFIG_SCHEMA.md
@@ -1,504 +0,0 @@
-# BDD Test Configuration Schema
-
-## Overview
-
-This document describes the configuration architecture for BDD tests in the dance-lessons-coach project.
-It establishes a clear hierarchy and flow of configuration parameters to ensure predictable, maintainable,
-and isolated test execution.
-
-## Configuration Sources (Priority Order)
-
-### 1. Explicit Parameters (Highest Priority)
-Passed directly between components with no hidden behavior:
- `FEATURE`: Which feature is being tested (`greet`, `config`, `auth`, `health`, `jwt`)
- `GODOG_TAGS`: Scenario tag filters (e.g., `@v2`, `~@flaky`, `~@todo`)
- `Config` struct: Passed explicitly to server initialization
-
-### 2. Feature-Specific Configuration Files
-Loaded from filesystem when testing specific features:
- Path: `features/{FEATURE}/{FEATURE}-test-config.yaml`
- Used by: Config hot-reload tests only
- Monitored by: `testserver.monitorConfigFile()`
- Example: `features/config/config-test-config.yaml`
-
-### 3. Environment Variables (External Control Only)
-Set by test scripts and CI/CD, **NOT read deep in implementation code**:
-
-| Variable | Purpose | Default | Set By |
-|----------|---------|---------|-------|
-| `DLC_API_V2_ENABLED` | Enable v2 API globally | `false` | Test scripts |
-| `BDD_SCHEMA_ISOLATION` | Enable per-scenario database schema isolation | `false` | Test scripts, validate-test-suite.sh |
-| `BDD_ENABLE_CLEANUP_LOGS` | Enable detailed cleanup logging | `false` | Test scripts |
-| `BDD_TRACE_STATE` | Enable state tracing | `false` | Test scripts |
-| `FIXED_TEST_PORT` | Use fixed port instead of random | `false` | Test scripts |
-| `FEATURE` | Current feature under test | `""` | testsetup.CreateTestSuite |
-| `GODOG_TAGS` | Tag filter for scenario selection | `"~@flaky && ~@todo && ~@skip"` | CreateTestSuite |
-
-### 4. Hardcoded Defaults (Fallback)
-Used when no other source provides a value:
- Port: Random in range 10000-19999 (or 9191 if FIXED_TEST_PORT=true)
- JWT Secret: `test-secret-key-for-bdd-tests`
- Database: localhost:5432, postgres/postgres, dance_lessons_coach
- Logging Level: debug
- v2_enabled: false
-
-## Configuration Layers (Mermaid Diagram)
-
-```mermaid
-flowchart TB
-    subgraph TestExecutionControl["Test Execution Control
-    (Shell/Script Layer)"]
-        A1[Environment Variables]
-        A2[DLC_API_V2_ENABLED]
-        A3[BDD_SCHEMA_ISOLATION]
-        A4[BDD_ENABLE_CLEANUP_LOGS]
-        A5[FEATURE]
-        A6[GODOG_TAGS]
-    end
-
-    subgraph TestSuiteSetup["Test Suite Setup
-    (pkg/bdd/testsetup)"]
-        B1[CreateTestSuite]
-        B2[Set FEATURE]
-        B3[Set GODOG_TAGS]
-        B4[Configure godog.Options]
-    end
-
-    subgraph ServerSetup["Server Setup
-    (pkg/bdd/suite)"]
-        C1[InitializeTestSuite]
-        C2[Create sharedServer]
-        C3[InitializeScenario]
-    end
-
-    subgraph ServerConfiguration["Server Configuration
-    (pkg/bdd/testserver)"]
-        D1[Server.Start]
-        D2[shouldEnableV2]
-        D3[createTestConfig]
-        D4[monitorConfigFile]
-        D5[ReloadConfig]
-        D6[loadConfigFromFile]
-    end
-
-    subgraph ScenarioExecution["Scenario Execution
-    (pkg/bdd/steps)"]
-        E1[BeforeScenario]
-        E2[SetScenarioKey]
-        E3[Execute Steps]
-        E4[AfterScenario]
-        E5[ClearScenarioState]
-    end
-
-    A1 --> B1
-    A2 --> D2
-    A3 --> D1
-    A4 --> D1
-    A5 --> B2
-    A5 --> D2
-    A6 --> B3
-    A6 --> D2
-
-    B1 --> C1
-    B2 --> C1
-    B3 --> C1
-    B4 --> C1
-
-    C1 --> D1
-    C2 --> D1
-    C3 --> E1
-
-    D1 --> D4
-    D2 --> D3
-    D3 --> D1
-    D4 --> D5
-    D5 --> D1
-    D5 --> D6
-    D6 --> D3
-
-    D1 --> E1
-    E1 --> E2
-    E2 --> E3
-    E3 --> E4
-    E4 --> E5
-
-    classDef external fill:#09f,stroke:#333
-    classDef setup fill:#08f,stroke:#333
-    classDef server fill:#090,stroke:#333
-    classDef scenario fill:#000,stroke:#333
-
-    class A1,A2,A3,A4,A5,A6 external
-    class B1,B2,B3,B4 setup
-    class C1,C2,C3 setup
-    class D1,D2,D3,D4,D5,D6 server
-    class E1,E2,E3,E4,E5 scenario
-```
-
-## Configuration Flow (Mermaid Sequence Diagram)
-
-```mermaid
-sequenceDiagram
-    participant Script as Test Script
-    participant TestSetup as testsetup
-    participant Suite as suite.go
-    participant Server as testserver
-    participant ConfigFile as Config File
-    participant Steps as Step Definitions
-
-    Script->>Script: Set env vars (BDD_*, DLC_*)
-    Script->>TestSetup: Run go test ./features/{feature}
-    
-    TestSetup->>TestSetup: Read FEATURE from env
-    TestSetup->>TestSetup: Read GODOG_TAGS from env
-    TestSetup->>Suite: CreateTestSuite(FEATURE, tags)
-    
-    Suite->>Server: InitializeTestSuite -> NewServer()
-    Server->>Server: shouldEnableV2() checks FEATURE+GODOG_TAGS
-    Server->>Server: createTestConfig(port, v2Enabled)
-    Server->>Server: Start()
-    Server->>Server: Start monitorConfigFile() goroutine
-    
-    Suite->>Suite: InitializeScenario
-    Suite->>Steps: Create step context
-    
-    loop Each Scenario
-        Suite->>Server: BeforeScenario: SetupSchemaIsolation
-        Suite->>Steps: SetScenarioKeyForAllSteps
-        Steps->>Steps: Clear scenario state
-        
-        Steps->>Server: Execute step requests
-        
-        alt Config Feature + File Modified
-            ConfigFile->>Server: File modification detected
-            Server->>Server: ReloadConfig()
-            Server->>ConfigFile: loadConfigFromFile()
-            Server->>Server: Restart with new config
-        end
-        
-        Suite->>Server: AfterScenario: Cleanup
-        Suite->>Steps: ClearScenarioState
-    end
-```
-
-## Use Cases
-
-### UC-1: Default Test Run (No v2, No Config File)
-```
-Input:     go test ./features/greet
-FEATURE:   greet
-GODOG_TAGS: ~@flaky && ~@todo && ~@skip
-Config Source: createTestConfig(port)
-v2_enabled: false
-Result: v1 scenarios pass, v2 scenarios skipped by tag filter
-```
-
-### UC-2: v2 API Tests (Split Test Suite)
-```
-Input:     go test ./features/greet (with GODOG_TAGS="@v2" in v2 subtest)
-FEATURE:   greet
-GODOG_TAGS: @v2 && ~@skip
-Config Source: createTestConfig(port) with v2 check
-v2_enabled: true (because FEATURE=greet AND tags contain @v2)
-Result: v2 scenarios execute with v2 API available
-
-Flow:
-1. TestGreetBDD runs v1 subtest with tags="~@v2"
-2. TestGreetBDD runs v2 subtest with tags="@v2"
-3. Each subtest starts its own server
-4. Server in v2 subtest has v2_enabled=true
-5. v2 scenarios pass
-```
-
-### UC-3: Config Hot Reload Tests
-```
-Input:     go test ./features/config
-FEATURE:   config
-GODOG_TAGS: ~@flaky && ~@todo && ~@skip
-Config File: features/config/config-test-config.yaml
-Config Monitor: Watches config file for changes
-
-When config file is modified:
-1. monitorConfigFile() detects file change via mod time
-2. Calls ReloadConfig()
-3. ReloadConfig() for FEATURE=config: loads from config file
-4. Server restarts with new config
-5. Subsequent scenarios see new configuration
-
-Note: This is the ONLY feature that uses config file hot-reload.
-      All other features use hardcoded/test defaults.
-```
-
-### UC-4: Config Hot Reload with v2 Enable
-```
-Scenario: Hot reloading feature flags
-Steps:
-1. Server starts with default config (v2_enabled: false)
-2. Test sets v2_enabled: true in config file
-3. Config monitor detects change
-4. ReloadConfig() called
-5. Server loads from config file (NOT createTestConfig)
-6. Server restarts with v2_enabled: true
-7. Test verifies v2 API works
-
-Current Bug: ReloadConfig() calls createTestConfig() which:
- Reads FEATURE=config
- Reads GODOG_TAGS (doesn't contain @v2)
- Sets v2_enabled: false
- Overrides the config file setting!
-
-Fix: ReloadConfig() must load from file for config feature.
-```
-
-## Implementation Details
-
-### Config Creation Flow
-
-```go
-// pkg/bdd/testserver/server.go
-
-func NewServer() *Server {
-    port := getRandomPort() // 10000-19999
-    return &Server{port: port}
-}
-
-func (s *Server) Start() error {
-    cfg := createTestConfig(s.port)
-    // ... start server with cfg
-    go s.monitorConfigFile()
-}
-
-// CURRENT - BAD
-func createTestConfig(port int) *config.Config {
-    feature := os.Getenv("FEATURE")
-    tags := os.Getenv("GODOG_TAGS")
-    
-    enableV2 := false
-    if feature == "greet" && strings.Contains(tags, "@v2") {
-        enableV2 = true
-    }
-    // ...
-    return &config.Config{
-        API: config.APIConfig{V2Enabled: enableV2},
-        // ...
-    }
-}
-
-// PROPOSED - GOOD
-func createTestConfig(port int, opts ConfigOptions) *config.Config {
-    defaults := &config.Config{
-        Server: config.ServerConfig{Host: "0.0.0.0", Port: port},
-        // ... all hardcoded defaults
-    }
-    
-    // Apply explicit options (passed from caller)
-    if opts.V2Enabled {
-        defaults.API.V2Enabled = true
-    }
-    
-    return defaults
-}
-
-// ConfigOptions passed from testsuite
-type ConfigOptions struct {
-    V2Enabled      bool
-    UseConfigFile  bool
-    ConfigFilePath string
-}
-```
-
-### Reload Flow Fix
-
-```go
-// pkg/bdd/testserver/server.go
-
-func (s *Server) ReloadConfig() error {
-    feature := os.Getenv("FEATURE")
-    
-    if feature == "config" && s.configFilePath != "" {
-        // For config tests: load from monitored file
-        cfg, err := loadConfigFromFile(s.configFilePath)
-        if err != nil {
-            return err
-        }
-        return s.applyConfig(cfg)
-    }
-    
-    // For all other features: use defaults
-    // (hot reload not supported for non-config features)
-    cfg := createDefaultConfig(s.port)
-    return s.applyConfig(cfg)
-}
-
-func loadConfigFromFile(path string) (*config.Config, error) {
-    v := viper.New()
-    v.SetConfigFile(path)
-    v.SetConfigType("yaml")
-    
-    if err := v.ReadInConfig(); err != nil {
-        return nil, err
-    }
-    
-    var cfg config.Config
-    if err := v.Unmarshal(&cfg); err != nil {
-        return nil, err
-    }
-    
-    // Apply hardcoded values that should NOT come from file
-    // (database connection for BDD tests, etc.)
-    cfg.Database.Host = getDatabaseHost()
-    cfg.Database.Port = getDatabasePort()
-    cfg.Database.User = "postgres"
-    cfg.Database.Password = "postgres"
-    cfg.Database.Name = "dance_lessons_coach"
-    
-    return &cfg, nil
-}
-```
-
-## Configuration File Format
-
-### Config Test File (features/config/config-test-config.yaml)
-```yaml
-server:
-  host: "127.0.0.1"
-  port: 9191
-
-logging:
-  level: "info"
-  json: false
-
-api:
-  v2_enabled: false  # Will be toggled by tests
-
-telemetry:
-  enabled: true
-  sampler:
-    type: "parentbased_always_on"
-    ratio: 1.0
-
-auth:
-  jwt:
-    ttl: 1h
-
-database:
-  # These are OVERRIDDEN by BDD test infrastructure
-  host: "localhost"
-  port: 5432
-  user: "postgres"
-  password: "postgres"
-  name: "dance_lessons_coach_bdd_test"
-  ssl_mode: "disable"
-```
-
-## State Isolation
-
-### Per-Scenario State
- Managed by: `pkg/bdd/steps/scenario_state.go`
- Key: SHA256 hash of scenario URI + name
- State includes: LastToken, FirstToken, LastUserID, LastSecret, LastError
- Cleared: At start of each scenario in BeforeScenario hook
-
-### Database Schema Isolation
- Enabled by: `BDD_SCHEMA_ISOLATION=true`
- Mechanism: Creates unique schema per scenario
- Schema name: `test_{sha256(scenarioKey)[:8]}`
- Search path: Set via `SET search_path TO ...`
- Cleanup: Schema dropped after scenario
-
-### Server-Level State Reset
- JWT secrets: Reset after every scenario via `ResetJWTSecrets()`
- Database: Cleaned up after every scenario
- Auth state: Per-scenario via state manager
-
-## Package Responsibilities
-
-### pkg/bdd/testserver
- **Purpose**: Test HTTP server management
- **Responsibilities**:
-  - Server lifecycle (Start, Stop)
-  - Configuration loading and reloading
-  - Database cleanup
-  - Schema isolation
-  - JWT secret management
-  - Config file monitoring (config feature only)
-
-### pkg/bdd/testsetup  
- **Purpose**: Godog test suite setup
- **Responsibilities**:
-  - Feature test file discovery
-  - Test suite configuration
-  - Tag filtering
-  - godog options setup
-
-### pkg/bdd/suite
- **Purpose**: Test suite initialization hooks
- **Responsibilities**:
-  - BeforeSuite/AfterSuite hooks
-  - BeforeScenario/AfterScenario hooks
-  - Step context creation
-  - State isolation setup
-
-### pkg/bdd/steps
- **Purpose**: Step definitions
- **Responsibilities**:
-  - All Gherkin step implementations
-  - Per-scenario state management
-  - Per-feature step organization
-
-## Migration Plan
-
-### Phase 1: Fix Config Reload (Urgent)
-1. Create `loadConfigFromFile()` function
-2. Modify `ReloadConfig()` to use file for config feature
-3. Add tests to verify config hot-reload works
-
-### Phase 2: Clean Up Config Creation
-1. Create `ConfigOptions` struct
-2. Modify `createTestConfig()` to accept options
-3. Update callers to pass explicit options
-4. Remove env var reading from deep in config creation
-
-### Phase 3: Document and Validate
-1. Write comprehensive documentation (this file)
-2. Add validation tests for all use cases
-3. Create troubleshooting guide
-
-### Phase 4: Consider Package Merge (Optional)
-1. Evaluate merging testserver + testsetup
-2. Design new `pkg/bdd/testing` package structure
-3. Migrate code incrementally
-
-## Rules for Adding New Configuration
-
-1. **Prefer explicit parameters** over environment variables
-2. **Read env vars at ONE layer only** (typically test entry point)
-3. **Document all config sources** in this file
-4. **Test config combinations** to prevent override bugs
-5. **Never read env vars in hot paths** (scenario steps, server handlers)
-
-## Troubleshooting
-
-### Symptom: Config file changes not applied
- Check: Is FEATURE=config?
- Check: Does config file exist at `features/config/config-test-config.yaml`?
- Check: Does monitorConfigFile() detect the change?
- Fix: ReloadConfig() must load from file, not createTestConfig()
-
-### Symptom: v2 tests fail with 404
- Check: Is FEATURE=greet?
- Check: Does GODOG_TAGS contain @v2?
- Check: Does createTestConfig() see the tags?
- Fix: Ensure tags are set before server creation
-
-### Symptom: State pollution between scenarios
- Check: Is schema isolation enabled?
- Check: Are step definitions using per-scenario state?
- Fix: Use ScenarioState for all mutable state
-
-## References
-
- [Godog Documentation](https://github.com/cucumber/godog)
- [pkg/config/config.go](../config/config.go) - Config struct definitions
- [pkg/bdd/testsetup/testsetup.go](../testsetup/testsetup.go) - Test suite creation
- [pkg/bdd/suite.go](../suite.go) - Test hooks
- [ADR-0008: BDD Testing](../adr/0008-bdd-testing.md)
--- a/pkg/bdd/testserver/STATE_TRACER_README.md
+++ b/pkg/bdd/testserver/STATE_TRACER_README.md
@@ -1,241 +0,0 @@
-# BDD State Tracer
-
-## Overview
-
-The BDD State Tracer is a debugging tool that logs scenario execution, database operations, and state modifications to a file in `$TMPDIR` for analysis of test execution order and state pollution issues.
-
-## Purpose
-
-### Why Tracing Was Added
-
-During multi-iteration BDD test runs with `./scripts/validate-test-suite.sh`, intermittent failures occurred that were difficult to diagnose:
- Tests passed when run individually
- Tests failed when run together in the validation script
- Patterns suggested database state pollution between scenarios across different feature packages
-
-The tracer was created to answer key questions:
-1. **Execution Order**: Which scenarios run in which order?
-2. **State Modifications**: What database writes/cleanups occur and when?
-3. **Overlap Detection**: Are scenarios running in parallel (causing race conditions)?
-4. **Isolation Verification**: Is schema isolation working as expected?
-
-### Key Findings from Tracing
-
-1. **Sequential Execution**: Each feature package runs in a separate process (separate PIDs), but scenarios within each feature run sequentially
-2. **Shared Database**: All processes share the same PostgreSQL database connection
-3. **Schema Isolation Status**: When `BDD_SCHEMA_ISOLATION=false` (default in validate script), all scenarios share the `public` schema
-4. **Cleanup Operations**: Database cleanup (`CleanupDatabase`) runs after each scenario, deleting all test data from all tables
-5. **In-Memory State**: JWT secrets are stored in-memory only, not in database - schema isolation doesn't prevent JWT secret pollution
-
-### Example Trace Output
-
-```
-2026-04-11T10:10:53.032156 | auth            | User registration               | SCENARIO_START   | 
-2026-04-11T10:10:53.146438 | auth            | User registration               | SCENARIO_END     | PASSED
-2026-04-11T10:10:53.152398 | auth            | User registration               | JWT_RESET        | ok
-2026-04-11T10:10:53.162357 | auth            | Failed authentication          | SCENARIO_START   | 
-2026-04-11T10:10:53.268273 | auth            | Failed authentication          | SCENARIO_END     | PASSED
-```
-
-## Usage
-
-### Enable Tracing
-
-Set the environment variable `BDD_TRACE_STATE=1` before running tests:
-
-```bash
-# Single run with tracing
-BDD_TRACE_STATE=1 go test ./features/auth -v
-
-# Validation script with tracing
-BDD_TRACE_STATE=1 ./scripts/validate-test-suite.sh 1
-
-# Multiple runs with tracing
-BDD_TRACE_STATE=1 ./scripts/validate-test-suite.sh 5
-```
-
-### Trace File Location
-
-Trace files are written to `$TMPDIR` (typically `/var/folders/.../T/` on macOS or `/tmp` on Linux):
-
-```bash
-# Find trace files
-ls -la $TMPDIR/bdd-state-trace-*.log
-
-# View a trace file
-cat $TMPDIR/bdd-state-trace-20260411-101053-12345.log
-```
-
-### Trace File Format
-
-```
-TIMESTAMP | FEATURE          | SCENARIO                              | ACTION           | DETAILS
-2026-04-11T10:10:53.032156 | auth            | User registration               | SCENARIO_START   | 
-2026-04-11T10:10:53.146438 | auth            | User registration               | SCENARIO_END     | PASSED
-2026-04-11T10:10:53.152398 | auth            | User registration               | JWT_RESET        | ok
-2026-04-11T10:10:53.162357 | auth            | User registration               | DB_CLEANUP       | all_tables
-```
-
-**Columns:**
- `TIMESTAMP`: ISO 8601 format with microseconds
- `FEATURE`: Feature name from `FEATURE` environment variable
- `SCENARIO`: Scenario name (includes URI for disambiguation)
- `ACTION`: Type of action (see below)
- `DETAILS`: Additional context
-
-**Action Types:**
- `SCENARIO_START` - Scenario execution begins
- `SCENARIO_END` - Scenario execution completes (PASSED or FAILED)
- `DB_CLEANUP` - Database cleanup operation
- `DB_SELECT` - Database read operation
- `JWT_RESET` - JWT secrets reset to initial state
- `DB_INSERT/UPDATE/DELETE` - Database write operations (future)
- `SCHEMA_*` - Schema isolation operations (future)
- `TX_*` - Transaction boundary operations (future)
-
-## Implementation
-
-### Architecture
-
-The state tracer uses a simple file-based approach:
-
-1. **Per-Process Tracing**: Each `go test` process creates its own trace file with unique filename based on timestamp and PID
-2. **Immediate Flush**: Each trace line is flushed immediately to disk using `Sync()` to prevent data loss
-3. **No Dependencies**: Uses only standard library (`os`, `fmt`, `time`, `path/filepath`)
-4. **Singleton Pattern**: Package-level functions for easy usage across the codebase
-
-### Files
-
- `pkg/bdd/testserver/state_tracer.go` - Core tracing functions
- `pkg/bdd/suite.go` - Integration with godog Before/After scenario hooks
-
-### Key Functions
-
-```go
-// Package-level functions (called from anywhere)
-TraceStateScenarioStart(feature, scenario string)
-TraceStateScenarioEnd(feature, scenario string, err error)
-TraceStateDBCleanup(feature, scenario, table string)
-TraceStateJWTSecretOperation(feature, scenario, operation, details string)
-TraceStateSchemaIsolation(feature, scenario, operation, details string)
-TraceStateTransaction(feature, scenario, action, details string)
-TraceStateDBRead(feature, scenario, table, details string)
-```
-
-## Limitations
-
-### Current Limitations
-
-1. **Per-Process Files**: Each `go test` process creates its own file, making correlation across processes manual
-2. **No Database Write Tracing**: Currently only traces cleanup, not individual INSERT/UPDATE/DELETE operations
-3. **No API Call Tracing**: Doesn't trace HTTP requests made during scenarios
-4. **No Timing Analysis**: Doesn't measure duration between operations automatically
-5. **No Schema Name in Trace**: When schema isolation is enabled, doesn't show which schema is active
-6. **File Rotation**: No automatic cleanup of old trace files
-
-### Known Issues
-
-1. **PID-based filenames**: If multiple runs happen in the same second, filenames could collide
-2. **Large file sizes**: High-volume tracing could create large files (mitigated by per-run files)
-3. **No header/footer**: Trace files start immediately with data, no metadata about the run
-
-## Future Enhancements
-
-### Priority 1: Process Correlation
- Add a unique run ID that can be passed across all processes
- Include process start/end markers to show process lifecycle
- Add parent PID tracking to show process hierarchy
-
-### Priority 2: Database Operation Tracing
- Add tracing for all database writes (INSERT, UPDATE, DELETE)
- Include query text and affected rows
- Trace transaction boundaries with IDs
- Add schema name to all database operations when isolation is enabled
-
-### Priority 3: API Call Tracing
- Trace all HTTP requests made during scenarios
- Include request method, path, status code, and duration
- Mark requests that modify state (POST, PUT, DELETE vs GET)
-
-### Priority 4: Analysis Tools
- Create a `bdd-trace-analyzer` tool to:
-  - Merge trace files from all processes in correct order
-  - Detect overlapping scenarios (parallel execution)
-  - Identify database state pollution patterns
-  - Generate visualization of scenario execution timeline
-  - Flag potential race conditions
-
-### Priority 5: Improved Output
- Add trace file header with metadata (run ID, start time, config, etc.)
- Color-coded output for different action types
- JSON output option for programmatic analysis
- Trace level filtering (DEBUG, INFO, WARN, ERROR)
-
-### Priority 6: Performance Optimization
- Batch writes instead of per-line flush (with configurable flush interval)
- Compress old trace files
- Automatic cleanup of old files
-
-## Analysis Use Cases
-
-### Detecting State Pollution
-
-Look for patterns like:
-```
-PID 1234 | auth | Scenario A | DB_CLEANUP | all_tables
-PID 5678 | greet | Scenario B | SCENARIO_START |
-# ^ Scenario B starts AFTER auth cleanup - potential issue
-```
-
-### Detecting Parallel Execution
-
-Check if timestamps overlap:
-```
-PID 1234 | 10:10:53.032 | auth | Scenario A | SCENARIO_START
-PID 5678 | 10:10:53.035 | greet | Scenario B | SCENARIO_START
-# ^ Both started within 3ms - likely parallel
-```
-
-### Verifying Schema Isolation
-
-Check that each scenario gets its own schema:
-```
-PID 1234 | auth | Scenario A | SCHEMA_CREATE | test_a1b2c3d4
-PID 1234 | auth | Scenario B | SCHEMA_CREATE | test_e5f6g7h8
-# ^ Different schemas for different scenarios - good
-```
-
-## Troubleshooting
-
-### Tracing Not Working
-
-1. Verify `BDD_TRACE_STATE=1` is set:
-   ```bash
-   echo $BDD_TRACE_STATE
-   ```
-2. Check if trace files are being created:
-   ```bash
-   ls -la $TMPDIR/bdd-state-trace-*.log
-   ```
-3. Verify the `testserver` package is being used (tracing is integrated there)
-
-### No Trace Files Found
-
- Tracing only works when `BDD_TRACE_STATE=1` is set before the test process starts
- Each `go test` process creates its own file - if tests pass quickly, files may be short
- Files are created in `$TMPDIR` which defaults to `/tmp` on Linux and a temp folder on macOS
-
-### Trace Files Too Large
-
- Tracing every operation can generate large files
- Consider filtering to specific scenarios:
-  ```bash
-  # Run only failing scenarios with tracing
-  BDD_TRACE_STATE=1 go test ./features/auth -v -run "TestAuthBDD/Password_reset"
-  ```
-
-## Related Files
-
- `pkg/bdd/suite.go` - Godog test suite initialization with tracing hooks
- `pkg/bdd/testserver/server.go` - Test server with tracing integration
- `scripts/validate-test-suite.sh` - Test validation script
--- a/pkg/bdd/testserver/config_test.go
+++ b/pkg/bdd/testserver/config_test.go
@@ -10,26 +10,76 @@ import (
 func TestCreateTestConfig(t *testing.T) {
 	// Test 1: Default config (no test config file)
 	t.Run("DefaultConfig", func(t *testing.T) {
-		cfg := createTestConfig(9999, false)
+		cfg := createTestConfig(9999)

-		expectedDatabaseName := os.Getenv("DLC_DATABASE_NAME")
-		if expectedDatabaseName == "" {
-			expectedDatabaseName = "dance_lessons_coach"
-		}
-
-		assert.Equal(t, "0.0.0.0", cfg.Server.Host)
+		assert.Equal(t, "localhost", cfg.Server.Host)
 		assert.Equal(t, 9999, cfg.Server.Port)
-		assert.Equal(t, "test-secret-key-for-bdd-tests", cfg.Auth.JWTSecret)
+		assert.Equal(t, true, cfg.API.V2Enabled, "v2 should be enabled by default")
+		assert.Equal(t, "default-secret-key-please-change-in-production", cfg.Auth.JWTSecret)
 		assert.Equal(t, "admin123", cfg.Auth.AdminMasterPassword)
-		assert.Equal(t, expectedDatabaseName, cfg.Database.Name)
+		assert.Equal(t, "dance_lessons_coach_bdd_test", cfg.Database.Name)
 	})

-	// Test 2: Config with v2 enabled
-	t.Run("V2EnabledConfig", func(t *testing.T) {
-		cfg := createTestConfig(9999, true)
+	// Test 2: Config with environment variable override should NOT affect test config
+	t.Run("EnvironmentVariableIsolation", func(t *testing.T) {
+		// Set environment variables that would normally override config
+		os.Setenv("DLC_API_V2_ENABLED", "false")
+		os.Setenv("DLC_AUTH_JWT_SECRET", "env-secret")
+		defer func() {
+			os.Unsetenv("DLC_API_V2_ENABLED")
+			os.Unsetenv("DLC_AUTH_JWT_SECRET")
+		}()

-		assert.Equal(t, "0.0.0.0", cfg.Server.Host)
-		assert.Equal(t, 9999, cfg.Server.Port)
-		assert.True(t, cfg.API.V2Enabled)
+		cfg := createTestConfig(8888)
+
+		// These should NOT be affected by environment variables
+		assert.Equal(t, true, cfg.API.V2Enabled, "v2 should still be enabled despite env var")
+		assert.Equal(t, "default-secret-key-please-change-in-production", cfg.Auth.JWTSecret, "should use default secret, not env var")
+	})
+
+	// Test 3: Test config file loading
+	t.Run("TestConfigFileLoading", func(t *testing.T) {
+		// Create a temporary test config file
+		testConfig := `server:
+  host: testhost
+  port: 1234
+api:
+  v2_enabled: false
+auth:
+  jwt_secret: test-secret
+  admin_master_password: test-admin
+database:
+  name: test_db
+`
+
+		tempFile := "test-config-test.yaml"
+		if err := os.WriteFile(tempFile, []byte(testConfig), 0644); err != nil {
+			t.Fatal("Failed to create test config file:", err)
+		}
+		defer os.Remove(tempFile)
+
+		// Set FEATURE env to trigger config file loading
+		os.Setenv("FEATURE", "test")
+		defer os.Unsetenv("FEATURE")
+
+		// Create a feature-specific config file that points to our test file
+		featureConfigDir := "features/test"
+		os.MkdirAll(featureConfigDir, 0755)
+		defer os.RemoveAll(featureConfigDir)
+
+		if err := os.Symlink("../../"+tempFile, featureConfigDir+"/test-test-config.yaml"); err != nil {
+			t.Fatal("Failed to create symlink:", err)
+		}
+		defer os.Remove(featureConfigDir + "/test-test-config.yaml")
+
+		cfg := createTestConfig(7777) // This port should override config file port
+
+		// Values from config file should be used, except port which is overridden by parameter
+		assert.Equal(t, "testhost", cfg.Server.Host)
+		assert.Equal(t, 7777, cfg.Server.Port, "parameter port should override config file port")
+		assert.Equal(t, false, cfg.API.V2Enabled, "v2_enabled from config file should be used")
+		assert.Equal(t, "test-secret", cfg.Auth.JWTSecret, "jwt_secret from config file should be used")
+		assert.Equal(t, "test-admin", cfg.Auth.AdminMasterPassword, "admin_master_password from config file should be used")
+		assert.Equal(t, "test_db", cfg.Database.Name, "database name from config file should be used")
 	})
 }
--- a/pkg/bdd/testserver/server.go
+++ b/pkg/bdd/testserver/server.go
@@ -2,17 +2,13 @@ package testserver

 import (
 	"context"
-	"crypto/sha256"
 	"database/sql"
-	"encoding/hex"
 	"fmt"
 	"math/rand"
 	"net/http"
 	"os"
-
 	"strconv"
 	"strings"
-	"sync"
 	"time"

 	"dance-lessons-coach/pkg/config"
@@ -24,35 +20,6 @@ import (
 	"github.com/spf13/viper"
 )

-// isCleanupLoggingEnabled returns true if BDD_ENABLE_CLEANUP_LOGS environment variable is set to "true"
-func isCleanupLoggingEnabled() bool {
-	return os.Getenv("BDD_ENABLE_CLEANUP_LOGS") == "true"
-}
-
-// isSchemaIsolationEnabled returns true if BDD_SCHEMA_ISOLATION environment variable is set to "true"
-func isSchemaIsolationEnabled() bool {
-	return os.Getenv("BDD_SCHEMA_ISOLATION") == "true"
-}
-
-// generateSchemaName creates a unique schema name for a scenario
-// Format: test_{sha256(feature_scenario)[:8]}
-func generateSchemaName(feature, scenario string) string {
-	hash := sha256.Sum256([]byte(feature + ":" + scenario))
-	hashStr := hex.EncodeToString(hash[:])
-	return "test_" + hashStr[:8]
-}
-
-type Server struct {
-	httpServer         *http.Server
-	port               int
-	baseURL            string
-	db                 *sql.DB
-	authService        user.AuthService // Reference to auth service for cleanup
-	schemaMutex        sync.Mutex       // Protects schema operations
-	currentSchema      string           // Current schema being used
-	originalSearchPath string           // Original search_path to restore
-}
-
 // getDatabaseHost returns the database host from environment variable or defaults to localhost
 func getDatabaseHost() string {
 	host := os.Getenv("DLC_DATABASE_HOST")
@@ -73,22 +40,12 @@ func getDatabasePort() int {
 	return port
 }

-// getDatabaseName returns the database name from environment variable or defaults to dance_lessons_coach
-func getDatabaseName() string {
-	name := os.Getenv("DLC_DATABASE_NAME")
-	if name == "" {
-		return "dance_lessons_coach"
-	}
-	return name
-}
-
-// getDatabaseSSLMode returns the SSL mode from environment variable or defaults to disable
-func getDatabaseSSLMode() string {
-	sslMode := os.Getenv("DLC_DATABASE_SSL_MODE")
-	if sslMode == "" {
-		return "disable"
-	}
-	return sslMode
+type Server struct {
+	httpServer  *http.Server
+	port        int
+	baseURL     string
+	db          *sql.DB
+	authService user.AuthService // Reference to auth service for cleanup
 }

 func init() {
@@ -133,21 +90,15 @@ func NewServer() *Server {
 	}

 	return &Server{
-		port:               port,
-		currentSchema:      "public",
-		originalSearchPath: "public",
+		port: port,
 	}
 }

 func (s *Server) Start() error {
 	s.baseURL = fmt.Sprintf("http://localhost:%d", s.port)

-	// Determine if v2 should be enabled based on feature and tags
-	// This is the ONLY place where we check env vars for v2 configuration
-	v2Enabled := s.shouldEnableV2()
-
 	// Create real server instance from pkg/server
-	cfg := createTestConfig(s.port, v2Enabled)
+	cfg := createTestConfig(s.port)
 	realServer := server.NewServer(cfg, context.Background())

 	// Store auth service for cleanup
@@ -251,24 +202,9 @@ func (s *Server) ReloadConfig() error {
 		}
 	}

-	// Recreate server with new config from file
-	// This is the ONLY feature that uses config file hot-reload
-	feature := os.Getenv("FEATURE")
-
-	var realServer *server.Server
-	if feature == "config" {
-		// For config feature: load config from the monitored file
-		cfg, err := s.loadConfigFromFile()
-		if err != nil {
-			log.Warn().Err(err).Msg("Failed to load config from file, using defaults")
-			cfg = createTestConfig(s.port, false)
-		}
-		realServer = server.NewServer(cfg, context.Background())
-	} else {
-		// For other features: use defaults with v2 check
-		cfg := createTestConfig(s.port, s.shouldEnableV2())
-		realServer = server.NewServer(cfg, context.Background())
-	}
+	// Recreate server with new config
+	cfg := createTestConfig(s.port)
+	realServer := server.NewServer(cfg, context.Background())
 	s.httpServer = &http.Server{
 		Addr:    fmt.Sprintf(":%d", s.port),
 		Handler: realServer.Router(),
@@ -287,54 +223,6 @@ func (s *Server) ReloadConfig() error {
 	return s.waitForServerReady()
 }

-// loadConfigFromFile loads configuration from the monitored config file
-// Used for config feature hot-reload tests only
-func (s *Server) loadConfigFromFile() (*config.Config, error) {
-	feature := os.Getenv("FEATURE")
-	if feature == "" {
-		return nil, fmt.Errorf("FEATURE not set")
-	}
-
-	configPath := fmt.Sprintf("features/%s/%s-test-config.yaml", feature, feature)
-
-	v := viper.New()
-	v.SetConfigFile(configPath)
-	v.SetConfigType("yaml")
-
-	if err := v.ReadInConfig(); err != nil {
-		return nil, fmt.Errorf("failed to read config file %s: %w", configPath, err)
-	}
-
-	var cfg config.Config
-	if err := v.Unmarshal(&cfg); err != nil {
-		return nil, fmt.Errorf("failed to unmarshal config from %s: %w", configPath, err)
-	}
-
-	// Apply BDD test infrastructure defaults that should NOT come from config file
-	// These are specific to the test environment
-	cfg.Database.Host = getDatabaseHost()
-	cfg.Database.Port = getDatabasePort()
-	cfg.Database.User = "postgres"
-	cfg.Database.Password = "postgres"
-	cfg.Database.Name = getDatabaseName()
-	cfg.Database.SSLMode = getDatabaseSSLMode()
-
-	// Ensure auth defaults
-	if cfg.Auth.JWTSecret == "" {
-		cfg.Auth.JWTSecret = "test-secret-key-for-bdd-tests"
-	}
-	if cfg.Auth.AdminMasterPassword == "" {
-		cfg.Auth.AdminMasterPassword = "admin123"
-	}
-
-	// Ensure logging default
-	if cfg.Logging.Level == "" {
-		cfg.Logging.Level = "debug"
-	}
-
-	return &cfg, nil
-}
-
 // initDBConnection initializes a direct database connection for cleanup operations
 func (s *Server) initDBConnection() error {
 	// Get feature-specific configuration
@@ -345,18 +233,29 @@ func (s *Server) initDBConnection() error {
 		// Try to load feature-specific config
 		configPath := fmt.Sprintf("features/%s/%s-test-config.yaml", feature, feature)
 		if _, err := os.Stat(configPath); err == nil {
-			var loadErr error
-			cfg, loadErr = s.loadConfigFromFile()
-			if loadErr != nil {
-				log.Warn().Err(loadErr).Str("path", configPath).Msg("Failed to load config, using defaults")
-				cfg = nil
+			v := viper.New()
+			v.SetConfigFile(configPath)
+			v.SetConfigType("yaml")
+
+			if readErr := v.ReadInConfig(); readErr == nil {
+				var featureCfg config.Config
+				if unmarshalErr := v.Unmarshal(&featureCfg); unmarshalErr == nil {
+					// Set default values if not configured
+					if featureCfg.Auth.JWTSecret == "" {
+						featureCfg.Auth.JWTSecret = "default-secret-key-please-change-in-production"
+					}
+					if featureCfg.Auth.AdminMasterPassword == "" {
+						featureCfg.Auth.AdminMasterPassword = "admin123"
+					}
+					cfg = &featureCfg
+				}
 			}
 		}
 	}

 	// Fallback to default config if feature-specific not available
 	if cfg == nil {
-		cfg = createTestConfig(s.port, s.shouldEnableV2())
+		cfg = createTestConfig(s.port)
 	}

 	dsn := fmt.Sprintf(
@@ -396,16 +295,12 @@ func (s *Server) initDBConnection() error {
 // This prevents JWT secret pollution between tests
 func (s *Server) ResetJWTSecrets() error {
 	if s.authService == nil {
-		if isCleanupLoggingEnabled() {
-			log.Info().Msg("CLEANUP: No auth service available, skipping JWT secrets reset")
-		}
+		log.Debug().Msg("No auth service available, skipping JWT secrets reset")
 		return nil
 	}

 	s.authService.ResetJWTSecrets()
-	if isCleanupLoggingEnabled() {
-		log.Info().Msg("CLEANUP: JWT secrets reset to initial state")
-	}
+	log.Trace().Msg("JWT secrets reset to initial state")
 	return nil
 }

@@ -414,17 +309,10 @@ func (s *Server) ResetJWTSecrets() error {
 // Uses SET CONSTRAINTS ALL DEFERRED to temporarily disable foreign key checks
 func (s *Server) CleanupDatabase() error {
 	if s.db == nil {
-		if isCleanupLoggingEnabled() {
-			log.Info().Msg("CLEANUP: No database connection, skipping cleanup")
-		}
+		log.Debug().Msg("No database connection, skipping cleanup")
 		return nil // No database connection, skip cleanup
 	}

-	// Log database state before cleanup
-	if isCleanupLoggingEnabled() {
-		log.Info().Msg("CLEANUP: Starting database cleanup")
-	}
-
 	// Start a transaction for atomic cleanup
 	tx, err := s.db.Begin()
 	if err != nil {
@@ -520,187 +408,150 @@ func (s *Server) CleanupDatabase() error {
 		return fmt.Errorf("failed to commit cleanup transaction: %w", err)
 	}

-	if isCleanupLoggingEnabled() {
-		log.Info().Msg("CLEANUP: Database cleanup completed successfully")
+	log.Debug().Msg("Database cleanup completed successfully")
+	return nil
+}
+
+// CloseDatabase closes the database connection
+func (s *Server) CloseDatabase() error {
+	if s.db != nil {
+		return s.db.Close()
 	}
 	return nil
 }

-// SetupScenarioSchema creates and activates a unique schema for the scenario
-func (s *Server) SetupScenarioSchema(feature, scenario string) error {
-	if !isSchemaIsolationEnabled() {
-		if isCleanupLoggingEnabled() {
-			log.Info().Str("feature", feature).Str("scenario", scenario).Msg("ISOLATION: Schema isolation disabled, using public schema")
+func (s *Server) waitForServerReady() error {
+	maxAttempts := 30
+	attempt := 0
+
+	for attempt < maxAttempts {
+		resp, err := http.Get(fmt.Sprintf("%s/api/ready", s.baseURL))
+		if err == nil && resp.StatusCode == http.StatusOK {
+			resp.Body.Close()
+			return nil
 		}
-		return nil
-	}
-
-	schemaName := generateSchemaName(feature, scenario)
-	s.schemaMutex.Lock()
-	defer s.schemaMutex.Unlock()
-
-	// Store original search path if not already stored
-	if s.originalSearchPath == "" {
-		var err error
-		s.originalSearchPath, err = s.getCurrentSearchPath()
-		if err != nil {
-			log.Warn().Err(err).Msg("ISOLATION: Failed to get current search_path")
-			s.originalSearchPath = "public"
+		if resp != nil {
+			resp.Body.Close()
 		}
+		attempt++
+		time.Sleep(100 * time.Millisecond)
 	}

-	// Create the schema
-	createSQL := fmt.Sprintf("CREATE SCHEMA IF NOT EXISTS %s", schemaName)
-	if _, err := s.db.Exec(createSQL); err != nil {
-		return fmt.Errorf("failed to create schema %s: %w", schemaName, err)
-	}
-
-	// Set search path to use the new schema
-	searchPathSQL := fmt.Sprintf("SET search_path = %s, %s", schemaName, s.originalSearchPath)
-	if _, err := s.db.Exec(searchPathSQL); err != nil {
-		return fmt.Errorf("failed to set search_path: %w", err)
-	}
-
-	s.currentSchema = schemaName
-
-	if isCleanupLoggingEnabled() {
-		log.Info().Str("feature", feature).Str("scenario", scenario).Str("schema", schemaName).Msg("ISOLATION: Created and activated schema")
-	}
-
-	return nil
-}
-
-// TeardownScenarioSchema drops the scenario's schema and restores search path
-func (s *Server) TeardownScenarioSchema() error {
-	if !isSchemaIsolationEnabled() {
-		return nil
-	}
-
-	s.schemaMutex.Lock()
-	defer s.schemaMutex.Unlock()
-
-	if s.currentSchema == "" || s.currentSchema == "public" {
-		if isCleanupLoggingEnabled() {
-			log.Info().Msg("ISOLATION: No custom schema to teardown")
-		}
-		return nil
-	}
-
-	schemaName := s.currentSchema
-
-	// Restore original search path
-	restoreSQL := fmt.Sprintf("SET search_path = %s", s.originalSearchPath)
-	if _, err := s.db.Exec(restoreSQL); err != nil {
-		log.Warn().Err(err).Str("original", s.originalSearchPath).Msg("ISOLATION: Failed to restore search_path")
-	}
-
-	// Drop the schema - CASCADE ensures dependent objects are also dropped
-	dropSQL := fmt.Sprintf("DROP SCHEMA IF EXISTS %s CASCADE", schemaName)
-	if _, err := s.db.Exec(dropSQL); err != nil {
-		return fmt.Errorf("failed to drop schema %s: %w", schemaName, err)
-	}
-
-	s.currentSchema = ""
-
-	if isCleanupLoggingEnabled() {
-		log.Info().Str("schema", schemaName).Msg("ISOLATION: Dropped schema")
-	}
-
-	return nil
-}
-
-// getCurrentSearchPath retrieves the current search_path setting
-func (s *Server) getCurrentSearchPath() (string, error) {
-	var searchPath string
-	err := s.db.QueryRow("SHOW search_path").Scan(&searchPath)
-	return searchPath, err
+	return fmt.Errorf("server did not become ready after %d attempts", maxAttempts)
 }

 func (s *Server) Stop() error {
-	if s.httpServer != nil {
-		ctx, cancel := context.WithTimeout(context.Background(), 5*time.Second)
-		defer cancel()
-		return s.httpServer.Shutdown(ctx)
+	if s.httpServer == nil {
+		return nil
 	}
-	return nil
+
+	// Shutdown HTTP server gracefully
+	ctx, cancel := context.WithTimeout(context.Background(), 5*time.Second)
+	defer cancel()
+
+	return s.httpServer.Shutdown(ctx)
 }

 func (s *Server) GetBaseURL() string {
 	return s.baseURL
 }

-func (s *Server) GetPort() int {
-	return s.port
-}
+func createTestConfig(port int) *config.Config {
+	// Check for feature-specific config file first
+	// This supports the new modular BDD test structure
+	feature := os.Getenv("FEATURE")
+	var configPaths []string

-// waitForServerReady waits for the server to be ready
-func (s *Server) waitForServerReady() error {
-	ctx, cancel := context.WithTimeout(context.Background(), 10*time.Second)
-	defer cancel()
+	if feature != "" {
+		// Feature-specific config takes precedence
+		configPaths = []string{
+			fmt.Sprintf("features/%s/%s-test-config.yaml", feature, feature),
+			"test-config.yaml", // Fallback to legacy config
+		}
+	} else {
+		// When running all features, use legacy config
+		configPaths = []string{"test-config.yaml"}
+	}

-	ticker := time.NewTicker(100 * time.Millisecond)
-	defer ticker.Stop()
+	// Try each config path in order
+	for _, configPath := range configPaths {
+		if _, err := os.Stat(configPath); err == nil {
+			// Config file exists, use it
+			v := viper.New()
+			v.SetConfigFile(configPath)
+			v.SetConfigType("yaml")

-	for {
-		select {
-		case <-ctx.Done():
-			return fmt.Errorf("server not ready after 10s: %w", ctx.Err())
-		case <-ticker.C:
-			// Try to connect to the health endpoint
-			resp, err := http.Get(fmt.Sprintf("%s/api/health", s.baseURL))
-			if err == nil {
-				resp.Body.Close()
-				return nil
+			// Read the config file
+			if err := v.ReadInConfig(); err == nil {
+				var cfg config.Config
+				if err := v.Unmarshal(&cfg); err == nil {
+					// Override server port for testing
+					cfg.Server.Port = port
+
+					// Set default auth values if not configured
+					if cfg.Auth.JWTSecret == "" {
+						cfg.Auth.JWTSecret = "default-secret-key-please-change-in-production"
+					}
+					if cfg.Auth.AdminMasterPassword == "" {
+						cfg.Auth.AdminMasterPassword = "admin123"
+					}
+
+					log.Debug().
+						Str("config", configPath).
+						Str("db_host", cfg.Database.Host).
+						Int("db_port", cfg.Database.Port).
+						Str("db_user", cfg.Database.User).
+						Str("db_name", cfg.Database.Name).
+						Bool("v2flag", cfg.API.V2Enabled).
+						Msg("Using test config file")
+					return &cfg
+				}
 			}
 		}
 	}
-}

-// shouldEnableV2 determines if v2 API should be enabled for this test server
-// This is the ONLY place that reads FEATURE and GODOG_TAGS env vars
-func (s *Server) shouldEnableV2() bool {
-	feature := os.Getenv("FEATURE")
+	// No test config file found, use hardcoded test defaults
+	// This ensures test suite has complete control and isn't affected by
+	// environment variables or main config file settings
+	log.Debug().
+		Str("db_host", "localhost").
+		Int("db_port", 5432).
+		Str("db_user", "postgres").
+		Str("db_name", "dance_lessons_coach_bdd_test").
+		Msg("No test config file found, using hardcoded test defaults")

-	// Only check for v2 in greet feature (where we have @v2 tagged scenarios)
-	if feature != "greet" {
-		// For config feature, v2 is controlled via config file hot-reload
-		// For other features, v2 is disabled by default
-		return false
-	}
-
-	// For greet feature: enable v2 if tags include @v2
-	tags := os.Getenv("GODOG_TAGS")
-	return strings.Contains(tags, "@v2")
-}
-
-// createTestConfig creates a test configuration
-// Pass v2Enabled explicitly to avoid reading env vars deep in the stack
-func createTestConfig(port int, v2Enabled bool) *config.Config {
 	return &config.Config{
 		Server: config.ServerConfig{
-			Host: "0.0.0.0",
+			Host: "localhost",
 			Port: port,
 		},
-		Database: config.DatabaseConfig{
-			Host:     getDatabaseHost(),
-			Port:     getDatabasePort(),
-			User:     "postgres",
-			Password: "postgres",
-			Name:     getDatabaseName(),
-			SSLMode:  getDatabaseSSLMode(),
-		},
-		Auth: config.AuthConfig{
-			JWTSecret:           "test-secret-key-for-bdd-tests",
-			AdminMasterPassword: "admin123",
-			JWT: config.JWTConfig{
-				TTL: 24 * time.Hour,
-			},
-		},
-		API: config.APIConfig{
-			V2Enabled: v2Enabled,
+		Shutdown: config.ShutdownConfig{
+			Timeout: 5 * time.Second,
 		},
 		Logging: config.LoggingConfig{
-			Level: "debug",
+			JSON:  false,
+			Level: "trace",
+		},
+		Telemetry: config.TelemetryConfig{
+			Enabled: false,
+		},
+		API: config.APIConfig{
+			V2Enabled: true, // Enable v2 by default for most tests
+		},
+		Auth: config.AuthConfig{
+			JWTSecret:           "default-secret-key-please-change-in-production",
+			AdminMasterPassword: "admin123",
+		},
+		Database: config.DatabaseConfig{
+			Host:            getDatabaseHost(), // Use env var if set, otherwise localhost
+			Port:            getDatabasePort(), // Use env var if set, otherwise 5432
+			User:            "postgres",
+			Password:        "postgres",
+			Name:            "dance_lessons_coach_bdd_test", // Separate BDD test database
+			SSLMode:         "disable",
+			MaxOpenConns:    10,
+			MaxIdleConns:    5,
+			ConnMaxLifetime: time.Hour,
 		},
 	}
 }
--- a/pkg/bdd/testserver/state_tracer.go
+++ b/pkg/bdd/testserver/state_tracer.go
@@ -1,86 +0,0 @@
-package testserver
-
-import (
-	"fmt"
-	"os"
-	"path/filepath"
-	"time"
-)
-
-// TraceStateScenarioStart logs the start of a scenario
-func TraceStateScenarioStart(feature, scenario string) {
-	writeTraceLine(feature, scenario, "SCENARIO_START", "")
-}
-
-// TraceStateScenarioEnd logs the end of a scenario
-func TraceStateScenarioEnd(feature, scenario string, err error) {
-	status := "PASSED"
-	if err != nil {
-		status = fmt.Sprintf("FAILED: %v", err)
-	}
-	writeTraceLine(feature, scenario, "SCENARIO_END", status)
-}
-
-// TraceStateDBCleanup logs a database cleanup operation
-func TraceStateDBCleanup(feature, scenario, table string) {
-	writeTraceLine(feature, scenario, "DB_CLEANUP", table)
-}
-
-// TraceStateJWTSecretOperation logs a JWT secret operation
-func TraceStateJWTSecretOperation(feature, scenario, operation, details string) {
-	writeTraceLine(feature, scenario, "JWT_"+operation, details)
-}
-
-// TraceStateSchemaIsolation logs a schema isolation operation
-func TraceStateSchemaIsolation(feature, scenario, operation, details string) {
-	writeTraceLine(feature, scenario, "SCHEMA_"+operation, details)
-}
-
-// TraceStateTransaction logs a transaction boundary
-func TraceStateTransaction(feature, scenario, action, details string) {
-	writeTraceLine(feature, scenario, "TX_"+action, details)
-}
-
-// TraceStateDBRead logs a database read operation
-func TraceStateDBRead(feature, scenario, table, details string) {
-	writeTraceLine(feature, scenario, "DB_SELECT", fmt.Sprintf("table=%s %s", table, details))
-}
-
-// StateTracingEnabled returns true if BDD_TRACE_STATE environment variable is set to "1"
-func StateTracingEnabled() bool {
-	return os.Getenv("BDD_TRACE_STATE") == "1"
-}
-
-// writeTraceLine writes a trace line to the state trace file in $TMPDIR
-func writeTraceLine(feature, scenario, action, details string) {
-	if !StateTracingEnabled() {
-		return
-	}
-	tmpDir := os.Getenv("TMPDIR")
-	if tmpDir == "" {
-		tmpDir = "/tmp"
-	}
-	timestamp := time.Now().Format("20060102-150405")
-	pid := os.Getpid()
-	filename := fmt.Sprintf("bdd-state-trace-%s-%d.log", timestamp, pid)
-	filePath := filepath.Join(tmpDir, filename)
-
-	line := fmt.Sprintf("%s | %-15s | %-40s | %-16s | %s\n",
-		time.Now().Format("2006-01-02T15:04:05.000000"),
-		feature,
-		scenario,
-		action,
-		details,
-	)
-
-	file, err := os.OpenFile(filePath, os.O_CREATE|os.O_APPEND|os.O_WRONLY, 0644)
-	if err != nil {
-		return
-	}
-	defer file.Close()
-
-	if _, err := file.WriteString(line); err != nil {
-		return
-	}
-	file.Sync()
-}
--- a/pkg/config/config.go
+++ b/pkg/config/config.go
@@ -118,34 +118,6 @@ type SamplerConfig struct {
 	Ratio float64 `mapstructure:"ratio"`
 }

-// peekJSONLogging determines whether JSON logging should be used before the full
-// config is loaded, solving the chicken-and-egg problem where the logger format
-// must be known before any log is emitted, yet the format is stored in the config.
-//
-// Resolution order (mirrors Viper's own priority):
-//  1. DLC_LOGGING_JSON env var — checked directly via os.Getenv (zero overhead)
-//  2. logging.json key in the config file — read with a minimal throwaway Viper
-//     instance so we don't parse the whole config twice unnecessarily
-func peekJSONLogging() bool {
-	// 1. Env var takes highest priority — check it first
-	if env := os.Getenv("DLC_LOGGING_JSON"); env != "" {
-		return strings.EqualFold(env, "true") || env == "1"
-	}
-
-	// 2. Try to read logging.json from the config file
-	preV := viper.New()
-	preV.SetDefault("logging.json", false)
-	if configFile := os.Getenv("DLC_CONFIG_FILE"); configFile != "" {
-		preV.SetConfigFile(configFile)
-	} else {
-		preV.SetConfigName("config")
-		preV.SetConfigType("yaml")
-		preV.AddConfigPath(".")
-	}
-	_ = preV.ReadInConfig() // ignore errors — defaults apply on failure
-	return preV.GetBool("logging.json")
-}
-
 // LoadConfig loads configuration from file, environment variables, and defaults
 // Configuration priority: file > environment variables > defaults
 // To specify a custom config file path, set DLC_CONFIG_FILE environment variable
@@ -157,17 +129,9 @@ func LoadConfig() (*Config, error) {

 	v := viper.New()

-	// Configure the logger format before emitting any log output.
-	// peekJSONLogging reads the JSON setting early (env var + config file pre-read)
-	// so that every log line — including those produced during config loading — is
-	// already in the correct format.
-	jsonLogging := peekJSONLogging()
-	if jsonLogging {
-		log.Logger = log.Output(os.Stderr)
-	} else {
-		log.Logger = log.Output(zerolog.ConsoleWriter{Out: os.Stderr})
-	}
-	log.Info().Bool("json", jsonLogging).Msg("Logging configured")
+	// Set up initial console logging for config loading messages
+	consoleWriter := zerolog.ConsoleWriter{Out: os.Stderr}
+	log.Logger = log.Output(consoleWriter)

 	// Set default values
 	v.SetDefault("server.host", "0.0.0.0")
@@ -263,9 +227,15 @@ func LoadConfig() (*Config, error) {
 		return nil, fmt.Errorf("config unmarshal error: %w", err)
 	}

-	// Setup logging based on configuration (level, output file, time format).
-	// The JSON/console format was already applied at the top of LoadConfig via
-	// peekJSONLogging, so SetupLogging only needs to handle the remaining knobs.
+	// Configure log output format (JSON or console) first
+	if config.Logging.JSON {
+		log.Logger = log.Output(os.Stderr)
+	} else {
+		consoleWriter := zerolog.ConsoleWriter{Out: os.Stderr}
+		log.Logger = log.Output(consoleWriter)
+	}
+
+	// Setup logging based on configuration
 	config.SetupLogging()

 	log.Info().
--- a/pkg/server/server.go
+++ b/pkg/server/server.go
@@ -33,28 +33,6 @@ import (
 //go:embed docs/swagger.json
 var swaggerJSON embed.FS

-// CancelableContext wraps a context.Context and exposes a Cancel() method so
-// that Server.Run() can cancel readiness during graceful shutdown via the type
-// assertion it already performs. Callers that don't need controlled cancellation
-// (tests, CLI) can pass a plain context.Background() — the assertion silently
-// fails and readiness is never explicitly cancelled, which is harmless.
-type CancelableContext struct {
-	context.Context
-	cancel context.CancelFunc
-}
-
-// NewCancelableContext creates a CancelableContext whose Cancel() method will
-// be invoked by Server.Run() at the start of graceful shutdown, before the
-// 1-second readiness propagation window. The returned CancelFunc is a no-op
-// after Cancel() has been called, so it is safe to defer in main.
-func NewCancelableContext(parent context.Context) (*CancelableContext, context.CancelFunc) {
-	ctx, cancel := context.WithCancel(parent)
-	return &CancelableContext{Context: ctx, cancel: cancel}, cancel
-}
-
-// Cancel satisfies the interface checked in Run() and cancels the context.
-func (c *CancelableContext) Cancel() { c.cancel() }
-
 type Server struct {
 	router         *chi.Mux
 	readyCtx       context.Context
--- a/scripts/ci-update-coverage-badge.sh
+++ b/scripts/ci-update-coverage-badge.sh
@@ -1,22 +1,32 @@
 #!/bin/bash
-# KISS coverage badge updater using line numbers
-# Usage: scripts/ci-update-coverage-badge.sh <coverage_percentage> [badge_type]
-# badge_type: "unit" or "bdd", defaults to "unit"
+# CI script to update coverage badge in README.md
+# Usage: scripts/ci-update-coverage-badge.sh <coverage_percentage> [badge_type] [flags]
+# badge_type can be "bdd", "unit", or empty for combined coverage
+# flags: --no-commit (skip git commit), --no-push (skip git push)

 set -e

-COVERAGE=$1
-BADGE_TYPE=${2:-"unit"}
-
-# Get first line number of the badge
-LINE_NUM=$(cat -n README.md | grep -i "${BADGE_TYPE} coverage" | head -1 | awk '{print $1}')
-
-if [ -z "$LINE_NUM" ]; then
-    echo "Error: Could not find ${BADGE_TYPE} coverage badge in README.md"
+if [ -z "$1" ]; then
+    echo "Error: Coverage percentage not provided"
    exit 1
 fi

-# Get color
+COVERAGE=$1
+BADGE_TYPE=${2:-"combined"}
+
+# Parse flags
+NO_COMMIT=false
+NO_PUSH=false
+
+for arg in "$@"; do
+    if [ "$arg" = "--no-commit" ]; then
+        NO_COMMIT=true
+    elif [ "$arg" = "--no-push" ]; then
+        NO_PUSH=true
+    fi
+done
+
+# Determine badge color
 if (( $(echo "$COVERAGE >= 80" | bc -l) )); then
    COLOR="brightgreen"
 elif (( $(echo "$COVERAGE >= 50" | bc -l) )); then
@@ -25,15 +35,138 @@ else
    COLOR="red"
 fi

-# Create badge markdown
-BADGE_TYPE_UPPER=$(echo "$BADGE_TYPE" | tr '[:lower:]' '[:upper:]')
-BADGE_MARKDOWN="[![${BADGE_TYPE_UPPER} Coverage](https://img.shields.io/badge/${BADGE_TYPE_UPPER}_Coverage-${COVERAGE}%-${COLOR}?style=flat-square)](https://gitea.arcodange.lab/arcodange/dance-lessons-coach)"
-
-# Replace the line using sed
-if [[ "$(uname)" == "Darwin" ]]; then
-    sed -i '' "${LINE_NUM}s|.*|${BADGE_MARKDOWN}|" README.md
+# Create different badge URLs and markdown format based on type
+if [ "$BADGE_TYPE" = "bdd" ]; then
+    BADGE_URL="https://img.shields.io/badge/BDD_Coverage-${COVERAGE}%-${COLOR}?style=flat-square"
+    BADGE_MARKDOWN="[![BDD Coverage](${BADGE_URL})](https://gitea.arcodange.lab/arcodange/dance-lessons-coach)"
+    SEARCH_PATTERN="BDD_Coverage-.*-.*?style=flat-square"
+elif [ "$BADGE_TYPE" = "unit" ]; then
+    BADGE_URL="https://img.shields.io/badge/Unit_Coverage-${COVERAGE}%-${COLOR}?style=flat-square"
+    BADGE_MARKDOWN="[![Unit Coverage](${BADGE_URL})](https://gitea.arcodange.lab/arcodange/dance-lessons-coach)"
+    SEARCH_PATTERN="Unit_Coverage-.*-.*?style=flat-square"
 else
-    sed -i "${LINE_NUM}s|.*|${BADGE_MARKDOWN}|" README.md
+    BADGE_URL="https://img.shields.io/badge/coverage-${COVERAGE}%-${COLOR}?style=flat-square"
+    BADGE_MARKDOWN="[![Coverage](${BADGE_URL})](https://gitea.arcodange.lab/arcodange/dance-lessons-coach)"
+    SEARCH_PATTERN="coverage-.*-.*?style=flat-square"
 fi

-echo "Updated ${BADGE_TYPE} coverage badge to ${COVERAGE}% (line ${LINE_NUM})"
+# Clean up any malformed badge lines from previous runs
+# Remove lines starting with "nhttps://" or "https://" that aren't proper markdown
+sed -i.bak '/^nhttps:\/\/.*img.shields.io.*Coverage/d' README.md 2>/dev/null || true
+sed -i.bak '/^https:\/\/.*img.shields.io.*Coverage/d' README.md 2>/dev/null || true
+
+# Remove old duplicate badges for the specific type being updated
+if [ "$BADGE_TYPE" = "bdd" ] || [ "$BADGE_TYPE" = "unit" ]; then
+    # Remove all existing badges of this type before adding new one
+    sed -i.bak "/${BADGE_TYPE}_Coverage/d" README.md 2>/dev/null || true
+fi
+
+rm -f README.md.bak
+
+# Only update if coverage has actually changed
+if grep -q "${BADGE_TYPE}_Coverage-${COVERAGE}%" README.md || grep -q "coverage-${COVERAGE}%" README.md; then
+    echo "Coverage badge already up to date at ${COVERAGE}%"
+    exit 0
+fi
+
+# Also check if badge already exists with this coverage (more flexible pattern)
+if [ "$BADGE_TYPE" = "bdd" ] || [ "$BADGE_TYPE" = "unit" ]; then
+    # Capitalize first letter for badge name
+    if [ "$BADGE_TYPE" = "unit" ]; then
+        BADGE_NAME="Unit"
+    else
+        BADGE_NAME="BDD"
+    fi
+    if grep -q "\[!\[${BADGE_NAME} Coverage\].*${COVERAGE}%" README.md; then
+        echo "Coverage badge already exists at ${COVERAGE}%"
+        exit 0
+    fi
+fi
+
+# Cross-platform sed command
+# Detect if we're on macOS (BSD sed) or Linux (GNU sed)
+SED_CMD=""
+if [[ "$(uname)" == "Darwin" ]]; then
+    # macOS - requires empty string after -i
+    SED_CMD="sed -i ''"
+else
+    # Linux - standard GNU sed
+    SED_CMD="sed -i"
+fi
+
+# Update README - handle both old and new badge formats
+if [ "$BADGE_TYPE" = "bdd" ] || [ "$BADGE_TYPE" = "unit" ]; then
+    # For BDD/Unit badges, add them if they don't exist, or update if they do
+    if grep -q "${BADGE_TYPE}_Coverage" README.md; then
+        # Update existing badge with proper markdown format
+        $SED_CMD "s|^\[!\[${BADGE_TYPE} Coverage\].*|"${BADGE_MARKDOWN}"|" README.md
+    else
+        # Add new badge line after the License badge (more reliable reference)
+        # Use a more reliable approach with temporary file for cross-platform compatibility
+        TEMP_FILE=$(mktemp)
+        awk -v new_badge="${BADGE_MARKDOWN}" '{
+            if ($0 ~ /\[!\[License\].*license-MIT-green/) {
+                print $0
+                print new_badge
+            } else {
+                print $0
+            }
+        }' README.md > "$TEMP_FILE"
+        mv "$TEMP_FILE" README.md
+    fi
+else
+    # For combined coverage, use the original logic
+    $SED_CMD "s|^\[!\[Coverage\].*|"${BADGE_MARKDOWN}"|" README.md
+fi
+
+# Set up git
+git config --global user.name "CI Bot"
+git config --global user.email "ci@arcodange.fr"
+
+# Set up credentials using Gitea token
+if [ -n "$PACKAGES_TOKEN" ]; then
+    git config --global credential.helper store
+    echo "https://${PACKAGES_TOKEN}@gitea.arcodange.lab" > ~/.git-credentials
+fi
+
+git add README.md
+
+# Skip commit if --no-commit flag is set
+if [ "$NO_COMMIT" = true ]; then
+    echo "Skipping git commit due to --no-commit flag"
+    echo "Coverage badge updated to ${COVERAGE}% in README.md (not committed)"
+    exit 0
+fi
+
+if git commit -m "🤖 chore: update coverage badge to ${COVERAGE}% [skip ci]"; then
+    # Skip push if --no-push flag is set
+    if [ "$NO_PUSH" = true ]; then
+        echo "Skipping git push due to --no-push flag"
+        echo "Coverage badge updated to ${COVERAGE}% and committed locally"
+        exit 0
+    fi
+    
+    # Try push with retry logic for race conditions
+    for i in 1 2 3; do
+        if git push; then
+            echo "Successfully updated coverage badge to ${COVERAGE}%"
+            # Update local repo to the new HEAD after successful push
+            git fetch origin
+            git reset --hard origin/${GITHUB_REF_NAME:-${CI_COMMIT_REF_NAME:-main}}
+            exit 0
+        else
+            echo "Push attempt $i failed, retrying..."
+            if [ $i -eq 3 ]; then
+                echo "Final push attempt failed - another job may have updated the badge"
+                git pull --rebase || true
+                git push || echo "Recovery push also failed"
+                # Ensure we're on the latest commit even if push failed
+                git fetch origin
+                git reset --hard origin/${GITHUB_REF_NAME:-${CI_COMMIT_REF_NAME:-main}}
+            fi
+            sleep 2
+        fi
+    done
+else
+    echo "No coverage change to commit"
+fi
--- a/scripts/run-bdd-tests.sh
+++ b/scripts/run-bdd-tests.sh
@@ -132,22 +132,17 @@ run_tests_with_tags() {
    # Run tests with proper coverage measurement and tag exclusion
    set +e
    
-    # Default tag filter: exclude flaky, todo, and skip scenarios
-    DEFAULT_TAGS="~@flaky && ~@todo && ~@skip"
-    
    if [ -n "$tags" ]; then
        # Use godog directly for tag filtering with exclusion
        echo "🚀 Running: godog $tags --tags=~@flaky --tags=~@todo --tags=~@skip features/"
        test_output=$(godog $tags --tags=~@flaky --tags=~@todo --tags=~@skip features/ 2>&1)
    else
-        # Use go test for full test suite with GODOG_TAGS environement variable
-        # Note: -tags flag in go test is for Go build tags, NOT Godog feature tags
-        # We use GODOG_TAGS env var which is read by the test framework
-        echo "🚀 Running: GODOG_TAGS=\"${DEFAULT_TAGS}\" go test ./features/..."
-        GODOG_TAGS="$DEFAULT_TAGS" go test ./features/... -v -cover -coverpkg=./... -coverprofile=coverage.out 2>&1 | tee /tmp/bdd_test_output.txt && test_output=$(cat /tmp/bdd_test_output.txt) && rm -f /tmp/bdd_test_output.txt || test_output=$(cat /tmp/bdd_test_output.txt 2>/dev/null || echo "")
-        test_exit_code=${PIPESTATUS[0]}
+        # Use go test for full test suite with tag exclusion
+        echo "🚀 Running: go test ./features/... -tags=~@flaky,~@todo,~@skip"
+        test_output=$(go test ./features/... -tags=~@flaky,~@todo,~@skip -v -cover -coverpkg=./... -coverprofile=coverage.out 2>&1)
    fi
    
+    test_exit_code=$?
    set -e
    
    echo "$test_output"
@@ -174,13 +169,12 @@ run_tests_with_tags() {
        exit 1
    fi
    
-    # Check for skipped steps - NO LONGER FAIL on skipped since we use GODOG_TAGS=~@todo by default
-    # Skipped steps are expected when @todo tagged scenarios are excluded
-    # if [ -z "$tags" ] && echo "$test_output" | grep -q "skipped"; then
-    #     echo "❌ FAILED: Found skipped steps"
-    #     echo 'DLC_DATABASE_HOST=localhost DLC_DATABASE_PORT=5432 DLC_DATABASE_USER=postgres DLC_DATABASE_PASSWORD=postgres DLC_DATABASE_NAME=dance_lessons_coach_bdd_test DLC_DATABASE_SSL_MODE=disable go test ./features/... -v'
-    #     exit 1
-    # fi
+    # Check for skipped steps (only for go test output)
+    if [ -z "$tags" ] && echo "$test_output" | grep -q "skipped"; then
+        echo "❌ FAILED: Found skipped steps"
+        echo 'DLC_DATABASE_HOST=localhost DLC_DATABASE_PORT=5432 DLC_DATABASE_USER=postgres DLC_DATABASE_PASSWORD=postgres DLC_DATABASE_NAME=dance_lessons_coach_bdd_test DLC_DATABASE_SSL_MODE=disable go test ./features/... -v'
+        exit 1
+    fi
    
    # Check if tests passed
    if [ $test_exit_code -eq 0 ]; then
--- a/scripts/start-server.sh
+++ b/scripts/start-server.sh
@@ -4,8 +4,7 @@
 # This script starts the server in the background and provides control functions

 # Configuration
-SCRIPTS_DIR=$(dirname "$(realpath "${BASH_SOURCE[0]}")")
-PROJECT_DIR=$(dirname "$SCRIPTS_DIR")
+PROJECT_DIR="/Users/gabrielradureau/Work/Vibe/dance-lessons-coach"
 SERVER_CMD="go run ./cmd/server"
 LOG_FILE="server.log"
 PID_FILE="server.pid"
--- a/scripts/test-graceful-shutdown.sh
+++ b/scripts/test-graceful-shutdown.sh
@@ -7,8 +7,7 @@
 set -e

 # Configuration
-SCRIPTS_DIR=$(dirname "$(realpath "${BASH_SOURCE[0]}")")
-PROJECT_DIR=$(dirname "$SCRIPTS_DIR")
+PROJECT_DIR="/Users/gabrielradureau/Work/Vibe/dance-lessons-coach"
 SERVER_CMD="./scripts/start-server.sh"
 LOG_FILE="server.log"
 PID_FILE="server.pid"
@@ -60,40 +59,11 @@ echo "Response: $GREET_NAME_RESPONSE"
 echo ""
 echo "Stopping server gracefully..."

-# Send SIGTERM once and probe /api/ready during the 1-second propagation window
-# the server holds open (pkg/server/server.go: time.Sleep(1s) after readiness
-# cancel). Previously the curl fired *before* the signal — it always saw "ready".
-# We also avoid calling "$SERVER_CMD stop" afterwards because that would send a
-# second SIGTERM: after signal.NotifyContext is done, the default handler kicks in
-# and the process terminates with a non-JSON "signal: terminated" on stderr.
-SERVER_PID=$(cat "$PID_FILE" 2>/dev/null || echo "")
-if [[ -z "$SERVER_PID" ]]; then
-    echo -e "\033[0;31m❌ FAIL: PID file not found\033[0m"
-    exit 1
-fi
+# Test readiness during shutdown (in background)
+(curl -s http://localhost:8080/api/ready > /dev/null 2>&1 &)

-kill -TERM "$SERVER_PID"
-# Brief yield so the signal handler runs and CancelableContext.Cancel() fires
-sleep 0.2
-READY_DURING_SHUTDOWN=$(curl -s -w "\n[HTTP %{http_code}]" http://localhost:8080/api/ready 2>&1 || echo "[connection refused]")
-echo "Readiness during shutdown: $READY_DURING_SHUTDOWN"
-
-# Wait for the process to exit cleanly (up to 30s) without sending another signal
-echo "Waiting for server to exit..."
-for i in {1..30}; do
-    if ! ps -p "$SERVER_PID" > /dev/null 2>&1; then
-        echo "Server stopped successfully"
-        rm -f "$PID_FILE"
-        break
-    fi
-    sleep 1
-done
-if ps -p "$SERVER_PID" > /dev/null 2>&1; then
-    echo -e "\033[0;31m❌ FAIL: Server did not stop within 30s\033[0m"
-    kill -9 "$SERVER_PID" 2>/dev/null || true
-    exit 1
-fi
-sleep 0.5
+$SERVER_CMD stop
+sleep 3

 echo ""
 echo "Analyzing server logs..."
@@ -231,12 +201,6 @@ fi
 echo ""
 echo -e "\033[0;32m🎉 GRACEFUL SHUTDOWN TEST PASSED!\033[0m"
 echo "All required logs are present and in correct order."
-
-echo ""
-echo "📋 Full server log:"
-echo "==============================="
-cat "$LOG_FILE" | jq -r '"[\(.level | ascii_upcase)] \(.time | tostring) — \(.message)"'
-echo "==============================="
 echo ""

 # Clean up
--- a/scripts/test-opentelemetry.sh
+++ b/scripts/test-opentelemetry.sh
@@ -9,8 +9,7 @@ echo -e "\033[1;34m=== dance-lessons-coach OpenTelemetry Test ===\033[0m"
 echo ""

 # Configuration
-SCRIPTS_DIR=$(dirname "$(realpath "${BASH_SOURCE[0]}")")
-PROJECT_DIR=$(dirname "$SCRIPTS_DIR")
+PROJECT_DIR="/Users/gabrielradureau/Work/Vibe/dance-lessons-coach"
 SERVER_CMD="./scripts/start-server.sh"
 LOG_FILE="server.log"
 PID_FILE="server.pid"
--- a/scripts/validate-test-suite.sh
+++ b/scripts/validate-test-suite.sh
@@ -2,55 +2,13 @@

 # Test Suite Validation Script
 # Runs tests N times with separate unit and BDD test phases
-# Usage: ./scripts/validate-test-suite.sh [N] [OPTIONS]
+# Usage: ./scripts/validate-test-suite.sh [N]
 #   N          - Number of times to run tests (default: 20)
-#   OPTIONS:
-#     --parallel      - Run feature tests in parallel
-#     --count=C      - Override -count flag for go test (default: same as N)
-#     --quick        - Run only core tests (skip @flaky)
-#     --features=X   - Test specific features only (comma-separated)

 set -e

 # Default values
 RUN_COUNT=${1:-20}
-GOTEST_COUNT=""
-PARALLEL=false
-QUICK=false
-FEATURES_FILTER=""
-
-# Parse arguments
-shift
-while [[ $# -gt 0 ]]; do
-    case "$1" in
-        --parallel)
-            PARALLEL=true
-            shift
-            ;;
-        --count=*)
-            GOTEST_COUNT="${1#*=}"
-            shift
-            ;;
-        --quick)
-            QUICK=true
-            shift
-            ;;
-        --features=*)
-            FEATURES_FILTER="${1#*=}"
-            shift
-            ;;
-        *)
-            echo "Unknown option: $1"
-            exit 1
-            ;;
-    esac
-done
-
-# Use GOTEST_COUNT if set, otherwise use RUN_COUNT
-if [ -z "$GOTEST_COUNT" ]; then
-    GOTEST_COUNT=$RUN_COUNT
-fi
-
 SCRIPTS_DIR=$(dirname "$(realpath "${BASH_SOURCE[0]}")")

 # Colors for output
@@ -119,45 +77,9 @@ for (( run=1; run<=$RUN_COUNT; run++ )); do
    echo "  🧪 BDD tests..."
    go clean -testcache > /dev/null 2>&1
    
-    # Set environment variables for consistent BDD test behavior
-    export DLC_DATABASE_HOST=localhost
-    export DLC_DATABASE_PORT=5432
-    export DLC_DATABASE_USER=postgres
-    export DLC_DATABASE_PASSWORD=postgres
-    export DLC_DATABASE_NAME=dance_lessons_coach_test
-
-    export BDD_SCHEMA_ISOLATION=true
-    
-    # Build feature test arguments
-    FEATURE_PACKAGES=("config" "auth" "greet" "health" "jwt")
-    
-    # Filter features if specified
-    if [ -n "$FEATURES_FILTER" ]; then
-        IFS=',' read -ra FILTERED_FEATURES <<< "$FEATURES_FILTER"
-        ALL_FEATURES=("config" "auth" "greet" "health" "jwt")
-        FEATURE_PACKAGES=()
-        for feat in "${FILTERED_FEATURES[@]}"; do
-            if [[ " ${ALL_FEATURES[@]} " =~ " ${feat} " ]]; then
-                FEATURE_PACKAGES+=("$feat")
-            fi
-        done
-    fi
-    
-    # Build go test command for features
-    FEATURE_TESTS=""
-    for feat in "${FEATURE_PACKAGES[@]}"; do
-        FEATURE_TESTS+="./features/$feat "
-    done
-    
-    # Set tags for quick mode
-    if [ "$QUICK" = true ]; then
-        export GODOG_TAGS="~@flaky && ~@todo && ~@skip"
-    fi
-    
    set +e  # Temporarily disable exit on error
-    # Force sequential package testing and use fixed port to prevent race conditions
-    FIXED_TEST_PORT=true BDD_SCHEMA_ISOLATION=true go test ${FEATURE_TESTS} -count=$GOTEST_COUNT -v -p 1 2>&1 | tee /tmp/bdd_raw_$$.txt | grep -v '^{"level"' > /tmp/bdd_output_$$.txt && BDD_OUTPUT=$(cat /tmp/bdd_output_$$.txt) && rm -f /tmp/bdd_output_$$.txt /tmp/bdd_raw_$$.txt || true
-    BDD_EXIT_CODE=${PIPESTATUS[0]}
+    BDD_OUTPUT=$(go test ./features/... -v 2>&1)
+    BDD_EXIT_CODE=$?
    set -e  # Re-enable exit on error
    
    if [ $BDD_EXIT_CODE -eq 0 ]; then
@@ -220,7 +142,7 @@ else
    # Process BDD test failures
    if [ -s "$BDD_FAILURE_LOG" ]; then
        echo "BDD Test Failures:"
-        echo "==============="
+        echo "================"
        
        # Count BDD test failures with granularity
        BDD_FAILURES=$(grep "FAIL" "$BDD_FAILURE_LOG" | \
@@ -233,7 +155,7 @@ else
            while IFS= read -r line; do
                count=$(echo "$line" | awk '{print $1}')
                test=$(echo "$line" | sed 's/^[0-9]*[[:space:]]*//')
-                echo "  $count x $test"
+                echo "  $count × $test"
            done <<< "$BDD_FAILURES"
        else
            echo "  None (check log for details)"
@@ -253,11 +175,11 @@ else
    
    echo
    echo "Recommendations:"
-    echo "  1. Investigate unit test failures first (faster to fix)"
-    echo "  2. Check for race conditions in failing tests"
-    echo "  3. Review test dependencies and isolation (schema/database isolation)"
-    echo "  4. Run individual failing tests with: FIXED_TEST_PORT=true go test ./features -v -run TestBDD/Name"
+    echo "  1. Mark flaky BDD tests with @flaky tag"
+    echo "  2. Investigate unit test failures first (faster to fix)"
+    echo "  3. Check for race conditions in failing tests"
+    echo "  4. Run with FIXED_TEST_PORT=true for debugging"
    echo "  5. Use ./scripts/run-bdd-tests.sh list-tags to see available tags"
    
    exit 1
-fi
+fi