Outline · v0.1 Draft

Developer KPI Policy — AI-Oriented SDLC

Performance metrics and operating rules for the Techies engineering team as we shift from traditional SDLC to an AI-assisted workflow built on Claude Code and GitHub.

Owner: Paul (Techies) Scope: All developers (team of 1–10) Use: Individual + team evaluation Status: Outline for review

1Purpose & Principles

Why this policy exists and the values it must protect.

1.1 Purpose

1.2 Guiding principles

2The SDLC Shift

What changes when we move to an AI-oriented lifecycle — and what it means for how we measure people.

Traditional SDLC

Manual coding, line-by-line authorship, effort ≈ output, reviews catch most defects late.

AI-Oriented SDLC

Claude Code drafts, refactors, and tests; developer becomes director/reviewer; speed rises, so quality gates and judgement matter more.

2.1 Implications for KPIs

3KPI Framework Overview

Four categories, balanced weights. Tune weights before rollout.

CategoryWhat it measuresSuggested weight
Productivity & VelocityThroughput and flow of shipped work25% (tune)
Code QualityDefects, review health, test coverage30% (tune)
AI AdoptionEffective use of Claude Code20% (tune)
Delivery & ReliabilityDORA-style deploy & stability metrics25% (tune)
Legend: Individual tracked per developer   Team tracked at team level. Most metrics inform both.

4Productivity & Velocity

Flow of value through the system — not raw output.

KPIDefinitionTargetSource
Cycle time IndFirst commit → merged to mainset baselineGitHub
PR throughput TeamMerged PRs per dev per weekset baselineGitHub
PR size IndMedian lines/files per PR (smaller = better)< 400 LOCGitHub
Time to first review TeamPR opened → first review< 4 working hrsGitHub
Work-in-progress IndConcurrent open PRs/issuesset limitGitHub
⚠️ Do not use commit count, lines of code, or hours as performance metrics. They reward the wrong behavior in an AI workflow.

5Code Quality

Speed is only valuable if what ships is correct and maintainable.

KPIDefinitionTargetSource
Change failure rate Ind% of merged PRs causing a bug/revert/hotfix< 15%GitHub
PR review pass rate Ind% PRs approved without major reworkset baselineGitHub
Test coverage Team% lines/branches covered on changed code≥ 80%CI / GitHub
Escaped defects TeamBugs found in production per releasetrend downIssues
Rework ratio IndCode rewritten within 21 days of mergeset baselineGitHub

5.1 Rule — AI output is owned by the author

6AI Adoption (Claude Code)

Measure effective use — adoption is a leading indicator, not the end goal.

KPIDefinitionTargetSource
Active usage IndActive days using Claude Code / weekset baselineClaude Code
AI-assisted PRs Team% of PRs where Claude Code was usedgrowth trendTag / convention
Proficiency tier IndSelf + lead rating: Novice → Fluent → AdvancedFluent by Q3Review
Workflow contribution TeamReusable prompts, CLAUDE.md, agents/skills sharedqualitativeRepo

6.1 Adoption expectations

7Delivery & Reliability

DORA-aligned team metrics — the real test of whether AI velocity is healthy.

KPI (DORA)DefinitionTargetSource
Deployment frequency TeamHow often we ship to production≥ weeklyGitHub Actions
Lead time for changes TeamCommit → running in productionset baselineGitHub
Change failure rate Team% deploys causing a failure< 15%GitHub / incidents
Mean time to restore TeamIncident start → resolved< 24 hrsIncidents
CI pass rate Ind% pipeline runs green on first pushtrend upGitHub Actions

8Data Sources & Measurement

Where each number comes from and how it's collected.

Automation note: aim to auto-collect GitHub + CI metrics into a dashboard; reserve manual input for the qualitative items only. [Define dashboard owner & tooling.]

9Review Cadence & Scoring

How and how often KPIs are reviewed.

CadenceFocusUse
WeeklyTeam flow metrics (velocity, CI, reviews)Process tuning
MonthlyQuality & reliability trendsCoaching
QuarterlyFull scorecard, individual + teamPerformance review

9.1 Scoring approach

10Guardrails & Anti-Gaming

Protect the metrics from distortion.

11Rollout Plan

Phased introduction so the policy lands as support, not surveillance.

PhaseActivityTiming
1 · BaselineCollect current metrics, no targets yet[Weeks 1–4]
2 · CalibrateSet targets/weights from real data, share openly[Weeks 5–8]
3 · CoachRun cadence; metrics drive 1:1s only[Quarter 1]
4 · FormalizeScorecard enters review cycle[Quarter 2+]

12Appendix & Glossary