01Clone a broken production codebase
Each repo is a real production pattern turned into a broken codebase — a type system gap, an async boundary that fails under load, a retrieval pipeline that passes CI but breaks at edge cases. Not puzzles. Real patterns.
02Form a hypothesis — then fix it and explain it
Before you open the AI, write your hypothesis. Before you merge, open a PR — CI verifies tests pass — and write why the bug existed, why your fix works, and where your judgment overrode the model. The methodology is the proof.
03Get your weekly snapshot
After each merge, your skills profile updates. You get a personal report — what moved, what to focus on next, with specific guidance.
Pukkaship Weekly Snapshot · Alex R. · Week 10 of 20
This week: pe-10-judge — LLM-as-judge · PR #10 merged ✓
Current standing
Measurement & evals Strong ↑ Proficient
AI-collaboration Strong ↑ Proficient
Debugging discipline Proficient → unchanged
What moved this week
Your journal named the distinction between a hard gate and a soft signal without being prompted. That moved both dimensions.
Focus for pe-11
Evals: Your judge rubric has three criteria but no fixture for when two conflict. Add one before you merge.
AI workflow: Before asking what a function does, write your hypothesis first. Loom should show that sequence.
— Sudi