1. Author the benchmark
A domain expert writes 20-40 scenarios using the structured shape above. Each scenario takes 15-30 minutes once the pattern is internalised. The benchmark is the first asset. It tells the platform what better means before any model trains against it.