Skip to main content

Roadmap

Adtech is active. Future verticals wait for evidence.

A future candidate is not a released result. Adtech is the current released benchmark surface. A non-adtech vertical enters the active refresh cadence only after its first dataset, rubric, and result table are frozen.

Assay-Adtech v1

Adtech is the operator-of-origin vertical for Agentsia and the current public evidence base. RTB has a hard end-to-end latency budget (100 ms in the OpenRTB spec and 50 ms for the DSP bid decision inside it), and frontier APIs often struggle to meet that budget under production-style routing. Decisioning is high-volume, narrow, and scorable against ground truth on most axes.

Released
Active evidence

Assay-Fintech v1

Fintech decisioning rhymes with adtech: narrow, high-volume, regulated, latency-sensitive. It remains a future candidate until customer-approved rubrics, scenario provenance, and public results exist.

Future
Candidate only

Assay-Legaltech v1

Statutory interpretation, contract analysis, regulatory reasoning. Rubric design is harder because ground truth is partially subjective. Expert panel structure is being designed first.

2027
Reviewer model

Assay-Health v1

Clinical triage, decision support, documentation quality under HIPAA-aligned constraints. Scoring rubric requires clinical reviewer availability before a dated release target is set.

2027
Panel recruitment

Assay-Auto v1

ADAS perception edge cases, in-vehicle agent reasoning, driving-telemetry interpretation. Multimodal elements make the evaluator architecture more demanding. Scheduled after the three text-first verticals ship.

2027
Evaluator design

Have a commercial vertical that deserves an Assay and is missing from the list? Use the contact form with a short brief.