Pilot Proposal — OKO / ATLAS

Metric	What it measures	What success looks like
Time-to-review	Wall-clock from package upload to human signature.	Faster than the current baseline.
Missing-evidence catch rate	Percentage of MOP rows where required photo absence is correctly flagged.	High recall on missing-evidence — the system does not let a gap slip past the reviewer.
Override rate	Percentage of machine recommendations the human overrides.	Low for routine cases; preserved for the ambiguous ones — that's the point of the human in the loop.
Receipt completeness	Percentage of decisions with a fully-formed sealed receipt — manifest, evidence, finding, decision.	Approaching 100%. Receipts are the artifact, not a side effect.

Metric

What it measures

What success looks like

Time-to-review

Wall-clock from package upload to human signature.

Faster than the current baseline.

Missing-evidence catch rate

Percentage of MOP rows where required photo absence is correctly flagged.

High recall on missing-evidence — the system does not let a gap slip past the reviewer.

Override rate

Percentage of machine recommendations the human overrides.

Low for routine cases; preserved for the ambiguous ones — that's the point of the human in the loop.

Receipt completeness

Percentage of decisions with a fully-formed sealed receipt — manifest, evidence, finding, decision.

Approaching 100%. Receipts are the artifact, not a side effect.

Why now

The category is forming. The patterns are not set.

Closeout-review automation is becoming a buyer category. Carriers, EPCs, and infrastructure operators are starting to ask the same question — "why is the most expensive part of every project the part where humans line up photos against a workbook?" — and the answers they accept now will set the patterns the rest of the industry adopts.

The carriers that pilot now do three things at once. They move their own time-to-review on a workflow that bleeds margin. They earn a seat at the table when category standards crystallize — what a sealed receipt looks like, what override rate is acceptable, what fail-closed parsing means in practice. And they do it with a reviewer-in-the-loop posture that keeps regulator-facing accountability where it belongs: with the licensed human who signs the package.

The carriers that wait inherit whatever the early movers normalized. That's a worse position whether you're a buyer, an investor, or a senior operator who's seen this pattern in adjacent categories. Pilot now, set the pattern. Pilot later, accept it.

Start the conversation.

One reply. One named reviewer on your side. One package family. We come back with a scoped pilot agreement inside a week.

Machine recommends. Human decides. Plan-acceptance is not implementation-authorization. The pilot's job is to make the next decision easy — including, honestly, the decision to stop.

From plan to receipts in eight weeks.

Five commitments to get the pilot moving

Authorize the pilot

Provide one representative workbook shape and one redacted/synthetic photo set

Name the reviewer in the loop and the rubric they use today

Agree on success metrics

After the pilot, decide together whether to expand

Twelve weeks. Five phases. One readout.

Discovery

Parser

Indexing

Cockpit

Shadow + readout

Four KPIs. Your packages. No borrowed numbers.

Working software. Sealed receipts. An honest readout.

Working ATLAS workflow instance

Sealed receipts for every package

Override / audit trail

End-of-pilot KPI readout

Recommendation packet

Four price points. None of them quotes.

Six things this pilot is not.

The category is forming. The patterns are not set.

Start the conversation.