AINA data engine room - technical handoff - 2026-06-11

AINA Production Runtime Readiness Handoff

A next-agent map for the new headless runtime harness, its receipts, its gates, and what remains before real production.

Codex execution lane · next agent, technical collaborator, and Ali · branch ali/personalization-engine-mission-2026-06-09

The Single Idea

The completed local ICP title map now has a headless production-runtime readiness harness. The repo can run representative title requests through the beta readiness map, generate local synthetic learning decisions for allowed cases, run deterministic evaluator receipts, block held/excluded/unknown titles, and prove the runtime gates through full validation.

01

Current State

9golden runtime cases
9golden expectations passed
6local synthetic plans
2fallback plans
1reviewed hold
2blocked excluded/unknown cases
6evaluator runs and passes
0production unlocks

Scope remains VDS-local, headless, and synthetic. This milestone prepares the runtime boundary; it does not add UI, auth, public serving, production telemetry, or real-user data.

02

Files Added Or Changed

PathPurpose
src/aina_data_engine/production_runtime_readiness.pyNew runtime readiness harness, golden cases, receipt writer, generated report writer, deterministic evaluator runner, and safety gates.
src/aina_data_engine/cli.pyAdds production-runtime-readiness command and wires the artifact into validate.
src/aina_data_engine/reports.pyAdds production-runtime readiness checks to full validation.
tests/test_production_runtime_readiness.pyAdds unit/CLI coverage for the runtime harness and blocked/planned case behavior.
artifacts/validation/production_runtime_readiness_v0.jsonSummary receipt for the runtime rehearsal.
artifacts/validation/production_runtime_readiness_v0.jsonlRow-level case receipt for all golden cases.
artifacts/runtime/production_runtime_readiness_v0_golden_cases.jsonlGolden runtime title requests.
artifacts/validation/full_validation.jsonRegenerated validation receipt with production-runtime checks.
docs/reports/2026-06-11-founder-production-runtime-readiness.md/htmlFounder-facing report pair.
docs/handoff/2026-06-11-production-runtime-readiness-handoff.md/htmlTechnical handoff pair.
03

Runtime Contract

The command ensures the source-backed warehouse and beta readiness path exist, then runs the production-runtime readiness harness. Each golden case is validated against the current beta title map and assigned exactly one runtime action.

Codex · Runtime readiness · run the headless harness
uv run aina-data-engine --root /srv/aina/aina-data-engine-room production-runtime-readiness
Watch: do not treat a valid local receipt as real production approval.
Runtime actionBehavior
plan_local_syntheticCalls generate_data_decision(..., persist=False), records packet/plan/module/exercise/rubric details, and runs evaluate_submission.
hold_for_reviewRefuses planning because the title is a reviewed residual hold.
block_not_icpRefuses planning because the title is excluded or outside the ICP path.
block_unknown_titleRefuses planning because the title is not in the current beta title map.
All case records explicitly set these production gates to false: real production runtime, real-user data, external writes, production telemetry, and production claims.
04

Golden Cases

All nine cases passed expectation checks in the live VDS run.

CaseTitleExpected cohortRuntime action
serve_now_seasonal_sales_associateseasonal sales associateserve_nowplan_local_synthetic
serve_now_support_associate_somasupport associate - somaserve_nowplan_local_synthetic
serve_now_director_bidirector of business intelligenceserve_nowplan_local_synthetic
serve_now_retail_sales_associatepart-time retail sales associateserve_nowplan_local_synthetic
fallback_case_managercase managerserve_with_fallbackplan_local_synthetic
fallback_technical_support_assistant_managertechnical support assistant managerserve_with_fallbackplan_local_synthetic
hold_family_law_attorneyfamily law attorneyreviewed_residual_holdhold_for_review
excluded_registered_nurseregistered nurse - 1755724excluded_or_not_icpblock_not_icp
unknown_future_jobtotally unknown future jobunknown_unmapped_titleblock_unknown_title
05

Validation Checks Added

CheckWhy it matters
beta_readiness_receipt_validConfirms the runtime rehearsal is based on the current beta title map.
all_golden_expectations_metConfirms each title resolves to the intended cohort and action.
planned_cases_have_modulesConfirms planned cases are not empty shells.
planned_cases_have_exercisesConfirms practice exists for planned cases.
planned_cases_have_rubricsConfirms evaluator criteria exist.
planned_cases_evaluator_passedConfirms deterministic evaluator checks pass.
blocked_and_held_cases_do_not_planConfirms holds/exclusions/unknowns do not generate plans.
fallback_cases_caveatedConfirms fallback plans carry caveats.
source_grounding_visibleConfirms title/source grounding appears in the receipt.
domain_review_required_visibleConfirms sensitive review tags are visible.
no_real_production_runtime_unlocksConfirms the harness unlocks no real production path.
auth_privacy_runtime_gates_declaredConfirms missing production boundaries are named.
06

Review Outcome

Two read-only Codex review agents checked the milestone before checkpointing.

ReviewerFindingResolution
CorrectnessFull validation did not gate the golden-cases JSONL artifact.Fixed in CLI ensure, full validation, receipt metadata, and tests.
CorrectnessHeld/blocked and fallback safety checks could false-green through aggregate counts.Fixed with row-level checks and a regression test.
Safety/claimsNo concrete safety or overclaim bug found.Auth/session/privacy gaps remain declared as missing production work.
07

Validation Commands

Codex · Validation · rerun the milestone proof
uv run ruff check src/aina_data_engine/production_runtime_readiness.py src/aina_data_engine/cli.py src/aina_data_engine/reports.py tests/test_production_runtime_readiness.py
uv run pytest tests/test_production_runtime_readiness.py -q
uv run aina-data-engine --root /srv/aina/aina-data-engine-room production-runtime-readiness
uv run aina-data-engine --root /srv/aina/aina-data-engine-room validate
uv run pytest -q
Watch: rerun this block if the runtime receipt, validation gates, or reports move again.
CommandResult
Targeted ruffPass
Targeted production-runtime tests4 passed
Production-runtime commandValid receipt, no failed checks
Full engine validationstatus: pass
Full pytest181 passed
08

What Is Still Not Production

GapWhy it matters
No auth/session/tenant integrationA real product must not trust client-supplied learner identity or tenant context.
No real-user data policy enforcementThe current harness uses synthetic profiles only.
No production telemetry sinkLocal receipts exist; production observability does not.
No external write pathCorrect for this repo, but production needs explicit write boundaries.
No real-beta allowlistLocal serviceability is broader than real learner eligibility.
No UINot needed for this milestone, but a thin internal tester could help later.
Domain review still requiredSensitive role classes remain gated.
09

Resume Commands

Codex · Resume · inspect current truth
git status --short
uv run aina-data-engine --root /srv/aina/aina-data-engine-room production-runtime-readiness
jq '{status, valid, metrics, failed_checks, scope}' artifacts/validation/production_runtime_readiness_v0.json
uv run aina-data-engine --root /srv/aina/aina-data-engine-room validate
uv run pytest tests/test_production_runtime_readiness.py -q
uv run pytest -q
Watch: do not skip the receipt check if the repo has moved since this handoff.
10

Recommended Next Build Step

The next real milestone should keep the repo self-contained but make the runtime boundary more production-shaped.

StepPurpose
Add a real-beta allowlist policy stricter than local synthetic serviceabilitySeparates internal proof from learner eligibility.
Add request/response JSON schema fixtures for the future runtime APIMakes integration assumptions testable before UI/auth work.
Add failing gates for missing auth/session/tenant assumptionsKeeps account boundaries visible.
Add privacy, consent, retention, deletion, and redaction fixturesPrepares for real learners without accepting real data yet.
Add telemetry sink policy with local-only and production-approved modesPrevents observability from becoming accidental data leakage.
Expand the golden title suite beyond 9 casesCreates regression coverage across representative ICP slices.
Add a static internal tester only if it helps inspect decisions fasterKeeps UI optional rather than making it a dependency.
Where to start

Start by rerunning the production-runtime receipt, then expand the runtime boundary from local synthetic proof toward real-beta allowlist gates.