AINA Data Engine Room · 2026-06-13 · VDS-local checkpoint

AI Fluency Public O*NET Proof-Tail Repair

A conservative source-authority repair slice that lifts clean proof-tail rows while leaving risky title guesses blocked.

Ali Mehdi Mukadam · co-authored with Codex · local handoff

The Single Idea

This checkpoint repairs the AI Fluency proof-tail only where public O*NET evidence and existing AINA/JD evidence are strong enough. It raises top-1,000 local judgment/proof coverage from 984 to 991 while keeping 9 rows blocked instead of laundering broad title guesses into runtime authority.

01 - What Changed

O*NET Is Authority Only When The Match Is Clean

The repair lane now has a curated public O*NET authority path. Exact public anchors can repair; special-education family repairs need JD context; electrical technician needs related AINA title-alias evidence; subject/domain guesses stay blocked as context-only.

ExactO*NET occupation and task refs can repair clear titles.
FamilySpecial education keeps grade-band caveats and needs JD context.
DomainElectrical technician needs related AINA title-alias support.
FilePurpose
src/aina_data_engine/ai_fluency_proof_tail_authority_repair.pyLoads O*NET, indexes AINA title aliases, separates authority from context-only evidence, and keeps embeddings/runtime disabled.
tests/test_ai_fluency_proof_tail_authority_repair.pyAdds exact O*NET success, context-only block, and electrical alias-gated repair coverage.
docs/planning/aina-production-readiness-board-2026-06-12.mdUpdates the production board to the 11-ready/9-blocked proof-tail state.
02 - Current Proof

991 Of The Top 1,000 Now Have Local Proof Proxies

MetricValue
Cumulative ready repairs11
Public O*NET ready repairs7
Still blocked rows9
Top-1,000 local judgment/proof proxies991
Top-500 local judgment/proof proxies497
Embedding, batch, production unlocks0 / 0 / 0

The repaired rows are still local evaluator/proof fixtures, not learner-observed proof and not embedding authority. Public runtime, real-user data, external writes, production telemetry, and runtime embedding authority remain disabled.

03 - Ready And Blocked

The Remaining 9 Are Blocked On Purpose

Ready examplesFunction
special education teacher, elementary school teachereducation
cassandra dba, geographic information system specialistdata_analytics
dog trainerphysical_frontline
electrical technicianengineering_hardware
Blocked rowReason
teacher-science / teacher-social studies / teacher-computerSubject and grade authority are underdetermined.
lab assistantDomain could be medical, research, veterinary, dental, chemical, or academic.
landscape designerLandscape Architect context exists, but designer vs licensed architect is too risky.
AML SME, ecommerce manager, food and beverage lead auditorExact audit/source authority is still unmapped or too broad.
04 - Review And Boundary

Multi-Agent Review Kept The Scope Tight

Three read-only Codex subagents recommended the 7-current-repair/9-block split. Claude CLI passed the diff and flagged two refinements: dog trainer should not be human education, and electrical should use title-alias support only. Both were implemented before validation.

Claude dynamic workflows were not used here because shared schema/artifact writes were safer through one orchestrator. They remain useful later for large read-only or disjoint source-mining sweeps.

05 - Linear Payload

Linear Auth Was Revoked During Closeout

The MCP returned auth_revoked, so the issue comment was not posted. Use this exact payload for AIN-510, with AIN-520 and AIN-527 as related proof references.

AI Fluency proof-tail public O*NET repair checkpoint is locally green on `/srv/aina/aina-data-engine-room`, branch `ali/ain-506-p0-gate-2026-06-12`.

Proof:
- Proof-tail authority repair: 11 cumulative ready repairs, 9 still blocked, 7 public O*NET-backed repairs, 3 top-500 ready repairs.
- Top-band coverage: top 1,000 local judgment/proof proxies increased to 991/1000; top 500 increased to 497/500; vector coverage remains 1000/1000 and 500/500.
- Runtime boundary: no live Gemini calls, no embedding manifests, no batch manifests, no public runtime, no production telemetry, no external writes, no real-user data, runtime embedding authority remains false.
- Final validators passed: focused pytest, ruff, proof-tail repair, proof-tail fixtures, capability coverage, AIN-506, AIN-510, production-runtime-readiness, and full `validate`.

Handoff:
`/srv/aina/aina-data-engine-room/docs/handoff/2026-06-13-ai-fluency-public-onet-proof-tail-repair-handoff.md`
06 - Resume Commands

Cold-Start From Here

cd /srv/aina/aina-data-engine-room
git status --short --branch
git log -5 --oneline
uv run aina-data-engine --root /srv/aina/aina-data-engine-room ain-506-p0-gate
uv run aina-data-engine --root /srv/aina/aina-data-engine-room ain-510-retrieval-promotion-gate
uv run aina-data-engine --root /srv/aina/aina-data-engine-room ai-fluency-proof-tail-authority-repair
uv run aina-data-engine --root /srv/aina/aina-data-engine-room ai-fluency-proof-tail-fixtures
uv run aina-data-engine --root /srv/aina/aina-data-engine-room ai-fluency-capability-coverage
uv run aina-data-engine --root /srv/aina/aina-data-engine-room production-runtime-readiness
uv run aina-data-engine --root /srv/aina/aina-data-engine-room validate
Where To Start

Harvest stronger source authority for the remaining 9 blocked rows before embedding or runtime reliance.