AINA Data Engine Room · Source Authority · 2026-06-15

Prior Work Source Authority Promotion Handoff

A durable receipt for the thing Ali kept pointing at: harvest and verify prior validated work before spending more tokens on title-level review.

Ali Mehdi Mukadam · co-authored with Codex and Claude · 6 minute read
The Single Idea

This milestone makes the anti-loop rule executable. If E5/E6 ledgers, source-authority receipts, donor salvage maps, export manifests, and validation proof already account for a title/source family, the engine blocks redundant title-level LLM review and routes the work to deterministic promotion or repair instead.

01 · What Changed

The repo now has a promotion gate, not another plan.

The new command is prior-work-source-authority-promotion. It writes validation receipts, JSONL row proof, a deterministic repair queue, and paired Markdown/HTML reports. The main validation command now includes its summary, so future runs cannot silently drift back into title-by-title rework.

uv run aina-data-engine --root /srv/aina/aina-data-engine-room prior-work-source-authority-promotion
02 · Current Receipt

The receipt passes and blocks broad title re-review.

19assets checked
8accounted
11promote candidates
5repair rows
MeasureValue
Statuspass
Missing assets0
Title-level LLM review blocked19 assets
Trusted jobs-research titles counted15,104
Clean candidate rows counted44,440
Current vector count151,983
Boundary locks remain shut: no donor mutation, no live Gemini call, no real-user data, no public runtime, no production telemetry, no runtime embedding authority, and no bulk title-level LLM review.
03 · Accounted Work

E5, E6, registry, export, vector, and salvage proof are now linked.

The accounted set includes the E5 title ledger, the E6 mapping-chain ledger, source-authority registries, the Academy-safe export manifest, vector reconciliation, and donor retirement/salvage route maps. This does not claim full runtime perfection. It says title-level LLM review is the wrong next move for these already-accounted families.

AreaWhat it proves
E5 title lineageTitle/service-tier history is already recorded.
E6 mapping chainRole-to-workflow topology and gaps are named.
Source authorityCurrent registry rows, chunk families, and authority classes pass.
Export manifestTop 500/top 1,000 Academy-safe payloads are readable and bounded.
Vector reconciliation151,983 vectors exist with zero stale vectors, but vectors are not runtime truth.
04 · Repair Queue

Remaining work is deterministic and named.

PriorityClassNext move
p0mapping_chain_gapClose role-to-workflow promotion with existing workflow intelligence.
p0runtime_source_authority_gapComplete bridge or abstain decisions for incomplete top-band rows.
p1export_workflow_coverage_gapAdd workflow/practice refs from mapping-chain assets.
p1clean_candidate_promotion_backlogPromote clean candidates through deterministic family gates.
p1donor_lineage_deltaDiff donor/salvage candidates against the current registry.
05 · Cross-Engine Review

The reviewers narrowed the milestone to proof, not hope.

Codex subagents handled source mining, data-integrity review, and adversarial review. Claude Opus ran as a read-only CLI checkpoint copilot. All converged on the same operating constraint: old LLM verdicts are provenance, not authority, and the closeout must be the receipt plus deterministic exception ledger.

Promote useful donor work by deterministic diff, not broad LLM review.

Start with aina-jobs-research source-intelligence exports and the 6,866 workflow seed queue, then wire aina-core canonical evidence parquets where hash sidecars already exist. Treat old PE scripts and schemas as concept/script donors unless current receipts promote them.

Where to Start

Run the new promotion gate first; then repair the named deterministic gaps.