PE Donor Promotion Package Checkpoint

2026-06-15

PE Donor Promotion Package Checkpoint

Date: 2026-06-15 Branch: codex/pe-donor-promotion-2026-06-15 Latest package command: uv run aina-data-engine --root /srv/aina/aina-data-engine-room pe-donor-industry-taxonomy-support

The Single Idea

The old personalization-engine-aina work is now represented inside the engine room as a self-contained promotion package, not as a donor repo that future agents have to trust wholesale. Contract-shaped and evidence-shaped rows can be derived into engine-room receipts; prompt/workflow/ontology bulk stays advisory; known-bad market/K2 lineage stays quarantined.

What Changed

Added pe_donor_promotion_package_v1, a first-class artifact lane that reads the existing prior_work_source_authority_promotion_v1 receipt and extracts only the personalization_engine_aina rows into a smaller promotion queue.

Added pe_donor_derived_contracts_v1, a second artifact lane that promotes only two contract-shaped rows from that package into engine-room derived contract candidates:

Added pe_donor_export_runtime_mapping_v1, a third artifact lane that maps those two derived contracts onto current engine-room consumer surfaces without mutating exports or runtime contracts.

Added pe_donor_foundation_source_mapping_v1, a fourth artifact lane that promotes the verified PE foundation-status source as repair-reduction lineage only. This is the narrow bridge for the old repo’s useful status/ontology/tool/title-mapping work: it points to current engine-room proof surfaces instead of re-importing donor rows wholesale.

Added pe_donor_workflow_grounding_mapping_v1, a fifth artifact lane that captures the post-May-15 workflow-grounding consensus batches as lineage and anti-loop proof. This is intentionally not a bulk row promotion: the donor summary reviewed 250 rows, but only 4 were production-allowed and 246 stayed not promoted or pending. The engine room now preserves that work so agents do not redo it, while still keeping workflow bulk behind current repair and semantic QA gates.

Added pe_donor_title_taxonomy_gate_v1, a sixth artifact lane that captures the deterministic PE title-taxonomy bucket output as gate-pending lineage. The donor output is useful because it canonically pooled JDs by bucket before normalization and produced 2,026 deterministic bucket records, but its bundled 50-row audit template is unscored. The engine room now records the source, provenance, SHA256s, and current replacement proof while blocking row promotion, runtime authority, embedding authority, and batch authority until a scored audit or current-repo diff proof exists.

Added pe_donor_title_taxonomy_audit_v1, a seventh artifact lane that deterministically checks the donor’s 50-row audit sample against the donor JSONL, donor hashes, and current engine-room aggregate function support. This does not use an LLM and does not promote rows. It proves the donor files are fresh against the gate receipt, confirms all 50 sampled buckets still exist and match donor JD counts, and separates 32 replacement-diff candidates from 11 generic/noisy lineage-only rows and 7 blocked rows.

Added pe_donor_prompt_workflow_ontology_inventory_v1, an eighth artifact lane that splits the old repo’s prompt/workflow/ontology bulk into source-family inventory rows. This is Ali’s “do not lose the valuable prompt, image prompt, workflow, and ontology work” concern captured as durable proof: 10 donor families are hashed and counted, but still blocked from row promotion, runtime authority, embedding authority, and batch authority until each family passes repair/diff/quality gates.

Added pe_donor_curriculum_release_lineage_v1, a ninth artifact lane that captures the old repo’s curriculum-engine release packets, curricula, polished packet, and reports as repair lineage. This preserves the valuable packet/curriculum/mastery-gate shape without importing raw profile text or promoting stale role/workflow joins. The live receipt hashes 14 files, sees 46 curriculum modules, blocks 6 packet rows for empty role tasks/workflows plus supply-safety bypass evidence, and blocks the weak founder/ecommerce role match before any runtime, export, embedding, or batch authority.

Added pe_donor_source_intelligence_scaleout_lineage_v1, a tenth artifact lane that captures the old repo’s source-intelligence scaleout package as a 30-family import-decision ledger. This closes the loop on the parallel audit reports Ali ran: the 71-file donor package is useful as lineage, deterministic script comparison input, alpha feedback support, review-packet support, and future diff candidates, but it is not direct runtime/export/embedding/batch/row authority. The receipt explicitly preserves 4 future import candidates, 7 repair-first families, 8 already-accounted-for families, 6 advisory-only families, 4 superseded families, and 1 blocked raw surface.

Added e5_source_authority_reconciliation_v1, an eleventh artifact lane that turns the E5 title ledger, E6 mapping-chain ledger, source-authority registry, Academy export manifest, LinkedIn/JD source-intake, JD-aware role context, chunk/vector reconciliation, prior-work source promotion, and full validation into an explicit anti-loop receipt. This is the machine-checkable answer to Ali’s concern that agents were re-reviewing individual titles when contextual source authority already existed. The live receipt accounts for 9 of 9 required assets, blocks fresh title-level LLM review for all accounted assets, records 129,165 LinkedIn/JD context rows and 151,983 current vectors, and still grants no runtime, embedding, batch, donor-mutation, external-write, or public-runtime authority.

Added prior_work_promotion_delta_closure_v1, a twelfth artifact lane that reconciles the original 16 promote_candidate_verify_first rows from prior_work_source_authority_promotion_v1 against the specialized gates that now exist. This closes the generic donor backlog without pretending the rows are production data: 6 are covered by current engine-room receipts, 7 by specialized donor gates, 2 are advisory-only, and 1 is a future deterministic diff lane. Fresh title-level LLM review remains blocked for all 16 rows.

Added pe_donor_alpha_feedback_support_v1, a thirteenth artifact lane that turns the source-intelligence alpha_feedback_bundle_v2 import candidate into a hashed review-support receipt. It preserves 40 examples, marks 36 as review-support candidates, blocks 4 noisy marketplace-title examples, excludes raw workflow/practice/proof text, and still grants no export, runtime, row-promotion, embedding, batch, public-runtime, real-user-data, external-write, or donor-mutation authority.

Added pe_donor_review_packet_support_v1, a fourteenth artifact lane that turns the source-intelligence review_packets_v1 import candidate into a packet-footprint support receipt. It preserves the four 500-row review packet file pairs as hashes, column shapes, and packet counts, excludes all 2,000 donor packet rows and old auto-reviewed labels, redacts the legacy status column name, and still grants no export, runtime, row-promotion, embedding, batch, public-runtime, real-user-data, external-write, or donor-mutation authority.

Added pe_donor_industry_taxonomy_support_v1, a fifteenth artifact lane that turns the source-intelligence industry_taxonomy_decisions_v1 import candidate into aggregate repair-reduction support. It verifies the full 17,118-row donor JSONL/CSV against provenance, summarizes only four decision groups, excludes individual title/category rows and old auto-reviewed labels, and still grants no export, runtime, row-promotion, embedding, batch, public-runtime, real-user-data, external-write, or donor-mutation authority.

Promoted that mapping into the actual consumer receipts as lineage-only input:

The lane writes four durable artifacts:

It is also wired into the CLI as:

uv run aina-data-engine --root /srv/aina/aina-data-engine-room pe-donor-promotion-package
uv run aina-data-engine --root /srv/aina/aina-data-engine-room pe-donor-derived-contracts
uv run aina-data-engine --root /srv/aina/aina-data-engine-room pe-donor-export-runtime-mapping
uv run aina-data-engine --root /srv/aina/aina-data-engine-room pe-donor-foundation-source-mapping
uv run aina-data-engine --root /srv/aina/aina-data-engine-room pe-donor-workflow-grounding-mapping
uv run aina-data-engine --root /srv/aina/aina-data-engine-room pe-donor-title-taxonomy-gate
uv run aina-data-engine --root /srv/aina/aina-data-engine-room pe-donor-title-taxonomy-audit
uv run aina-data-engine --root /srv/aina/aina-data-engine-room pe-donor-prompt-workflow-ontology-inventory
uv run aina-data-engine --root /srv/aina/aina-data-engine-room pe-donor-curriculum-release-lineage
uv run aina-data-engine --root /srv/aina/aina-data-engine-room pe-donor-source-intelligence-scaleout-lineage
uv run aina-data-engine --root /srv/aina/aina-data-engine-room pe-donor-alpha-feedback-support
uv run aina-data-engine --root /srv/aina/aina-data-engine-room pe-donor-review-packet-support
uv run aina-data-engine --root /srv/aina/aina-data-engine-room pe-donor-industry-taxonomy-support
uv run aina-data-engine --root /srv/aina/aina-data-engine-room e5-source-authority-reconciliation
uv run aina-data-engine --root /srv/aina/aina-data-engine-room prior-work-promotion-delta-closure

Current Proof

The live package receipt is valid and reports:

Metric Value
PE donor rows 7
Derive-ready rows 5
Advisory-only rows 1
Quarantined rows 1
Embedding rows allowed now 0
Runtime authority rows allowed now 0

The derived-contract receipt is also valid and reports:

Metric Value
Derived contract rows 2
Expected source rows present 2
Embedding rows allowed now 0
Runtime authority rows allowed now 0

The export/runtime mapping receipt is valid and reports:

Metric Value
Mapping rows 2
Mapped surface refs 6
Embedding rows allowed now 0
Runtime authority rows allowed now 0

It maps the donor-derived contracts to:

The verified-foundation mapping receipt is valid and reports:

Metric Value
Mapping rows 1
Mapped source-authority surfaces 5
Embedding rows allowed now 0
Runtime authority rows allowed now 0
LLM title review allowed rows 0

It maps pe_aina_verified_data_foundation_status to:

The workflow-grounding mapping receipt is valid and reports:

Metric Value
Mapping rows 1
Mapped proof surfaces 6
Donor reviewed rows 250
Donor production-allowed rows 4
Donor not-promoted rows 246
Donor hard-stop rows 245
Workflow-seed semantic QA failures 5
Workflow-intelligence repaired QA pass count 50
Embedding rows allowed now 0
Runtime authority rows allowed now 0
Batch rows allowed now 0

It maps pe_aina_workflow_grounding_consensus_batches to the current workflow proof surfaces:

The title-taxonomy gate receipt is valid and reports:

Metric Value
Mapping rows 1
Donor bucket count 2,026
Donor provenance bucket count 2,026
Donor audit sample rows 50
Donor audit verified rows 0
Donor audit unscored rows 50
Current beta title rows 74,225
Current serviceable title rows 50,053
Clean candidate rows 44,440
Trusted jobs-research titles 15,104
Top 500 titles with role context 484
Top 1,000 titles with role context 964
Row promotion allowed now 0
Embedding rows allowed now 0
Runtime authority rows allowed now 0
Batch rows allowed now 0

It maps pe_aina_title_taxonomy_bucket_outputs to the current title/source proof surfaces:

The title-taxonomy donor paths are intentionally not serialized as live host paths. Receipts now use sanitized external-ref:* references plus SHA256s so the engine room preserves provenance without exposing local donor filesystem layout.

The title-taxonomy deterministic audit receipt is valid and reports:

Metric Value
Audit rows parsed 50
Buckets found in donor JSONL 50
JD-count matches 50
Current function-supported rows 43
Replacement-diff candidates 32
Generic/noisy lineage-only rows 11
Blocked missing/mismatch rows 7
Noisy tool rows 0
Noisy responsibility rows 0
Row promotion allowed now 0
Runtime authority rows allowed now 0
Embedding rows allowed now 0
Batch rows allowed now 0

Its checks prove the donor bucket JSONL, provenance file, and audit template hashes still match the gate receipt. This closes the immediate freshness loop Claude flagged, while keeping the actual promotion threshold and replacement-diff application as a future deliberate slice.

The prompt/workflow/ontology inventory receipt is valid and reports:

Metric Value
Inventory rows 10
Existing donor source families 10
Files inventoried 7,916
JSON files 3,040
Markdown files 2,010
Schema files 13
JSONL lines counted 48,541
Parseable JSON rows 48,534
Invalid JSON rows 8
Rows with role IDs 48,513
Rows with workflow IDs 39,863
Rows with prompt instructions 39,863
Legacy reviewer wording rows 14,798
Row promotion allowed now 0
Runtime authority rows allowed now 0
Embedding rows allowed now 0
Batch rows allowed now 0

Its checks prove the coarse PE bulk row exists in the donor package, source refs are sanitized, legacy human_review fields are not serialized, every row is inventory-only, donor repos stay read-only, and no live Gemini call was made. The 8 invalid JSON rows and legacy wording count are now explicit repair inputs instead of hidden risk.

The curriculum-release lineage receipt is valid and reports:

Metric Value
Donor files inventoried 14
Packet JSON files 5
Polished packet JSON files 1
Curriculum JSON files 5
Report markdown files 3
Curriculum modules seen 46
Packet rows with empty role tasks 6
Packet rows with empty role workflows 6
Rows with supply-safety bypass evidence 6
Rows with profile or learner context excluded 11
Weak role-match rows 1
Row promotion allowed now 0
Runtime authority rows allowed now 0
Embedding rows allowed now 0
Batch rows allowed now 0

Its checks prove donor files are hashed, source refs are sanitized, raw profile text is not serialized, profile/learner context is excluded from import, the weak founder/ecommerce match is blocked, current contract receipts are present, donor repos stay read-only, and no live Gemini call was made. This is the engine-room proof that the old curriculum work is valuable lineage and shape evidence, not current product authority.

The source-intelligence scaleout lineage receipt is valid and reports:

Metric Value
Donor files inventoried 71
Asset families classified 30
Already-accounted-for families 8
Future import candidates 4
Repair-first families 7
Advisory-only families 6
Superseded-by-engine-room families 4
Blocked raw surfaces 1
Row promotion allowed now 0
Runtime authority rows allowed now 0
Embedding rows allowed now 0
Batch rows allowed now 0

Its checks prove the real donor package is present, has the expected 71-file footprint, matches the 30-family classification ledger from the parallel source-intelligence audit, uses sanitized external source refs, removes legacy human-review wording from derived receipt rows, keeps raw JD/original source references blocked, keeps workflow bulk repair-first, treats alpha feedback as candidate-only, keeps donor repos read-only, and makes no live Gemini call.

The alpha-feedback support receipt is valid and reports:

Metric Value
Donor examples 40
Review-support candidates 36
Blocked noisy-title examples 4
Artifact types 5
Audience buckets 6
Row promotion allowed now 0
Export allowed now 0
Runtime authority rows allowed now 0
Embedding rows allowed now 0
Batch rows allowed now 0

Its checks prove the source-intelligence lineage receipt still marks alpha_feedback_bundle_v2 as an import candidate, the six donor alpha files are present, row counts match the JSON/JSONL/summary footprint, noisy marketplace titles are blocked, raw workflow/practice/proof text is excluded from the engine-room receipt, only hashes and compact role/tool metadata are serialized, legacy review fields are absent, donor repos stay read-only, and no live Gemini call was made. This gives future review-support or curriculum-critique lanes a safe starting point without promoting the alpha examples to product authority.

The review-packet support receipt is valid and reports:

Metric Value
Review packets 4
CSV rows counted 2,000
Packets with 500 rows 4
CSV files 4
Markdown files 4
Row promotion allowed now 0
Export allowed now 0
Runtime authority rows allowed now 0
Embedding rows allowed now 0
Batch rows allowed now 0

Its checks prove the source-intelligence lineage receipt still marks review_packets_v1 as an import candidate, all four expected packet pairs are present, each CSV has 500 rows, packet rows are not serialized, old reviewer values are absent from the derived receipt, only hashes and column shapes are carried forward, donor repos stay read-only, and no live Gemini call was made. This gives future deterministic repair, QA, or semantic comparison lanes a bounded prior-work reference without making any old packet row or label a product authority.

The industry-taxonomy support receipt is valid and reports:

Metric Value
Decision rows verified 17,118
CSV rows verified 17,118
Decision groups serialized 4
Generic/caveated rows 14,906
Spelling-normalization rows 1,749
Accepted-industry rows 304
Role-label-not-industry rows 159
Row promotion allowed now 0
Export allowed now 0
Runtime authority rows allowed now 0
Embedding rows allowed now 0
Batch rows allowed now 0

Its checks prove the source-intelligence lineage receipt still marks industry_taxonomy_decisions_v1 as an import candidate, the full donor JSONL and CSV hashes match provenance, the packaged summary count matches the 17,118-row donor file, only aggregate decision groups are serialized, old reviewer values are absent from the derived receipt, donor repos stay read-only, and no live Gemini call was made. This gives future company/industry repair lanes a verified prior-work reference without making any donor industry label product truth.

The E5/E6 source-authority reconciliation receipt is valid and reports:

Metric Value
Required assets 9
Accounted assets 9
Context/JD-aware gate assets 2
Fresh LLM review allowed assets 0
Top 500 export rows 500
Top 1,000 export rows 1,000
LinkedIn/JD context rows 129,165
Current vectors accounted 151,983
Runtime authority rows allowed now 0
Embedding rows allowed now 0
Batch rows allowed now 0

Its checks prove docs/TITLE-LEDGER.md, docs/MAPPING-CHAIN-LEDGER.md, source_authority_registry_v2, engine_room_export_manifest_v1, top_band_linkedin_source_authority_intake_v1, jd_aware_role_context_evidence_v1, production_chunk_vector_reconciliation_v1, prior_work_source_authority_promotion_v1, and full_validation.json are present enough to block fresh title-only LLM review unless a missing, stale, broken, or conflicting source-authority signal is named.

The prior-work promotion delta closure receipt is valid and reports:

Metric Value
Promote-candidate rows reconciled 16
Prior promote-candidate rows found 16
Covered by current engine-room receipts 6
Covered by specialized donor gates 7
Advisory-only rows 2
Future deterministic diff lanes 1
Generic promote-candidate rows still open 0
Fresh title-level LLM review allowed rows 0
Row promotion allowed now 0
Runtime authority rows allowed now 0
Embedding rows allowed now 0
Batch rows allowed now 0

Its checks prove every prior promote-candidate row is now accounted for by a current receipt or specific future lane, every row has complete proof, the generic promote-candidate backlog is closed, redundant title-level LLM review is blocked, donor repos stay read-only, and no live Gemini call was made.

The source-authority and JD-aware receipts are also valid after this slice:

Metric Value
Source-authority registry rows 48
PE donor lineage rows in source authority 2
PE foundation source rows in source authority 1
PE workflow-grounding rows in source authority 1
PE curriculum-release rows in source authority 1
PE curriculum-release files 14
PE curriculum-release modules 46
PE curriculum-release repair blockers 6
PE prompt/workflow/ontology inventory rows in source authority 1
PE prompt/workflow/ontology candidate families 10
PE source-intelligence scaleout rows in source authority 1
E5/E6 reconciliation rows in source authority 1
E5/E6 accounted assets 9
E5/E6 fresh LLM review allowed assets 0
Prior-work delta closure rows in source authority 1
Prior-work delta candidate rows 16
Prior-work generic promote candidates still open 0
Prior-work specific future lanes 3
PE prompt/workflow/ontology files 7,916
PE prompt/workflow/ontology parseable rows 48,534
PE source-intelligence rows in source authority 1
PE source-intelligence asset families 30
PE source-intelligence repair-first families 7
PE source-intelligence import candidates 4
PE alpha-feedback rows in source authority 1
PE alpha-feedback examples 40
PE alpha-feedback review-support candidates 36
PE alpha-feedback noisy-title blocks 4
PE review-packet rows in source authority 1
PE review packets 4
PE review-packet CSV rows 2,000
PE review packets with 500 rows 4
PE industry-taxonomy rows in source authority 1
PE industry-taxonomy decision rows 17,118
PE industry-taxonomy decision groups 4
PE title-taxonomy gate rows in source authority 1
PE title-taxonomy deterministic audit candidates 32
Combined chunks covered 467,436
Gemini vectors covered 151,983
JD-aware role-context rows 1,056
Rows with JD context 1,018
Top 500 titles with role context 484 / 499
Top 1,000 titles with role context 964 / 996
E2E fixture rows 50
Teaching-ready / guardrail fixtures 34 / 16

The JD-aware recovery is deterministic: it first uses explicit source refs, then gold-spine evidence_jd_ids, then bounded exact linkedin_jobs.title_normalized matches. Consumer artifacts still redact raw job IDs, company refs, summaries, and snippets according to the artifact exposure policy.

All package checks passed:

Why This Matters

This reduces the loop Ali flagged: we should not repeatedly review one title at a time when prior repo work already contains useful validated concepts, prompts, workflows, ontologies, and status reports. This package gives the next slice a clean way to promote good prior work into engine-room contracts without also importing unverified bulk or known-bad lineage.

Parallel Report Reconciliation

Ali ran three parallel report lanes before this closeout. I checked them before landing this checkpoint:

Parallel report Path How it affects this lane
Source intelligence scaleout audit /srv/aina/worktrees/aina-data-engine-room-source-intelligence-scaleout/docs/handoff/2026-06-15-source-intelligence-scaleout-audit.md Confirms the 71-file donor package is useful as lineage/review support only. It explicitly identifies alpha_feedback_bundle_v2 as a future import candidate, which this checkpoint now converts into a hashed support receipt without row/runtime/export/embedding authority.
Curriculum release audit /srv/aina/worktrees/aina-data-engine-room-curriculum-release-audit/docs/reports/2026-06-15-agent-curriculum-release-audit.md Confirms old curriculum packets, mastery gates, and rubrics are valuable shape evidence, but must remain repair-lineage because of synthetic learner context, empty task/workflow fields, supply-safety bypass evidence, and weak role-match examples.
Academy static export consumption report /srv/aina/worktrees/aina-academy-engine-room-export-consumption/ops/reports/2026-06-15-engine-room-static-export-consumption.md Confirms Academy should consume pinned, static, versioned export bundles only. No Cloudflare runtime should call VDS paths, DuckDB, Python, Gemini jobs, or live engine-room internals.

The common decision across all three is consistent with the current engine-room posture: preserve and hash useful donor work, use it to reduce rework, but do not promote donor rows, raw JDs, old labels, or vector presence as production authority.

Exact Resume Commands

cd /srv/aina/aina-data-engine-room
git status --short --branch
uv run aina-data-engine --root /srv/aina/aina-data-engine-room prior-work-source-authority-promotion
uv run aina-data-engine --root /srv/aina/aina-data-engine-room pe-donor-promotion-package
uv run aina-data-engine --root /srv/aina/aina-data-engine-room pe-donor-derived-contracts
uv run aina-data-engine --root /srv/aina/aina-data-engine-room pe-donor-export-runtime-mapping
uv run aina-data-engine --root /srv/aina/aina-data-engine-room pe-donor-foundation-source-mapping
uv run aina-data-engine --root /srv/aina/aina-data-engine-room pe-donor-workflow-grounding-mapping
uv run aina-data-engine --root /srv/aina/aina-data-engine-room pe-donor-title-taxonomy-gate
uv run aina-data-engine --root /srv/aina/aina-data-engine-room pe-donor-title-taxonomy-audit
uv run aina-data-engine --root /srv/aina/aina-data-engine-room pe-donor-curriculum-release-lineage
uv run aina-data-engine --root /srv/aina/aina-data-engine-room pe-donor-source-intelligence-scaleout-lineage
uv run aina-data-engine --root /srv/aina/aina-data-engine-room pe-donor-alpha-feedback-support
uv run aina-data-engine --root /srv/aina/aina-data-engine-room pe-donor-review-packet-support
uv run aina-data-engine --root /srv/aina/aina-data-engine-room pe-donor-industry-taxonomy-support
uv run aina-data-engine --root /srv/aina/aina-data-engine-room e5-source-authority-reconciliation
uv run aina-data-engine --root /srv/aina/aina-data-engine-room prior-work-promotion-delta-closure
uv run aina-data-engine --root /srv/aina/aina-data-engine-room production-runtime-contracts
uv run aina-data-engine --root /srv/aina/aina-data-engine-room engine-room-export-manifest
uv run aina-data-engine --root /srv/aina/aina-data-engine-room source-authority-registry-v2
uv run aina-data-engine --root /srv/aina/aina-data-engine-room jd-aware-role-context-evidence
uv run pytest tests/test_production_runtime_contracts.py tests/test_engine_room_export_manifest.py tests/test_source_authority_registry_v2.py tests/test_jd_aware_role_context.py tests/test_pe_donor_export_runtime_mapping.py tests/test_pe_donor_foundation_source_mapping.py tests/test_pe_donor_workflow_grounding_mapping.py tests/test_pe_donor_title_taxonomy_gate.py tests/test_pe_donor_curriculum_release_lineage.py tests/test_pe_donor_source_intelligence_scaleout_lineage.py tests/test_pe_donor_alpha_feedback_support.py tests/test_pe_donor_review_packet_support.py tests/test_pe_donor_industry_taxonomy_support.py tests/test_e5_source_authority_reconciliation.py tests/test_prior_work_promotion_delta_closure.py tests/test_pe_donor_derived_contracts.py tests/test_pe_donor_promotion_package.py tests/test_prior_work_source_authority_promotion.py -q
uv run aina-data-engine --root /srv/aina/aina-data-engine-room artifact-exposure-scan
uv run aina-data-engine --root /srv/aina/aina-data-engine-room ain-506-p0-gate
uv run aina-data-engine --root /srv/aina/aina-data-engine-room production-runtime-readiness
uv run aina-data-engine --root /srv/aina/aina-data-engine-room ain-510-retrieval-promotion-gate
uv run aina-data-engine --root /srv/aina/aina-data-engine-room validate

The two most contract-shaped rows now have derived, mapping, runtime-contract, export-manifest, source-authority, and JD-aware lineage proof:

The verified foundation-status source now has repair-reduction source-evidence proof:

The workflow-grounding consensus source now has source-evidence proof and a hard boundary against bulk promotion:

The title-taxonomy bucket source now has gate-pending source-evidence proof, freshness proof, and deterministic sample-audit proof, with a hard boundary against row/runtime/vector promotion:

The prompt/workflow/ontology bulk source now has source-family inventory proof, hashes, counts, parse-risk disclosure, and a hard boundary against row/runtime/ vector/batch promotion:

The curriculum-release source now has packet/curriculum/report lineage proof, module counts, repair blockers, and a hard boundary against row/runtime/vector/ batch promotion:

The source-intelligence scaleout source now has import-decision ledger proof, parallel-audit reconciliation, repair-first/import-candidate classification, legacy wording sanitization, and a hard boundary against row/runtime/vector/ batch promotion:

The alpha-feedback bundle now has review-support proof, hashed examples, noisy-title blocking, raw-text exclusion, and a hard boundary against export, row/runtime/vector/batch promotion:

The review-packet bundle now has packet-footprint proof, four preserved 500-row packet pairs, legacy status redaction, row serialization exclusion, and a hard boundary against export, row/runtime/vector/batch promotion:

The industry-taxonomy decisions now have aggregate repair-reduction proof, full-file provenance hash verification, four decision-group summaries, legacy status exclusion, row serialization exclusion, and a hard boundary against export, row/runtime/vector/batch promotion:

The E5/E6 reconciliation source now has anti-loop proof across current ledgers, export manifests, JD-aware context, chunk/vector receipts, prior-work promotion, and validation, with fresh title-level LLM review blocked for accounted assets:

The prior-work promotion delta closure source now has anti-loop proof that the old generic promote-candidate backlog is closed into named gates/specific lanes:

Next, keep moving through the same evidence-first pattern:


Ali Mehdi Mukadam · co-authored with Codex · 2026-06-15

topics:
  - personalization-engine
  - source-authority
  - donor-promotion
subtopics:
  - personalization-engine-aina
  - engine-room-receipts
  - quarantine-before-runtime