IWA Evidence Family Complete Embedding Checkpoint
A small, clean source-evidence family moved from repair-first to fully embedded without retrieval regression.
The iwa_evidence source family is now fully embedded in the local Gemini vector authority. It passed repair, repaired-input semantic QA, dry-run selection, live Gemini embedding, AIN-510, vector reconciliation, source authority registry, and full validation.
The full family moved through the clean-before-embed lane.
| Step | Result |
|---|---|
| Repair queue | 476 rows ready |
| Repaired corpus | 476 chunks, 0 skipped |
| Semantic QA | 50/50 pass, 0 raw JD hits |
| Live Gemini | 476 vectors added, 0 failed |
It was small, source-backed, and did not move the quality floor.
This was a compact source-evidence family with deterministic repair and a passing repaired-input semantic QA sample. Unlike the broad mixed semantic_review 5k expansion, this family did not reduce known-pair separation, did not introduce provider failures, and did not require rollback.
This completes another M2 source family.
| Mission slice | Status |
|---|---|
| M2.S1 source-family eligibility | Completed for iwa_evidence |
| M2.S2 progressive Gemini runs | Completed family in one 476-row live run after dry-run proof |
| AIN-510 retrieval proof | Still promotion_ready |
| Runtime boundary | Still local-only and unpromoted |
Keep harvesting small clean families first.
Continue with small, high-signal families before revisiting broad mixed families. Good candidates should have repaired-input QA pass, enough text signal, no label-only rows, and a clean 500-or-smaller live proof. Keep jobs_research_role blocked until richer context repair exists, and do not retry semantic_review at 5k as one mixed family until it is partitioned or the quality-pair suite is improved.
Restart from the 151,983-vector authority.
cd /srv/aina/aina-data-engine-room git status --short --branch uv run aina-data-engine --root /srv/aina/aina-data-engine-room ain-510-retrieval-promotion-gate uv run aina-data-engine --root /srv/aina/aina-data-engine-room production-chunk-vector-reconciliation uv run aina-data-engine --root /srv/aina/aina-data-engine-room source-authority-registry-v2 uv run aina-data-engine --root /srv/aina/aina-data-engine-room validate
Start from the 151,983-vector authority and keep choosing source families that pass repaired-input QA before live spend.