AINA Data Engine Room · Source Authority · 2026-06-13

Source Authority Registry v2

A source-family registry tied to the current combined corpus, not stale planning counts.

The Single Idea

Every current chunk family now has an authority class, vector count, and next action before further embedding or runtime promotion.

322519chunks covered
6510vectors covered
25families

Families

FamilyAuthorityChunksVectorsNext
onet_task_evidencesource_evidence1310950hold_for_source_authority_or_semantic_qa_before_embedding
serviceable_titlecanonical601003440use_repaired_overlay_as_current_authority_then_continue_progressive_embedding
semantic_reviewcanonical54686500use_repaired_overlay_as_current_authority_then_continue_progressive_embedding
jobs_research_responsibilitydonor_clean431960run_source_family_eligibility_then_progressive_embedding
workflow_seeddonor_clean72770run_source_family_eligibility_then_progressive_embedding
jobs_research_roledonor_clean66560run_source_family_eligibility_then_progressive_embedding
affordance_packsource_evidence66260hold_for_source_authority_or_semantic_qa_before_embedding
workflow_intelligencesource_evidence31520hold_for_source_authority_or_semantic_qa_before_embedding
workflow_ai_affordancesource_evidence30510hold_for_source_authority_or_semantic_qa_before_embedding
onet_occupation_evidencesource_evidence28280hold_for_source_authority_or_semantic_qa_before_embedding
top_worked_titlecanonical10841000use_repaired_overlay_as_current_authority_then_continue_progressive_embedding
hf_role_signalsource_evidence907826partially_vectorized_continue_after_quality_gate
iwa_evidencesource_evidence4760hold_for_source_authority_or_semantic_qa_before_embedding
jd_aware_role_contextcanonical292292vectorized_current_snapshot
jobs_research_workflowdonor_clean26733use_repaired_overlay_as_current_authority_then_continue_progressive_embedding
realism_corpussource_evidence2300hold_for_source_authority_or_semantic_qa_before_embedding
gdpval_tasksource_evidence220220vectorized_current_snapshot
jobs_research_tooldonor_clean14673partially_vectorized_continue_after_quality_gate
jobs_research_ai_affordancedonor_clean890hold_for_source_authority_or_semantic_qa_before_embedding
ai_fluency_headless_loopcanonical4848vectorized_current_snapshot
alipe_vision_docadvisory_lineage3232vectorized_current_snapshot
named_tool_authoritycanonical2020vectorized_current_snapshot
workflow_tool_evidencesource_evidence2010partially_vectorized_continue_after_quality_gate
harvest_source_mapcanonical1616vectorized_current_snapshot
qualitative_corpussource_evidence50hold_for_source_authority_or_semantic_qa_before_embedding

Checks

CheckStatus
all_chunk_families_classifiedPASS
chunk_vector_reconciliation_validPASS
donor_repos_read_onlyPASS
family_chunk_counts_match_reconciliationPASS
family_vector_counts_match_reconciliationPASS
labels_are_metadata_not_truthPASS
no_live_gemini_api_invokedPASS
raw_market_rows_not_embedding_authorityPASS
source_assets_carried_from_v1PASS
source_authority_registry_v1_validPASS