AINA data engine room - Codex - 2026-06-13

GDPval Calibration Packet v1

GDPval calibration packets turn remaining Hugging Face rubric holds into reviewer-ready, privacy-bounded handoffs without approving unattended evaluation, external beta, or public release.

The Single Idea

The remaining Hugging Face GDPval hold is reviewer-ready, but not approved: source files resolve, the rubric burden is explicit, and external beta stays blocked.

01

Snapshot

ready_for_structured_model_calibrationstatus
1packets
1unique tasks
1structured model decisions
0auto-approved
blockedexternal beta
02

Checks

CheckResult
deployment_readiness_validPASS
gdpval_hold_closeout_validPASS
all_remaining_calibration_holds_have_packetsPASS
all_packet_task_rows_resolvedPASS
all_packet_tasks_have_reference_filesPASS
all_packet_tasks_have_deliverable_examplesPASS
all_large_rubric_reasons_recordedPASS
no_packet_auto_approvedPASS
redacted_samples_onlyPASS
external_beta_still_blocked_until_structured_model_decisionPASS
no_public_runtimePASS
no_external_writesPASS
no_real_user_dataPASS
03

Calibration Packets

FunctionLevelModuleOwnerTask IdsRaw Rubric ItemsStatusNext Action
financelevel_3Build repeatable quality-reviewed workflows for high-frequency finance tasks.domain_risk_reviewer43dc9778-450b-4b46-b77e-b6d82b20203567awaiting_structured_model_decisionRun structured model/domain calibration on the large GDPval rubric before using it for unattended beta evaluation.
04

Task Evidence

TaskOccupationRef FilesDeliverablesRaw Rubric ItemsRedacted Sample Criteria
43dc9778-450b-4b46-b77e-b6d82b202035Accountants and Auditors152676
Where to start
A domain reviewer should inspect the referenced HF task row and full rubric, record approve/replace/hold, then rerun gdpval-calibration-packet and validate.