GDPval Calibration Packet v1
GDPval calibration packets turn remaining Hugging Face rubric holds into reviewer-ready, privacy-bounded handoffs without approving unattended evaluation, external beta, or public release.
The Single Idea
The remaining Hugging Face GDPval hold is reviewer-ready, but not approved: source files resolve, the rubric burden is explicit, and external beta stays blocked.
01
Snapshot
ready_for_structured_model_calibrationstatus
1packets
1unique tasks
1structured model decisions
0auto-approved
blockedexternal beta
02
Checks
| Check | Result |
|---|---|
| deployment_readiness_valid | PASS |
| gdpval_hold_closeout_valid | PASS |
| all_remaining_calibration_holds_have_packets | PASS |
| all_packet_task_rows_resolved | PASS |
| all_packet_tasks_have_reference_files | PASS |
| all_packet_tasks_have_deliverable_examples | PASS |
| all_large_rubric_reasons_recorded | PASS |
| no_packet_auto_approved | PASS |
| redacted_samples_only | PASS |
| external_beta_still_blocked_until_structured_model_decision | PASS |
| no_public_runtime | PASS |
| no_external_writes | PASS |
| no_real_user_data | PASS |
03
Calibration Packets
| Function | Level | Module | Owner | Task Ids | Raw Rubric Items | Status | Next Action |
|---|---|---|---|---|---|---|---|
| finance | level_3 | Build repeatable quality-reviewed workflows for high-frequency finance tasks. | domain_risk_reviewer | 43dc9778-450b-4b46-b77e-b6d82b202035 | 67 | awaiting_structured_model_decision | Run structured model/domain calibration on the large GDPval rubric before using it for unattended beta evaluation. |
04
Task Evidence
| Task | Occupation | Ref Files | Deliverables | Raw Rubric Items | Redacted Sample Criteria |
|---|---|---|---|---|---|
| 43dc9778-450b-4b46-b77e-b6d82b202035 | Accountants and Auditors | 15 | 2 | 67 | 6 |
Where to start
A domain reviewer should inspect the referenced HF task row and full rubric, record approve/replace/hold, then rerun
A domain reviewer should inspect the referenced HF task row and full rubric, record approve/replace/hold, then rerun
gdpval-calibration-packet and
validate.