Supervision contract
Form gold versus sense evidence
Form-level evaluation uses adjudicated gold targets. Sense-level evaluation uses source-linked targets. The form gold should not be reconstructed by mechanically unioning or voting over linked sense rows.
- 507 polysemous forms keep one coarse and one fine label across senses.
- 199 polysemous forms keep one coarse label but multiple fine labels.
- 408 forms remain coarse-diverse across senses.