| Canonical surface form / headword strings |
Allowed |
Allowed |
Allowed for fusion text side and retrieval text query; not ASR side info |
Lexical-unit text input. |
| Linked definition text (ZH / EN) |
Allowed if present in the released lexical export |
Allowed |
Allowed only on the fusion text side; not used by ASR or retrieval |
Must be reported as released lexical input, not as hidden metadata. |
Normalized phonetics (jyutping, ipa) |
Allowed |
Not a sense-side benchmark input |
Allowed only on the fusion text side |
Part of the released form object. |
| Canonical waveform |
No |
No |
Allowed for audio-only, fusion, ASR cascade, and retrieval |
One controlled clip per released form. |
| ASR transcript |
No |
No |
Allowed only inside the ASR cascade after speech decoding |
Not an extra released lexical side channel. |
| Source provenance, review flags, missingness indicators |
Diagnostic only |
Diagnostic only |
Diagnostic only |
Used for audit and slice analysis, not for benchmark scoring inputs. |
| Speaker ids, filenames, split ids, ordering fields |
Forbidden |
Forbidden |
Forbidden |
Identity and bookkeeping fields are never model inputs. |
| Gold-label and adjudication fields |
Forbidden |
Forbidden |
Forbidden |
Targets define evaluation only. |