The modality that this token count applies to.
The Part contains audio.
Part
The Part contains a document, such as a PDF.
The Part contains an image.
When a modality is not specified, it is treated as TEXT.
TEXT
The Part contains plain text.
The Part contains a video.
The modality that this token count applies to.