Task:
    | "text_to_video"
    | "image_to_video"
    | "reference_to_video"
    | "edit"
    | string & {}

Optional task mode for video generation. If not specified, the model

automatically determines the appropriate mode based on the provided text prompt and input media.