OptionalexportOptional. If set to true, disable intermediate checkpoints for Preference Optimization and only the last checkpoint will be exported. Otherwise, enable intermediate checkpoints for Preference Optimization. Default is false.
OptionalhyperOptional. Hyperparameters for Preference Optimization.
OptionaltrainingRequired. Cloud Storage path to file containing training dataset for preference optimization tuning. The dataset must be formatted as a JSONL file.
OptionalvalidationOptional. Cloud Storage path to file containing validation dataset for preference optimization tuning. The dataset must be formatted as a JSONL file.
Preference optimization tuning spec for tuning.