OptionalcompositeComposite reward function configuration for reinforcement tuning. Mutually exclusive with single_reward_config.
OptionalconfigOptional parameters for the request.
Required. The example to validate the reward configuration.
Required. The resource name of the Location to validate the reward in, e.g. projects/{project}/locations/{location}.
Required. The sample response for validating the reward configuration.
OptionalsingleSingle reward function configuration for reinforcement tuning. Mutually exclusive with composite_reward_config.
Parameters for the validate_reward method.
Validates a reinforcement tuning reward configuration against a sample response and example before creating a reinforcement tuning job.