Interface ValidateRewardParameters

Parameters for the validate_reward method.

Validates a reinforcement tuning reward configuration against a sample response and example before creating a reinforcement tuning job.

Properties

Composite reward function configuration for reinforcement tuning. Mutually exclusive with single_reward_config.

Optional parameters for the request.

Required. The example to validate the reward configuration.

parent: string

Required. The resource name of the Location to validate the reward in, e.g. projects/{project}/locations/{location}.

sampleResponse: Content

Required. The sample response for validating the reward configuration.

Single reward function configuration for reinforcement tuning. Mutually exclusive with composite_reward_config.