Class ReinforcementTuningHyperParameters

java.lang.Object
com.google.genai.JsonSerializable
com.google.genai.types.ReinforcementTuningHyperParameters

@InternalApi public abstract class ReinforcementTuningHyperParameters extends JsonSerializable
Hyperparameters for Reinforcement Tuning.
  • Constructor Details

    • ReinforcementTuningHyperParameters

      public ReinforcementTuningHyperParameters()
  • Method Details

    • epochCount

      public abstract Optional<Long> epochCount()
      Number of training epochs for the tuning job.
    • learningRateMultiplier

      public abstract Optional<Float> learningRateMultiplier()
      Learning rate multiplier for Reinforcement Learning.
    • adapterSize

      public abstract Optional<AdapterSize> adapterSize()
      Adapter size for Reinforcement Tuning.
    • samplesPerPrompt

      public abstract Optional<Integer> samplesPerPrompt()
      Number of different responses to generate per prompt during tuning.
    • batchSize

      public abstract Optional<Integer> batchSize()
      Batch size for the tuning job. How many prompts to process at a train step. If not set, the batch size will be determined automatically.
    • evaluateInterval

      public abstract Optional<Integer> evaluateInterval()
      How often (in steps) to evaluate the tuning job during training. If not set, evaluation will run per epoch.
    • checkpointInterval

      public abstract Optional<Integer> checkpointInterval()
      How often (in steps) to save checkpoints during training. If not set, one checkpoint per epoch will be saved.
    • maxOutputTokens

      public abstract Optional<Integer> maxOutputTokens()
      The maximum number of tokens to generate per prompt. If not set, defaults to 32768.
    • thinkingLevel

      public abstract Optional<ReinforcementTuningThinkingLevel> thinkingLevel()
      Indicates the maximum thinking depth. Use with earlier models shall result in error.
    • builder

      Instantiates a builder for ReinforcementTuningHyperParameters.
    • toBuilder

      public abstract ReinforcementTuningHyperParameters.Builder toBuilder()
      Creates a builder with the same values as this instance.
    • fromJson

      public static ReinforcementTuningHyperParameters fromJson(String jsonString)
      Deserializes a JSON string to a ReinforcementTuningHyperParameters object.