Class ReinforcementTuningHyperParameters.Builder
- Enclosing class:
- ReinforcementTuningHyperParameters
-
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionadapterSize(AdapterSize adapterSize) Setter for adapterSize.adapterSize(AdapterSize.Known knownType) Setter for adapterSize given a known enum.adapterSize(String adapterSize) Setter for adapterSize given a string.Setter for batchSize.abstract ReinforcementTuningHyperParametersbuild()checkpointInterval(Integer checkpointInterval) Setter for checkpointInterval.Clears the value of adapterSize field.Clears the value of batchSize field.Clears the value of checkpointInterval field.Clears the value of epochCount field.Clears the value of evaluateInterval field.Clears the value of learningRateMultiplier field.Clears the value of maxOutputTokens field.Clears the value of samplesPerPrompt field.Clears the value of thinkingLevel field.epochCount(Long epochCount) Setter for epochCount.evaluateInterval(Integer evaluateInterval) Setter for evaluateInterval.learningRateMultiplier(Float learningRateMultiplier) Setter for learningRateMultiplier.maxOutputTokens(Integer maxOutputTokens) Setter for maxOutputTokens.samplesPerPrompt(Integer samplesPerPrompt) Setter for samplesPerPrompt.thinkingLevel(ReinforcementTuningThinkingLevel thinkingLevel) Setter for thinkingLevel.Setter for thinkingLevel given a known enum.thinkingLevel(String thinkingLevel) Setter for thinkingLevel given a string.
-
Constructor Details
-
Builder
public Builder()
-
-
Method Details
-
epochCount
Setter for epochCount.epochCount: Number of training epochs for the tuning job.
-
clearEpochCount
Clears the value of epochCount field. -
learningRateMultiplier
public abstract ReinforcementTuningHyperParameters.Builder learningRateMultiplier(Float learningRateMultiplier) Setter for learningRateMultiplier.learningRateMultiplier: Learning rate multiplier for Reinforcement Learning.
-
clearLearningRateMultiplier
@CanIgnoreReturnValue public ReinforcementTuningHyperParameters.Builder clearLearningRateMultiplier()Clears the value of learningRateMultiplier field. -
adapterSize
Setter for adapterSize.adapterSize: Adapter size for Reinforcement Tuning.
-
clearAdapterSize
Clears the value of adapterSize field. -
adapterSize
@CanIgnoreReturnValue public ReinforcementTuningHyperParameters.Builder adapterSize(AdapterSize.Known knownType) Setter for adapterSize given a known enum.adapterSize: Adapter size for Reinforcement Tuning.
-
adapterSize
@CanIgnoreReturnValue public ReinforcementTuningHyperParameters.Builder adapterSize(String adapterSize) Setter for adapterSize given a string.adapterSize: Adapter size for Reinforcement Tuning.
-
samplesPerPrompt
public abstract ReinforcementTuningHyperParameters.Builder samplesPerPrompt(Integer samplesPerPrompt) Setter for samplesPerPrompt.samplesPerPrompt: Number of different responses to generate per prompt during tuning.
-
clearSamplesPerPrompt
Clears the value of samplesPerPrompt field. -
batchSize
Setter for batchSize.batchSize: Batch size for the tuning job. How many prompts to process at a train step. If not set, the batch size will be determined automatically.
-
clearBatchSize
Clears the value of batchSize field. -
evaluateInterval
public abstract ReinforcementTuningHyperParameters.Builder evaluateInterval(Integer evaluateInterval) Setter for evaluateInterval.evaluateInterval: How often (in steps) to evaluate the tuning job during training. If not set, evaluation will run per epoch.
-
clearEvaluateInterval
Clears the value of evaluateInterval field. -
checkpointInterval
public abstract ReinforcementTuningHyperParameters.Builder checkpointInterval(Integer checkpointInterval) Setter for checkpointInterval.checkpointInterval: How often (in steps) to save checkpoints during training. If not set, one checkpoint per epoch will be saved.
-
clearCheckpointInterval
Clears the value of checkpointInterval field. -
maxOutputTokens
Setter for maxOutputTokens.maxOutputTokens: The maximum number of tokens to generate per prompt. If not set, defaults to 32768.
-
clearMaxOutputTokens
Clears the value of maxOutputTokens field. -
thinkingLevel
public abstract ReinforcementTuningHyperParameters.Builder thinkingLevel(ReinforcementTuningThinkingLevel thinkingLevel) Setter for thinkingLevel.thinkingLevel: Indicates the maximum thinking depth. Use with earlier models shall result in error.
-
clearThinkingLevel
Clears the value of thinkingLevel field. -
thinkingLevel
@CanIgnoreReturnValue public ReinforcementTuningHyperParameters.Builder thinkingLevel(ReinforcementTuningThinkingLevel.Known knownType) Setter for thinkingLevel given a known enum.thinkingLevel: Indicates the maximum thinking depth. Use with earlier models shall result in error.
-
thinkingLevel
@CanIgnoreReturnValue public ReinforcementTuningHyperParameters.Builder thinkingLevel(String thinkingLevel) Setter for thinkingLevel given a string.thinkingLevel: Indicates the maximum thinking depth. Use with earlier models shall result in error.
-
build
-