Interface ReinforcementTuningStringMatchRewardScorer

Scores parsed responses for string matching use cases.

Properties

correctAnswerReward?: number

Correct answer reward is returned if evaluator evaluates to true. All correct answers get the same reward.

Uses json match expression to evaluate parsed response.

Uses string match expression to evaluate parsed response.

wrongAnswerReward?: number

Wrong answer reward is returned if evaluator evaluates to false. All wrong answers get the same reward.