Package com.google.genai.proto
Class SentencepieceModel.NormalizerSpec.Builder
java.lang.Object
com.google.protobuf.AbstractMessageLite.Builder
com.google.protobuf.AbstractMessage.Builder<BuilderT>
com.google.protobuf.GeneratedMessageV3.Builder<BuilderT>
com.google.protobuf.GeneratedMessageV3.ExtendableBuilder<SentencepieceModel.NormalizerSpec,SentencepieceModel.NormalizerSpec.Builder>
com.google.genai.proto.SentencepieceModel.NormalizerSpec.Builder
- All Implemented Interfaces:
SentencepieceModel.NormalizerSpecOrBuilder,com.google.protobuf.GeneratedMessageV3.ExtendableMessageOrBuilder<SentencepieceModel.NormalizerSpec>,com.google.protobuf.Message.Builder,com.google.protobuf.MessageLite.Builder,com.google.protobuf.MessageLiteOrBuilder,com.google.protobuf.MessageOrBuilder,Cloneable
- Enclosing class:
- SentencepieceModel.NormalizerSpec
public static final class SentencepieceModel.NormalizerSpec.Builder
extends com.google.protobuf.GeneratedMessageV3.ExtendableBuilder<SentencepieceModel.NormalizerSpec,SentencepieceModel.NormalizerSpec.Builder>
implements SentencepieceModel.NormalizerSpecOrBuilder
NormalizerSpec encodes a various parameters for string normalizationProtobuf type
com.google.genai.proto.NormalizerSpec-
Method Summary
Modifier and TypeMethodDescriptionaddExtension(com.google.protobuf.GeneratedMessage.GeneratedExtension<SentencepieceModel.NormalizerSpec, List<Type>> extension, Type value) addRepeatedField(com.google.protobuf.Descriptors.FieldDescriptor field, Object value) build()clear()Adds dummy whitespace at the beginning of text in order to treat "world" in "world" and "hello world" in the same way.Replaces whitespace with meta symbol.clearExtension(com.google.protobuf.GeneratedMessage.GeneratedExtension<SentencepieceModel.NormalizerSpec, T> extension) clearField(com.google.protobuf.Descriptors.FieldDescriptor field) name of normalization rule.Custom normalization rule file in TSV format.clearOneof(com.google.protobuf.Descriptors.OneofDescriptor oneof) Pre-compiled normalization rule created by Builder::GetPrecompiledCharsMap() or Builder::CompileCharsMap() method.Removes leading, trailing, and duplicate internal whitespace.clone()booleanAdds dummy whitespace at the beginning of text in order to treat "world" in "world" and "hello world" in the same way.static final com.google.protobuf.Descriptors.Descriptorcom.google.protobuf.Descriptors.DescriptorbooleanReplaces whitespace with meta symbol.getName()name of normalization rule.com.google.protobuf.ByteStringname of normalization rule.Custom normalization rule file in TSV format.com.google.protobuf.ByteStringCustom normalization rule file in TSV format.com.google.protobuf.ByteStringPre-compiled normalization rule created by Builder::GetPrecompiledCharsMap() or Builder::CompileCharsMap() method.booleanRemoves leading, trailing, and duplicate internal whitespace.booleanAdds dummy whitespace at the beginning of text in order to treat "world" in "world" and "hello world" in the same way.booleanReplaces whitespace with meta symbol.booleanhasName()name of normalization rule.booleanCustom normalization rule file in TSV format.booleanPre-compiled normalization rule created by Builder::GetPrecompiledCharsMap() or Builder::CompileCharsMap() method.booleanRemoves leading, trailing, and duplicate internal whitespace.final booleanmergeFrom(com.google.protobuf.CodedInputStream input, com.google.protobuf.ExtensionRegistryLite extensionRegistry) mergeFrom(com.google.protobuf.Message other) mergeUnknownFields(com.google.protobuf.UnknownFieldSet unknownFields) setAddDummyPrefix(boolean value) Adds dummy whitespace at the beginning of text in order to treat "world" in "world" and "hello world" in the same way.setEscapeWhitespaces(boolean value) Replaces whitespace with meta symbol.setExtension(com.google.protobuf.GeneratedMessage.GeneratedExtension<SentencepieceModel.NormalizerSpec, List<Type>> extension, int index, Type value) setExtension(com.google.protobuf.GeneratedMessage.GeneratedExtension<SentencepieceModel.NormalizerSpec, Type> extension, Type value) name of normalization rule.setNameBytes(com.google.protobuf.ByteString value) name of normalization rule.setNormalizationRuleTsv(String value) Custom normalization rule file in TSV format.setNormalizationRuleTsvBytes(com.google.protobuf.ByteString value) Custom normalization rule file in TSV format.setPrecompiledCharsmap(com.google.protobuf.ByteString value) Pre-compiled normalization rule created by Builder::GetPrecompiledCharsMap() or Builder::CompileCharsMap() method.setRemoveExtraWhitespaces(boolean value) Removes leading, trailing, and duplicate internal whitespace.setRepeatedField(com.google.protobuf.Descriptors.FieldDescriptor field, int index, Object value) setUnknownFields(com.google.protobuf.UnknownFieldSet unknownFields) Methods inherited from class com.google.protobuf.GeneratedMessageV3.ExtendableBuilder
addExtension, addExtension, clearExtension, clearExtension, getAllFields, getExtension, getExtension, getExtension, getExtension, getExtension, getExtension, getExtensionCount, getExtensionCount, getExtensionCount, getField, getFieldBuilder, getRepeatedField, getRepeatedFieldBuilder, getRepeatedFieldCount, hasExtension, hasExtension, hasExtension, hasField, newBuilderForField, setExtension, setExtension, setExtension, setExtensionMethods inherited from class com.google.protobuf.GeneratedMessageV3.Builder
getOneofFieldDescriptor, getUnknownFields, hasOneofMethods inherited from class com.google.protobuf.AbstractMessage.Builder
findInitializationErrors, getInitializationErrorString, mergeFrom, mergeFrom, mergeFrom, mergeFrom, mergeFrom, mergeFrom, mergeFrom, mergeFrom, mergeFrom, toStringMethods inherited from class com.google.protobuf.AbstractMessageLite.Builder
mergeDelimitedFrom, mergeDelimitedFrom, mergeFromMethods inherited from class java.lang.Object
equals, getClass, hashCode, notify, notifyAll, wait, wait, waitMethods inherited from interface com.google.protobuf.GeneratedMessageV3.ExtendableMessageOrBuilder
getExtension, getExtension, getExtension, getExtension, getExtension, getExtension, getExtensionCount, getExtensionCount, getExtensionCount, hasExtension, hasExtension, hasExtensionMethods inherited from interface com.google.protobuf.Message.Builder
mergeDelimitedFrom, mergeDelimitedFromMethods inherited from interface com.google.protobuf.MessageLite.Builder
mergeFromMethods inherited from interface com.google.protobuf.MessageOrBuilder
findInitializationErrors, getAllFields, getField, getInitializationErrorString, getOneofFieldDescriptor, getRepeatedField, getRepeatedFieldCount, getUnknownFields, hasField, hasOneof
-
Method Details
-
getDescriptor
public static final com.google.protobuf.Descriptors.Descriptor getDescriptor() -
clear
- Specified by:
clearin interfacecom.google.protobuf.Message.Builder- Specified by:
clearin interfacecom.google.protobuf.MessageLite.Builder- Overrides:
clearin classcom.google.protobuf.GeneratedMessageV3.ExtendableBuilder<SentencepieceModel.NormalizerSpec,SentencepieceModel.NormalizerSpec.Builder>
-
getDescriptorForType
public com.google.protobuf.Descriptors.Descriptor getDescriptorForType()- Specified by:
getDescriptorForTypein interfacecom.google.protobuf.Message.Builder- Specified by:
getDescriptorForTypein interfacecom.google.protobuf.MessageOrBuilder- Overrides:
getDescriptorForTypein classcom.google.protobuf.GeneratedMessageV3.Builder<SentencepieceModel.NormalizerSpec.Builder>
-
getDefaultInstanceForType
- Specified by:
getDefaultInstanceForTypein interfacecom.google.protobuf.GeneratedMessageV3.ExtendableMessageOrBuilder<SentencepieceModel.NormalizerSpec>- Specified by:
getDefaultInstanceForTypein interfacecom.google.protobuf.MessageLiteOrBuilder- Specified by:
getDefaultInstanceForTypein interfacecom.google.protobuf.MessageOrBuilder
-
build
- Specified by:
buildin interfacecom.google.protobuf.Message.Builder- Specified by:
buildin interfacecom.google.protobuf.MessageLite.Builder
-
buildPartial
- Specified by:
buildPartialin interfacecom.google.protobuf.Message.Builder- Specified by:
buildPartialin interfacecom.google.protobuf.MessageLite.Builder
-
clone
- Specified by:
clonein interfacecom.google.protobuf.Message.Builder- Specified by:
clonein interfacecom.google.protobuf.MessageLite.Builder- Overrides:
clonein classcom.google.protobuf.GeneratedMessageV3.Builder<SentencepieceModel.NormalizerSpec.Builder>
-
setField
public SentencepieceModel.NormalizerSpec.Builder setField(com.google.protobuf.Descriptors.FieldDescriptor field, Object value) - Specified by:
setFieldin interfacecom.google.protobuf.Message.Builder- Overrides:
setFieldin classcom.google.protobuf.GeneratedMessageV3.ExtendableBuilder<SentencepieceModel.NormalizerSpec,SentencepieceModel.NormalizerSpec.Builder>
-
clearField
public SentencepieceModel.NormalizerSpec.Builder clearField(com.google.protobuf.Descriptors.FieldDescriptor field) - Specified by:
clearFieldin interfacecom.google.protobuf.Message.Builder- Overrides:
clearFieldin classcom.google.protobuf.GeneratedMessageV3.ExtendableBuilder<SentencepieceModel.NormalizerSpec,SentencepieceModel.NormalizerSpec.Builder>
-
clearOneof
public SentencepieceModel.NormalizerSpec.Builder clearOneof(com.google.protobuf.Descriptors.OneofDescriptor oneof) - Specified by:
clearOneofin interfacecom.google.protobuf.Message.Builder- Overrides:
clearOneofin classcom.google.protobuf.GeneratedMessageV3.Builder<SentencepieceModel.NormalizerSpec.Builder>
-
setRepeatedField
public SentencepieceModel.NormalizerSpec.Builder setRepeatedField(com.google.protobuf.Descriptors.FieldDescriptor field, int index, Object value) - Specified by:
setRepeatedFieldin interfacecom.google.protobuf.Message.Builder- Overrides:
setRepeatedFieldin classcom.google.protobuf.GeneratedMessageV3.ExtendableBuilder<SentencepieceModel.NormalizerSpec,SentencepieceModel.NormalizerSpec.Builder>
-
addRepeatedField
public SentencepieceModel.NormalizerSpec.Builder addRepeatedField(com.google.protobuf.Descriptors.FieldDescriptor field, Object value) - Specified by:
addRepeatedFieldin interfacecom.google.protobuf.Message.Builder- Overrides:
addRepeatedFieldin classcom.google.protobuf.GeneratedMessageV3.ExtendableBuilder<SentencepieceModel.NormalizerSpec,SentencepieceModel.NormalizerSpec.Builder>
-
setExtension
public <Type> SentencepieceModel.NormalizerSpec.Builder setExtension(com.google.protobuf.GeneratedMessage.GeneratedExtension<SentencepieceModel.NormalizerSpec, Type> extension, Type value) - Overrides:
setExtensionin classcom.google.protobuf.GeneratedMessageV3.ExtendableBuilder<SentencepieceModel.NormalizerSpec,SentencepieceModel.NormalizerSpec.Builder>
-
setExtension
public <Type> SentencepieceModel.NormalizerSpec.Builder setExtension(com.google.protobuf.GeneratedMessage.GeneratedExtension<SentencepieceModel.NormalizerSpec, List<Type>> extension, int index, Type value) - Overrides:
setExtensionin classcom.google.protobuf.GeneratedMessageV3.ExtendableBuilder<SentencepieceModel.NormalizerSpec,SentencepieceModel.NormalizerSpec.Builder>
-
addExtension
public <Type> SentencepieceModel.NormalizerSpec.Builder addExtension(com.google.protobuf.GeneratedMessage.GeneratedExtension<SentencepieceModel.NormalizerSpec, List<Type>> extension, Type value) - Overrides:
addExtensionin classcom.google.protobuf.GeneratedMessageV3.ExtendableBuilder<SentencepieceModel.NormalizerSpec,SentencepieceModel.NormalizerSpec.Builder>
-
clearExtension
public <T> SentencepieceModel.NormalizerSpec.Builder clearExtension(com.google.protobuf.GeneratedMessage.GeneratedExtension<SentencepieceModel.NormalizerSpec, T> extension) - Overrides:
clearExtensionin classcom.google.protobuf.GeneratedMessageV3.ExtendableBuilder<SentencepieceModel.NormalizerSpec,SentencepieceModel.NormalizerSpec.Builder>
-
mergeFrom
- Specified by:
mergeFromin interfacecom.google.protobuf.Message.Builder- Overrides:
mergeFromin classcom.google.protobuf.AbstractMessage.Builder<SentencepieceModel.NormalizerSpec.Builder>
-
mergeFrom
-
isInitialized
public final boolean isInitialized()- Specified by:
isInitializedin interfacecom.google.protobuf.MessageLiteOrBuilder- Overrides:
isInitializedin classcom.google.protobuf.GeneratedMessageV3.ExtendableBuilder<SentencepieceModel.NormalizerSpec,SentencepieceModel.NormalizerSpec.Builder>
-
mergeFrom
public SentencepieceModel.NormalizerSpec.Builder mergeFrom(com.google.protobuf.CodedInputStream input, com.google.protobuf.ExtensionRegistryLite extensionRegistry) throws IOException - Specified by:
mergeFromin interfacecom.google.protobuf.Message.Builder- Specified by:
mergeFromin interfacecom.google.protobuf.MessageLite.Builder- Overrides:
mergeFromin classcom.google.protobuf.AbstractMessage.Builder<SentencepieceModel.NormalizerSpec.Builder>- Throws:
IOException
-
hasName
public boolean hasName()name of normalization rule.
optional string name = 1;- Specified by:
hasNamein interfaceSentencepieceModel.NormalizerSpecOrBuilder- Returns:
- Whether the name field is set.
-
getName
name of normalization rule.
optional string name = 1;- Specified by:
getNamein interfaceSentencepieceModel.NormalizerSpecOrBuilder- Returns:
- The name.
-
getNameBytes
public com.google.protobuf.ByteString getNameBytes()name of normalization rule.
optional string name = 1;- Specified by:
getNameBytesin interfaceSentencepieceModel.NormalizerSpecOrBuilder- Returns:
- The bytes for name.
-
setName
name of normalization rule.
optional string name = 1;- Parameters:
value- The name to set.- Returns:
- This builder for chaining.
-
clearName
name of normalization rule.
optional string name = 1;- Returns:
- This builder for chaining.
-
setNameBytes
name of normalization rule.
optional string name = 1;- Parameters:
value- The bytes for name to set.- Returns:
- This builder for chaining.
-
hasPrecompiledCharsmap
public boolean hasPrecompiledCharsmap()Pre-compiled normalization rule created by Builder::GetPrecompiledCharsMap() or Builder::CompileCharsMap() method. Usually this field is set by Builder::GetNormalizerSpec() method.
optional bytes precompiled_charsmap = 2;- Specified by:
hasPrecompiledCharsmapin interfaceSentencepieceModel.NormalizerSpecOrBuilder- Returns:
- Whether the precompiledCharsmap field is set.
-
getPrecompiledCharsmap
public com.google.protobuf.ByteString getPrecompiledCharsmap()Pre-compiled normalization rule created by Builder::GetPrecompiledCharsMap() or Builder::CompileCharsMap() method. Usually this field is set by Builder::GetNormalizerSpec() method.
optional bytes precompiled_charsmap = 2;- Specified by:
getPrecompiledCharsmapin interfaceSentencepieceModel.NormalizerSpecOrBuilder- Returns:
- The precompiledCharsmap.
-
setPrecompiledCharsmap
public SentencepieceModel.NormalizerSpec.Builder setPrecompiledCharsmap(com.google.protobuf.ByteString value) Pre-compiled normalization rule created by Builder::GetPrecompiledCharsMap() or Builder::CompileCharsMap() method. Usually this field is set by Builder::GetNormalizerSpec() method.
optional bytes precompiled_charsmap = 2;- Parameters:
value- The precompiledCharsmap to set.- Returns:
- This builder for chaining.
-
clearPrecompiledCharsmap
Pre-compiled normalization rule created by Builder::GetPrecompiledCharsMap() or Builder::CompileCharsMap() method. Usually this field is set by Builder::GetNormalizerSpec() method.
optional bytes precompiled_charsmap = 2;- Returns:
- This builder for chaining.
-
hasAddDummyPrefix
public boolean hasAddDummyPrefix()Adds dummy whitespace at the beginning of text in order to treat "world" in "world" and "hello world" in the same way.
optional bool add_dummy_prefix = 3 [default = true];- Specified by:
hasAddDummyPrefixin interfaceSentencepieceModel.NormalizerSpecOrBuilder- Returns:
- Whether the addDummyPrefix field is set.
-
getAddDummyPrefix
public boolean getAddDummyPrefix()Adds dummy whitespace at the beginning of text in order to treat "world" in "world" and "hello world" in the same way.
optional bool add_dummy_prefix = 3 [default = true];- Specified by:
getAddDummyPrefixin interfaceSentencepieceModel.NormalizerSpecOrBuilder- Returns:
- The addDummyPrefix.
-
setAddDummyPrefix
Adds dummy whitespace at the beginning of text in order to treat "world" in "world" and "hello world" in the same way.
optional bool add_dummy_prefix = 3 [default = true];- Parameters:
value- The addDummyPrefix to set.- Returns:
- This builder for chaining.
-
clearAddDummyPrefix
Adds dummy whitespace at the beginning of text in order to treat "world" in "world" and "hello world" in the same way.
optional bool add_dummy_prefix = 3 [default = true];- Returns:
- This builder for chaining.
-
hasRemoveExtraWhitespaces
public boolean hasRemoveExtraWhitespaces()Removes leading, trailing, and duplicate internal whitespace.
optional bool remove_extra_whitespaces = 4 [default = true];- Specified by:
hasRemoveExtraWhitespacesin interfaceSentencepieceModel.NormalizerSpecOrBuilder- Returns:
- Whether the removeExtraWhitespaces field is set.
-
getRemoveExtraWhitespaces
public boolean getRemoveExtraWhitespaces()Removes leading, trailing, and duplicate internal whitespace.
optional bool remove_extra_whitespaces = 4 [default = true];- Specified by:
getRemoveExtraWhitespacesin interfaceSentencepieceModel.NormalizerSpecOrBuilder- Returns:
- The removeExtraWhitespaces.
-
setRemoveExtraWhitespaces
Removes leading, trailing, and duplicate internal whitespace.
optional bool remove_extra_whitespaces = 4 [default = true];- Parameters:
value- The removeExtraWhitespaces to set.- Returns:
- This builder for chaining.
-
clearRemoveExtraWhitespaces
Removes leading, trailing, and duplicate internal whitespace.
optional bool remove_extra_whitespaces = 4 [default = true];- Returns:
- This builder for chaining.
-
hasEscapeWhitespaces
public boolean hasEscapeWhitespaces()Replaces whitespace with meta symbol. This field must be true to train sentence piece model.
optional bool escape_whitespaces = 5 [default = true];- Specified by:
hasEscapeWhitespacesin interfaceSentencepieceModel.NormalizerSpecOrBuilder- Returns:
- Whether the escapeWhitespaces field is set.
-
getEscapeWhitespaces
public boolean getEscapeWhitespaces()Replaces whitespace with meta symbol. This field must be true to train sentence piece model.
optional bool escape_whitespaces = 5 [default = true];- Specified by:
getEscapeWhitespacesin interfaceSentencepieceModel.NormalizerSpecOrBuilder- Returns:
- The escapeWhitespaces.
-
setEscapeWhitespaces
Replaces whitespace with meta symbol. This field must be true to train sentence piece model.
optional bool escape_whitespaces = 5 [default = true];- Parameters:
value- The escapeWhitespaces to set.- Returns:
- This builder for chaining.
-
clearEscapeWhitespaces
Replaces whitespace with meta symbol. This field must be true to train sentence piece model.
optional bool escape_whitespaces = 5 [default = true];- Returns:
- This builder for chaining.
-
hasNormalizationRuleTsv
public boolean hasNormalizationRuleTsv()Custom normalization rule file in TSV format. https://github.com/google/sentencepiece/blob/master/doc/normalization.md This field is only used in SentencePieceTrainer::Train() method, which compiles the rule into the binary rule stored in `precompiled_charsmap`.
optional string normalization_rule_tsv = 6;- Specified by:
hasNormalizationRuleTsvin interfaceSentencepieceModel.NormalizerSpecOrBuilder- Returns:
- Whether the normalizationRuleTsv field is set.
-
getNormalizationRuleTsv
Custom normalization rule file in TSV format. https://github.com/google/sentencepiece/blob/master/doc/normalization.md This field is only used in SentencePieceTrainer::Train() method, which compiles the rule into the binary rule stored in `precompiled_charsmap`.
optional string normalization_rule_tsv = 6;- Specified by:
getNormalizationRuleTsvin interfaceSentencepieceModel.NormalizerSpecOrBuilder- Returns:
- The normalizationRuleTsv.
-
getNormalizationRuleTsvBytes
public com.google.protobuf.ByteString getNormalizationRuleTsvBytes()Custom normalization rule file in TSV format. https://github.com/google/sentencepiece/blob/master/doc/normalization.md This field is only used in SentencePieceTrainer::Train() method, which compiles the rule into the binary rule stored in `precompiled_charsmap`.
optional string normalization_rule_tsv = 6;- Specified by:
getNormalizationRuleTsvBytesin interfaceSentencepieceModel.NormalizerSpecOrBuilder- Returns:
- The bytes for normalizationRuleTsv.
-
setNormalizationRuleTsv
Custom normalization rule file in TSV format. https://github.com/google/sentencepiece/blob/master/doc/normalization.md This field is only used in SentencePieceTrainer::Train() method, which compiles the rule into the binary rule stored in `precompiled_charsmap`.
optional string normalization_rule_tsv = 6;- Parameters:
value- The normalizationRuleTsv to set.- Returns:
- This builder for chaining.
-
clearNormalizationRuleTsv
Custom normalization rule file in TSV format. https://github.com/google/sentencepiece/blob/master/doc/normalization.md This field is only used in SentencePieceTrainer::Train() method, which compiles the rule into the binary rule stored in `precompiled_charsmap`.
optional string normalization_rule_tsv = 6;- Returns:
- This builder for chaining.
-
setNormalizationRuleTsvBytes
public SentencepieceModel.NormalizerSpec.Builder setNormalizationRuleTsvBytes(com.google.protobuf.ByteString value) Custom normalization rule file in TSV format. https://github.com/google/sentencepiece/blob/master/doc/normalization.md This field is only used in SentencePieceTrainer::Train() method, which compiles the rule into the binary rule stored in `precompiled_charsmap`.
optional string normalization_rule_tsv = 6;- Parameters:
value- The bytes for normalizationRuleTsv to set.- Returns:
- This builder for chaining.
-
setUnknownFields
public final SentencepieceModel.NormalizerSpec.Builder setUnknownFields(com.google.protobuf.UnknownFieldSet unknownFields) - Specified by:
setUnknownFieldsin interfacecom.google.protobuf.Message.Builder- Overrides:
setUnknownFieldsin classcom.google.protobuf.GeneratedMessageV3.Builder<SentencepieceModel.NormalizerSpec.Builder>
-
mergeUnknownFields
public final SentencepieceModel.NormalizerSpec.Builder mergeUnknownFields(com.google.protobuf.UnknownFieldSet unknownFields) - Specified by:
mergeUnknownFieldsin interfacecom.google.protobuf.Message.Builder- Overrides:
mergeUnknownFieldsin classcom.google.protobuf.GeneratedMessageV3.Builder<SentencepieceModel.NormalizerSpec.Builder>
-