cerebras.modelzoo.tools.checkpoint_converters.bert.ConfigConverter_Bert_HF_CS21#

class cerebras.modelzoo.tools.checkpoint_converters.bert.ConfigConverter_Bert_HF_CS21[source]#

Bases: cerebras.modelzoo.tools.checkpoint_converters.bert.ConfigConverter_Bert_HF_CS18

CS 2.1 config is the same as CS 2.0

Methods

assert_factory_fn

assert_mlm_nonlinearity

attempt_mup_to_sp

Determines whether muP -> sP conversion is should be attempted.

convert

convert_all_keys

convert_helper

Converts all keys in a config from converter_indices.direction format to the other format.

convert_key

Attempts to convert the old key by matching against the list of conversion rules.

convert_position_embedding_type

file_formats

formats

get_converter_indices

get_mup_converter

Allows models to override the default muP converters with their own

load

post_config_convert

pre_config_convert

replaceKey

Copies value that exists at old_state_dict's old_key to new_state_dict's new_key.

save

supports_conversion

supports_mup_conversion

Determines whether muP -> sP conversion is supported for this model.

attempt_mup_to_sp()#

Determines whether muP -> sP conversion is should be attempted. Some HF models (such as BTLM) should not attempt muP -> sP conversion since they can natively handle muP.

convert_helper(config, converter_indices, drop_unmatched_keys=False, no_progress_bar=True, debug=False)#

Converts all keys in a config from converter_indices.direction format to the other format. Conversion will fail if at least one of the keys did not match on any conversion rules and drop_unmatched_keys is not enabled. Returns the newly converted config.

convert_key(old_key, old_state_dict, new_state_dict, from_index, match_start=0, prefix='', action_fn_args=None, debug=False)#

Attempts to convert the old key by matching against the list of conversion rules. The first rule to match is used for conversion (i.e. even if multiple rules would match, the latter ones are never used). Returns True if a conversion occurred.

get_mup_converter()#

Allows models to override the default muP converters with their own

static replaceKey(old_key, new_key, old_state_dict, new_state_dict, from_index, action_fn_args=None)#

Copies value that exists at old_state_dict’s old_key to new_state_dict’s new_key.

supports_mup_conversion()#

Determines whether muP -> sP conversion is supported for this model.