Getting Started
Concepts
Model Zoo
CS Torch
cerebras.pytorch
cerebras.pytorch.amp
cerebras.pytorch.optim
cerebras.pytorch.sparse
cerebras.pytorch.metrics
Cluster Monitoring
Fundamentals
Support
Bases: object
object
Class to manage checkpoints created within one run.
Initializes CkptInfo.
ckpt_dir (str) – directory where checkpoints are to be saved
Methods
update
Save ckpt_info if last checkpoint path changed.
ckpt_path (str) – path to the last checkpoint
max_store (Optional[int]) – maximum number of checkpoints to store
previous
cerebras.modelzoo.common.checkpoint_utils.get_latest_checkpoint
next
cerebras.modelzoo.common.dump_context