Getting Started
Concepts
Model Zoo
CS Torch
cerebras.pytorch
cerebras.pytorch.amp
cerebras.pytorch.optim
cerebras.pytorch.sparse
cerebras.pytorch.metrics
Cluster Monitoring
Fundamentals
Support
dedup
deduplicate_dataset
generate_connected_components
generate_duplicate_pairs
This script is used for duplicate pairs generation.
to_hash
previous
cerebras.modelzoo.data_preparation.data_preprocessing.custom_tokenizer_example.CustomLlama3Tokenizer.CustomLlama3Tokenizer
next
cerebras.modelzoo.data_preparation.data_preprocessing.data_dedup.dedup