cerebras.modelzoo.data_preparation.nlp.bert#
Common pre-processing functions for BERTSUM data processing |
|
Preprocessed CSV data generator for BERT pretraining from raw text documents. |
|
Preprocessed CSV data generator for BERT pretraining from raw text documents. |
|
Preprocessed CSV data generator for BERT pretraining from raw text documents. |
|
Preprocessed CSV data generator for BERT pretraining from raw text documents. |
|
Script to write HDF5 files for MLM_only and MLM + NSP datasets. |
|
Common pre-processing functions taken from: https://github.com/NVIDIA/DeepLearningExamples/blob/master/TensorFlow/LanguageModeling/BERT/run_ner.py with minor modifications |
|