cerebras.modelzoo.data_preparation.utils.whitespace_tokenize#

cerebras.modelzoo.data_preparation.utils.whitespace_tokenize(text, lower=False)[source]#

Splits a piece of text based on whitespace characters