cudf.core.column.string.StringMethods.token_count#

StringMethods.token_count(delimiter: str = ' ') SeriesOrIndex#

Each string is split into tokens using the provided delimiter. The returned integer sequence is the number of tokens in each string.

Parameters
delimiterstr or list of strs, Default is whitespace.

The characters or strings used to locate the split points of each string.

Returns
Series or Index.

Examples

>>> import cudf
>>> ser = cudf.Series(["hello world","goodbye",""])
>>> ser.str.token_count()
0    2
1    1
2    0
dtype: int32