Tokenize texts in the `text_cols` of the csv `fname` in parallel using `n_workers`
tokenize_csv(
fname,
text_cols,
outname = NULL,
n_workers = 4,
rules = NULL,
mark_fields = NULL,
tok = NULL,
header = "infer",
chunksize = 50000
)
file name
text columns
outname
numeber of workers
rules
mark fields
tokenizer
header
chunk size
None