controls_txt {blocking} | R Documentation |
Controls for processing character data
Description
Controls for text data used in the blocking
function (if representation = shingles
), passed to tokenize_character_shingles.
Usage
controls_txt(
n_shingles = 2L,
n_chunks = 10L,
lowercase = TRUE,
strip_non_alphanum = TRUE
)
Arguments
n_shingles |
length of shingles (default |
n_chunks |
passed to (default |
lowercase |
should the characters be made lower-case? (default |
strip_non_alphanum |
should punctuation and white space be stripped? (default |
Value
Returns a list with parameters.
Author(s)
Maciej Beręsewicz
[Package blocking version 1.0.1 Index]