DF_CLAMP_Tokenizer is the default tokenizer designed specifically for clinical notes.
Advanced users can use the config.conf file to change the default tokenization.
To replace the default file:
Double click on config.conf file to open it
Click on the button with three dots to browse for your own file
Click on the open button
DF_ OpenNLP_Tokenizer
This is an OpenNLP tokenizer. Advanced users can use its config.conf file to change its
default model, en-token.bin.
To replace the default model:
Double click on config.conf file to open it
Click on the button with three dots to browse for your own file
Click on the open button
DF_Tokenize_by_spaces
This tokenizer uses the spaces in a sentence to separate the tokens.