Class RegexTokenizer

java.lang.Object
org.apache.commons.text.similarity.RegexTokenizer
All Implemented Interfaces:
Function<CharSequence, CharSequence[]>, CharSequenceTokenizer<CharSequence>, Tokenizer<CharSequence, CharSequence>

final class RegexTokenizer extends Object implements CharSequenceTokenizer<CharSequence>
A simple word Tokenizer that utilizes a regex to find words. It applies a regex (\w)+ over the input text to extract words from a given character sequence.

Instances of this class are immutable and are safe for use by multiple concurrent threads.

Since:
1.0