public class NGramTokenizerFactory extends java.lang.Object implements TokenizerFactory
| Constructor and Description |
|---|
NGramTokenizerFactory(TokenizerFactory tokenizerFactory,
java.lang.Integer minN,
java.lang.Integer maxN) |
| Modifier and Type | Method and Description |
|---|---|
Tokenizer |
create(java.io.InputStream toTokenize)
Create a tokenizer based on an input stream
|
Tokenizer |
create(java.lang.String toTokenize)
The tokenizer to createComplex
|
TokenPreProcess |
getTokenPreProcessor()
Returns TokenPreProcessor set for this TokenizerFactory instance
|
void |
setTokenPreProcessor(TokenPreProcess preProcessor)
Sets a token pre processor to be used
with every tokenizer
|
public NGramTokenizerFactory(TokenizerFactory tokenizerFactory, java.lang.Integer minN, java.lang.Integer maxN)
public Tokenizer create(java.lang.String toTokenize)
TokenizerFactorycreate in interface TokenizerFactorytoTokenize - the string to createComplex the tokenizer withpublic Tokenizer create(java.io.InputStream toTokenize)
TokenizerFactorycreate in interface TokenizerFactorypublic void setTokenPreProcessor(TokenPreProcess preProcessor)
TokenizerFactorysetTokenPreProcessor in interface TokenizerFactorypreProcessor - the token pre processor to usepublic TokenPreProcess getTokenPreProcessor()
getTokenPreProcessor in interface TokenizerFactory