public class UimaTokenizerFactory extends java.lang.Object implements TokenizerFactory
AnalysisEngine
to
tokenize text.Constructor and Description |
---|
UimaTokenizerFactory() |
UimaTokenizerFactory(org.apache.uima.analysis_engine.AnalysisEngine tokenizer) |
UimaTokenizerFactory(org.apache.uima.analysis_engine.AnalysisEngine tokenizer,
boolean checkForLabel) |
UimaTokenizerFactory(boolean checkForLabel) |
UimaTokenizerFactory(UimaResource resource) |
UimaTokenizerFactory(UimaResource resource,
boolean checkForLabel) |
Modifier and Type | Method and Description |
---|---|
Tokenizer |
create(java.io.InputStream toTokenize)
Create a tokenizer based on an input stream
|
Tokenizer |
create(java.lang.String toTokenize)
The tokenizer to createComplex
|
static org.apache.uima.analysis_engine.AnalysisEngine |
defaultAnalysisEngine()
Creates a tokenization,/stemming pipeline
|
TokenPreProcess |
getTokenPreProcessor()
Returns TokenPreProcessor set for this TokenizerFactory instance
|
UimaResource |
getUimaResource() |
void |
setTokenPreProcessor(TokenPreProcess preProcessor)
Sets a token pre processor to be used
with every tokenizer
|
public UimaTokenizerFactory() throws org.apache.uima.resource.ResourceInitializationException
org.apache.uima.resource.ResourceInitializationException
public UimaTokenizerFactory(UimaResource resource)
public UimaTokenizerFactory(org.apache.uima.analysis_engine.AnalysisEngine tokenizer)
public UimaTokenizerFactory(UimaResource resource, boolean checkForLabel)
public UimaTokenizerFactory(boolean checkForLabel) throws org.apache.uima.resource.ResourceInitializationException
org.apache.uima.resource.ResourceInitializationException
public UimaTokenizerFactory(org.apache.uima.analysis_engine.AnalysisEngine tokenizer, boolean checkForLabel)
public Tokenizer create(java.lang.String toTokenize)
TokenizerFactory
create
in interface TokenizerFactory
toTokenize
- the string to createComplex the tokenizer withpublic UimaResource getUimaResource()
public static org.apache.uima.analysis_engine.AnalysisEngine defaultAnalysisEngine()
public Tokenizer create(java.io.InputStream toTokenize)
TokenizerFactory
create
in interface TokenizerFactory
public void setTokenPreProcessor(TokenPreProcess preProcessor)
TokenizerFactory
setTokenPreProcessor
in interface TokenizerFactory
preProcessor
- the token pre processor to usepublic TokenPreProcess getTokenPreProcessor()
getTokenPreProcessor
in interface TokenizerFactory