public class PosUimaTokenizerFactory extends java.lang.Object implements TokenizerFactory
{org.deeplearning4j.text.tokenization.tokenizer.PosUimaTokenizer}
Constructor and Description |
---|
PosUimaTokenizerFactory(org.apache.uima.analysis_engine.AnalysisEngine tokenizer,
java.util.Collection<java.lang.String> allowedPosTags) |
PosUimaTokenizerFactory(java.util.Collection<java.lang.String> allowedPoSTags) |
PosUimaTokenizerFactory(java.util.Collection<java.lang.String> allowedPoSTags,
boolean stripNones) |
Modifier and Type | Method and Description |
---|---|
Tokenizer |
create(java.io.InputStream toTokenize)
Create a tokenizer based on an input stream
|
Tokenizer |
create(java.lang.String toTokenize)
The tokenizer to createComplex
|
static org.apache.uima.analysis_engine.AnalysisEngine |
defaultAnalysisEngine() |
TokenPreProcess |
getTokenPreProcessor()
Returns TokenPreProcessor set for this TokenizerFactory instance
|
void |
setTokenPreProcessor(TokenPreProcess preProcessor)
Sets a token pre processor to be used
with every tokenizer
|
public PosUimaTokenizerFactory(java.util.Collection<java.lang.String> allowedPoSTags, boolean stripNones)
public PosUimaTokenizerFactory(java.util.Collection<java.lang.String> allowedPoSTags)
public PosUimaTokenizerFactory(org.apache.uima.analysis_engine.AnalysisEngine tokenizer, java.util.Collection<java.lang.String> allowedPosTags)
public static org.apache.uima.analysis_engine.AnalysisEngine defaultAnalysisEngine()
public Tokenizer create(java.lang.String toTokenize)
TokenizerFactory
create
in interface TokenizerFactory
toTokenize
- the string to createComplex the tokenizer withpublic Tokenizer create(java.io.InputStream toTokenize)
TokenizerFactory
create
in interface TokenizerFactory
public void setTokenPreProcessor(TokenPreProcess preProcessor)
TokenizerFactory
setTokenPreProcessor
in interface TokenizerFactory
preProcessor
- the token pre processor to usepublic TokenPreProcess getTokenPreProcessor()
getTokenPreProcessor
in interface TokenizerFactory