public class PosUimaTokenizerFactory extends java.lang.Object implements TokenizerFactory
{org.deeplearning4j.text.tokenization.tokenizer.PosUimaTokenizer}| Constructor and Description |
|---|
PosUimaTokenizerFactory(org.apache.uima.analysis_engine.AnalysisEngine tokenizer,
java.util.Collection<java.lang.String> allowedPosTags) |
PosUimaTokenizerFactory(java.util.Collection<java.lang.String> allowedPoSTags) |
PosUimaTokenizerFactory(java.util.Collection<java.lang.String> allowedPoSTags,
boolean stripNones) |
| Modifier and Type | Method and Description |
|---|---|
Tokenizer |
create(java.io.InputStream toTokenize)
Create a tokenizer based on an input stream
|
Tokenizer |
create(java.lang.String toTokenize)
The tokenizer to createComplex
|
static org.apache.uima.analysis_engine.AnalysisEngine |
defaultAnalysisEngine() |
TokenPreProcess |
getTokenPreProcessor()
Returns TokenPreProcessor set for this TokenizerFactory instance
|
void |
setTokenPreProcessor(TokenPreProcess preProcessor)
Sets a token pre processor to be used
with every tokenizer
|
public PosUimaTokenizerFactory(java.util.Collection<java.lang.String> allowedPoSTags,
boolean stripNones)
public PosUimaTokenizerFactory(java.util.Collection<java.lang.String> allowedPoSTags)
public PosUimaTokenizerFactory(org.apache.uima.analysis_engine.AnalysisEngine tokenizer,
java.util.Collection<java.lang.String> allowedPosTags)
public static org.apache.uima.analysis_engine.AnalysisEngine defaultAnalysisEngine()
public Tokenizer create(java.lang.String toTokenize)
TokenizerFactorycreate in interface TokenizerFactorytoTokenize - the string to createComplex the tokenizer withpublic Tokenizer create(java.io.InputStream toTokenize)
TokenizerFactorycreate in interface TokenizerFactorypublic void setTokenPreProcessor(TokenPreProcess preProcessor)
TokenizerFactorysetTokenPreProcessor in interface TokenizerFactorypreProcessor - the token pre processor to usepublic TokenPreProcess getTokenPreProcessor()
getTokenPreProcessor in interface TokenizerFactory