public class CnnSentenceDataSetIterator
extends java.lang.Object
implements org.nd4j.linalg.dataset.api.iterator.DataSetIterator
Specifically:
- Features have shape [minibatchSize, 1, maxSentenceLength, wordVectorSize] OR [minibatchSize, 1, wordVectorSize, maxSentenceLength]
depending on the configuration (for sentencesAlongHeight = true/false respectively)
- Labels are a 2d array with shape [minibatchSize, numLabels].
Sentences and labels are provided by a LabeledSentenceProvider
- different implementations of this provide different
ways of loading sentences/documents with labels - for example, from files, etc.
Note: With regard to labels to class index assignment, they are sorted alphabetically. To get the assigment/mapping,
use getLabels()
or getLabelClassMap()
Modifier and Type | Class and Description |
---|---|
static class |
CnnSentenceDataSetIterator.Builder |
static class |
CnnSentenceDataSetIterator.UnknownWordHandling |
Modifier and Type | Method and Description |
---|---|
boolean |
asyncSupported() |
int |
batch() |
int |
cursor() |
java.util.Map<java.lang.String,java.lang.Integer> |
getLabelClassMap() |
java.util.List<java.lang.String> |
getLabels() |
org.nd4j.linalg.dataset.api.DataSetPreProcessor |
getPreProcessor() |
boolean |
hasNext() |
int |
inputColumns() |
org.nd4j.linalg.api.ndarray.INDArray |
loadSingleSentence(java.lang.String sentence)
Generally used post training time to load a single sentence for predictions
|
org.nd4j.linalg.dataset.DataSet |
next() |
org.nd4j.linalg.dataset.DataSet |
next(int num) |
int |
numExamples() |
void |
remove() |
void |
reset() |
boolean |
resetSupported() |
void |
setPreProcessor(org.nd4j.linalg.dataset.api.DataSetPreProcessor preProcessor) |
int |
totalExamples() |
int |
totalOutcomes() |
public org.nd4j.linalg.api.ndarray.INDArray loadSingleSentence(java.lang.String sentence)
public java.util.Map<java.lang.String,java.lang.Integer> getLabelClassMap()
public java.util.List<java.lang.String> getLabels()
getLabels
in interface org.nd4j.linalg.dataset.api.iterator.DataSetIterator
public boolean hasNext()
hasNext
in interface java.util.Iterator<org.nd4j.linalg.dataset.DataSet>
public org.nd4j.linalg.dataset.DataSet next()
next
in interface java.util.Iterator<org.nd4j.linalg.dataset.DataSet>
public org.nd4j.linalg.dataset.DataSet next(int num)
next
in interface org.nd4j.linalg.dataset.api.iterator.DataSetIterator
public int totalExamples()
totalExamples
in interface org.nd4j.linalg.dataset.api.iterator.DataSetIterator
public int inputColumns()
inputColumns
in interface org.nd4j.linalg.dataset.api.iterator.DataSetIterator
public int totalOutcomes()
totalOutcomes
in interface org.nd4j.linalg.dataset.api.iterator.DataSetIterator
public boolean resetSupported()
resetSupported
in interface org.nd4j.linalg.dataset.api.iterator.DataSetIterator
public boolean asyncSupported()
asyncSupported
in interface org.nd4j.linalg.dataset.api.iterator.DataSetIterator
public void reset()
reset
in interface org.nd4j.linalg.dataset.api.iterator.DataSetIterator
public int batch()
batch
in interface org.nd4j.linalg.dataset.api.iterator.DataSetIterator
public int cursor()
cursor
in interface org.nd4j.linalg.dataset.api.iterator.DataSetIterator
public int numExamples()
numExamples
in interface org.nd4j.linalg.dataset.api.iterator.DataSetIterator
public void setPreProcessor(org.nd4j.linalg.dataset.api.DataSetPreProcessor preProcessor)
setPreProcessor
in interface org.nd4j.linalg.dataset.api.iterator.DataSetIterator
public org.nd4j.linalg.dataset.api.DataSetPreProcessor getPreProcessor()
getPreProcessor
in interface org.nd4j.linalg.dataset.api.iterator.DataSetIterator
public void remove()
remove
in interface java.util.Iterator<org.nd4j.linalg.dataset.DataSet>