org.datavec.spark.functions.pairdata

Interface Summary
Interface	Description
PathToKeyConverter	PathToKeyConverter: Used to match up files based on their file names, for PairSequenceRecordReaderBytesFunction For example, suppose we have files "/features_0.csv" and "/labels_0.csv", map both to same key: "0"

Class Summary
Class	Description
BytesPairWritable	A Hadoop writable class for a pair of byte arrays, plus the original URIs (as Strings) of the files they came from
MapToBytesPairWritableFunction	A function to read files (assuming exactly 2 per input) from a PortableDataStream and combine the contents into a BytesPairWritable
PairSequenceRecordReaderBytesFunction	SequenceRecordReaderBytesFunction: Converts two sets of binary data (in the form of a BytesPairWritable) to DataVec format data (`Tuple2<List<List<<Writable>>,List<List<Writable>>`) using two SequenceRecordReaders.
PathToKeyConverterFilename	Convert the path to a key by taking the full file name (excluding the file extension and directories)
PathToKeyConverterNumber	A PathToKeyConverter that generates a key based on the file name.
PathToKeyFunction	Given a Tuple2, where the first value is the full path, map this to a Tuple3 where the first value is a key (using a `PathToKeyConverter`), second is an index, and third is the original data stream

Package org.datavec.spark.functions.pairdata