Class ColumnEncoderRecode
- java.lang.Object
- 
- org.apache.sysds.runtime.transform.encode.ColumnEncoder
- 
- org.apache.sysds.runtime.transform.encode.ColumnEncoderRecode
 
 
- 
- All Implemented Interfaces:
- Externalizable,- Serializable,- Comparable<ColumnEncoder>,- Encoder
 
 public class ColumnEncoderRecode extends ColumnEncoder - See Also:
- Serialized Form
 
- 
- 
Nested Class Summary- 
Nested classes/interfaces inherited from class org.apache.sysds.runtime.transform.encode.ColumnEncoderColumnEncoder.EncoderType
 
- 
 - 
Field SummaryFields Modifier and Type Field Description static booleanSORT_RECODE_MAP- 
Fields inherited from class org.apache.sysds.runtime.transform.encode.ColumnEncoderAPPLY_ROW_BLOCKS_PER_COLUMN, BUILD_ROW_BLOCKS_PER_COLUMN
 
- 
 - 
Constructor SummaryConstructors Constructor Description ColumnEncoderRecode()ColumnEncoderRecode(int colID)
 - 
Method SummaryAll Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description voidallocateMetaData(FrameBlock meta)Pre-allocate a FrameBlock for metadata collection.voidbuild(CacheBlock in)Build the transform meta data for the given block input.voidbuildPartial(FrameBlock in)Partial build of internal data structures (e.g., in distributed spark operations).voidcomputeRCDMapSizeEstimate(CacheBlock in, int[] sampleIndices)static StringconstructRecodeMapEntry(String token, Long code)Returns the Recode map entry which consists of concatenation of code, delimiter and token.booleanequals(Object o)Callable<Object>getBuildTask(CacheBlock in)HashMap<String,Long>getCPRecodeMaps()HashSet<Object>getCPRecodeMapsPartial()FrameBlockgetMetaData(FrameBlock meta)Construct a frame block out of the transform meta data.intgetNumDistinctValues()Callable<Object>getPartialBuildTask(CacheBlock in, int startRow, int blockSize, HashMap<Integer,Object> ret)Callable<Object>getPartialMergeBuildTask(HashMap<Integer,?> ret)HashMap<String,Long>getRcdMap()inthashCode()voidinitMetaData(FrameBlock meta)Construct the recodemaps from the given input frame for all columns registered for recode.voidmergeAt(ColumnEncoder other)Merges another encoder, of a compatible type, in after a certain position.voidprepareBuildPartial()Allocates internal data structures for partial build.voidreadExternal(ObjectInput in)Redirects the default java serialization via externalizable to our default hadoop writable serialization for efficient broadcast/rdd deserialization.voidsortCPRecodeMaps()static String[]splitRecodeMapEntry(String value)Splits a Recode map entry into its token and code.voidwriteExternal(ObjectOutput out)Redirects the default java serialization via externalizable to our default hadoop writable serialization for efficient broadcast/rdd serialization.- 
Methods inherited from class org.apache.sysds.runtime.transform.encode.ColumnEncoderapply, apply, build, build, compareTo, getApplyTasks, getBuildTasks, getColID, getColMapping, getEstMetaSize, getEstNumDistincts, getSparseRowsWZeros, isApplicable, isApplicable, setColID, setEstMetaSize, setEstNumDistincts, shiftCol, updateIndexRanges
 
- 
 
- 
- 
- 
Method Detail- 
constructRecodeMapEntrypublic static String constructRecodeMapEntry(String token, Long code) Returns the Recode map entry which consists of concatenation of code, delimiter and token.- Parameters:
- token- is part of Recode map
- code- is code for token
- Returns:
- the concatenation of token and code with delimiter in between
 
 - 
splitRecodeMapEntrypublic static String[] splitRecodeMapEntry(String value) Splits a Recode map entry into its token and code.- Parameters:
- value- concatenation of token and code with delimiter in between
- Returns:
- string array of token and code
 
 - 
sortCPRecodeMapspublic void sortCPRecodeMaps() 
 - 
computeRCDMapSizeEstimatepublic void computeRCDMapSizeEstimate(CacheBlock in, int[] sampleIndices) 
 - 
buildpublic void build(CacheBlock in) Description copied from interface:EncoderBuild the transform meta data for the given block input. This call modifies and keeps meta data as encoder state.- Parameters:
- in- input frame block
 
 - 
getBuildTaskpublic Callable<Object> getBuildTask(CacheBlock in) - Overrides:
- getBuildTaskin class- ColumnEncoder
 
 - 
getPartialBuildTaskpublic Callable<Object> getPartialBuildTask(CacheBlock in, int startRow, int blockSize, HashMap<Integer,Object> ret) - Overrides:
- getPartialBuildTaskin class- ColumnEncoder
 
 - 
getPartialMergeBuildTaskpublic Callable<Object> getPartialMergeBuildTask(HashMap<Integer,?> ret) - Overrides:
- getPartialMergeBuildTaskin class- ColumnEncoder
 
 - 
prepareBuildPartialpublic void prepareBuildPartial() Description copied from class:ColumnEncoderAllocates internal data structures for partial build.- Specified by:
- prepareBuildPartialin interface- Encoder
- Overrides:
- prepareBuildPartialin class- ColumnEncoder
 
 - 
buildPartialpublic void buildPartial(FrameBlock in) Description copied from class:ColumnEncoderPartial build of internal data structures (e.g., in distributed spark operations).- Specified by:
- buildPartialin interface- Encoder
- Overrides:
- buildPartialin class- ColumnEncoder
- Parameters:
- in- input frame block
 
 - 
mergeAtpublic void mergeAt(ColumnEncoder other) Description copied from class:ColumnEncoderMerges another encoder, of a compatible type, in after a certain position. Resizes as necessary.ColumnEncodersare compatible with themselves andEncoderCompositeis compatible with every otherColumnEncoders.MultiColumnEncodersare compatible with every encoder- Overrides:
- mergeAtin class- ColumnEncoder
- Parameters:
- other- the encoder that should be merged in
 
 - 
getNumDistinctValuespublic int getNumDistinctValues() 
 - 
allocateMetaDatapublic void allocateMetaData(FrameBlock meta) Description copied from interface:EncoderPre-allocate a FrameBlock for metadata collection.- Parameters:
- meta- frame block
 
 - 
getMetaDatapublic FrameBlock getMetaData(FrameBlock meta) Description copied from interface:EncoderConstruct a frame block out of the transform meta data.- Parameters:
- meta- output frame block
- Returns:
- output frame block?
 
 - 
initMetaDatapublic void initMetaData(FrameBlock meta) Construct the recodemaps from the given input frame for all columns registered for recode.- Parameters:
- meta- frame block
 
 - 
writeExternalpublic void writeExternal(ObjectOutput out) throws IOException Description copied from class:ColumnEncoderRedirects the default java serialization via externalizable to our default hadoop writable serialization for efficient broadcast/rdd serialization.- Specified by:
- writeExternalin interface- Externalizable
- Overrides:
- writeExternalin class- ColumnEncoder
- Parameters:
- out- object output
- Throws:
- IOException- if IOException occurs
 
 - 
readExternalpublic void readExternal(ObjectInput in) throws IOException Description copied from class:ColumnEncoderRedirects the default java serialization via externalizable to our default hadoop writable serialization for efficient broadcast/rdd deserialization.- Specified by:
- readExternalin interface- Externalizable
- Overrides:
- readExternalin class- ColumnEncoder
- Parameters:
- in- object input
- Throws:
- IOException- if IOException occur
 
 
- 
 
-