Class ColGroupSDC
- 
- All Implemented Interfaces:
- Serializable,- Cloneable
 
 public class ColGroupSDC extends AMorphingMMColGroup Column group that sparsely encodes the dictionary values. The idea is that all values is encoded with indexes except the most common one. the most common one can be inferred by not being included in the indexes. This column group is handy in cases where sparse unsafe operations is executed on very sparse columns. Then the zeros would be materialized in the group without any overhead.- See Also:
- Serialized Form
 
- 
- 
Nested Class Summary- 
Nested classes/interfaces inherited from class org.apache.sysds.runtime.compress.colgroup.AColGroupAColGroup.CompressionType
 
- 
 - 
Method SummaryAll Methods Instance Methods Concrete Methods Modifier and Type Method Description AColGroupbinaryRowOpLeft(BinaryOperator op, double[] v, boolean isRowSafe)Perform a binary row operation.AColGroupbinaryRowOpRight(BinaryOperator op, double[] v, boolean isRowSafe)Perform a binary row operation.CM_COV_ObjectcentralMoment(CMOperator op, int nRows)Central Moment instruction executed on a column group.voidcomputeColSums(double[] c, int nRows)Compute the column sumbooleancontainsValue(double pattern)Detect if the column group contains a specific value.longestimateInMemorySize()Get the upper bound estimate of in memory allocation for the column group.AColGroupextractCommon(double[] constV)org.apache.sysds.runtime.compress.colgroup.AColGroup.ColGroupTypegetColGroupType()AColGroup.CompressionTypegetCompType()Obtain the compression type.doublegetCost(ComputationCostEstimator e, int nRows)Get the computation cost associated with this column group.int[]getCounts(int[] counts)ADictionarygetDictionary()longgetExactSizeOnDisk()Returns the exact serialized size of column group.doublegetIdx(int r, int colIdx)Get the value at a colGroup specific row/column index position.longgetNumberNonZeros(int nRows)Get the number of nonZeros contained in this column group.voidreadFields(DataInput in)Deserialize column group from data input.AColGroupreplace(double pattern, double replace)Make a copy of the column group values, and replace all values that match pattern with replacement value.AColGrouprexpandCols(int max, boolean ignore, boolean cast, int nRows)Expand the column group to multiple columns.AColGroupscalarOperation(ScalarOperator op)Perform the specified scalar operation directly on the compressed column group, without decompressing individual cells if possible.AColGroupsubtractDefaultTuple()StringtoString()AColGroupunaryOperation(UnaryOperator op)voidwrite(DataOutput out)Serializes column group to data output.- 
Methods inherited from class org.apache.sysds.runtime.compress.colgroup.AMorphingMMColGroupleftMultByAColGroup, leftMultByMatrixNoPreAgg, tsmmAColGroup
 - 
Methods inherited from class org.apache.sysds.runtime.compress.colgroup.AColGroupValuecopy, decompressToDenseBlock, decompressToSparseBlock, forceMatrixBlockDictionary, getCachedCounts, getCounts, getNumValues, rightMultByMatrix
 - 
Methods inherited from class org.apache.sysds.runtime.compress.colgroup.AColGroupCompressedgetMax, getMin, preAggRows, tsmm, unaryAggregateOperations, unaryAggregateOperations
 - 
Methods inherited from class org.apache.sysds.runtime.compress.colgroup.AColGroupcolSum, decompressToDenseBlock, decompressToSparseBlock, get, getColIndices, getNumCols, shiftColIndices, sliceColumn, sliceColumns
 
- 
 
- 
- 
- 
Method Detail- 
getCompTypepublic AColGroup.CompressionType getCompType() Description copied from class:AColGroupObtain the compression type.- Specified by:
- getCompTypein class- AColGroup
- Returns:
- How the elements of the column group are compressed.
 
 - 
getColGroupTypepublic org.apache.sysds.runtime.compress.colgroup.AColGroup.ColGroupType getColGroupType() 
 - 
getIdxpublic double getIdx(int r, int colIdx)Description copied from class:AColGroupGet the value at a colGroup specific row/column index position.
 - 
getDictionarypublic ADictionary getDictionary() - Overrides:
- getDictionaryin class- AColGroupValue
 
 - 
computeColSumspublic void computeColSums(double[] c, int nRows)Description copied from class:AColGroupCompute the column sum- Overrides:
- computeColSumsin class- AColGroupValue
- Parameters:
- c- The array to add the column sum to.
- nRows- The number of rows in the column group.
 
 - 
getCountspublic int[] getCounts(int[] counts) - Specified by:
- getCountsin class- AColGroupValue
 
 - 
getNumberNonZerospublic long getNumberNonZeros(int nRows) Description copied from class:AColGroupGet the number of nonZeros contained in this column group.- Overrides:
- getNumberNonZerosin class- AColGroupValue
- Parameters:
- nRows- The number of rows in the column group, this is used for groups that does not contain information about how many rows they have.
- Returns:
- The nnz.
 
 - 
estimateInMemorySizepublic long estimateInMemorySize() Description copied from class:AColGroupGet the upper bound estimate of in memory allocation for the column group.- Overrides:
- estimateInMemorySizein class- AColGroupValue
- Returns:
- an upper bound on the number of bytes used to store this ColGroup in memory.
 
 - 
scalarOperationpublic AColGroup scalarOperation(ScalarOperator op) Description copied from class:AColGroupPerform the specified scalar operation directly on the compressed column group, without decompressing individual cells if possible.- Specified by:
- scalarOperationin class- AColGroup
- Parameters:
- op- operation to perform
- Returns:
- version of this column group with the operation applied
 
 - 
unaryOperationpublic AColGroup unaryOperation(UnaryOperator op) - Specified by:
- unaryOperationin class- AColGroup
 
 - 
binaryRowOpLeftpublic AColGroup binaryRowOpLeft(BinaryOperator op, double[] v, boolean isRowSafe) Description copied from class:AColGroupPerform a binary row operation.- Specified by:
- binaryRowOpLeftin class- AColGroup
- Parameters:
- op- The operation to execute
- v- The vector of values to apply, should be same length as dictionary length.
- isRowSafe- True if the binary op is applied to an entire zero row and all results are zero
- Returns:
- A updated column group with the new values.
 
 - 
binaryRowOpRightpublic AColGroup binaryRowOpRight(BinaryOperator op, double[] v, boolean isRowSafe) Description copied from class:AColGroupPerform a binary row operation.- Specified by:
- binaryRowOpRightin class- AColGroup
- Parameters:
- op- The operation to execute
- v- The vector of values to apply, should be same length as dictionary length.
- isRowSafe- True if the binary op is applied to an entire zero row and all results are zero
- Returns:
- A updated column group with the new values.
 
 - 
writepublic void write(DataOutput out) throws IOException Description copied from class:AColGroupSerializes column group to data output.- Overrides:
- writein class- AColGroupValue
- Parameters:
- out- data output
- Throws:
- IOException- if IOException occurs
 
 - 
readFieldspublic void readFields(DataInput in) throws IOException Description copied from class:AColGroupDeserialize column group from data input.- Overrides:
- readFieldsin class- AColGroupValue
- Parameters:
- in- data input
- Throws:
- IOException- if IOException occurs
 
 - 
getExactSizeOnDiskpublic long getExactSizeOnDisk() Description copied from class:AColGroupReturns the exact serialized size of column group. This can be used for example for buffer preallocation.- Overrides:
- getExactSizeOnDiskin class- AColGroupValue
- Returns:
- exact serialized size for column group
 
 - 
replacepublic AColGroup replace(double pattern, double replace) Description copied from class:AColGroupMake a copy of the column group values, and replace all values that match pattern with replacement value.- Overrides:
- replacein class- AColGroupValue
- Parameters:
- pattern- The value to look for
- replace- The value to replace the other value with
- Returns:
- A new Column Group, reusing the index structure but with new values.
 
 - 
extractCommonpublic AColGroup extractCommon(double[] constV) - Specified by:
- extractCommonin class- AMorphingMMColGroup
 
 - 
subtractDefaultTuplepublic AColGroup subtractDefaultTuple() 
 - 
centralMomentpublic CM_COV_Object centralMoment(CMOperator op, int nRows) Description copied from class:AColGroupCentral Moment instruction executed on a column group.- Overrides:
- centralMomentin class- AColGroupValue
- Parameters:
- op- The Operator to use.
- nRows- The number of rows contained in the ColumnGroup.
- Returns:
- A Central Moment object.
 
 - 
rexpandColspublic AColGroup rexpandCols(int max, boolean ignore, boolean cast, int nRows) Description copied from class:AColGroupExpand the column group to multiple columns. (one hot encode the column group)- Overrides:
- rexpandColsin class- AColGroupValue
- Parameters:
- max- The number of columns to expand to and cutoff values at.
- ignore- If zero and negative values should be ignored.
- cast- If the double values contained should be cast to whole numbers.
- nRows- The number of rows in the column group.
- Returns:
- A new column group containing max number of columns.
 
 - 
getCostpublic double getCost(ComputationCostEstimator e, int nRows) Description copied from class:AColGroupGet the computation cost associated with this column group.
 - 
containsValuepublic boolean containsValue(double pattern) Description copied from class:AColGroupDetect if the column group contains a specific value.- Overrides:
- containsValuein class- AColGroupValue
- Parameters:
- pattern- The value to look for.
- Returns:
- boolean saying true if the value is contained.
 
 - 
toStringpublic String toString() - Overrides:
- toStringin class- AColGroupValue
 
 
- 
 
-