Class ColGroupUncompressed
- java.lang.Object
- 
- org.apache.sysds.runtime.compress.colgroup.AColGroup
- 
- org.apache.sysds.runtime.compress.colgroup.ColGroupUncompressed
 
 
- 
- All Implemented Interfaces:
- Serializable
 
 public class ColGroupUncompressed extends AColGroup Column group type for columns that are stored as dense arrays of doubles. Uses a MatrixBlock internally to store the column contents.- See Also:
- Serialized Form
 
- 
- 
Nested Class Summary- 
Nested classes/interfaces inherited from class org.apache.sysds.runtime.compress.colgroup.AColGroupAColGroup.CompressionType
 
- 
 - 
Constructor SummaryConstructors Constructor Description ColGroupUncompressed(MatrixBlock data)Constructor for allocating a single uncompressed column group.
 - 
Method SummaryAll Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description AColGroupbinaryRowOpLeft(BinaryOperator op, double[] v, boolean isRowSafe)Perform a binary row operation.AColGroupbinaryRowOpRight(BinaryOperator op, double[] v, boolean isRowSafe)Perform a binary row operation.CM_COV_ObjectcentralMoment(CMOperator op, int nRows)Central Moment instruction executed on a column group.voidcomputeColSums(double[] c, int nRows)Compute the column sumbooleancontainsValue(double pattern)Detect if the column group contains a specific value.AColGroupcopy()Get a copy of this column group note this is only a shallow copy.static AColGroupcreate(int[] colIndexes, MatrixBlock rawBlock, boolean transposed)Main constructor for Uncompressed ColGroup.voiddecompressToDenseBlock(DenseBlock db, int rl, int ru, int offR, int offC)Decompress into the DenseBlock.voiddecompressToSparseBlock(SparseBlock ret, int rl, int ru, int offR, int offC)Decompress into the SparseBlock.longestimateInMemorySize()Get the upper bound estimate of in memory allocation for the column group.org.apache.sysds.runtime.compress.colgroup.AColGroup.ColGroupTypegetColGroupType()AColGroup.CompressionTypegetCompType()Obtain the compression type.doublegetCost(ComputationCostEstimator e, int nRows)Get the computation cost associated with this column group.MatrixBlockgetData()Access for superclasslonggetExactSizeOnDisk()Returns the exact serialized size of column group.doublegetIdx(int r, int colIdx)Get the value at a colGroup specific row/column index position.doublegetMax()Short hand method for getting maximum value contained in this column group.doublegetMin()Short hand method for getting minimum value contained in this column group.longgetNumberNonZeros(int nRows)Get the number of nonZeros contained in this column group.intgetNumValues()Obtain number of distinct tuples in contained sets of values associated with this column group.voidleftMultByAColGroup(AColGroup lhs, MatrixBlock result)Left side matrix multiplication with a column group that is transposed.voidleftMultByMatrix(MatrixBlock matrix, MatrixBlock result, int rl, int ru)voidleftMultByMatrixNoPreAgg(MatrixBlock matrix, MatrixBlock result, int rl, int ru, int cl, int cu)Left multiply with this column group.voidreadFields(DataInput in)Deserialize column group from data input.AColGroupreplace(double pattern, double replace)Make a copy of the column group values, and replace all values that match pattern with replacement value.AColGrouprexpandCols(int max, boolean ignore, boolean cast, int nRows)Expand the column group to multiple columns.AColGrouprightMultByMatrix(MatrixBlock right)Right matrix multiplication with this column group.AColGroupscalarOperation(ScalarOperator op)Perform the specified scalar operation directly on the compressed column group, without decompressing individual cells if possible.StringtoString()voidtsmm(MatrixBlock ret, int nRows)Do a transposed self matrix multiplication on the left side t(x) %*% x.voidtsmmAColGroup(AColGroup lhs, MatrixBlock result)Matrix multiply with this other column group, but: 1.voidunaryAggregateOperations(AggregateUnaryOperator op, double[] result, int nRows, int rl, int ru)Unary Aggregate operator, since aggregate operators require new object output, the output becomes an uncompressed matrix.AColGroupunaryOperation(UnaryOperator op)voidwrite(DataOutput out)Serializes column group to data output.- 
Methods inherited from class org.apache.sysds.runtime.compress.colgroup.AColGroupcolSum, decompressToDenseBlock, decompressToSparseBlock, get, getColIndices, getNumCols, shiftColIndices, sliceColumn, sliceColumns
 
- 
 
- 
- 
- 
Constructor Detail- 
ColGroupUncompressedpublic ColGroupUncompressed(MatrixBlock data) Constructor for allocating a single uncompressed column group.- Parameters:
- data- matrix block
 
 
- 
 - 
Method Detail- 
createpublic static AColGroup create(int[] colIndexes, MatrixBlock rawBlock, boolean transposed) Main constructor for Uncompressed ColGroup.- Parameters:
- colIndexes- Indices (relative to the current block) of the columns that this column group represents.
- rawBlock- The uncompressed block; uncompressed data must be present at the time that the constructor is called
- transposed- Says if the input matrix raw block have been transposed.
- Returns:
- AColGroup.
 
 - 
getCompTypepublic AColGroup.CompressionType getCompType() Description copied from class:AColGroupObtain the compression type.- Specified by:
- getCompTypein class- AColGroup
- Returns:
- How the elements of the column group are compressed.
 
 - 
getColGroupTypepublic org.apache.sysds.runtime.compress.colgroup.AColGroup.ColGroupType getColGroupType() 
 - 
getDatapublic MatrixBlock getData() Access for superclass- Returns:
- direct pointer to the internal representation of the columns
 
 - 
estimateInMemorySizepublic long estimateInMemorySize() Description copied from class:AColGroupGet the upper bound estimate of in memory allocation for the column group.- Overrides:
- estimateInMemorySizein class- AColGroup
- Returns:
- an upper bound on the number of bytes used to store this ColGroup in memory.
 
 - 
decompressToDenseBlockpublic void decompressToDenseBlock(DenseBlock db, int rl, int ru, int offR, int offC) Description copied from class:AColGroupDecompress into the DenseBlock. (no NNZ handling)- Specified by:
- decompressToDenseBlockin class- AColGroup
- Parameters:
- db- Target DenseBlock
- rl- Row to start decompression from
- ru- Row to end decompression at
- offR- Row offset into the target to decompress
- offC- Column offset into the target to decompress
 
 - 
decompressToSparseBlockpublic void decompressToSparseBlock(SparseBlock ret, int rl, int ru, int offR, int offC) Description copied from class:AColGroupDecompress into the SparseBlock. (no NNZ handling) Note this method is allowing to calls to append since it is assumed that the sparse column indexes are sorted afterwards- Specified by:
- decompressToSparseBlockin class- AColGroup
- Parameters:
- ret- Target SparseBlock
- rl- Row to start decompression from
- ru- Row to end decompression at
- offR- Row offset into the target to decompress
- offC- Column offset into the target to decompress
 
 - 
getIdxpublic double getIdx(int r, int colIdx)Description copied from class:AColGroupGet the value at a colGroup specific row/column index position.
 - 
leftMultByMatrixNoPreAggpublic void leftMultByMatrixNoPreAgg(MatrixBlock matrix, MatrixBlock result, int rl, int ru, int cl, int cu) Description copied from class:AColGroupLeft multiply with this column group.- Specified by:
- leftMultByMatrixNoPreAggin class- AColGroup
- Parameters:
- matrix- The matrix to multiply with on the left
- result- The result to output the values into, always dense for the purpose of the column groups parallelizing
- rl- The row to begin the multiplication from on the lhs matrix
- ru- The row to end the multiplication at on the lhs matrix
- cl- The column to begin the multiplication from on the lhs matrix
- cu- The column to end the multiplication at on the lhs matrix
 
 - 
leftMultByMatrixpublic void leftMultByMatrix(MatrixBlock matrix, MatrixBlock result, int rl, int ru) 
 - 
scalarOperationpublic AColGroup scalarOperation(ScalarOperator op) Description copied from class:AColGroupPerform the specified scalar operation directly on the compressed column group, without decompressing individual cells if possible.- Specified by:
- scalarOperationin class- AColGroup
- Parameters:
- op- operation to perform
- Returns:
- version of this column group with the operation applied
 
 - 
unaryOperationpublic AColGroup unaryOperation(UnaryOperator op) - Specified by:
- unaryOperationin class- AColGroup
 
 - 
binaryRowOpLeftpublic AColGroup binaryRowOpLeft(BinaryOperator op, double[] v, boolean isRowSafe) Description copied from class:AColGroupPerform a binary row operation.- Specified by:
- binaryRowOpLeftin class- AColGroup
- Parameters:
- op- The operation to execute
- v- The vector of values to apply, should be same length as dictionary length.
- isRowSafe- True if the binary op is applied to an entire zero row and all results are zero
- Returns:
- A updated column group with the new values.
 
 - 
binaryRowOpRightpublic AColGroup binaryRowOpRight(BinaryOperator op, double[] v, boolean isRowSafe) Description copied from class:AColGroupPerform a binary row operation.- Specified by:
- binaryRowOpRightin class- AColGroup
- Parameters:
- op- The operation to execute
- v- The vector of values to apply, should be same length as dictionary length.
- isRowSafe- True if the binary op is applied to an entire zero row and all results are zero
- Returns:
- A updated column group with the new values.
 
 - 
unaryAggregateOperationspublic void unaryAggregateOperations(AggregateUnaryOperator op, double[] result, int nRows, int rl, int ru) Description copied from class:AColGroupUnary Aggregate operator, since aggregate operators require new object output, the output becomes an uncompressed matrix.- Specified by:
- unaryAggregateOperationsin class- AColGroup
- Parameters:
- op- The operator used
- result- The output matrix block
- nRows- The total number of rows in the Column Group
- rl- The Starting Row to do aggregation from
- ru- The last Row to do aggregation to (not included)
 
 - 
readFieldspublic void readFields(DataInput in) throws IOException Description copied from class:AColGroupDeserialize column group from data input.- Overrides:
- readFieldsin class- AColGroup
- Parameters:
- in- data input
- Throws:
- IOException- if IOException occurs
 
 - 
writepublic void write(DataOutput out) throws IOException Description copied from class:AColGroupSerializes column group to data output.- Overrides:
- writein class- AColGroup
- Parameters:
- out- data output
- Throws:
- IOException- if IOException occurs
 
 - 
getExactSizeOnDiskpublic long getExactSizeOnDisk() Description copied from class:AColGroupReturns the exact serialized size of column group. This can be used for example for buffer preallocation.- Overrides:
- getExactSizeOnDiskin class- AColGroup
- Returns:
- exact serialized size for column group
 
 - 
getMinpublic double getMin() Description copied from class:AColGroupShort hand method for getting minimum value contained in this column group.
 - 
getMaxpublic double getMax() Description copied from class:AColGroupShort hand method for getting maximum value contained in this column group.
 - 
tsmmpublic final void tsmm(MatrixBlock ret, int nRows) Description copied from class:AColGroupDo a transposed self matrix multiplication on the left side t(x) %*% x. but only with this column group. This gives better performance since there is no need to iterate through all the rows of the matrix, but the execution can be limited to its number of distinct values. Note it only calculate the upper triangle
 - 
copypublic AColGroup copy() Description copied from class:AColGroupGet a copy of this column group note this is only a shallow copy. Meaning only the object wrapping index structures, column indexes and dictionaries are copied.
 - 
containsValuepublic boolean containsValue(double pattern) Description copied from class:AColGroupDetect if the column group contains a specific value.- Specified by:
- containsValuein class- AColGroup
- Parameters:
- pattern- The value to look for.
- Returns:
- boolean saying true if the value is contained.
 
 - 
getNumberNonZerospublic long getNumberNonZeros(int nRows) Description copied from class:AColGroupGet the number of nonZeros contained in this column group.- Specified by:
- getNumberNonZerosin class- AColGroup
- Parameters:
- nRows- The number of rows in the column group, this is used for groups that does not contain information about how many rows they have.
- Returns:
- The nnz.
 
 - 
leftMultByAColGrouppublic void leftMultByAColGroup(AColGroup lhs, MatrixBlock result) Description copied from class:AColGroupLeft side matrix multiplication with a column group that is transposed.- Specified by:
- leftMultByAColGroupin class- AColGroup
- Parameters:
- lhs- The left hand side Column group to multiply with, the left hand side should be considered transposed.
- result- The result matrix to insert the result of the multiplication into
 
 - 
tsmmAColGrouppublic void tsmmAColGroup(AColGroup lhs, MatrixBlock result) Description copied from class:AColGroupMatrix multiply with this other column group, but: 1. Only output upper triangle values. 2. Multiply both ways with "this" being on the left and on the right. It should be guaranteed that the input is not the same as the caller of the method. The second step is achievable by treating the initial multiplied matrix, and adding its values to the correct locations in the output.- Specified by:
- tsmmAColGroupin class- AColGroup
- Parameters:
- lhs- The other Column group to multiply with
- result- The result matrix to put the results into
 
 - 
rightMultByMatrixpublic AColGroup rightMultByMatrix(MatrixBlock right) Description copied from class:AColGroupRight matrix multiplication with this column group. This method can return null, meaning that the output overlapping group would have been empty.- Specified by:
- rightMultByMatrixin class- AColGroup
- Parameters:
- right- The MatrixBlock on the right of this matrix multiplication
- Returns:
- The new Column Group or null that is the result of the matrix multiplication.
 
 - 
getNumValuespublic int getNumValues() Description copied from class:AColGroupObtain number of distinct tuples in contained sets of values associated with this column group. If the column group is uncompressed the number or rows is returned.- Specified by:
- getNumValuesin class- AColGroup
- Returns:
- the number of distinct sets of values associated with the bitmaps in this column group
 
 - 
replacepublic AColGroup replace(double pattern, double replace) Description copied from class:AColGroupMake a copy of the column group values, and replace all values that match pattern with replacement value.
 - 
computeColSumspublic void computeColSums(double[] c, int nRows)Description copied from class:AColGroupCompute the column sum- Specified by:
- computeColSumsin class- AColGroup
- Parameters:
- c- The array to add the column sum to.
- nRows- The number of rows in the column group.
 
 - 
centralMomentpublic CM_COV_Object centralMoment(CMOperator op, int nRows) Description copied from class:AColGroupCentral Moment instruction executed on a column group.- Specified by:
- centralMomentin class- AColGroup
- Parameters:
- op- The Operator to use.
- nRows- The number of rows contained in the ColumnGroup.
- Returns:
- A Central Moment object.
 
 - 
rexpandColspublic AColGroup rexpandCols(int max, boolean ignore, boolean cast, int nRows) Description copied from class:AColGroupExpand the column group to multiple columns. (one hot encode the column group)- Specified by:
- rexpandColsin class- AColGroup
- Parameters:
- max- The number of columns to expand to and cutoff values at.
- ignore- If zero and negative values should be ignored.
- cast- If the double values contained should be cast to whole numbers.
- nRows- The number of rows in the column group.
- Returns:
- A new column group containing max number of columns.
 
 - 
getCostpublic double getCost(ComputationCostEstimator e, int nRows) Description copied from class:AColGroupGet the computation cost associated with this column group.
 
- 
 
-