This is an old version of ROCm documentation. Read the latest ROCm release documentation to stay informed of all our developments.

Class List

Class List#

hipCUB: Class List
Class List
Here are the classes, structs, unions and interfaces with brief descriptions:
[detail level 123]
 Nhipcub
 CDeviceHistogram
 CDeviceMergeSort
 CDevicePartition
 CDeviceRadixSort
 CDeviceReduce
 CDeviceRunLengthEncode
 CDeviceScan
 CDeviceSegmentedRadixSort
 CDeviceSegmentedReduce
 CDeviceSegmentedSort
 CDeviceSelect
 CDeviceSpmv
 CSpmvParams< Signed integer type for sequence offsets
 CGridBarrierLifetimeGridBarrierLifetime extends GridBarrier to provide lifetime management of the temporary device storage needed for cooperation
 CCachingDeviceAllocator
 CBlockDescriptor
 CTotalBytes
 CBlockAdjacentDifference
 CBlockDiscontinuity
 CBlockExchange
 CBlockLoad
 CBlockMergeSortStrategyGeneralized merge sort algorithm
 CTempStorage\smemstorage{BlockMergeSort}
 CBlockMergeSortThe BlockMergeSort class provides methods for sorting items partitioned across a CUDA thread block using a merge sorting method
 CBlockRadixRankBlockRadixRank provides operations for ranking unsigned integer types within a CUDA thread block
 CTempStorage\smemstorage{BlockScan}
 CBlockRadixRankMatch
 CTempStorage\smemstorage{BlockScan}
 CBlockRadixSort
 Cblock_raking_layoutBlockRakingLayout provides a conflict-free shared memory layout abstraction for 1D raking across thread block data
 CTempStorageAlias wrapper allowing storage to be unioned
 CBlockRunLengthDecodeThe BlockRunLengthDecode class supports decoding a run-length encoded array of items. That is, given the two arrays run_value[N] and run_lengths[N], run_value[i] is repeated run_lengths[i] many times in the output array. Due to the nature of the run-length decoding algorithm ("decompression"), the output size of the run-length decoded array is runtime-dependent and potentially without any upper bound. To address this, BlockRunLengthDecode allows retrieving a "window" from the run-length decoded array. The window's offset can be specified and BLOCK_THREADS * DECODED_ITEMS_PER_THREAD (i.e., referred to as window_size) decoded items from the specified window will be returned
 CTempStorage
 CBlockScan
 CBlockShuffle
 CBlockStore
 CRadixSortTwiddleTwiddling keys for radix sort
 CBaseDigitExtractorBase struct for digit extractor. Contains common code to provide special handling for floating-point -0.0
 CBFEDigitExtractorA wrapper type to extract digits. Uses the BFE intrinsic to extract a key from a digit
 CShiftDigitExtractorA wrapper type to extract digits. Uses a combination of shift and bitwise and to extract digits
 CGridBarrierGridBarrier implements a software global barrier among thread blocks within a hip grid
 CGridEvenShareGridEvenShare is a descriptor utility for distributing input among CUDA thread blocks in an "even-share" fashion. Each thread block gets roughly the same number of input tiles
 CGridQueueGridQueue is a descriptor utility for dynamic queue management
 CCacheModifiedInputIterator
 CCacheModifiedOutputIterator
 CDiscardOutputIteratorA discard iterator
 CTexObjInputIterator
 CTexRefInputIterator
 CEquality
 CInequality
 CInequalityWrapper
 CSum
 CMax
 CMin
 CArgMax
 CArgMin
 CIf
 CIsPointer
 CIsVolatile
 CRemoveQualifiers
 CPowerOfTwo
 CLog2
 CDoubleBuffer
 CInt2Type
 CUninitializedA storage-backing wrapper that allows types with non-trivial constructors to be aliased in unions
 CWarpExchange
 CTempStorage
 CWarpLoad
 CTempStorage
 CWarpMergeSortThe WarpMergeSort class provides methods for sorting items partitioned across a CUDA warp using a merge sorting method
 CWarpReduce
 CWarpScan
 CWarpStore
 CTempStorage
 CBlockHistogram
 CBlockReduce