rocblas_ztrmm_outofplace_strided_batched Interface Reference

rocblas_ztrmm_outofplace_strided_batched Interface Reference#

HIPFORT API Reference: hipfort_rocblas::rocblas_ztrmm_outofplace_strided_batched Interface Reference
hipfort_rocblas::rocblas_ztrmm_outofplace_strided_batched Interface Reference

BLAS Level 3 API. More...

Public Member Functions

integer(kind(rocblas_status_success)) function rocblas_ztrmm_outofplace_strided_batched_ (handle, side, uplo, transa, diag, m, n, alpha, A, lda, stride_a, B, ldb, stride_b, C, ldc, stride_c, batch_count)
 
integer(kind(rocblas_status_success)) function rocblas_ztrmm_outofplace_strided_batched_full_rank (handle, side, uplo, transa, diag, m, n, alpha, A, lda, stride_a, B, ldb, stride_b, C, ldc, stride_c, batch_count)
 
integer(kind(rocblas_status_success)) function rocblas_ztrmm_outofplace_strided_batched_rank_0 (handle, side, uplo, transa, diag, m, n, alpha, A, lda, stride_a, B, ldb, stride_b, C, ldc, stride_c, batch_count)
 
integer(kind(rocblas_status_success)) function rocblas_ztrmm_outofplace_strided_batched_rank_1 (handle, side, uplo, transa, diag, m, n, alpha, A, lda, stride_a, B, ldb, stride_b, C, ldc, stride_c, batch_count)
 

Detailed Description

BLAS Level 3 API.

trmm_outofplace_strided_batched performs one of the strided_batched matrix-matrix operations

C_i := alpha*op( A_i )*B_i, or C_i := alpha*B_i*op( A_i ) for i = 0, 1, ... batch_count -1

where alpha is a scalar, B_i is an m by n matrix, A_i is a unit, or non-unit, upper or lower triangular matrix and op( A_i ) is one of

op( A_i ) = A_i   or   op( A_i ) = A_i^T   or   op( A_i ) = A_i^H.

Note that trmm_outofplace_strided_batched can provide in-place functionality in the same way as trmm_strided_batched by passing in the same address for both matrices B and C.

Parameters
[in]handle[rocblas_handle] handle to the rocblas library context queue.
[in]side[rocblas_side] Specifies whether op(A_i) multiplies B_i from the left or right as follows: rocblas_side_left: C_i := alpha*op( A_i )*B_i. rocblas_side_right: C_i := alpha*B_i*op( A_i ).
[in]uplo[rocblas_fill] Specifies whether the matrix A is an upper or lower triangular matrix as follows: rocblas_fill_upper: A is an upper triangular matrix. rocblas_fill_lower: A is a lower triangular matrix.
[in]transA[rocblas_operation] Specifies the form of op(A_i) to be used in the matrix multiplication as follows: rocblas_operation_none: op(A_i) = A_i. rocblas_operation_transpose: op(A_i) = A_i^T. rocblas_operation_conjugate_transpose: op(A_i) = A_i^H.
[in]diag[rocblas_diagonal] Specifies whether or not A_i is unit triangular as follows: rocblas_diagonal_unit: A_i is assumed to be unit triangular. rocblas_diagonal_non_unit: A_i is not assumed to be unit triangular.
[in]m[rocblas_int] m specifies the number of rows of B_i. m >= 0.
[in]n[rocblas_int] n specifies the number of columns of B_i. n >= 0.
[in]alphaalpha specifies the scalar alpha. When alpha is zero then A_i is not referenced and B_i need not be set before entry.
[in]ADevice pointer to the first matrix A_0 on the GPU. Each A_i is of dimension ( lda, k ), where k is m when side == rocblas_side_left and is n when side == rocblas_side_right.

When uplo == rocblas_fill_upper the leading k by k upper triangular part of the array A must contain the upper triangular matrix and the strictly lower triangular part of A is not referenced.

When uplo == rocblas_fill_lower the leading k by k lower triangular part of the array A must contain the lower triangular matrix and the strictly upper triangular part of A is not referenced.

Note that when diag == rocblas_diagonal_unit the diagonal elements of A_i are not referenced either, but are assumed to be unity.

Parameters
[in]lda[rocblas_int] lda specifies the first dimension of A. if side == rocblas_side_left, lda >= max( 1, m ), if side == rocblas_side_right, lda >= max( 1, n ).
[in]stride_A[rocblas_stride] stride from the start of one matrix (A_i) and the next one (A_i+1)
[in]BDevice pointer to the first matrix B_0 on the GPU.
[in]ldb[rocblas_int] ldb specifies the first dimension of B_i. ldb >= max( 1, m ).
[in]stride_B[rocblas_stride] stride from the start of one matrix (B_i) and the next one (B_i+1)
[out]CDevice pointer to the first matrix C_0 on the GPU.
[in]ldc[rocblas_int] ldc specifies the first dimension of C_i. ldc >= max( 1, m).
[in]stride_C[rocblas_stride] stride from the start of one matrix (C_i) and the next one (C_i+1)
[in]batch_count[rocblas_int] number of instances i in the batch.

Member Function/Subroutine Documentation

◆ rocblas_ztrmm_outofplace_strided_batched_()

integer(kind(rocblas_status_success)) function hipfort_rocblas::rocblas_ztrmm_outofplace_strided_batched::rocblas_ztrmm_outofplace_strided_batched_ ( type(c_ptr), value  handle,
integer(kind(rocblas_side_left)), value  side,
integer(kind(rocblas_fill_upper)), value  uplo,
integer(kind(rocblas_operation_none)), value  transa,
integer(kind(rocblas_diagonal_non_unit)), value  diag,
integer(c_int), value  m,
integer(c_int), value  n,
complex(c_double_complex)  alpha,
type(c_ptr), value  A,
integer(c_int), value  lda,
integer(c_int64_t), value  stride_a,
type(c_ptr), value  B,
integer(c_int), value  ldb,
integer(c_int64_t), value  stride_b,
type(c_ptr), value  C,
integer(c_int), value  ldc,
integer(c_int64_t), value  stride_c,
integer(c_int), value  batch_count 
)

◆ rocblas_ztrmm_outofplace_strided_batched_full_rank()

integer(kind(rocblas_status_success)) function hipfort_rocblas::rocblas_ztrmm_outofplace_strided_batched::rocblas_ztrmm_outofplace_strided_batched_full_rank ( type(c_ptr)  handle,
integer(kind(rocblas_side_left))  side,
integer(kind(rocblas_fill_upper))  uplo,
integer(kind(rocblas_operation_none))  transa,
integer(kind(rocblas_diagonal_non_unit))  diag,
integer(c_int)  m,
integer(c_int)  n,
complex(c_double_complex)  alpha,
complex(c_double_complex), dimension(:,:), target  A,
integer(c_int)  lda,
integer(c_int64_t)  stride_a,
complex(c_double_complex), dimension(:,:), target  B,
integer(c_int)  ldb,
integer(c_int64_t)  stride_b,
complex(c_double_complex), dimension(:,:), target  C,
integer(c_int)  ldc,
integer(c_int64_t)  stride_c,
integer(c_int)  batch_count 
)

◆ rocblas_ztrmm_outofplace_strided_batched_rank_0()

integer(kind(rocblas_status_success)) function hipfort_rocblas::rocblas_ztrmm_outofplace_strided_batched::rocblas_ztrmm_outofplace_strided_batched_rank_0 ( type(c_ptr)  handle,
integer(kind(rocblas_side_left))  side,
integer(kind(rocblas_fill_upper))  uplo,
integer(kind(rocblas_operation_none))  transa,
integer(kind(rocblas_diagonal_non_unit))  diag,
integer(c_int)  m,
integer(c_int)  n,
complex(c_double_complex)  alpha,
complex(c_double_complex), target  A,
integer(c_int)  lda,
integer(c_int64_t)  stride_a,
complex(c_double_complex), target  B,
integer(c_int)  ldb,
integer(c_int64_t)  stride_b,
complex(c_double_complex), target  C,
integer(c_int)  ldc,
integer(c_int64_t)  stride_c,
integer(c_int)  batch_count 
)

◆ rocblas_ztrmm_outofplace_strided_batched_rank_1()

integer(kind(rocblas_status_success)) function hipfort_rocblas::rocblas_ztrmm_outofplace_strided_batched::rocblas_ztrmm_outofplace_strided_batched_rank_1 ( type(c_ptr)  handle,
integer(kind(rocblas_side_left))  side,
integer(kind(rocblas_fill_upper))  uplo,
integer(kind(rocblas_operation_none))  transa,
integer(kind(rocblas_diagonal_non_unit))  diag,
integer(c_int)  m,
integer(c_int)  n,
complex(c_double_complex)  alpha,
complex(c_double_complex), dimension(:), target  A,
integer(c_int)  lda,
integer(c_int64_t)  stride_a,
complex(c_double_complex), dimension(:), target  B,
integer(c_int)  ldb,
integer(c_int64_t)  stride_b,
complex(c_double_complex), dimension(:), target  C,
integer(c_int)  ldc,
integer(c_int64_t)  stride_c,
integer(c_int)  batch_count 
)

The documentation for this interface was generated from the following file: