rocsolver_ssytf2_batched Interface Reference

rocsolver_ssytf2_batched Interface Reference#

HIPFORT API Reference: hipfort_rocsolver::rocsolver_ssytf2_batched Interface Reference
hipfort_rocsolver::rocsolver_ssytf2_batched Interface Reference

SYTF2_BATCHED computes the factorization of a batch of symmetric indefinite matrices using Bunch-Kaufman diagonal pivoting. More...

Public Member Functions

integer(kind(rocblas_status_success)) function rocsolver_ssytf2_batched_ (handle, uplo, n, A, lda, ipiv, strideP, myInfo, batch_count)
 
integer(kind(rocblas_status_success)) function rocsolver_ssytf2_batched_full_rank (handle, uplo, n, A, lda, ipiv, strideP, myInfo, batch_count)
 
integer(kind(rocblas_status_success)) function rocsolver_ssytf2_batched_rank_0 (handle, uplo, n, A, lda, ipiv, strideP, myInfo, batch_count)
 
integer(kind(rocblas_status_success)) function rocsolver_ssytf2_batched_rank_1 (handle, uplo, n, A, lda, ipiv, strideP, myInfo, batch_count)
 

Detailed Description

SYTF2_BATCHED computes the factorization of a batch of symmetric indefinite matrices using Bunch-Kaufman diagonal pivoting.

(This is the unblocked version of the algorithm).

The factorization has the form

\[ \begin{array}{cl} A_i = U_i D_i U_i^T & \: \text{or}\newline A_i = L_i D_i L_i^T & \end{array} \]

where \(U_i\) or \(L_i\) is a product of permutation and unit upper/lower triangular matrices (depending on the value of uplo), and \(D_i\) is a symmetric block diagonal matrix with 1-by-1 and 2-by-2 diagonal blocks \(D_i(k)\).

Specifically, \(U_i\) and \(L_i\) are computed as

\[ \begin{array}{cl} U_i = P_i(n) U_i(n) \cdots P_i(k) U_i(k) \cdots & \: \text{and}\newline L_i = P_i(1) L_i(1) \cdots P_i(k) L_i(k) \cdots & \end{array} \]

where \(k\) decreases from \(n\) to 1 (increases from 1 to \(n\)) in steps of 1 or 2, depending on the order of block \(D_i(k)\), and \(P_i(k)\) is a permutation matrix defined by \(ipiv_i[k]\). If we let \(s\) denote the order of block \(D_i(k)\), then \(U_i(k)\) and \(L_i(k)\) are unit upper/lower triangular matrices defined as

\[ U_i(k) = \left[ \begin{array}{ccc} I_{k-s} & v & 0 \newline 0 & I_s & 0 \newline 0 & 0 & I_{n-k} \end{array} \right] \]

and

\[ L_i(k) = \left[ \begin{array}{ccc} I_{k-1} & 0 & 0 \newline 0 & I_s & 0 \newline 0 & v & I_{n-k-s+1} \end{array} \right]. \]

If \(s = 1\), then \(D_i(k)\) is stored in \(A_i[k,k]\) and \(v\) is stored in the upper/lower part of column \(k\) of \(A_i\). If \(s = 2\) and uplo is upper, then \(D_i(k)\) is stored in \(A_i[k-1,k-1]\), \(A_i[k-1,k]\), and \(A_i[k,k]\), and \(v\) is stored in the upper parts of columns \(k-1\) and \(k\) of \(A_i\). If \(s = 2\) and uplo is lower, then \(D_i(k)\) is stored in \(A_i[k,k]\), \(A_i[k+1,k]\), and \(A_i[k+1,k+1]\), and \(v\) is stored in the lower parts of columns \(k\) and \(k+1\) of \(A_i\).

Parameters
[in]handlerocblas_handle.
[in]uplorocblas_fill.
Specifies whether the upper or lower part of the matrices A_i are stored. If uplo indicates lower (or upper), then the upper (or lower) part of A_i is not used.
[in]nrocblas_int. n >= 0.
The number of rows and columns of all matrices A_i in the batch.
[in,out]Aarray of pointers to type. Each pointer points to an array on the GPU of dimension lda*n.
On entry, the symmetric matrices A_i to be factored. On exit, the block diagonal matrices D_i and the multipliers needed to compute U_i or L_i.
[in]ldarocblas_int. lda >= n.
Specifies the leading dimension of matrices A_i.
[out]ipivpointer to rocblas_int. Array on the GPU of dimension n.
The vector of pivot indices. Elements of ipiv are 1-based indices. For 1 <= k <= n, if ipiv_i[k] > 0 then rows and columns k and ipiv_i[k] were interchanged and D_i[k,k] is a 1-by-1 diagonal block. If, instead, ipiv_i[k] = ipiv_i[k-1] < 0 and uplo is upper (or ipiv_i[k] = ipiv_i[k+1] < 0 and uplo is lower), then rows and columns k-1 and -ipiv_i[k] (or rows and columns k+1 and -ipiv_i[k]) were interchanged and D_i[k-1,k-1] to D_i[k,k] (or D_i[k,k] to D_i[k+1,k+1]) is a 2-by-2 diagonal block.
[in]strideProcblas_stride.
Stride from the start of one vector ipiv_i to the next one ipiv_(i+1). There is no restriction for the value of strideP. Normal use case is strideP >= n.
[out]infopointer to rocblas_int. Array of batch_count integers on the GPU.
If info[i] = 0, successful exit for factorization of A_i. If info[i] = j > 0, D_i is singular. D_i[j,j] is the first diagonal zero.
[in]batch_countrocblas_int. batch_count >= 0.
Number of matrices in the batch.

Member Function/Subroutine Documentation

◆ rocsolver_ssytf2_batched_()

integer(kind(rocblas_status_success)) function hipfort_rocsolver::rocsolver_ssytf2_batched::rocsolver_ssytf2_batched_ ( type(c_ptr), value  handle,
integer(kind(rocblas_fill_upper)), value  uplo,
integer(c_int), value  n,
type(c_ptr)  A,
integer(c_int), value  lda,
type(c_ptr), value  ipiv,
integer(c_int64_t), value  strideP,
integer(c_int)  myInfo,
integer(c_int), value  batch_count 
)

◆ rocsolver_ssytf2_batched_full_rank()

integer(kind(rocblas_status_success)) function hipfort_rocsolver::rocsolver_ssytf2_batched::rocsolver_ssytf2_batched_full_rank ( type(c_ptr)  handle,
integer(kind(rocblas_fill_upper))  uplo,
integer(c_int)  n,
real(c_float), dimension(:,:,:), target  A,
integer(c_int)  lda,
integer(c_int), dimension(:), target  ipiv,
integer(c_int64_t)  strideP,
integer(c_int)  myInfo,
integer(c_int)  batch_count 
)

◆ rocsolver_ssytf2_batched_rank_0()

integer(kind(rocblas_status_success)) function hipfort_rocsolver::rocsolver_ssytf2_batched::rocsolver_ssytf2_batched_rank_0 ( type(c_ptr)  handle,
integer(kind(rocblas_fill_upper))  uplo,
integer(c_int)  n,
real(c_float), target  A,
integer(c_int)  lda,
integer(c_int), target  ipiv,
integer(c_int64_t)  strideP,
integer(c_int)  myInfo,
integer(c_int)  batch_count 
)

◆ rocsolver_ssytf2_batched_rank_1()

integer(kind(rocblas_status_success)) function hipfort_rocsolver::rocsolver_ssytf2_batched::rocsolver_ssytf2_batched_rank_1 ( type(c_ptr)  handle,
integer(kind(rocblas_fill_upper))  uplo,
integer(c_int)  n,
real(c_float), dimension(:), target  A,
integer(c_int)  lda,
integer(c_int), dimension(:), target  ipiv,
integer(c_int64_t)  strideP,
integer(c_int)  myInfo,
integer(c_int)  batch_count 
)

The documentation for this interface was generated from the following file: