CUDA Transpose Plan
Type | Visibility | Attributes | Name | Initial | |||
---|---|---|---|---|---|---|---|
type(dtfft_backend_t), | public | :: | backend | = | DTFFT_BACKEND_MPI_DATATYPE |
GPU backend |
|
type(backend_helper), | public | :: | helper |
Backend helper |
|||
logical, | public | :: | is_z_slab |
Z-slab optimization flag (for 3D transforms) |
|||
integer(kind=int64), | public | :: | min_buffer_size |
Minimal buffer size for transposition |
|||
type(dtfft_stream_t), | private | :: | stream |
CUDA stream |
|||
type(c_ptr), | private | :: | aux |
Auxiliary memory |
|||
real(kind=real32), | private, | pointer | :: | paux(:) |
Pointer to auxiliary memory |
||
logical, | private | :: | is_aux_alloc |
Is auxiliary memory allocated |
|||
type(transpose_handle_cuda), | private, | allocatable | :: | fplans(:) |
Forward transposition plans |
||
type(transpose_handle_cuda), | private, | allocatable | :: | bplans(:) |
Backward transposition plans |
Create transposition plan
Creates transposition plans
Type | Intent | Optional | Attributes | Name | ||
---|---|---|---|---|---|---|
class(abstract_transpose_plan), | intent(inout) | :: | self |
Transposition class |
||
integer(kind=int32), | intent(in) | :: | dims(:) |
Global sizes of the transform requested |
||
type(MPI_Comm), | intent(in) | :: | base_comm_ |
Base communicator |
||
type(dtfft_effort_t), | intent(in) | :: | effort |
|
||
type(MPI_Datatype), | intent(in) | :: | base_dtype |
Base MPI_Datatype |
||
integer(kind=int64), | intent(in) | :: | base_storage |
Number of bytes needed to store single element |
||
type(MPI_Comm), | intent(out) | :: | cart_comm |
Cartesian communicator |
||
type(MPI_Comm), | intent(out) | :: | comms(:) |
Array of 1d communicators |
||
type(pencil), | intent(out) | :: | pencils(:) |
Data distributing meta |
Error code
Executes transposition
Executes single transposition
Type | Intent | Optional | Attributes | Name | ||
---|---|---|---|---|---|---|
class(abstract_transpose_plan), | intent(inout) | :: | self |
Transposition class |
||
type(c_ptr), | intent(in) | :: | in |
Incoming pointer |
||
type(c_ptr), | intent(in) | :: | out |
Result pointer |
||
type(dtfft_transpose_t), | intent(in) | :: | transpose_type |
Type of transpose |
Returns backend id
Returns plan GPU backend
Type | Intent | Optional | Attributes | Name | ||
---|---|---|---|---|---|---|
class(abstract_transpose_plan), | intent(in) | :: | self |
Transposition class |
Allocates memory based on selected backend
Allocates memory based on selected backend
Type | Intent | Optional | Attributes | Name | ||
---|---|---|---|---|---|---|
class(abstract_transpose_plan), | intent(inout) | :: | self |
Transposition class |
||
type(MPI_Comm), | intent(in) | :: | comm |
MPI communicator |
||
integer(kind=int64), | intent(in) | :: | alloc_bytes |
Number of bytes to allocate |
||
type(c_ptr), | intent(out) | :: | ptr |
Pointer to the allocated memory |
||
integer(kind=int32), | intent(out) | :: | error_code |
Error code |
Frees memory allocated with mem_alloc
Frees memory allocated with mem_alloc
Type | Intent | Optional | Attributes | Name | ||
---|---|---|---|---|---|---|
class(abstract_transpose_plan), | intent(inout) | :: | self |
Transposition class |
||
type(c_ptr), | intent(in) | :: | ptr |
Pointer to the memory to free |
||
integer(kind=int32), | intent(out) | :: | error_code |
Error code |
Creates CUDA transpose plan
Creates CUDA transpose plan
Type | Intent | Optional | Attributes | Name | ||
---|---|---|---|---|---|---|
class(transpose_plan_cuda), | intent(inout) | :: | self |
GPU transpose plan |
||
integer(kind=int32), | intent(in) | :: | dims(:) |
Global sizes of the transform requested |
||
integer(kind=int32), | intent(in) | :: | transposed_dims(:,:) |
Transposed dimensions |
||
type(MPI_Comm), | intent(in) | :: | base_comm |
Base communicator |
||
integer(kind=int32), | intent(in) | :: | comm_dims(:) |
Number of processors in each dimension |
||
type(dtfft_effort_t), | intent(in) | :: | effort |
How thoroughly |
||
type(MPI_Datatype), | intent(in) | :: | base_dtype |
Base MPI_Datatype |
||
integer(kind=int64), | intent(in) | :: | base_storage |
Number of bytes needed to store single element |
||
logical, | intent(in) | :: | is_custom_cart_comm |
is custom Cartesian communicator provided by user |
||
type(MPI_Comm), | intent(out) | :: | cart_comm |
Cartesian communicator |
||
type(MPI_Comm), | intent(out) | :: | comms(:) |
Array of 1d communicators |
||
type(pencil), | intent(out) | :: | pencils(:) |
Data distributing meta |
Executes single transposition
Executes single transposition
Type | Intent | Optional | Attributes | Name | ||
---|---|---|---|---|---|---|
class(transpose_plan_cuda), | intent(inout) | :: | self |
Transposition class |
||
real(kind=real32), | intent(inout) | :: | in(:) |
Incoming buffer |
||
real(kind=real32), | intent(inout) | :: | out(:) |
Resulting buffer |
||
type(dtfft_transpose_t), | intent(in) | :: | transpose_type |
Type of transpose to execute |
Destroys CUDA transpose plan
Destroys transposition plans
Type | Intent | Optional | Attributes | Name | ||
---|---|---|---|---|---|---|
class(transpose_plan_cuda), | intent(inout) | :: | self |
Transposition class |