Executes kernel
| Type | Intent | Optional | Attributes | Name | ||
|---|---|---|---|---|---|---|
| class(abstract_kernel), | intent(inout) | :: | self |
Abstract kernel |
||
| type(c_ptr), | intent(in) | :: | in |
Source buffer, can be device or host pointer |
||
| type(c_ptr), | intent(in) | :: | out |
Target buffer, can be device or host pointer |
||
| type(dtfft_stream_t), | intent(in) | :: | stream |
Stream to execute on, used only for device pointers |
||
| integer(kind=int32), | intent(in), | optional | :: | neighbor |
Source rank for pipelined unpacking |
|
| type(c_ptr), | intent(in), | optional | :: | aux |
Target buffer, can be device or host pointer |
|
| integer(kind=int32), | intent(inout), | optional | :: | csize |
Compressed buffer size |
|
| integer(kind=int32), | intent(inout), | optional | :: | csizes(:) |
Multiple compression sizes. This should only be used with CUDA backends |
|
| logical, | intent(in), | optional | :: | skip_compression |
Skip compression/decompression stage. Should be used when packing/unpacking from itself. |
|
| integer(kind=int32), | intent(in), | optional | :: | skip_rank |
Skip compression/decompression for specific rank when neighbor is not specified. |
|
| logical, | intent(in), | optional | :: | sync |
Sync stream after packing/compression. Should be used only for fused backends |