Compiles kernel and caches it. Returns compiled kernel.
| Type | Intent | Optional | Attributes | Name | ||
|---|---|---|---|---|---|---|
| integer(kind=int32), | intent(in) | :: | dims(:) |
Local dimensions to process |
||
| type(kernel_type_t), | intent(in) | :: | kernel_type |
Type of kernel to build |
||
| type(dtfft_effort_t), | intent(in) | :: | effort |
How thoroughly |
||
| integer(kind=int64), | intent(in) | :: | base_storage |
Number of bytes needed to store single element |
||
| type(device_props), | intent(in) | :: | props |
GPU architecture properties |
||
| integer(kind=int32), | intent(out) | :: | tile_size |
Size of the tile in shared memory |
||
| integer(kind=int32), | intent(out) | :: | block_rows |
Number of rows in each block processed by each thread |
||
| type(CUfunction), | intent(out) | :: | kernel |
Compiled kernel to return |
||
| logical, | intent(in), | optional | :: | force_effort |
Should effort be forced or not |
|
| integer(kind=int32), | intent(in), | optional | :: | neighbor_data(:) |
Neighbor data for pipelined kernels |