This function evaluates the performance of a kernel configuration based on various architectural and problem-specific parameters.
| Type | Intent | Optional | Attributes | Name | ||
|---|---|---|---|---|---|---|
| integer(kind=int32), | intent(in) | :: | dims(:) |
Problem dimensions |
||
| integer(kind=int32), | intent(in) | :: | tile_dim |
Tile dimension |
||
| integer(kind=int32), | intent(in) | :: | other_dim |
Other dimension (not tiled) |
||
| type(kernel_type_t), | intent(in) | :: | kernel_type |
Type of kernel_type to evaluate |
||
| type(kernel_config), | intent(in) | :: | config |
Kernel configuration |
||
| type(device_props), | intent(in) | :: | props |
GPU architecture properties |
||
| integer(kind=int64), | intent(in) | :: | base_storage |
Number of bytes needed to store single element |
||
| integer(kind=int32), | intent(in), | optional | :: | neighbor_data(:) |
Neighboring data dimensions for pipelined kernels |
Performance score