find_valid_combination Subroutine

private subroutine find_valid_combination(base_tile, base_rows)

This subroutine optimizes the tile size and number of rows for narrow matrices by adjusting them to be compatible with the warp size.

Arguments

Type IntentOptional Attributes Name
integer(kind=int32), intent(inout) :: base_tile

< Tile size

integer(kind=int32), intent(inout) :: base_rows

< Number of rows


Called by

proc~~find_valid_combination~~CalledByGraph proc~find_valid_combination find_valid_combination proc~generate_candidates generate_candidates proc~generate_candidates->proc~find_valid_combination proc~get_kernel get_kernel proc~get_kernel->proc~generate_candidates proc~create~4 nvrtc_kernel%create proc~create~4->proc~get_kernel proc~create~9 transpose_handle_cuda%create proc~create~9->proc~create~4