Attach `opencl.kernels` metadata #2869

victor-eds · 2024-11-28T15:58:31Z

It's codegen's responsibility to add OCL's kernels metadata to the LLVM module. We are failing to do so and this prevents further optimizations via kernel metadata like setting a constant work-group size. Add MD in a similar way to https://github.com/intel/llvm/blob/a024380e33151b8d1e0d3abb937ff089ec1f24ba/llvm/test/CodeGen/AMDGPU/sampler-resource-id.ll#L44:

!opencl.kernels = !{!0, !1, !2}

!0 = !{ptr @test_0, !10, !20, !30, !40, !50}
!10 = !{!"kernel_arg_addr_space", i32 0, i32 1}
!20 = !{!"kernel_arg_access_qual", !"none", !"none"}
!30 = !{!"kernel_arg_type", !"sampler_t", !"int*"}
!40 = !{!"kernel_arg_base_type", !"sampler_t", !"int*"}
!50 = !{!"kernel_arg_type_qual", !"", !""}

!1 = !{ptr @test_1, !11, !21, !31, !41, !51}
!11 = !{!"kernel_arg_addr_space", i32 0, i32 0, i32 1}
!21 = !{!"kernel_arg_access_qual", !"none", !"none", !"none"}
!31 = !{!"kernel_arg_type", !"sampler_t", !"sampler_t", !"int*"}
!41 = !{!"kernel_arg_base_type", !"sampler_t", !"sampler_t", !"int*"}
!51 = !{!"kernel_arg_type_qual", !"", !"", !""}

!2 = !{ptr @test_2, !12, !22, !32, !42, !52}
!12 = !{!"kernel_arg_addr_space", i32 0, i32 0, i32 0, i32 1}
!22 = !{!"kernel_arg_access_qual", !"none", !"none", !"none", !"none"}
!32 = !{!"kernel_arg_type", !"sampler_t", !"sampler_t", !"sampler_t", !"int*"}
!42 = !{!"kernel_arg_base_type", !"sampler_t", !"sampler_t", !"sampler_t", !"int*"}
!52 = !{!"kernel_arg_type_qual", !"", !"", !"", !""}

What MD nodes to add is yet to be decided. At least the opencl.kernels and the nodes with the kernel references should be added.

We can do this by adding the required metadata to the function before lowering to LLVM dialect as a triton_gen attribute and handling the attribute in https://github.com/intel/intel-xpu-backend-for-triton/blob/main/third_party/intel/lib/Target/LLVMIR/Dialect/TritonGEN/TritonGENToLLVMIRTranslation.cpp.

The text was updated successfully, but these errors were encountered:

victor-eds · 2024-12-09T13:30:22Z

Blocked by backend bug. May not be needed. Check after bug is fixed

victor-eds added enhancement New feature or request good first issue Good for newcomers performance labels Nov 28, 2024

victor-eds mentioned this issue Nov 28, 2024

[XPU][TritonGPUToLLVM] Use reqd_work_group_size #2845

Draft

vlad-penkin added this to the 0.3 [Triton] Language and Runtime milestone Dec 2, 2024

victor-eds linked a pull request Dec 4, 2024 that will close this issue

[XPU][LLVMIR] Generate opencl.kernels named metadata #2923

Draft

victor-eds self-assigned this Dec 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Attach `opencl.kernels` metadata #2869

Attach `opencl.kernels` metadata #2869

victor-eds commented Nov 28, 2024

victor-eds commented Dec 9, 2024

Attach opencl.kernels metadata #2869

Attach opencl.kernels metadata #2869

Comments

victor-eds commented Nov 28, 2024

victor-eds commented Dec 9, 2024

Attach `opencl.kernels` metadata #2869

Attach `opencl.kernels` metadata #2869