Don't use implicitly `elapsed_time` in autotuner #3036

anmyachev · 2024-12-17T22:24:38Z

The main idea of this pull request is not to use elapsed_time that enable profiling mode for sycl queues, as this is not needed for profiling with PyTorch and PTI.

CI runs:

https://github.com/intel/intel-xpu-backend-for-triton/actions/runs/12390476117 (legacy profiler)
https://github.com/intel/intel-xpu-backend-for-triton/actions/runs/12390481323 (upstream profiler)
https://github.com/intel/intel-xpu-backend-for-triton/actions/runs/12392648088 (legacy profiler - 1a1c98e)
https://github.com/intel/intel-xpu-backend-for-triton/actions/runs/12392654868 (upstream profiler - 1a1c98e)

Signed-off-by: Anatoly Myachev <[email protected]>

anmyachev · 2024-12-18T16:06:05Z

@whitneywhtsang we can try the changes in #2484 on DLE runner, but we need to cherry-pick 2a4b818 into Pavel's branch

benchmarks/triton_kernels_benchmark/gemm_benchmark.py

benchmarks/triton_kernels_benchmark/gemm_postop_addmatrix_benchmark.py

benchmarks/triton_kernels_benchmark/gemm_postop_gelu_benchmark.py

benchmarks/triton_kernels_benchmark/gemm_preop_exp_benchmark.py

whitneywhtsang · 2024-12-18T17:29:04Z

@whitneywhtsang we can try the changes in #2484 on DLE runner, but we need to cherry-pick 2a4b818 into Pavel's branch

Let's cherry-pick this PR to ptdb-dle-runner.

Co-authored-by: Whitney Tsang <[email protected]>

anmyachev · 2024-12-18T17:37:52Z

@whitneywhtsang we can try the changes in #2484 on DLE runner, but we need to cherry-pick 2a4b818 into Pavel's branch

Let's cherry-pick this PR to ptdb-dle-runner.

ok, but let's use 2a4b818 (last commit in #2484) which compatible with changes on Pavel's branch

Signed-off-by: Anatoly Myachev <[email protected]>

This reverts commit 2a4b818.

anmyachev added 2 commits December 17, 2024 22:24

Don't use implicitly 'elapsed_time' in autotuner

5878f0d

Signed-off-by: Anatoly Myachev <[email protected]>

pass a function for autotuner via 'do_bench' param

1a1c98e

Signed-off-by: Anatoly Myachev <[email protected]>

anmyachev linked an issue Dec 18, 2024 that may be closed by this pull request

Don't use implicitly elapsed_time in autotuner when profiling with PyTorch and PTI #3039

Open

revert warmup/rep changes

c316093

Signed-off-by: Anatoly Myachev <[email protected]>

anmyachev marked this pull request as ready for review December 18, 2024 13:51

anmyachev requested a review from whitneywhtsang December 18, 2024 13:52

anmyachev added a commit that referenced this pull request Dec 18, 2024

try changes from #3036

2a4b818

Signed-off-by: Anatoly Myachev <[email protected]>

whitneywhtsang reviewed Dec 18, 2024

View reviewed changes

Apply suggestions from code review

c21c92a

Co-authored-by: Whitney Tsang <[email protected]>

whitneywhtsang pushed a commit that referenced this pull request Dec 18, 2024

try changes from #3036

231b07a

Signed-off-by: Anatoly Myachev <[email protected]>

anmyachev added a commit that referenced this pull request Dec 18, 2024

Revert "try changes from #3036"

0d66c8e

This reverts commit 2a4b818.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Don't use implicitly `elapsed_time` in autotuner #3036

Don't use implicitly `elapsed_time` in autotuner #3036

anmyachev commented Dec 17, 2024 •

edited

Loading

anmyachev commented Dec 18, 2024

whitneywhtsang commented Dec 18, 2024

anmyachev commented Dec 18, 2024 •

edited

Loading

Don't use implicitly elapsed_time in autotuner #3036

Are you sure you want to change the base?

Don't use implicitly elapsed_time in autotuner #3036

Conversation

anmyachev commented Dec 17, 2024 • edited Loading

anmyachev commented Dec 18, 2024

whitneywhtsang commented Dec 18, 2024

anmyachev commented Dec 18, 2024 • edited Loading

Don't use implicitly `elapsed_time` in autotuner #3036

Don't use implicitly `elapsed_time` in autotuner #3036

anmyachev commented Dec 17, 2024 •

edited

Loading

anmyachev commented Dec 18, 2024 •

edited

Loading