[GPU] updates to build some selected kernels in separate batches #24499

e-ddykim · 2024-05-14T06:33:44Z

Details:

This PR updates the kernels_cache to build the selected kernels in separate batches.
- This is a temporary WA to resolve performance degradation when some kernels are built with other kernels in the same batch
- Currently, the selected kernel includes gemm_tiled_opt.
- The impacted scenario : Qwen INT4 first token latency for > 1K input in MTL

Tickets:

GSD-8910

vladimir-paramuzov · 2024-05-14T06:49:22Z

src/plugins/intel_gpu/src/runtime/kernels_cache.cpp

+                return unique_kernel_name.substr(0, pos);
+            };
+
+            auto get_target_batch = [&]() -> batch_program& {


Does the issue really happen due to multiple instances of same kernel in the batch or it's just related to batch size? As I remember, if program source is too large, then igc may produce worse binary

It's not clear what the root cause is as of now. BTW, it looks like that it is not due to program source size. In my test, the issue was gone when I commented out just one line.

yeonbok · 2024-05-20T06:56:51Z

src/plugins/intel_gpu/src/runtime/kernels_cache.cpp

+                // check if the current kernel name is in special_kernels
+                auto target_base_kernel_name = get_base_kernel_name(entry_point);
+                if (std::count(special_kernels.begin(), special_kernels.end(), target_base_kernel_name) > 0)
+                    return true;


If currerent entryu has gemm_tiled_opt => it will need_seperate_batch : Is this the intention?
(Current behavior seems so)
If it is so, why not just simply check :
if (entry_point.find("gemm_tiled_opt") != string::npos)
=> need_separate_batch?

Oh! I updated it as you reviewed. Thank you!

…atches

yeonbok · 2024-05-20T17:45:34Z

I believe this will be reverted once the driver issue is resolved. Could you please add the ticket numbers to the PR?

e-ddykim · 2024-05-20T17:51:57Z

I believe this will be reverted once the driver issue is resolved. Could you please add the ticket numbers to the PR?

I added it. Thank you.

e-ddykim added WIP work in progress do not merge labels May 14, 2024

e-ddykim requested review from a team as code owners May 14, 2024 06:33

github-actions bot added the category: GPU OpenVINO GPU plugin label May 14, 2024

vladimir-paramuzov reviewed May 14, 2024

View reviewed changes

e-ddykim force-pushed the gpu-wa-kernels_cache branch from 1215cb9 to 8e6b519 Compare May 17, 2024 08:47

e-ddykim changed the title ~~[GPU] updates to build similar kernels in separate batches~~ [GPU] updates to build some selected kernels in separate batches May 19, 2024

e-ddykim force-pushed the gpu-wa-kernels_cache branch from 8e6b519 to e55b60a Compare May 19, 2024 18:09

e-ddykim added this to the 2024.2 milestone May 20, 2024

e-ddykim force-pushed the gpu-wa-kernels_cache branch from e55b60a to 9b45712 Compare May 20, 2024 02:11

e-ddykim removed WIP work in progress do not merge labels May 20, 2024

geunhwan added the Code Freeze label May 20, 2024

yeonbok reviewed May 20, 2024

View reviewed changes

e-ddykim added 5 commits May 20, 2024 16:12

updated to distribute kernels with the same base kernel in separate b…

dadc0fb

…atches

updated to check up to hash values

e651ae8

updated to build special kernels in separate batches

ef2c9e1

updated to check current_bucket only if it has only one kernel

de52549

updated to use std::string::find

7cbd233

e-ddykim force-pushed the gpu-wa-kernels_cache branch from 9b45712 to 7cbd233 Compare May 20, 2024 07:13

yeonbok approved these changes May 20, 2024

View reviewed changes

yeonbok added this pull request to the merge queue May 21, 2024

ahnyoung-paul approved these changes May 21, 2024

View reviewed changes

github-merge-queue bot removed this pull request from the merge queue due to failed status checks May 21, 2024

yeonbok added this pull request to the merge queue May 21, 2024

Merged via the queue into openvinotoolkit:master with commit 415ba28 May 21, 2024
100 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[GPU] updates to build some selected kernels in separate batches #24499

[GPU] updates to build some selected kernels in separate batches #24499

e-ddykim commented May 14, 2024 •

edited

vladimir-paramuzov May 14, 2024

e-ddykim May 14, 2024

yeonbok May 20, 2024

e-ddykim May 20, 2024

yeonbok commented May 20, 2024

e-ddykim commented May 20, 2024

[GPU] updates to build some selected kernels in separate batches #24499

[GPU] updates to build some selected kernels in separate batches #24499

Conversation

e-ddykim commented May 14, 2024 • edited

Details:

Tickets:

vladimir-paramuzov May 14, 2024

Choose a reason for hiding this comment

e-ddykim May 14, 2024

Choose a reason for hiding this comment

yeonbok May 20, 2024

Choose a reason for hiding this comment

e-ddykim May 20, 2024

Choose a reason for hiding this comment

yeonbok commented May 20, 2024

e-ddykim commented May 20, 2024

e-ddykim commented May 14, 2024 •

edited