Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

xe: sdpa: fix several errors related to out of bound accesses in sdpa #2497

Merged
merged 8 commits into from
Jan 27, 2025

Conversation

umar456
Copy link
Contributor

@umar456 umar456 commented Jan 24, 2025

Description

This PR addresses several out of bound access errors related to second token shapes(Q = 1 K/V = SEQ_LEN+1).

Additionally this PR fixes issues related to compilation errors when using common zero points masks are used without scales when quantizing.

@umar456 umar456 added bug A confirmed library bug platform:gpu-intel Codeowner: @oneapi-src/onednn-gpu-intel labels Jan 24, 2025
@umar456 umar456 requested review from a team as code owners January 24, 2025 00:43
@github-actions github-actions bot added the component:tests Codeowner: @oneapi-src/onednn-arch label Jan 24, 2025
@umar456 umar456 force-pushed the uarshad/fix_sdpa_config_2nd_token branch from 3952a04 to cd354b5 Compare January 24, 2025 00:45
@umar456 umar456 force-pushed the uarshad/fix_sdpa_config_2nd_token branch from cd354b5 to 9a94ead Compare January 24, 2025 00:49
@umar456
Copy link
Contributor Author

umar456 commented Jan 24, 2025

make test
disable device_cpu
disable benchdnn_all
enable benchdnn_graph

--case=complex_fusion/mha/sdpa-compressed-kv-int4-gs32.json

# llama-2-7b-chat
--in-shapes=0:1x32x128x384*abdc+1:1x32x1x384+2:1x32x1x384+3:1x32x384x128+5:1x1x384x384+6:1x32x384x128+7:1x32x384x1+8:1x32x384x1,\
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: better to reset dt here, to align with other tests, and avoid misunderstanding.

@umar456 umar456 force-pushed the uarshad/fix_sdpa_config_2nd_token branch from 9a94ead to fcded4f Compare January 27, 2025 20:35
@umar456
Copy link
Contributor Author

umar456 commented Jan 27, 2025

make test
disable device_cpu
disable benchdnn_all
enable benchdnn_graph

@umar456 umar456 merged commit 08b5c01 into main Jan 27, 2025
11 of 21 checks passed
@umar456 umar456 deleted the uarshad/fix_sdpa_config_2nd_token branch January 27, 2025 23:19
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug A confirmed library bug component:tests Codeowner: @oneapi-src/onednn-arch platform:gpu-intel Codeowner: @oneapi-src/onednn-gpu-intel
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants