Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

PR #19112: [GPU] GEMM fusion: support more broadcasts. #19247

Merged
merged 1 commit into from
Nov 15, 2024

Conversation

copybara-service[bot]
Copy link

@copybara-service copybara-service bot commented Nov 12, 2024

PR #19112: [GPU] GEMM fusion: support more broadcasts.

Imported from GitHub PR #19112

Support broadcasts involving 1-sized fragments of dimensions like [1,n] -> broadcast -> [1,m,n].
Copybara import of the project:

--
f024453 by Ilia Sergachev [email protected]:

[GPU] GEMM fusion analysis: support broadcasts of trivially-sized dimensions.

--
86ba22e by Ilia Sergachev [email protected]:

[GPU] Triton GEMM emitter: support broadcasts of trivially-sized dimensions.

--
2ca1165 by Ilia Sergachev [email protected]:

address feedback

--
308cf80 by Ilia Sergachev [email protected]:

add another test

--
8f38c5c by Ilia Sergachev [email protected]:

fix tensor pointer advancement

Merging this change closes #19112

FUTURE_COPYBARA_INTEGRATE_REVIEW=#19112 from openxla:gemm_fusion_support_more_broadcasts 8f38c5c

@copybara-service copybara-service bot force-pushed the test_695335360 branch 4 times, most recently from 8101e3e to b8afc19 Compare November 15, 2024 03:57
Imported from GitHub PR #19112

Support broadcasts involving 1-sized fragments of dimensions like [1,n] -> broadcast -> [1,m,n].
Copybara import of the project:

--
f024453 by Ilia Sergachev <[email protected]>:

[GPU] GEMM fusion analysis: support broadcasts of trivially-sized dimensions.

--
86ba22e by Ilia Sergachev <[email protected]>:

[GPU] Triton GEMM emitter: support broadcasts of trivially-sized dimensions.

--
2ca1165 by Ilia Sergachev <[email protected]>:

address feedback

--
308cf80 by Ilia Sergachev <[email protected]>:

add another test

--
8f38c5c by Ilia Sergachev <[email protected]>:

fix tensor pointer advancement

Merging this change closes #19112

COPYBARA_INTEGRATE_REVIEW=#19112 from openxla:gemm_fusion_support_more_broadcasts 8f38c5c
PiperOrigin-RevId: 696745026
@copybara-service copybara-service bot merged commit d3bbe1f into main Nov 15, 2024
1 check passed
@copybara-service copybara-service bot deleted the test_695335360 branch November 15, 2024 04:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant