Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Tracking Issue] SparseTIR Compatibility with Software Pipelining and Async Copy #61

Open
yzh119 opened this issue Nov 3, 2022 · 0 comments

Comments

@yzh119
Copy link
Member

yzh119 commented Nov 3, 2022

The pipeline and async copy feature in CUDA 11 can overlap data movement and computation.

TVM has already supported compiler passes such as InjectSoftwarePipeline/InjectDoubleBuffer/etc (https://github.com/apache/tvm-rfcs/blob/main/rfcs/0077-async-pipeline.md), however, we have not made these passes compatible with sparse workloads, which limits the performance of SparseTIR generated kernels.

@yzh119 yzh119 changed the title [Tracking Issue] SparseTIR Compatibility with Software Pipelining [Tracking Issue] SparseTIR Compatibility with Software Pipelining and Async Copy Nov 10, 2022
@yzh119 yzh119 moved this to TODO in SparseTIR Nov 15, 2022
@yzh119 yzh119 added this to SparseTIR Nov 15, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: TODO
Development

No branches or pull requests

1 participant