Bump flashinfer-python CUDA 13 #3327

johnnynunez · 2025-09-07T20:27:58Z

Thor and Spark

MasterJH5574 · 2025-09-10T19:45:04Z

Hi @johnnynunez, thanks for contributing! We found that the change in https://github.com/apache/tvm/pull/18300/files is the only one we need for CUDA 13. Everything else is compatible with CUDA 13, including flashinfer-python.

We have removed the AOT flashinfer support a while ago, so the cmake config of USE_FLASHINFER has no actual effect. I can help take over the PR and clean up those config in the codebase.

johnnynunez · 2025-09-10T19:47:10Z

Hi @johnnynunez, thanks for contributing! We found that the change in https://github.com/apache/tvm/pull/18300/files is the only one we need for CUDA 13. Everything else is compatible with CUDA 13, including flashinfer-python.

We have removed the AOT flashinfer support a while ago, so the cmake config of USE_FLASHINFER has no actual effect. I can help take over the PR and clean up those config in the codebase.

Thanks! It is because i was using thor and spark and it is compatible with cuda 13 about flashinfer with that version

MasterJH5574 · 2025-09-11T00:54:26Z

It is because i was using thor and spark and it is compatible with cuda 13 about flashinfer with that version

@johnnynunez Got it. We can bump it then. I am currently working on shipping our CUDA 13 python package, and will update this PR after finishing that.

johnnynunez · 2025-09-11T07:26:08Z

It is because i was using thor and spark and it is compatible with cuda 13 about flashinfer with that version

@johnnynunez Got it. We can bump it then. I am currently working on shipping our CUDA 13 python package, and will update this PR after finishing that.

Thank you! It is the baseline for those devices

MasterJH5574 · 2025-09-15T20:33:43Z

Hi @johnnynunez, I just updated this PR and got it merged. Unfortunately FlashInfer 0.3.1 doesn't work with the latest main tvm and mlc-llm because of our recent rename of ffi::NDArray to ffi::Tensor. We have updated FlashInfer side accordingly flashinfer-ai/flashinfer@0828553, and expect to have it included in the next FlashInfer release.

For now if you want to use FlashInfer, you will need to clone the FlashInfer GitHub repo and build it from source by following https://docs.flashinfer.ai/installation.html#python-package.

johnnynunez and others added 15 commits February 11, 2025 21:08

Blackwell Support Codegen

254ad4a

update

1ec0c6e

Revert package changes

205f5f6

Merge branch 'mlc-ai:main' into main

8a3144b

Merge branch 'mlc-ai:main' into main

ada5fac

Merge branch 'mlc-ai:main' into main

48d0796

Thor

c322d7d

Update gen_cmake_config.py

b20d092

Update pyproject.toml

2a27fb5

Update build_lib.sh

52edf3e

Update build_lib.sh

714043c

Update gen_cmake_config.py

a331981

Merge branch 'main' into patch-2

951eedd

Update gen_cmake_config.py

1ec4c34

Update requirements.txt

18814a8

johnnynunez mentioned this pull request Sep 9, 2025

[Feature Request] cuda 13 support #3325

Closed

Update

1e7853c

MasterJH5574 approved these changes Sep 15, 2025

View reviewed changes

MasterJH5574 merged commit a690e94 into mlc-ai:main Sep 15, 2025
1 check was pending

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Bump flashinfer-python CUDA 13 #3327

Bump flashinfer-python CUDA 13 #3327

Uh oh!

johnnynunez commented Sep 7, 2025

Uh oh!

MasterJH5574 commented Sep 10, 2025

Uh oh!

johnnynunez commented Sep 10, 2025 •

edited

Loading

Uh oh!

MasterJH5574 commented Sep 11, 2025

Uh oh!

johnnynunez commented Sep 11, 2025

Uh oh!

Uh oh!

MasterJH5574 commented Sep 15, 2025

Uh oh!

Uh oh!

Bump flashinfer-python CUDA 13 #3327

Bump flashinfer-python CUDA 13 #3327

Uh oh!

Conversation

johnnynunez commented Sep 7, 2025

Uh oh!

MasterJH5574 commented Sep 10, 2025

Uh oh!

johnnynunez commented Sep 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MasterJH5574 commented Sep 11, 2025

Uh oh!

johnnynunez commented Sep 11, 2025

Uh oh!

Uh oh!

MasterJH5574 commented Sep 15, 2025

Uh oh!

Uh oh!

johnnynunez commented Sep 10, 2025 •

edited

Loading