Skip to content

Commit 4732b1c

Browse files
committed
Force CUTE_HOST_RTC definition to be HOST_DEVICE because I could not
define this value from the compiler Signed-off-by: Phillip Kuznetsov <[email protected]>
1 parent 47a61d5 commit 4732b1c

File tree

1 file changed

+4
-0
lines changed

1 file changed

+4
-0
lines changed

cpp/tensorrt_llm/cutlass_extensions/include/cutlass_extensions/arch/copy_traits_sm90_multimem.hpp

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -21,6 +21,10 @@
2121
#include <cute/numeric/integral_ratio.hpp>
2222
#include <cute/tensor.hpp>
2323

24+
// Override other definitions so that these calls can be used on device. Otherwise they are not propagated properly via
25+
// define flags in the compiler.
26+
#define CUTE_HOST_RTC CUTE_HOST_DEVICE
27+
2428
namespace cute
2529
{
2630
// Utility for unpacking tensor into registers for multimem CopyOp

0 commit comments

Comments
 (0)