feat: add trigonometric functions #861

relativityhd · 2025-09-04T18:54:43Z

Add trigonometric functions (atan2 etc.)

Adds the following functions for all Floats:

Open Questions / Missing parts

LLVM MIR implementation
Non f64 / f32 correctness
Metal atomic / safe operations

LLVM MIR implementation

I have setup a placeholder currently with code which I assumed to work commented out.
It seems that support for these functions needs to be added in the tracel-llvm repository
, but I have no clue where.

Non f64/f32

At some point there is limited support for the "special" floats and for e.g. powf which I used as an example a conversion was needed to normal floats.
I am unsure at which points this is necessary and where I can find out whether these conversions are necessary.

Metal safe operations

Since I used the existing sin, cos, tanh and powf functions as examples on how to add the other functions, i stumbled across the implementation for metal for tanh:

    fn compile_instruction_tanh_scalar<T: Component<Self>>(
        f: &mut std::fmt::Formatter<'_>,
        input: T,
    ) -> std::fmt::Result {
        write!(f, "safe_tanh_scalar({input})")
    }

I couldn't find anything about this "safe" version in the metal documentation, but I am clearly not an expert.
For which functions is a "safe" implementation needed and which are fine without?

Validate your PR with burn.

It is important that you make sure that you don't introduce any bugs in burn.

Instructions

[] Create a new branch or fork of the burn repo
[] Update the main Cargo.toml with this PR hash.
[] Fix any broken tests or compilation errors in burn.
[] Submit a PR in burn with your fixes and link it here.

This just makes the compiler happy and is not yet tested!

relativityhd · 2025-09-04T19:27:10Z

cargo xtask validate just takes too long on my machine, aborted after 10 minutes of execution. I have run cargo test -p cubecl-wgpu and cargo test -p cubecl-core instead and they seem to work.

wingertge · 2025-09-05T11:20:23Z

Not sure if it might be a good idea to pull all the trigonometry functions (including the existing ones) into a new enum for the IR, because that's a lot of added stuff, and it's now enough to warrant its own category I think. We should at some point also think about applying a similar separation to the compilers themselves, but that's a larger rework that wouldn't go into this PR.

I'm currently looking into the MLIR functions for that stuff - the bindings are auto-generated I believe, so if the functions don't exist, they might be in a different namespace or need a different way to handle them.

As for the float types, casting to float for unsupported float types (i.e. F16, BF16) is reasonable. There do appear to be double versions of the functions, so that will work natively.

wingertge · 2025-09-05T11:32:19Z

crates/cubecl-cpu/src/compiler/visitor/operation/arithmetic.rs

                let result = self.append_operation_with_result(operation);
                self.insert_variable(out, result);
            }
+            Arithmetic::Sinh(_sinh) => {


The relevant functions are actually in the dialect::ods::math module (not dialect::ods::llvm). So you'll need to import that and adjust as appropriate.

I implemented everything. Now I get this error message when running tests:
error: cannot be converted to LLVM IR: missing LLVMTranslationDialectInterface registration for dialect for op: math.atan2
However, according to code hints everything seems fine...

I'll have to tag in @marcantoinem here because I don't understand much about the MLIR backend. I'm guessing the dialect needs to be registered somewhere but not sure about the details.

Good catch, I didn't see that there is a dialect other than arith for more complex mathematical operation, I was using llvm intrisic instead. The error you got is because the pass to transform the math dialect to llvm intrisic is not registered. To fix it you just needs to add pass_manager.add_pass(pass::conversion::create_math_to_llvm()); in src/compiler/module.rs

Locally, I still get the same error, even after run cargo clean, deleting the target directory and passing --features mlir-dump to cargo test -p cubecl-cpu. I also checked whether the line was called and it seems like it did - but nothing happend (maybe because I missed another cache?)
I pushed a commit with the added line, maybe it runs on your machine?

I tried your code and I get the same error the pass seems to do nothing. I didn't use it anywhere else in the MLIR compiler so it must be a problem on the C API of MLIR that is not running correctly this pass. For the moment you can use LLVM intrisic I think, it will needs more investigation to find why registering the math_to_llvm doesn't work. The only problem with using llvm intrisic instead of math is for portability if we want to port the MLIR compiler to GPU,. but it is not a concern right now.

The arc-* operations are not available in the llvm intrinsic, hence I disabled them for now until the math module becomes available.

crates/cubecl-cpu/src/compiler/visitor/operation/arithmetic.rs

precision for some of the new trigonometric operations

relativityhd · 2025-09-05T22:47:46Z

@wingertge

Not sure if it might be a good idea to pull all the trigonometry functions (including the existing ones) into a new enum for the IR, because that's a lot of added stuff, and it's now enough to warrant its own category I think.

Must I implement this in this PR or will this be part of another refactor?

As for the float types, casting to float for unsupported float types (i.e. F16, BF16) is reasonable. There do appear to be double versions of the functions, so that will work natively.

Can you please double-check my implementation? I don't really understand at which point the right functions are written to the shader for each dialect...

wingertge · 2025-09-06T10:22:58Z

@wingertge

Not sure if it might be a good idea to pull all the trigonometry functions (including the existing ones) into a new enum for the IR, because that's a lot of added stuff, and it's now enough to warrant its own category I think.

Must I implement this in this PR or will this be part of another refactor?

I think that would be better to do after all the outstanding work has been merged, so I'll do it separately.
One more thing I'd add retroactively is that since there's no intrinsics for converting degrees/radians, I think it would be better to just implement those as cube functions in cubecl-std, rather than the compiler. The implementation would be trivial:

#[cube]
pub fn to_degrees<F: Float>(val: F) -> F {
    val * F::new(180.0 / f32::PI)
}

It would keep it in one place rather than implemented separately for each compiler.

wingertge · 2025-09-06T10:33:58Z

Can you please double-check my implementation? I don't really understand at which point the right functions are written to the shader for each dialect...

CUDA is already correct, not sure what the WGSL behaviour even is because it's an experimental extension. It might just support f16 overloads on those functions by default, but I can't yet test it because of some issues with features on Vulkan.
For SPIR-V I think it would be best implemented by a preprocessor but they're poorly documented, so I think the easiest way to do it is to open an issue to track that it still needs to be done and I'll deal with it myself.

relativityhd · 2025-09-06T16:09:55Z

there's no intrinsics for converting degrees/radians

There are intrinsics for WGSL and SPIR-V (at least rspirv-ext), also Rusts f32 and f64 do have to_degrees and to_radians operations. Also, as a user, I would probably search in the Float Namespace for these functions instead of the cubecl-std. That's why I think having them there would make still sense.

I removed the dialect specific compile_instruction_xxx_scalar calls and instead hardcode the operations directly for the cubecl-cpp, since all CUDA, HIP and Metal should convert implicit to the right type and share the same syntax.
I also added the to_degrees and to_radians functions to the cubecl-std, so that you can decide which one fits best / should survive a refactor.

tests

f32 operations

relativityhd · 2025-09-09T18:23:56Z

Before things are getting stale, I mark this as ready to review.

Currently, 21 tests are failing, 20 of them related to the missing CPU implementation of the inverse trigonometric functions (arc-*) and one is that to_degrees fails for f16:

Values differ more than epsilon: actual=89.9375, expected=90, difference=0.0625, epsilon=0.020004272
index: 1
actual: [0.0, 89.9375, 179.875, -89.9375, -179.875]
expected: [0.0, 90.0, 180.0, -90.0, -180.0]

For both problems I can't do more, since the first one needs to be fixed by @marcantoinem I guess? and the second one needs to be fixed by a design decision whether to_degrees (and to_radians) should be present for Float types or not and if yes, whether it should be okay to lower the test-epsilons for this.

If I should do more, let me know.

crates/cubecl-std/src/lib.rs

wingertge · 2025-09-11T12:34:36Z

Currently, 21 tests are failing, 20 of them related to the missing CPU implementation of the inverse trigonometric functions (arc-*) and one is that to_degrees fails for f16:

that epsilon looks too tight for f16, might have another bug in the precision-aware comparison like when I forgot an abs and made the tolerance way too tight for negative values. This one is positive, but seems excessively precise regardless so something needs to be adjusted there. Maybe we can modify all tests to specify epsilon, since it can depend on the exact operation.

Change the epsilon for to_degree() to 0.3, which checks out with the f16 maximum error for our valid tests.

Move to_degrees and to_radians there

relativityhd · 2025-09-12T23:29:51Z

I changed the runtime tests for unary so that epsilon is now an optional paramater for the macro, defaulting to 0.02, and the epsilon for the to_degrees test set to 0.3 -> This test is now passing.

I moved the to_degrees and to_radians functions to their own trigonometric module and added documentation and tests.
I also asked an LLM to add more trigonometric functions so that this module is not so empty... I validated everything, I think most functions seem useful to some extent and all tests pass except for one functions with the cpu code because tan2 is not correctly implemented there and therefore also blocked by @marcantoinem MLIR magic :D
I see that this was somewhat unnecessary, so if you want, I can delete this.

relativityhd · 2025-09-12T23:33:19Z

Since I used a lot of PI and PI*2 etc., especially in the Tests, I wondered whether it would be useful to expose constants under the Float trait. I know that rust f32 and f64 already have a lot of constants under ::consts and that there are also a lot of constants already implemented for the cubecl f16 type.

relativityhd · 2025-09-12T23:37:30Z

Another thing: I wondered whether this function signature here would prevent the creation of super-precise f64 floats, since it is only possible to pass in f32.

nathanielsimard · 2025-09-16T12:25:18Z

The code looks fine to me, but the tests don't pass on the CI. @relativityhd We may also remove some functions in the trigo modules, unsure they are necessary.

relativityhd · 2025-09-16T17:08:57Z

@nathanielsimard yes the CI is failing because as I mentioned this PR is blocked by the MLIR melior project, because the math module there wont register correctly, as @marcantoinem mentioned.
The tests are failing because of the empty implementations of the functions depending on the melior llvm math module.

We may also remove some functions in the trigo modules, unsure they are necessary.

Sure, just tell which ones I should remove.

nathanielsimard · 2025-09-19T12:39:56Z

Sure, just tell which ones I should remove.

The ones that are not stricly necessary, using if statements is also very bad on most hardware, so we can remove the functions that have if statements.

relativityhd · 2025-10-07T14:54:54Z

Sorry for the long wait - my vacation is over and time limited again...

I've removed unnecessary trig functions from the std, only keeping hypot, to_radians and to_degrees since they are also present in rusts f32.

This PR is still blocked by the MLIR melior project. I still get the following error when testing (I've removed the dummy code and enabled ods math support again, as @marcantoinem described):

error: cannot be converted to LLVM IR: missing `LLVMTranslationDialectInterface` registration for dialect for op: math.acos

@nathanielsimard What repository is responsible for this? Maybe I can check and fix it there. I am quite confused about the different LLVM repositories and packages used.

nathanielsimard · 2025-10-14T13:25:14Z

@nathanielsimard What repository is responsible for this? Maybe I can check and fix it there. I am quite confused about the different LLVM repositories and packages used.

https://github.com/tracel-ai/tracel-llvm :)

relativityhd added 2 commits September 4, 2025 18:21

Add ArcSin, ArcCos, ArcTan and ArcTan2 to float operations

aef4168

This just makes the compiler happy and is not yet tested!

Add tests for acos, asin, atan and atan2

416fa3d

relativityhd added 3 commits September 4, 2025 21:27

Add sinh, cosh, asinh, acosh and atanh

99a5d96

Add degrees and radians function

89b1642

Merge branch 'main' into feature/arc-trigonomic-functions

068d63e

wingertge reviewed Sep 5, 2025

View reviewed changes

crates/cubecl-cpu/src/compiler/visitor/operation/arithmetic.rs Outdated Show resolved Hide resolved

Implement trigonometric functions in CPU backend Also try to handle half

e6d0410

precision for some of the new trigonometric operations

Add to_degrees and to_radians functions to cube-std

6735788

relativityhd added 7 commits September 7, 2025 00:25

Register math to llvm transform of mlir

d07c2b2

Merge branch 'main' into feature/arc-trigonomic-functions

e33ac87

Disable all ods_math dependant arithmetics for now

7aa1913

Add dummy implementations instead of todo! to satisfy compilation of

52429f7

tests

Merge branch 'main' into feature/arc-trigonomic-functions

831fa97

Fix merge formatting

11cd568

Rename degrees and radians to to_degrees and to_radians to reflect rusts

7977e7f

f32 operations

relativityhd marked this pull request as ready for review September 9, 2025 18:23

relativityhd requested review from marcantoinem and wingertge September 9, 2025 18:24

Add tan operation

5f06df5

wingertge approved these changes Sep 11, 2025

View reviewed changes

nathanielsimard reviewed Sep 11, 2025

View reviewed changes

crates/cubecl-std/src/lib.rs Outdated Show resolved Hide resolved

relativityhd added 2 commits September 13, 2025 01:11

Make runtime tests for unary epsilon dependent

5e1f84d

Change the epsilon for to_degree() to 0.3, which checks out with the f16 maximum error for our valid tests.

Add trigonometry module

e55eb53

Move to_degrees and to_radians there

relativityhd added 2 commits September 13, 2025 11:47

Merge branch 'main' into feature/arc-trigonomic-functions

48302fe

Update spir-v calls to forked version

93dda9b

nathanielsimard approved these changes Sep 16, 2025

View reviewed changes

relativityhd added 4 commits October 7, 2025 16:19

Remove unnecessary trig function in std

1319a30

Merge branch 'main' into feature/arc-trigonomic-functions

8923d14

Fix for refactored launch and reenable std trig tests

0af20fc

remove dummy implementations for ods math arithmetics

4eff5a8

relativityhd mentioned this pull request Oct 7, 2025

Math dialect not working tracel-ai/tracel-llvm#5

Open

feat: add trigonometric functions #861

Are you sure you want to change the base?

feat: add trigonometric functions #861

Uh oh!

Conversation

relativityhd commented Sep 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Add trigonometric functions (atan2 etc.)

Open Questions / Missing parts

LLVM MIR implementation

Non f64/f32

Metal safe operations

Validate your PR with burn.

Instructions

Uh oh!

relativityhd commented Sep 4, 2025

Uh oh!

wingertge commented Sep 5, 2025

Uh oh!

wingertge Sep 5, 2025

Choose a reason for hiding this comment

Uh oh!

relativityhd Sep 5, 2025

Choose a reason for hiding this comment

Uh oh!

wingertge Sep 5, 2025

Choose a reason for hiding this comment

Uh oh!

marcantoinem Sep 6, 2025

Choose a reason for hiding this comment

Uh oh!

relativityhd Sep 6, 2025

Choose a reason for hiding this comment

Uh oh!

marcantoinem Sep 6, 2025

Choose a reason for hiding this comment

Uh oh!

relativityhd Sep 7, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

relativityhd commented Sep 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wingertge commented Sep 6, 2025

Uh oh!

wingertge commented Sep 6, 2025

Uh oh!

relativityhd commented Sep 6, 2025

Uh oh!

relativityhd commented Sep 9, 2025

Uh oh!

Uh oh!

wingertge commented Sep 11, 2025

Uh oh!

relativityhd commented Sep 12, 2025

Uh oh!

relativityhd commented Sep 12, 2025

Uh oh!

relativityhd commented Sep 12, 2025

Uh oh!

nathanielsimard commented Sep 16, 2025

Uh oh!

relativityhd commented Sep 16, 2025

Uh oh!

nathanielsimard commented Sep 19, 2025

Uh oh!

relativityhd commented Oct 7, 2025

Uh oh!

nathanielsimard commented Oct 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

relativityhd commented Sep 4, 2025 •

edited

Loading

relativityhd commented Sep 5, 2025 •

edited

Loading