You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The data is logits and targets from one hot encoded MNIST and a poorly fitted logistic regression.
If you subset the data and take a few thousand subsamples, it works well. but for some reasons, the results are crazy on the MPS if nondeterminism is allowed. And without it, is is still super strange to get a 'nan'. This should really have a finite answer all the time. Which the cpu computation gives.
I'd be happy to supply the pt file, but im not allowed to attach it.
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
Uh oh!
There was an error while loading. Please reload this page.
-
Hello. Consider the code below, running on a MacBrook Pro with M1 mps accelleration.
with the supplied data file. The output on my machine is
The data is logits and targets from one hot encoded MNIST and a poorly fitted logistic regression.
If you subset the data and take a few thousand subsamples, it works well. but for some reasons, the results are crazy on the MPS if nondeterminism is allowed. And without it, is is still super strange to get a 'nan'. This should really have a finite answer all the time. Which the cpu computation gives.
I'd be happy to supply the pt file, but im not allowed to attach it.
Beta Was this translation helpful? Give feedback.
All reactions