Possible bug RPC Server - Kimi K2 - Apple Silicon #15898
alejandrods
started this conversation in
General
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I'm encountering an issue with distributed inference using Kimi-K2-Instruct-0905 IQ4_NL quantization across 2 Mac Studios (512GB each) via RPC. The model loads successfully but only generates repeated zeros ("0000000...").
Command I ran in the host:
Command I ran in the RPC:
What do you think it may be the problem? Thank you!
Beta Was this translation helpful? Give feedback.
All reactions