Skip to content

Conversation

@fanqiNO1
Copy link
Contributor

@fanqiNO1 fanqiNO1 commented Sep 12, 2024

If the LLM is too big to be loaded in a single GPU, we need device_map = 'auto' to avoid OOM.

According to the issue #715.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant