Skip to content

[v0.3.3] Release Tracker #3097

@WoosukKwon

Description

@WoosukKwon

ETA: Feb 29th - Mar 1st

Major changes

  • StarCoder2 support
  • Performance optimization and LoRA support for Gemma
  • Performance optimization for MoE kernel
  • 2/3/8-bit GPTQ support
  • [Experimental] AWS Inferentia2 support

PRs to be merged before the release

Metadata

Metadata

Assignees

No one assigned

    Labels

    releaseRelated to new version release

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions