[v0.3.3] Release Tracker

**ETA**: Feb 29th - Mar 1st

## Major changes

* StarCoder2 support
* Performance optimization and LoRA support for Gemma
* Performance optimization for MoE kernel
* 2/3/8-bit GPTQ support
* [Experimental] AWS Inferentia2 support

## PRs to be merged before the release

- [x] #2330 #2223
- [ ] ~~#2761~~
- [x] #2819 
- [x] #3087 #3099
- [x] #3089