Skip to content

Commit b278258

Browse files
authored
Update 02_inf-simulator.md (#40)
Signed-off-by: Abhishek koserwal <[email protected]>
1 parent 886631c commit b278258

File tree

1 file changed

+8
-3
lines changed

1 file changed

+8
-3
lines changed

docs/architecture/Components/02_inf-simulator.md

Lines changed: 8 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -90,16 +90,21 @@ For more details see the [vLLM documentation](https://docs.vllm.ai/en/stable/get
9090

9191
## Working with docker image
9292

93+
### Clone the llm-d-inference-sim repository
94+
```bash
95+
git clone https://github.com/llm-d/llm-d-inference-sim.git
96+
```
97+
9398
### Building
9499
To build a Docker image of the vLLM Simulator, run:
95100
```bash
96-
make build-llm-d-inference-sim-image
101+
make image-build
97102
```
98103

99104
### Running
100105
To run the vLLM Simulator image under Docker, run:
101106
```bash
102-
docker run --rm --publish 8000:8000 ai-aware-router/llm-d-inference-sim:0.0.1 /ai-aware-router/llm-d-inference-sim --port 8000 --model "Qwen/Qwen2.5-1.5B-Instruct" --lora "tweet-summary-0,tweet-summary-1"
107+
docker run --rm --publish 8000:8000 ghcr.io/llm-d/llm-d-inference-sim:dev --port 8000 --model "Qwen/Qwen2.5-1.5B-Instruct" --lora "tweet-summary-0,tweet-summary-1"
103108
```
104109
**Note:** The above command exposes the simulator on port 8000, and serves the Qwen/Qwen2.5-1.5B-Instruct model.
105110

@@ -108,7 +113,7 @@ docker run --rm --publish 8000:8000 ai-aware-router/llm-d-inference-sim:0.0.1 /a
108113
### Building
109114
To build the vLLM simulator, run:
110115
```bash
111-
make build-llm-d-inference-sim
116+
make build
112117
```
113118

114119
### Running

0 commit comments

Comments
 (0)