-
Notifications
You must be signed in to change notification settings - Fork 25
Pull requests: llm-d/llm-d-inference-sim
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat: generate response length based on a histogram when max_tokens is defined in the request
#169
opened Aug 25, 2025 by
mayabar
Loading…
Change time-to-first-token parameter to be based on number of request tokens #137
#165
opened Aug 24, 2025 by
pancak3
Loading…
10 of 11 tasks
feat: add max-num-batched-tokens configuration and implement request handling constraints (#83)
#97
opened Jul 15, 2025 by
mohitpalsingh
Loading…
6 tasks done
ProTip!
Exclude everything labeled
bug
with -label:bug.