Skip to content

Conversation

davidkyle
Copy link
Member

Request routing logic was added to enable per node rate limiting with the Elastic Inference Service but that code remained behind a feature flag and was never enabled. EIS now implements rate limiting make the code redundant.

@davidkyle davidkyle marked this pull request as ready for review September 12, 2025 12:12
@elasticsearchmachine elasticsearchmachine added the Team:ML Meta label for the ML team label Sep 12, 2025
@elasticsearchmachine
Copy link
Collaborator

Pinging @elastic/ml-core (Team:ML)

* @param divisor - divisor to divide the initial requests per time unit by
*/
public synchronized void updateTokensPerTimeUnit(Integer divisor) {
private synchronized void updateTokensPerTimeUnit(Integer divisor) {
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@jonathan-buttner is this code and the related updateRateLimitDivisor() functions still required?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No I'm pretty sure we only added for the node pinning so we should be able to remove it

Copy link
Contributor

@jonathan-buttner jonathan-buttner left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
:ml Machine learning >refactoring Team:ML Meta label for the ML team v9.2.0
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants