feat: nvidia triton embedding integration #19226

vpcano · 2025-06-26T08:20:05Z

Description

This integration allows LlamaIndex to use embedding models hosted on a Triton Inference Server. Uses tritonclient.

New Package?

Did I fill in the tool.llamahub section in the pyproject.toml and provide a detailed README.md for my new integration or package?

Yes
No

Version Bump?

Did I bump the version in the pyproject.toml file of the package I am updating? (Except for the llama-index-core package)

Yes
No

Type of Change

Please delete options that are not relevant.

New feature (non-breaking change which adds functionality)

How Has This Been Tested?

Your pull-request will likely not be merged unless it is covered by some form of impactful unit testing.

I added new unit tests to cover this change
I believe this change is already covered by existing unit tests

Suggested Checklist:

I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
I have added Google Colab support for the newly added notebooks.
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes
I ran uv run make format; uv run make lint to appease the lint gods

...embeddings/llama-index-embeddings-nvidia-triton/llama_index/embeddings/nvidia_triton/base.py

github-actions · 2025-08-22T02:10:47Z

This PR is stale because it has been open 50 days with no activity. Remove stale label or comment or this will be closed in 10 days.

…llama_index into feature/triton-embedding

vpcano · 2025-08-29T15:59:06Z

I believe the asynchronous embedding retrieval is now well implemented and the integration ready to be merged.

github-actions · 2025-10-19T02:15:15Z

This PR is stale because it has been open 50 days with no activity. Remove stale label or comment or this will be closed in 10 days.

github-actions · 2025-10-30T02:12:19Z

This PR was closed because it has been stalled for 10 days with no activity.

Víctor Pérez Cano added 7 commits June 23, 2025 15:54

nvidia triton embedding

f6a0c8c

docs

f7b6c09

version fix

82833f6

Merge branch 'main' into feature/triton-embedding

981bea5

triton embedding readme

b8a0284

fix triton embedding readme

cf10bec

uv run make format

da72bed

dosubot bot added the size:L This PR changes 100-499 lines, ignoring generated files. label Jun 26, 2025

Merge branch 'main' into feature/triton-embedding

0b8392f

logan-markewich reviewed Jul 3, 2025

View reviewed changes

...embeddings/llama-index-embeddings-nvidia-triton/llama_index/embeddings/nvidia_triton/base.py Show resolved Hide resolved

github-actions bot added the stale Issue has not had recent activity or appears to be solved. Stale issues will be automatically closed label Aug 22, 2025

Víctor Pérez Cano added 3 commits August 29, 2025 17:38

implement async calls

051af0b

Merge branch 'feature/triton-embedding' of https://github.com/vpcano/…

4a24ee9

…llama_index into feature/triton-embedding

Merge branch 'main' into feature/triton-embedding

2cac6b2

vpcano requested a review from logan-markewich August 29, 2025 16:00

github-actions bot removed the stale Issue has not had recent activity or appears to be solved. Stale issues will be automatically closed label Aug 30, 2025

github-actions bot added the stale Issue has not had recent activity or appears to be solved. Stale issues will be automatically closed label Oct 19, 2025

github-actions bot closed this Oct 30, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: nvidia triton embedding integration #19226

feat: nvidia triton embedding integration #19226

vpcano commented Jun 26, 2025

Uh oh!

Uh oh!

github-actions bot commented Aug 22, 2025

Uh oh!

vpcano commented Aug 29, 2025

Uh oh!

github-actions bot commented Oct 19, 2025

Uh oh!

github-actions bot commented Oct 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat: nvidia triton embedding integration #19226

feat: nvidia triton embedding integration #19226

Conversation

vpcano commented Jun 26, 2025

Description

New Package?

Version Bump?

Type of Change

How Has This Been Tested?

Suggested Checklist:

Uh oh!

Uh oh!

github-actions bot commented Aug 22, 2025

Uh oh!

vpcano commented Aug 29, 2025

Uh oh!

github-actions bot commented Oct 19, 2025

Uh oh!

github-actions bot commented Oct 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants