VESSL AI LLMProvider integration #17414

nsd9696 · 2025-01-03T05:41:18Z

Description

Integrating VESSL AI LLMProvider to use in Llama index.
VESSL AI provider will be served through vLLM and OpenAI Compatible.
User can manually serve their own huggingface model with 1) model_name, 2) vessl yaml_file or 3) connect with pre-served vessl llm service endpoint.
Example

from llama_index.llms.vesslai import VesslAILLM

llm = VesslAILLM()

#1 Serve with hf model name
llm.serve(
    service_name = "llama-index-vesslai",
    model_name = "mistralai/Mistral-7B-Instruct-v0.3",
    hf_token = "HF_TOKEN",
    api_key="openai-api-key"
)
#2 Serve with yaml file
llm.serve(
    service_name = "llama-index-vesslai",
    yaml_path="/users/own/vessl/service.yaml",
    api_key="openai-api-key"
)
#3 Connect with pre-served endpoint
llm.connect(
    served_model_name="mistralai/Mistral-7B-Instruct-v0.3",
    endpoint = "https://model-service-gateway-abc.oregon.google-cluster.vessl.ai/v1",
)
resp = llm.complete("Who is Paul Graham?")

Fixes # (issue)

New Package?

Did I fill in the tool.llamahub section in the pyproject.toml and provide a detailed README.md for my new integration or package?

Yes
No

Version Bump?

Did I bump the version in the pyproject.toml file of the package I am updating? (Except for the llama-index-core package)

Yes
No

Type of Change

Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

How Has This Been Tested?

Your pull-request will likely not be merged unless it is covered by some form of impactful unit testing.

I added new unit tests to cover this change
I believe this change is already covered by existing unit tests

Suggested Checklist:

I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
I have added Google Colab support for the newly added notebooks.
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes
I ran make format; make lint to appease the lint gods

vesslai integration

review-notebook-app · 2025-01-03T05:41:23Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

logan-markewich · 2025-01-03T15:11:16Z

llama-index-integrations/llms/llama-index-llms-vesslai/poetry.lock

no need to commit this file

logan-markewich · 2025-01-03T15:12:44Z

llama-index-integrations/llms/llama-index-llms-vesslai/llama_index/llms/vesslai/BUILD

should write an actual readme (see other llms, should show the install and basic usage)

I checked the sources of other llms, and most of them seem to be the same as "python_sources()". What specific details need to be included?

Sorry, I left the comment on the wrong file, this was intended for the README.md file 😅

logan-markewich · 2025-01-03T15:14:34Z

llama-index-integrations/llms/llama-index-llms-vesslai/llama_index/llms/vesslai/base.py

+
+        self.organization_name = organization_name
+
+    def serve(


Curious about the decision to do serve and connect outside of the __init__() function? Do your users often switch this after the llm object is created? In most llama-index LLMs, you would just do llm = VesslAILLM(...) and then from there you can directly use it

Its fine either way tbh, was just curious

Thank you for the feedback. To use VESSL, authentication through configure is required. I wanted to handle this process during initialization and explicitly separate the serving and connection of the llm_provider afterward. Internally at VESSL, we have discussed this flow, and it seems to be fine.

logan-markewich · 2025-01-03T15:16:43Z

llama-index-integrations/llms/llama-index-llms-vesslai/llama_index/llms/vesslai/base.py

+        llm = VesslAILLM()
+
+        #1 Serve hf model name
+        llm.serve(


Thoughts on making serve and connect async? Seems like this could possibly be a blocking operation with wait_for_gateway_enabled ?

logan-markewich · 2025-01-03T15:17:08Z

llama-index-integrations/llms/llama-index-llms-vesslai/llama_index_vesslai_example.ipynb

Lets put example notebooks in docs/docs/examples/llms/

logan-markewich · 2025-01-03T15:17:52Z

llama-index-integrations/llms/llama-index-llms-vesslai/llama_index/llms/vesslai/utils.py

+        print(f"The service {service_name} is currently rolling out.")
+        if _request_abort_rollout(service_name):
+            print("Waiting for the existing rollout to be aborted...")
+            time.sleep(30)


ouch. Another vote to have async imo

logan-markewich · 2025-01-03T15:18:29Z

llama-index-integrations/llms/llama-index-llms-vesslai/tests/test_llms_vesslai.py

There's quite a lot of code, is any of it testable? (you'd have to mock out api calls though)

nsd9696 added 15 commits December 8, 2024 05:33

vesslai integration

2e959de

add configure logic

985f1ff

set as class variables

e910616

change to deployed vessl

df12c44

fix default yaml

55477fa

apply comments

c50a6b5

code refactor

c88d096

if running service, connect

33268be

apply comments

0534a1b

ensure_service_idempotence with yaml_str

1c812bb

Merge pull request #1 from vessl-ai/vessl-integration

8bf80f1

vesslai integration

Merge branch 'run-llama:main' into main

dfa893c

Merge branch 'run-llama:main' into main

a03937f

add temporary yaml for model_name serve

1c7ebc5

fix: make format and lint

c9c06d9

dosubot bot added the size:XL This PR changes 500-999 lines, ignoring generated files. label Jan 3, 2025

logan-markewich reviewed Jan 3, 2025

View reviewed changes

logan-markewich self-assigned this Jan 3, 2025

nsd9696 mentioned this pull request Jan 8, 2025

Apply comments from llama_index vessl-ai/llama_index_vesslai#2

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

VESSL AI LLMProvider integration #17414

VESSL AI LLMProvider integration #17414

nsd9696 commented Jan 3, 2025

review-notebook-app bot commented Jan 3, 2025

logan-markewich Jan 3, 2025

logan-markewich Jan 3, 2025

nsd9696 Jan 7, 2025

logan-markewich Jan 8, 2025

logan-markewich Jan 3, 2025

logan-markewich Jan 3, 2025

nsd9696 Jan 7, 2025

logan-markewich Jan 3, 2025

logan-markewich Jan 3, 2025

logan-markewich Jan 3, 2025

logan-markewich Jan 3, 2025

VESSL AI LLMProvider integration #17414

Are you sure you want to change the base?

VESSL AI LLMProvider integration #17414

Conversation

nsd9696 commented Jan 3, 2025

Description

New Package?

Version Bump?

Type of Change

How Has This Been Tested?

Suggested Checklist:

review-notebook-app bot commented Jan 3, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment