Add tokenizer #394

robertgshaw2-redhat · 2024-07-31T20:49:40Z

SUMMARY:

add endpoints to request ModelConfig, SchedulerConfig, LoRAConfig, ParallelConfig
factor out tokenizer group creation function to be a utility function
create tokenizer_group on client side

github-actions · 2024-07-31T20:49:53Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which consists a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of default ones by unblocking the steps in your fast-check build on Buildkite UI.

Once the PR is approved and ready to go, please make sure to run full CI as it is required to merge (or just use auto-merge).

To run full CI, you can do one of these:

Comment /ready on the PR
Add ready label to the PR
Enable auto-merge.

🚀

joerunde · 2024-07-31T20:52:50Z

vllm/engine/async_llm_engine.py

@@ -924,6 +925,14 @@ async def get_model_config(self) -> ModelConfig:
        else:
            return self.engine.get_model_config()

+    async def get_parallel_config(self) -> ParallelConfig:
+        """Get the parallel configuration of the vLLM engine."""
+        if self.engine_use_ray:


these ifs are outta control, the ray engine should totally be a separate VLLMBackend 😉

...a change for another day, or week

joerunde · 2024-07-31T20:56:08Z

vllm/engine/async_llm_engine.py

@@ -924,6 +925,14 @@ async def get_model_config(self) -> ModelConfig:
        else:
            return self.engine.get_model_config()

+    async def get_parallel_config(self) -> ParallelConfig:


can these new methods go into the VLLMBackend protocol as well?

I dont think they should be in the protocol because the Protocol does not have to implement these + most of the time the Protocol will not implement these

ah yeah I see these are only on the AsyncLLMEngine, 🌶️

joerunde · 2024-07-31T21:52:49Z

tests/entrypoints/openai/test_completion.py

@@ -119,6 +119,7 @@ async def test_single_completion(client: openai.AsyncOpenAI, model_name: str,
    choice = completion.choices[0]
    assert len(choice.text) >= 5
    assert choice.finish_reason == "length"
+    print(completion.usage)


joerunde · 2024-07-31T21:53:12Z

vllm/entrypoints/openai/rpc/server.py

+                request_id=generate_request.request_id,
+                lora_request=generate_request.lora_request,
+                trace_headers=generate_request.trace_headers,
+                prompt_adapter_request=generate_request.prompt_adapter_request)


🤦 yeah this would do it lol

robertgshaw2-redhat added 3 commits July 31, 2024 20:19

pass configs

30a4f4d

almost there

bd27519

formatted

11d4de5

robertgshaw2-redhat added 3 commits July 31, 2024 20:50

comment

9dadae6

better comment

8edd734

another typo

a4fc498

joerunde reviewed Jul 31, 2024

View reviewed changes

robertgshaw2-redhat added 3 commits July 31, 2024 21:24

fix provate method

b73c5da

fix provate method

7266719

fixed plumbing

38f6568

joerunde reviewed Jul 31, 2024

View reviewed changes

robertgshaw2-redhat merged commit f5f0b45 into isolate-oai-server-process Jul 31, 2024
1 check passed

robertgshaw2-redhat deleted the add-tokenizer branch July 31, 2024 22:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add tokenizer #394

Add tokenizer #394

robertgshaw2-redhat commented Jul 31, 2024

github-actions bot commented Jul 31, 2024

joerunde Jul 31, 2024

robertgshaw2-redhat Jul 31, 2024

joerunde Jul 31, 2024

robertgshaw2-redhat Jul 31, 2024

joerunde Jul 31, 2024

joerunde Jul 31, 2024

joerunde Jul 31, 2024

Add tokenizer #394

Add tokenizer #394

Conversation

robertgshaw2-redhat commented Jul 31, 2024

github-actions bot commented Jul 31, 2024

joerunde Jul 31, 2024

Choose a reason for hiding this comment

robertgshaw2-redhat Jul 31, 2024

Choose a reason for hiding this comment

joerunde Jul 31, 2024

Choose a reason for hiding this comment

robertgshaw2-redhat Jul 31, 2024

Choose a reason for hiding this comment

joerunde Jul 31, 2024

Choose a reason for hiding this comment

joerunde Jul 31, 2024

Choose a reason for hiding this comment

joerunde Jul 31, 2024

Choose a reason for hiding this comment