add dynamic clients for all APIs #348

ashwinb · 2024-10-30T23:34:23Z

What does this PR do?

We have frequently bit-rotten (apis//client.py) files. They have two drawbacks:

we need to hand-write these client implementations (work!)
they often are poorly written and untested
they are accompanied with some useful testing code which is useful for developers but not intended for end users.

This PR is the first step towards killing these hand-written implementations. It dynamically creates Client classes for each API protocol and registers appropriate methods based on type introspection.

Test Plan

First, I ran an ollama server (ollama run llama3.2:3b-instruct-fp16) and then started a Llama Stack using --template ollama on port 5003.

Then I set up the following yaml for testing:

providers:
 - provider_id: remote
   provider_type: remote
   config: 
     host: localhost
     port: 5003

Then I ran the following set of tests:

MODEL_IDS=Llama3.2-3B-Instruct \
PROVIDER_ID=remote \
PROVIDER_CONFIG=config.yaml \
$CONDA_PREFIX/bin/pytest -s llama_stack/providers/tests/inference/test_inference.py

PROVIDER_ID=remote \
PROVIDER_CONFIG=config.yaml \
$CONDA_PREFIX/bin/pytest -s llama_stack/providers/tests/memory/test_memory.py

Then I modified the config.yaml to be:

providers:
  inference:
  - provider_id: remote
    provider_type: remote
    config:
      host: localhost
      port: 5003
  safety:
  - provider_id: remote
    provider_type: remote
    config:
      host: localhost
      port: 5003
  memory:
  - provider_id: remote
    provider_type: remote
    config:
      host: localhost
      port: 5003
  agents:
  - provider_id: remote
    provider_type: remote
    config:
      host: localhost
      port: 5003

And ran the following tests:

MODEL_ID=Llama3.2-3B-Instruct \
PROVIDER_ID=remote \
PROVIDER_CONFIG=config.yaml \
$CONDA_PREFIX/bin/pytest -s llama_stack/providers/tests/agents/test_agents.py

This test did not fully pass due to an unexpected model response from the ollama 3b-instruct llama model w.r.t. tool calling but tests which didn't exercise tool calling passed.

ashwinb · 2024-10-30T23:35:13Z

llama_stack/apis/inference/inference.py

@@ -239,7 +247,9 @@ async def chat_completion(
        response_format: Optional[ResponseFormat] = None,
        stream: Optional[bool] = False,
        logprobs: Optional[LogProbConfig] = None,
-    ) -> Union[ChatCompletionResponse, ChatCompletionResponseStreamChunk]: ...
+    ) -> Union[


FINALLY! we actually type-hint it in the way it is supposed to be.

This might become an issue with our OpenAPI generator, but we will fix that downstream. Our source must be always correct.

ashwinb · 2024-10-30T23:37:35Z

llama_stack/distribution/client.py

+    return APIClient
+
+
+async def example(model: str = None):


I will be deleting this code later since someone is going to stumble on this and use it inadvertently again

docs/openapi_generator/pyopenapi/operations.py

ashwinb · 2024-10-31T17:44:55Z

docs/resources/llama-stack-spec.html

                        "content": {
                            "text/event-stream": {
                                "schema": {
-                                    "$ref": "#/components/schemas/AgentTurnResponseStreamChunk"
+                                    "$ref": "#/components/schemas/Turn"


this looks bad though uh oh, need to check

docs/resources/llama-stack-spec.html

ashwinb requested review from yanxi0830, hardikjshah, dltn and raghotham as code owners October 30, 2024 23:34

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Oct 30, 2024

ashwinb commented Oct 30, 2024

View reviewed changes

yanxi0830 reviewed Oct 31, 2024

View reviewed changes

docs/openapi_generator/pyopenapi/operations.py Show resolved Hide resolved

Ashwin Bharambe added 3 commits October 31, 2024 10:35

add dynamic clients for all APIs

4067038

fix openapi generator

fc66131

inference + memory + agents tests now pass with "remote" providers

386372d

ashwinb force-pushed the dynamic_client branch from 18ae0d9 to 386372d Compare October 31, 2024 17:36

yanxi0830 approved these changes Oct 31, 2024

View reviewed changes

ashwinb commented Oct 31, 2024

View reviewed changes

docs/resources/llama-stack-spec.html Outdated Show resolved Hide resolved

Add docstring which fixes openapi generator :/

95db065

ashwinb merged commit 37b330b into main Oct 31, 2024
2 checks passed

ashwinb deleted the dynamic_client branch October 31, 2024 21:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add dynamic clients for all APIs #348

add dynamic clients for all APIs #348

ashwinb commented Oct 30, 2024 •

edited

Loading

ashwinb Oct 30, 2024

ashwinb Oct 30, 2024

ashwinb Oct 31, 2024

add dynamic clients for all APIs #348

add dynamic clients for all APIs #348

Conversation

ashwinb commented Oct 30, 2024 • edited Loading

What does this PR do?

Test Plan

ashwinb Oct 30, 2024

Choose a reason for hiding this comment

ashwinb Oct 30, 2024

Choose a reason for hiding this comment

ashwinb Oct 31, 2024

Choose a reason for hiding this comment

ashwinb commented Oct 30, 2024 •

edited

Loading