llama stack distributions / templates / docker refactor #266

yanxi0830 · 2024-10-18T02:36:21Z

Test

--list-templates

ollama docker

llama-stack/llama_stack/distribution/docker/ollama$ ls
compose.yaml  ollama-run.yaml

llama-stack/llama_stack/distribution/docker/ollama$ docker compose up

llama_stack/distribution/docker/ollama/compose.yaml

terrytangyuan · 2024-10-19T02:02:30Z

Is it possible to get #178 merged before further refactoring? Otherwise we'll end up fixing a lot of conflicts again. cc @ashwinb

ashwinb · 2024-10-21T01:43:56Z

Is it possible to get #178 merged before further refactoring? Otherwise we'll end up fixing a lot of conflicts again. cc @ashwinb

@terrytangyuan Done! I just merged #178.

…fying provider_id (#264) * fix case where memory bank is registered without provider_id * memory test * agents unit test

…269)

…ce (#270) PR #201 had made several changes while trying to fix issues with getting the stream=False branches of inference and agents API working. As part of this, it made a change which was slightly gratuitous. Namely, making chat_completion() and brethren "def" instead of "async def". The rationale was that this allowed the user (within llama-stack) of this to use it as: ``` async for chunk in api.chat_completion(params) ``` However, it causes unnecessary confusion for several folks. Given that clients (e.g., llama-stack-apps) anyway use the SDK methods (which are completely isolated) this choice was not ideal. Let's revert back so the call now looks like: ``` async for chunk in await api.chat_completion(params) ``` Bonus: Added a completion() implementation for the meta-reference provider. Technically should have been another PR :)

This PR adds vLLM inference provider for OpenAI compatible vLLM server.

Trying out readthedocs

spelling error

docker compose ollama

293d8f2

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Oct 18, 2024

comment

542ffbe

yanxi0830 marked this pull request as ready for review October 18, 2024 02:38

yanxi0830 requested review from ashwinb, hardikjshah, dltn and raghotham as code owners October 18, 2024 02:38

jmorganca reviewed Oct 18, 2024

View reviewed changes

llama_stack/distribution/docker/ollama/compose.yaml Outdated Show resolved Hide resolved

jmorganca reviewed Oct 18, 2024

View reviewed changes

llama_stack/distribution/docker/ollama/compose.yaml Outdated Show resolved Hide resolved

jmorganca reviewed Oct 18, 2024

View reviewed changes

llama_stack/distribution/docker/ollama/compose.yaml Outdated Show resolved Hide resolved

yanxi0830 added 5 commits October 18, 2024 11:12

update compose file

dcac9e4

readme for distributions

a3f748a

readme

fd90d2a

move distribution folders

b4aca0a

move distribution/templates to distributions/

cbb423a

yanxi0830 changed the title ~~docker compose ollama scripts~~ llama stack distributions / templates / docker refactor Oct 19, 2024

yanxi0830 added 4 commits October 18, 2024 17:28

rename

c830235

kill distribution/templates

955743b

readme

100b5fe

readme

f58441c

yanxi0830 and others added 7 commits October 21, 2024 09:01

build/developer cookbook/new api provider

302fa5c

developer cookbook

d4caab3

readme

5ea36b0

readme

29c8edb

[bugfix] fix case for agent when memory bank registered without speci…

2f5c410

…fying provider_id (#264) * fix case where memory bank is registered without provider_id * memory test * agents unit test

Add an option to not use elastic agents for meta-reference inference (#…

a90ab58

…269)

Allow overridding checkpoint_dir via config

6f4537b

ashwinb and others added 8 commits October 21, 2024 10:46

Small rename

92aca57

Improve an important error message

89759a0

update ollama for llama-guard3

391dedd

Add vLLM inference provider for OpenAI compatible vLLM server (#178)

74e6356

This PR adds vLLM inference provider for OpenAI compatible vLLM server.

Create .readthedocs.yaml

af52c22

Trying out readthedocs

Update event_logger.py (#275)

8ef3d3d

spelling error

vllm

ca2e7f5

ashwinb approved these changes Oct 21, 2024

View reviewed changes

yanxi0830 added 6 commits October 21, 2024 11:02

build templates

3ca822f

delete templates

202667f

tmp add back build to avoid merge conflicts

acfcbca

vllm

88187bc

vllm

8a50426

Merge branch 'main' into ollama_docker

8593c94

yanxi0830 merged commit 23210e8 into main Oct 21, 2024
4 checks passed

yanxi0830 deleted the ollama_docker branch October 29, 2024 17:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama stack distributions / templates / docker refactor #266

llama stack distributions / templates / docker refactor #266

yanxi0830 commented Oct 18, 2024 •

edited

Loading

terrytangyuan commented Oct 19, 2024 •

edited

Loading

ashwinb commented Oct 21, 2024

llama stack distributions / templates / docker refactor #266

llama stack distributions / templates / docker refactor #266

Conversation

yanxi0830 commented Oct 18, 2024 • edited Loading

Test

--list-templates

ollama docker

terrytangyuan commented Oct 19, 2024 • edited Loading

ashwinb commented Oct 21, 2024

yanxi0830 commented Oct 18, 2024 •

edited

Loading

terrytangyuan commented Oct 19, 2024 •

edited

Loading