Divanovic/llama tg demo #13105

djordje-tt · 2024-09-25T14:01:53Z

Enable prefill+decode demo

Added support for prefill+decode demo
Removed model_config dependency to t3k model_config and created TG one
Separated all component configs from it's files to model_config and made components cleaner and more readable

models/demos/t3000/llama2_70b/tt/llama_common.py

models/demos/tg/llama3_70b/tests/test_llama_model_galaxy.py

models/demos/tg/llama3_70b/tt/llama_model_galaxy.py

djordje-tt · 2024-09-30T16:57:47Z

djordje-tt requested review from cglagovichTT, uaydonat, johanna-rock-tt and kpaigwar as code owners September 25, 2024 14:01

djordje-tt temporarily deployed to dev September 25, 2024 14:11 — with GitHub Actions Inactive

cglagovichTT reviewed Sep 27, 2024

View reviewed changes

models/demos/t3000/llama2_70b/tt/llama_common.py Outdated Show resolved Hide resolved

djordje-tt added prefill LLM models have prefill mode and it's optimization is usually separated from decode mode. llama3 llm_tg labels Sep 30, 2024

djordje-tt self-assigned this Sep 30, 2024

djordje-tt force-pushed the divanovic/llama-tg-demo branch from a47d70f to 767ebba Compare September 30, 2024 10:46

johanna-rock-tt reviewed Sep 30, 2024

View reviewed changes

djordje-tt force-pushed the divanovic/llama-tg-demo branch from 767ebba to 4b95982 Compare September 30, 2024 16:35

djordje-tt force-pushed the divanovic/llama-tg-demo branch 4 times, most recently from 0503b41 to b38ee94 Compare September 30, 2024 17:13

cglagovichTT approved these changes Sep 30, 2024

View reviewed changes

djordje-tt force-pushed the divanovic/llama-tg-demo branch 2 times, most recently from dbd354b to b38ee94 Compare September 30, 2024 19:35

djordje-tt added 3 commits September 30, 2024 21:42

#0: TG prefill+decode demo and 8k prefill pcc fix

0adf088

#0: Update setup_llama_env to use tg implementation

3c6fa96

#0: Remove prefill tests until CI is fixed

52a7562

djordje-tt force-pushed the divanovic/llama-tg-demo branch from 52c07af to 52a7562 Compare September 30, 2024 19:42

djordje-tt merged commit b23d07e into main Sep 30, 2024
6 checks passed

djordje-tt deleted the divanovic/llama-tg-demo branch September 30, 2024 19:49

Provide feedback