Skip to content

[Excutorch][Llama] Decouple input sequence length from kv cache context length #7556

[Excutorch][Llama] Decouple input sequence length from kv cache context length

[Excutorch][Llama] Decouple input sequence length from kv cache context length #7556

test-llama-runner-mac (fp32, xnnpack+kv+custom)  /  macos-job

succeeded Jan 29, 2025 in 12m 36s