Skip to content

[Excutorch][Llama] Decouple input sequence length from kv cache context length #7556

[Excutorch][Llama] Decouple input sequence length from kv cache context length

[Excutorch][Llama] Decouple input sequence length from kv cache context length #7556

test-models-macos (cmake, mv3, xnnpack-quantization-delegation, macos-m1-stable, 90)  /  macos-job

succeeded Jan 29, 2025 in 10m 31s