Skip to content

v1.2.2

Latest
Compare
Choose a tag to compare
@XprobeBot XprobeBot released this 08 Feb 09:28
· 1 commit to main since this release
ac97a13

What's new in 1.2.2 (2025-02-08)

These are the changes in inference v1.2.2.

New features

Bug fixes

  • BUG: fix llama-cpp when some quantizations have multiple parts by @qinxuye in #2786
  • BUG: Use Cache class instead of raw tuple for transformers continuous batching, compatible with latest transformers by @ChengjieLi28 in #2820

Documentation

New Contributors

Full Changelog: v1.2.1...v1.2.2