Skip to content

Release 0.4.8

Latest
Compare
Choose a tag to compare
@jhen0409 jhen0409 released this 09 Jan 08:38

0.4.8 (2025-01-09)

Bug Fixes

  • log: implement Android logging in rn-llama.hpp as opposed to printf (#106) (fb3896e)

Features

  • android: enable runtime repacking for Q4_0 quantization on aarch64 (#105) (758157b)
  • expose n_ubatch and dynamically adjust ntokens for bench (#104) (9c25ec4)
  • mock: add model metadata & mock data for tokenize / embedding (3296388)
  • sync llama.cpp (#108) (b539012)