Release 0.2.0-rc.0
Pre-release
Pre-release
0.2.0-rc.0 (2023-08-29)
Bug Fixes
- cpp: incorrect prob convert if the str is already a known token (d296d09)
- ios: do manual release due to disabled ARC in pod (6623e96)
- ios: incorrect sent_token_probs_index (ac7c554)
- send rest of content on stop (8a4b863)
Features
- add detokenize method (#12) (13b0a15)
- example: create dir for model cache (7cde772)
- initial android support (#10) (36f608e)
Breaking changes
- sync llama.cpp
- A new file format has been introduced: GGUF
- Tokenizer fixes and API change
- GGUFv2 adds support for 64-bit sizes + backwards compatible
- Added support for Falcon models