v0.2.1.post1
Highlights
This patch fix allows for AutoEmbeddings to properly default to SentenceTransformerEmbeddings
which was being by-passed in the previous release.
Furthermore, because of reconstructable splitting, numerous smaller sentences were making it through to the SemanticChunker. To subvert the issue, this fix introduces a min_chunk_size
which takes in the minimum tokens that need to be in a chunk. This solves the issues in the tests.
What's Changed
- [Fix] AutoEmbeddings not loading
all-minilm-l6-v2
but loadsAll-MiniLM-L6-V2
by @bhavnicksm in #57 - [Fix] Add fix for #55 by @bhavnicksm in #58
- [Refactor] Add min_chunk_size parameter to SemanticChunker and SentenceChunker by @bhavnicksm in #60
- [Update] Bump version to 0.2.1.post1 and require Python 3.9 or higher by @bhavnicksm in #62
Full Changelog: v0.2.1...v0.2.1.post1