Releases: marqo-ai/marqo
Releases · marqo-ai/marqo
Release 0.0.10
0.0.10
New features
- Generic model support (#179). Create an index with your favourite SBERT-type models from HuggingFace! Read about usage here
- Visual search update 2. (#214). Search-time image reranking and open-vocabulary localization, based on users' queries, is now available with the Owl-ViT model. Locate the part of the image corresponding to your query! Read about usage here
- Visual search update 1. (#214). Better image patching. In addition to faster-rcnn, you can now use yolox or attention based (DINO) region proposal as a patching method at indexing time. This allows localization as the sub patches of the image can be searched. Read about usage here.
Check out this article about how this update makes image search awesome.
Bug fixes
- Fixed imports and outdated Python client usage in Wikipedia demo (#216)
Contributor shout-outs
- Thank you to @georgewritescode for debugging and updating the Wikipedia demo
- Thank you to our 1.8k stargazers and 60+ forkers!
Release images can be found on Docker hub
Release 0.0.9
0.0.9
Optimisations
- Set k to limit to for Marqo-os search queries (#219)
- Reduced the amount of metadata returned from Marqo-os, on searches (#218)
Non-breaking data model changes
- Set default kNN m value to 16 (#222)
Bug fixes
- Better error messages when downloading an image fails (#198)
- Bug where filtering wouldn't work on fields with spaces (#213), resolving #115
Release images can be found on Docker hub
Release 0.0.8
0.0.8
New features
- Get indexes endpoint:
GET /indexes
(#181). Use this endpoint to inspect existing Marqo indexes. Read about usage here. - Non-tensor fields(#161). During the indexing phase, mark fields as non-tensor to prevent tensors being created for them. This helps speed up indexing and reduce storage for fields where keyword search is good enough. For example: email, name and categorical fields. These fields can still be used for filtering. Read about usage here.
- Configurable preloaded models(#155). Specify which machine learning models to load as Marqo starts. This prevents a delay during initial search and index commands after Marqo starts. Read about usage here.
- New example and article: use Marqo to provide context for up-to-date GPT3 news summary generation (#171, #174). Special thanks to @iain-mackie for this example.
Bug fixes and minor changes
- Updated developer guide (#164)
- Updated requirements which prevented Marqo being built as an arm64 image (#173)
- Backend updated to use marqo-os:0.0.3 (#183)
- Default request timeout has been increased from 2 to 75 seconds (#184)
Contributor shout-outs
- For work on the GPT3 news summary generation example: @iain-mackie
- For contributing the non-tensor fields feature: @jeadie
- Thank you to our users who raise issues and give us valuable feeback
- Thank you to our 1.4k+ star gazers and 50+ forkers!
Release images can be found on Docker hub
Release 0.0.7
0.0.7
Bug fixes and minor changes
- 429 (too many request errors) are propagated from Marqo-os to the user properly (#150)
0.0.6
Also included in this release
New features
- Health check endpoint:
GET /health
. An endpoint that can be used to inspect the status of Marqo and Marqo's backend (Marqo-os)
(#128). Read about usage here. - Marqo can be launched with environment variables that define limits around maximum number of fields per index, maximum document size and the maximum number of documents that can be retrieved
(#135). Read about usage here. - README translations:
- Chinese 🇨🇳 (by @wanliAlex, #133)
- Polish 🇵🇱 (by @MichalLuck, #136)
- Ukrainian 🇺🇦 (by @dmyzlata, #138)
- French 🇫🇷 (by @rym-oualha, #147)
Breaking API changes
- The home
/
json response has been updated. If you have logic that reads the endpoint root, please update it (#128). - The Python client's
add_documents()
andupdate_documents()
batch_size
parameter has been replaced byserver_batch_size
andclient_batch_size
parameters
(py-marqo#27), (py-marqo#28)
Non-breaking data model changes
- Each text field just creates a top level Marqo-os text field, without any keywords
(#135) - Very large fields get their tensor_facet keywords ignored, rather than Marqo-OS preventing the doc being indexed
(#135) - Tensor facets can no longer have _id as a filtering field
(#135)
Bug fixes and minor changes
- FastAPI runs with better concurrency (#128)
- Get documents by IDs and lexical search and no longer returns vectors if expose_facets isn't specified
- Fixed batching bug in Python client
(py-marqo#28)
Caveats
- If a large request to add_documents or update_documents results in a document adding fields such that the index field limit is exceeded, the entire operation will fail (without resilience). Mitigate this sending
add_documents
andupdate_documents
requests with smaller batches of documents. - For optimal indexing of large volumes of images, we recommend that the images are hosted on the same region and cloud provider as Marqo.
Contributor shout-outs
- For their translation work: @rym-oualha, @dmyzlata, @wanliAlex, @dmyzlata, @MichalLuck
- For raising issues and helping with READMEs: @kdewald, @llermaly, @namit343
- Thank you to our 900+ star gazers and 30+ forkers
The image can be found on Docker