Tutorial for ml inference with cohere rerank model #3398

brianf-aws · 2025-01-16T01:45:45Z

Description

Currently there are tutorials that exist using the rerank pipeline. However its possible to do the same operation with the ML_inference processor with the by_field rerank.

Check List

New functionality includes testing.
New functionality has been documented.
API changes companion pull request created.
Commits are signed per the DCO using --signoff.
Public documentation issue/PR created.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

mingshl · 2025-01-16T06:05:39Z

Hi @brianf-aws, I am thinking we should have a new folder call ml_inference that can host all use cases combing using ml_inference processors.

dhrubo-os · 2025-01-16T06:19:25Z

Hi @brianf-aws, I am thinking we should have a new folder call ml_inference that can host all use cases combing using ml_inference processors.

And in that folder we can have sub folder for managed service vs opensource so that the blueprint doesn't seem too long.

@brianf-aws Could you also take update from main. Seems like the integ test doesn't run. I know for blueprint integ test doesn't matter.

I just wanted to check if codecov is activate properly or not.

mingshl · 2025-01-17T00:09:10Z

Hi @brianf-aws, I am thinking we should have a new folder call ml_inference that can host all use cases combing using ml_inference processors.

maybe the file path can be docs/tutorials/ml_Inference/rerank/ml_Inference_with_Cohere_Rerank_model.md

brianf-aws · 2025-01-17T00:14:19Z

Sorry @mingshl and @dhrubo-os I don't think that would benefit the community that much currently in the directory folder we have 4 documents that mention the ml_inference processor.

### I'm in /ml-commons/docs
 docs % echo pwd  | grep -i -r -l --include="*.md" -e "ml inference processor" -e "ML processor" -e "ml inference ingest processor" -e "ml_inference" .
./tutorials/rerank/ml_Inference_with_Cohere_Rerank_model.md
./tutorials/semantic_search/asymmetric_embedding_model.md
./remote_inference_blueprints/amazon_comprehend_connector_blueprint.md
./remote_inference_blueprints/amazon_textract_connector_blueprint.md

We will make it harder for users to find answers. Consider a community user that wants to rerank documents using our project. they probably will be surprised to find out that they need to look at a directory called ml_inference/rerank.

I think perhaps the best case would be to create a file pointing to these tutorials that show how significant the ML inference processor is.

mingshl · 2025-01-17T00:16:43Z

./remote_inference_blueprints/amazon_comprehend_connector_blueprint.md
./remote_inference_blueprints/amazon_textract_connector_blueprint.md

In the blueprint that use ml_inference processor that was just to demo one of the usage of the model. That's totally fine.

but these two can belong to ml_inference folder

./tutorials/rerank/ml_Inference_with_Cohere_Rerank_model.md
./tutorials/semantic_search/asymmetric_embedding_model.md

what do you think? @ylwu-amzn @dhrubo-os

dhrubo-os · 2025-01-17T00:40:55Z

./remote_inference_blueprints/amazon_comprehend_connector_blueprint.md
./remote_inference_blueprints/amazon_textract_connector_blueprint.md

In the blueprint that use ml_inference processor that was just to demo one of the usage of the model. That's totally fine.

but these two can belong to ml_inference folder

./tutorials/rerank/ml_Inference_with_Cohere_Rerank_model.md ./tutorials/semantic_search/asymmetric_embedding_model.md

what do you think? @ylwu-amzn @dhrubo-os

I see this as a similar issue when we write a long class and then later we start having confusion where to put this class as this serves multiple responsibilities.

Take ./tutorials/semantic_search/asymmetric_embedding_model.md as an example.

Currently, asymmetric_embedding_model.md tackles both generating asymmetric embeddings and performing semantic search with the ML-Inference processor. To improve clarity and maintainability, I suggest splitting it into two focused tutorials: one for generating embeddings and another for applying them in semantic search. This would make each tutorial easier to follow, reusable in other contexts, and simpler to maintain.

Signed-off-by: Brian Flores <[email protected]>

brianf-aws · 2025-01-27T19:47:02Z

Hey, @mingshl I refactored to introduce a ml_inference folder in this commit 3986b94. Apologies for the late change

mingshl

nice! keep this going!!!

* added tutorial for ml inference with cohere rerank model Signed-off-by: Brian Flores <[email protected]> * Refactor placement of tutorials using ML Inference processor Signed-off-by: Brian Flores <[email protected]> --------- Signed-off-by: Brian Flores <[email protected]> (cherry picked from commit 53bc3ec)

* added tutorial for ml inference with cohere rerank model Signed-off-by: Brian Flores <[email protected]> * Refactor placement of tutorials using ML Inference processor Signed-off-by: Brian Flores <[email protected]> --------- Signed-off-by: Brian Flores <[email protected]> (cherry picked from commit 53bc3ec) Co-authored-by: Brian Flores <[email protected]>

brianf-aws requested review from b4sjoo, dhrubo-os, mingshl, jngz-es, model-collapse, rbhavna, ylwu-amzn, zane-neo, Zhangxunmt, austintlee, HenryL27 and xinyual as code owners January 16, 2025 01:45

brianf-aws force-pushed the ml-inference-rerank-cohere branch from 41ecf8d to 6da5b54 Compare January 16, 2025 23:54

brianf-aws had a problem deploying to ml-commons-cicd-env-require-approval January 16, 2025 23:56 — with GitHub Actions Failure

brianf-aws added 2 commits January 27, 2025 11:38

added tutorial for ml inference with cohere rerank model

0c9be14

Signed-off-by: Brian Flores <[email protected]>

Refactor placement of tutorials using ML Inference processor

3986b94

Signed-off-by: Brian Flores <[email protected]>

brianf-aws force-pushed the ml-inference-rerank-cohere branch from 6da5b54 to 3986b94 Compare January 27, 2025 19:41

brianf-aws had a problem deploying to ml-commons-cicd-env-require-approval January 27, 2025 19:43 — with GitHub Actions Failure

mingshl approved these changes Jan 27, 2025

View reviewed changes

brianf-aws had a problem deploying to ml-commons-cicd-env-require-approval January 27, 2025 21:36 — with GitHub Actions Failure

jngz-es approved these changes Jan 28, 2025

View reviewed changes

jngz-es merged commit 53bc3ec into opensearch-project:main Jan 28, 2025
6 of 8 checks passed

jngz-es added the backport 2.x label Jan 28, 2025

opensearch-trigger-bot bot mentioned this pull request Jan 28, 2025

[Backport 2.x] Tutorial for ml inference with cohere rerank model #3448

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tutorial for ml inference with cohere rerank model #3398

Tutorial for ml inference with cohere rerank model #3398

brianf-aws commented Jan 16, 2025

mingshl commented Jan 16, 2025

dhrubo-os commented Jan 16, 2025

mingshl commented Jan 17, 2025

brianf-aws commented Jan 17, 2025

mingshl commented Jan 17, 2025

dhrubo-os commented Jan 17, 2025

brianf-aws commented Jan 27, 2025

mingshl left a comment

Tutorial for ml inference with cohere rerank model #3398

Tutorial for ml inference with cohere rerank model #3398

Conversation

brianf-aws commented Jan 16, 2025

Description

Check List

mingshl commented Jan 16, 2025

dhrubo-os commented Jan 16, 2025

mingshl commented Jan 17, 2025

brianf-aws commented Jan 17, 2025

mingshl commented Jan 17, 2025

dhrubo-os commented Jan 17, 2025

brianf-aws commented Jan 27, 2025

mingshl left a comment

Choose a reason for hiding this comment