Skip to content

Latest commit

 

History

History

Hybrid-Search

Hybrid Search

This sample shows how to combine Fulltext search in Azure SQL database with BM25 ranking and cosine similarity ranking to do hybrid search.

In this sample the local model multi-qa-MiniLM-L6-cos-v1 to generate embeddings. The Python script ./python/hybrid_search.py shows how to

Make sure to setup the database for this sample using the ./python/00-setup-database.sql script. Database can be either an Azure SQL DB or a SQL Server database. Once the database has been created, you can run the ./python/hybrid_search.py script to do the hybrid search:

First, set up the virtual environment and install the required packages:

python -m venv .venv

Activate the virtual environment and then install the required packages:

pip install -r requirements.txt

Create an environment file .env with the connection string to Azure SQL database. You can use the .env.sample as a starting point. The sample .env file shows how to use Entra ID to connect to the database, which looks like:

MSSQL='Driver={ODBC Driver 18 for SQL Server};Server=tcp:<server>,1433;Database=<database>;Encrypt=yes;TrustServerCertificate=no;Connection Timeout=30;LongAsMax=yes;'

If you want to use SQL Authentication the connection string would instead look like the following:

MSSQL='Driver={ODBC Driver 18 for SQL Server};Server=tcp:<server>,1433;Database=<database>;UID=<user>;PWD=<password>;Encrypt=yes;TrustServerCertificate=yes;Connection Timeout=30;LongAsMax=yes;'

Then run the script:

python hybrid_search.py