This reference architecture implemets the a RAG Chat engine on top of Amazon FSx for ONTAP (FSxN) data that has predefined access permission using ACLs with Active Directory.
This will create all the required AWS and application resources to on your AWS account using Terraform. You can read about it here.
This process takes the documents stored on the FSxN filesystem and embeds them together with the Access Control List into the OpenSearch Vector DB. You can read about the process in more details here.
The retrieval engine works as a Lambda function that get a promp and model parameter for retrieval and uses RAG to get answers based on the embedding on the FSxN data. It also filters data access based on SID (ACL) provided. You can read more about the process in more details here
The chat bot is a simple chat UI that simplifies access to the retrival engine. You can read more about the chat bot in more details here
- AWS CLI installed on host
- Terraform installed on host
- Docker engine installed on host
Use the following to start the environment:
terraform init
terraform apply --auto-approve
Use the following to clear the environment:
terraform destroy --auto-approve