Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

datasets #1

Open
MAyang38 opened this issue Dec 17, 2024 · 2 comments
Open

datasets #1

MAyang38 opened this issue Dec 17, 2024 · 2 comments

Comments

@MAyang38
Copy link

Hello, I would like to know where to download the data of CRSB-Embeddings-MPNET.json in your tests/adaptiveRAG_CAG_speed_test.ipynb file

@heydaari
Copy link
Owner

heydaari commented Dec 18, 2024

Hi, Thanks for your contribution.
Due to the limitation of Github itself, we couldn't upload a large file like CRSB-Embeddings-MPNET.json, which has size of about 400 MB.
To reproduce the results, you can download the original CRSB dataset from HuggingFace at this link and embed the dataset using Sentence Transformers and this model at here

@MAyang38
Copy link
Author

Hi, Thanks for your contribution. Due to the limitation of Github itself, we couldn't upload a large file like CRSB-Embeddings-MPNET.json, which has size of about 400 MB. To reproduce the results, you can download the original CRSB dataset from HuggingFace at this link and embed the dataset using Sentence Transformers and this model at here

Thank you for replying so quickly. You have been very helpful. I will try

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants