- Pyterrier (pip install --upgrade git+https://github.com/terrier-org/pyterrier.git#egg=python-terrier)
- OpenNIR (pip install --upgrade git+https://github.com/Georgetown-IR-Lab/OpenNIR)
- Sentence Transformers (pip install -U sentence-transformers)
- Fastrank (pip install fastrank)
- Natural Language Toolkit (Optional because the results were cached in dataset_sentiment.json)
- Basic files: dataset_full.csv, qrels.csv, query.csv
- Cache: dataset_sentiment.json, trained_bert55.tar.gz
- If you want to run the whole project, please go to Project_code.ipynb and run all blocks. It will redraw all plots and recalculate all evaluation metrics. The last block of this notebook is interactive, which means you can input your queries there and get retrieval results.
- If you only want to run the interactive part, please go to demo.py and run it. But please remember that we didn't find a good way to store pipelines, so it may take 30 seconds on GPU or 5-10 minutes on CPU for the training process. After the training is over, you can interactive with it, input your queries there and get retrieval results.