SI650 Final Project (Lechen Zhang)

How to run the code:

Pyterrier (pip install --upgrade git+https://github.com/terrier-org/pyterrier.git#egg=python-terrier)
OpenNIR (pip install --upgrade git+https://github.com/Georgetown-IR-Lab/OpenNIR)
Sentence Transformers (pip install -U sentence-transformers)
Fastrank (pip install fastrank)
Natural Language Toolkit (Optional because the results were cached in dataset_sentiment.json)

If you want to run the whole project, please go to Project_code.ipynb and run all blocks. It will redraw all plots and recalculate all evaluation metrics. The last block of this notebook is interactive, which means you can input your queries there and get retrieval results.
If you only want to run the interactive part, please go to demo.py and run it. But please remember that we didn't find a good way to store pipelines, so it may take 30 seconds on GPU or 5-10 minutes on CPU for the training process. After the training is over, you can interactive with it, input your queries there and get retrieval results.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
Part_1_Data_Crawling		Part_1_Data_Crawling
Part_2_Data_Annotation		Part_2_Data_Annotation
proj_file		proj_file
Project_code.ipynb		Project_code.ipynb
README.md		README.md
dataset_full.zip		dataset_full.zip
dataset_sentiment.json		dataset_sentiment.json
demo.py		demo.py
qrels.csv		qrels.csv
query.csv		query.csv