Natural-Language-Inference

In this repository we are trying to solve GLUE benchmark (mainly MRPC dataset) by fine-tuning RoBERTa model. We also used STS benchmark for pretraining the model and getting better results on the target dataset.

MRPC dataset:

The Microsoft Research Paraphrase Corpus (Dolan & Brockett, 2005) is a corpus of sentence pairs automatically extracted from online news sources, with human annotations for whether the sentences in the pair are semantically equivalent.

Split	Count
train	3,668
validation	408
test	1,725

Example from the training split:

{'idx': 58,
 'label': 1,
 'sentence1': 'Several of the questions asked by the audience in the fast-paced forum were new to the candidates .',
 'sentence2': 'Several of the audience questions were new to the candidates as well .'
 }

Trained models

All models are based on RoBERTa model. Few layers were added, then they were fine-tuned on one of the above-mentioned datasets. For more details about the models architecture check the project presentation.

Models training and evaluation

In order facilitate generation of our results, we provide all necessary details:

Requirements

All required libraries are included in requirements.txt file. A virtual environment could be created using this file (e.g. using Pycharm) to guarantee that there are no conflicts.

Training on STS Benchmark

You can train the model with the default parameters by running:

python "training RoBERTa-based model on STS benchmark.py"

For more details about parameters run:

python "training RoBERTa-based model on STS benchmark.py" -h

Training on GLEU/MRPC

You can train the model with the default parameters by running:

python "training RoBERTa-based model on MRPC.py"

For more details about parameters run:

python "training RoBERTa-based model on MRPC.py" -h

Fine tuning the model on MRPC after training it on STS benchmark

In order to increase the model accuracy on MRPC dataset, we trained it first on STS benchmark then contined training on MRPC. You can train this model with our default parameters by running:

python "fine-tuning on MRPC after training on STS.py"

For more details about parameters run:

python "fine-tuning on MRPC after training on STS.py" -h

NOTE: before running this part, you must have already trained the first model and saved its weights

Testing model on MRPC

NOTE: Please consider first downloading our trained models from this external link or train your own models. In the latter case you might need to change the name of weights file.

To test the model trained on MRPC data only run:

python "test final model.py" --f_model MRPC_model.pkl --t_model MRPC

To test the model pretrained on STS then fine-tuned on MRPC run:

python "test final model.py" --f_model MRPC_after_STS_model.pkl --t_model MIXED

Results:

\	Training on MRPC only	Training on STS then fine-tuning on MRPC
Train Accuracy	92.15%	99.45%
Validation Accuracy	85.54%	88.48%
Test Accuracy	85.68%	88.23%

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
models		models
searchEngine		searchEngine
stsbenchmark		stsbenchmark
.gitignore		.gitignore
Presention.pptx		Presention.pptx
README.md		README.md
STSDataHelper.py		STSDataHelper.py
fine-tuning on MRPC after training on STS.py		fine-tuning on MRPC after training on STS.py
glueDataHelper.py		glueDataHelper.py
models.py		models.py
requirements.txt		requirements.txt
test final model.py		test final model.py
trainModels.py		trainModels.py
training RoBERTa-based model on MRPC.py		training RoBERTa-based model on MRPC.py
training RoBERTa-based model on STS benchmark.py		training RoBERTa-based model on STS benchmark.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Natural-Language-Inference

MRPC dataset:

Trained models

Models training and evaluation

Requirements

Training on STS Benchmark

Training on GLEU/MRPC

Fine tuning the model on MRPC after training it on STS benchmark

Testing model on MRPC

Results:

About

Releases

Packages

Languages

cclx/Natural-Language-Inference-NLI-

Folders and files

Latest commit

History

Repository files navigation

Natural-Language-Inference

MRPC dataset:

Trained models

Models training and evaluation

Requirements

Training on STS Benchmark

Training on GLEU/MRPC

Fine tuning the model on MRPC after training it on STS benchmark

Testing model on MRPC

Results:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages