Unsupervised Question Answering by Cloze Translation

Notes

Introduction

Due to recent advances human performance is within reach QA dataset. But

The paper attempts to explore if quality extractive QA data can be generated using only the unlabelled data in an un-supervised setting. The approach used by the paper uses context, question and answer triplet to train a model.

Proposed method to generate QA dataset:

Sample a paragraph in target domain
Sample candidate answers within the context, usingpretrained NER
Extract fill-in-the-blank cloze question
Convert cloze question into natural questions using unsupervised close-to-normal question translator.

Conversion of cloze question into natural question is the most challenging step. So it is tackled by using seq2seq model to map between natural and cloze question using online back-translation and denoising auto-encoding.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cloze-translation.md

cloze-translation.md

Unsupervised Question Answering by Cloze Translation

Notes

Introduction

Proposed method to generate QA dataset:

Files

cloze-translation.md

Latest commit

History

cloze-translation.md

File metadata and controls

Unsupervised Question Answering by Cloze Translation

Notes

Introduction

Proposed method to generate QA dataset: