The Stanford Question Answering Dataset (SQuAD) is a reading comprehension dataset consisting of questions posed by crowdworkers on a set of Wikipedia articles. The answer to every question is a segment of text, or span, from the corresponding reading passage. There are 100,000+ question-answer pairs on 500+ articles.
Download here
train-v1.1.json dev-v1.1.json
Keras 2.0.4 Python: 2.7 & 3.x TensorFlow: 1.0.1
Inspired by: Minjoon Seol, Aniruddha Kembhavi, Ali Farhadi, Hananneh Hajishirzi
University of Washington, Allen Institute for Artificial Intelligence
Chau's BiDAF Model (Single) -- 10Epos -- EM : 50.141 - F1: 80.981