Data and Tools
English TRAIN+DEV data v3.2
- Data for all English subtasks v3.2 is here
-
It includes a TRAIN/DEV split with reliable double-checked DEV
- Subtask A (6,398 questions + 40,288 comments) + unannotated (189,941 questions + 1,894,456 comments)
- Subtask B (317 original + 3,169 related questions)
- Subtask C (317 original questions + 3,169 related questions + 31,690 comments)
Arabic TRAIN+DEV data v1.3
- The Arabic TRAIN+DEV data v 1.3 can be found here
- It includes a TRAIN/DEV split with reliable double-checked DEV (1,281 original questions, and 37,795 potentially related question-answer pairs) + unannotated (163,383 question--answer pairs)
Scorer v2.2 and random baselines (English and Arabic)
- Can be found here
*** TEST INPUT DATA ***