Data and Tools
Submissions should be submitted on CodaLab.
This download contains the data, split into training and development data:
Test data
- scoring scripts: scorer.zip
- development data key: dev_key.zip
- test tweets: rumoureval2017-test.tar.bz2
- gold standard test data: subtaskA.json subtaskB.json
Two news stories are included, which form the input for both subtasks. You can use the scorer over the development to validate your system. Final submissions are to be made on the CodaLab site.
The dataset used for training and development testing is from this PLoS article:
- Zubiaga A, Liakata M, Procter R, Wong Sak Hoi G, Tolmie P (2016) Analysing How People Orient to and Spread Rumours in Social Media by Looking at Conversational Threads. PLoS ONE 11(3): e0150989. doi:10.1371/journal.pone.0150989
The original data can be found here:
For the open variant of task B, the following wikipedia dump may be used: