Data and Tools

Trial data

Download here.

See readme.txt (part of the archive) for detailed description of the trial data.


Train data

Download here. (updated 1st October 2014, 10 AM UK time)

The updated version contains also Python script for scoring your data: see the script ( for further info.

See readme.txt (part of the archive) for detailed description of the train data. Note that in the archive there are two different datasets in two directories: Wingspread and Microcheck. Both datasets are described in respective readme.txt files in the directories.

If you have any questions regarding the data, do not hesitate to contact us via Google Mailing list, using the address


Test data

The test data can be downloaded from here.


Your submission template

Download here



A new version of the scorer is available here (10/12/2015).

Changes mainly apply to Task 3; see diff here.



Download here.

Contact Info

  • Vít Baisa (Masaryk University, Brno, CZ),
  • Jane Bradbury (University of Wolverhampton, UK),
  • Ismaïl El Maarouf (University of Wolverhampton, UK),
  • Patrick Hanks (University of Wolverhampton, UK),
  • Adam Kilgarriff (Lexical Computing Ltd, UK),
  • Octavian Popescu (FBK, Trento, IT)


  • September, 29th: Train data has been updated!
  • August, 19th: Train data has been released!
  • June, 3rd: Trial data has been released!
  • June, 5th: Google group for discussion has been created, you can send us an email, use the address: