Data and Tools < SemEval-2015 Task 2

Data and Tools

English subtask:
- (Aug. 15) Trial data with details of test datasets, as well as the training data (all data released in STS 2012, 2013 and 2014). It also includes the evaluation script.
- (Jan. 22) Raw annotations and the Perl scripts that generate the final gold standard files.
- (Jan. 22) Test data with gold standard annotations. It also includes the evaluation script and the task baseline.
Spanish subtask:
- Please use 2014 data for development and training.
- (Jan. 22) Test data with gold standard annotations.
Intepretable STS subtask:
- (Oct. 16) Annotation guidelines made available.
- NEW (Nov. 10) Final train data (including NEW evaluation script) made available.
  Evaluation script has been updated as follows:
  1. Bug affecting alignments which had multiple types fixed.
  2. Special case for the evaluation including types and score:
  - no type penalty between tags {SPE1, SPE2, REL, SIMI} when both scores are (0-2]
  - no type penalty between EQUI and SIMI/SPE with score 4.
- (Jan. 22) Test data with gold standard annotations. It also includes the evaluation script, gold labels (.wa files) and the task baseline.