Data and Tools
1. For subtasks A and B
The training and the development data are the same as for SemEval-2013 task 2:
- training (=trial)
- development -- can be used for training as well
We also provide as test-time development the test sets from SemEval-2013 task 2 and from SemEval-2014 Task 9:
- SemEval-2013 Task 2 development-test: (i) Twitter-2013 and (ii) SMS-2013 messages (CANNOT be used for training!)
- SemEval-2014 Task 9 development-test: (i) Twitter-2013, (ii) SMS-2013 messages, (iii) Twitter-2014, (iv) Twitter-2014-sracasm, and (v) Live Journal-2014 (CANNOT be used for training!)
You need a download script to obtain the data from Twitter:
- 2013 download script
- 2014 download script + index checker (please, use this!)
We further provide scorers:
- development scorer 1 (same as for SemEval-2013 task 2)
- development scorer 2 (same as for SemEval-2014 Task 9)
We also provide format checkers:
- Uploaded December 15, 2014 (the results should be submitted not later than Dec. 22, 2014 AND also not later than 7 days after the test data download)
2. For subtasks C and D
- trial data
- training data
- Test Data -- Uploaded December 15, 2014 (the results should be submitted not later than Dec. 22, 2014 AND also not later than 7 days after the test data download)
We also provide format checkers:
3. For subtask E:
- Task Description -- Updated September 23, 2014.
- New version of Trial Data - some erroneous annotations have been removed and the scores are re-calculated (Can be used for training as well. No additional training data will be provided.) -- Updated October 1, 2014.
- Trial Data (old version) -- Updated May 29, 2014.
- Evaluation Script -- Uploaded September 3, 2014.
- Test Data -- Uploaded December 5, 2014 (the results should be submitted not later than Dec. 14, 2014 AND also not later than 7 days after the test data download)
- README for test data -- Uploaded December 8, 2014.
- Gold Labels for Test Data -- Uploaded December 30, 2014.