Results < SemEval-2014 Task 10

Results

English STS results

Team Name	deft-forum	deft-news	headlines	images	OnWN	tweet-news	Weighted mean	Rank
Bielefeld_SC-run1	0.2109	0.4315	0.3208	0.3676	0.3667	0.4146	0.3538	35
Bielefeld_SC-run2	0.2108	0.4307	0.3112	0.3558	0.3607	0.4087	0.3470	36
BUAP-EN-run1	0.4557	0.6855	0.6888	0.6966	0.6539	0.7706	0.6715	19
DLS@CU-run1	0.4828	0.7657	0.7646	0.8214	0.7227	0.7639	0.7337	7
DLS@CU-run2	0.4828	0.7657	0.7646	0.8214	0.8589	0.7639	0.7610	1
FBK-TR-run1	0.3219	0.5231	0.5469	0.6009	0.6615	0.4625	0.5348	28
FBK-TR-run2	0.1670	0.4214	0.4854	0.5211	0.5725	0.3588	0.4413	31
FBK-TR-run3	0.3051	0.4046	0.4712	0.4891	0.5512	0.4378	0.4588	30
IBM_EG-run1	0.4742	0.7431	0.7371	0.8012	0.7603	0.7296	0.7220	8
IBM_EG-run2	0.4645	0.6412	0.7102	0.7471	0.7322	0.6960	0.6841	15
LIPN-run1	0.4544	0.6402	0.6527	0.8094	-	0.5507	0.5083	29
LIPN-run2	0.0843	-	-	-	-	-	0.0101	38
Meerkat_Mafia-Hulk	0.4495	0.7850	0.7571	0.7896	0.7872	0.7571	0.7349	6
Meerkat_Mafia-pairingWords	0.4711	0.7628	0.7597	0.8013	0.8745	0.7793	0.7605	2
Meerkat_Mafia-SuperSaiyan	0.4918	0.7712	0.7666	0.7676	0.8022	0.7651	0.7410	5
NTNU-run1	0.4369	0.7138	0.7219	0.8000	0.8348	0.4109	0.6631	21
NTNU-run2	0.5084	0.7656	0.7525	0.8129	0.7767	0.7921	0.7491	4
NTNU-run3	0.5305	0.7813	0.7837	0.8343	0.8502	0.6755	0.7549	3
RTM-DCU-run1*	0.4341	0.6974	0.6199	0.6995	0.8058	0.6882	0.6706	20
RTM-DCU-run2*	0.3965	0.6811	0.6125	0.6656	0.7992	0.6691	0.6513	23
RTM-DCU-run3*	0.3078	0.5562	0.6301	0.6475	0.8004	0.5531	0.6076	27
SemantiKLUE-run1	0.3373	0.6077	0.7283	0.7833	0.8482	0.6319	0.6874	14
SemantiKLUE-run2	0.3486	0.6429	0.7332	0.7728	0.8550	0.6403	0.6935	13
StanfordNLP-run1	0.3186	0.6347	0.6361	0.7583	0.6269	0.6685	0.6270	24
StanfordNLP-run2	0.3035	0.6791	0.6208	0.7149	0.6250	0.6362	0.6101	26
StanfordNLP-run3	0.3423	0.6503	0.6021	0.7540	0.6087	0.6380	0.6137	25
UMCC_DLSI_SemSim-run1	0.4752	0.6619	0.6318	0.7421	0.8127	0.6753	0.6823	16
UMCC_DLSI_SemSim-run2	0.4689	0.6622	0.6255	0.7390	0.8140	0.6536	0.6756	18
UMCC_DLSI_SemSim-run3	0.2826	0.3854	0.2669	0.4359	0.6028	0.2780	0.3815	33
UNAL-NLP-run1	0.5043	0.7205	0.7616	0.8071	0.7823	0.6145	0.7113	12
UNAL-NLP-run2	0.3826	0.7305	0.7645	0.7706	0.8268	0.4028	0.6573	22
UNAL-NLP-run3	0.4607	0.7216	0.7605	0.7782	0.8426	0.6583	0.7209	9
UNED-run22_p_np	0.1043	0.3148	0.0374	0.3243	0.5086	0.4898	0.3097	37
UNED-runS5K_10_np	0.1181	0.5059	0.0570	0.4981	0.4880	0.5794	0.3791	34
UNED-runS5K_3_np	0.0941	0.5644	0.0177	0.6070	0.5765	0.6700	0.4307	32
UoW-run1	0.3419	0.7512	0.7535	0.7763	0.7990	0.7368	0.7143	11
UoW-run2	0.3419	0.5875	0.7535	0.7877	0.7990	0.6281	0.6817	17
UoW-run3	0.3419	0.7634	0.7535	0.7877	0.7990	0.7529	0.7207	10

One team submitted non-uniform confidence scores. The following table shows the results and rankings when using the submitted confidence scores:

Team Name	deft-forum	deft-news	headlines	images	OnWN	tweet-news	Weighted mean	Rank
RTM-DCU-run1*	0.4181	0.6846	0.6216	0.6981	0.8331	0.6870	0.6729	19
RTM-DCU-run2*	0.3831	0.6739	0.6094	0.6629	0.8260	0.6691	0.6534	23
RTM-DCU-run3*	0.2731	0.5526	0.6330	0.6441	0.8246	0.5683	0.6110	26

Notes:

- : Not submitted.

* : Post-deadline submission.

Spanish STS results

Team Name	System type	Wikipedia	News	Weighted correlation	Rank
Bielefeld_SC-run1	unsupervised*	0.2632	0.55445	0.43708	22
Bielefeld_SC-run2	unsupervised*	0.26458	0.55455	0.4377	21
BUAP-run1	supervised	0.5504	0.6785	0.62688	17
BUAP-run2	unsupervised	0.63964	0.76369	0.7137	14
RTM-DCU-run1	supervised	0.42164	0.70003	0.58784	18
RTM-DCU-run2	supervised	0.36886	0.62527	0.52194	20
RTM-DCU-run3	supervised	0.42424	0.64113	0.55373	19
LIPN-run1	supervised	0.65194	0.82554	0.75558	11
LIPN-run2	supervised	0.71647	0.8316	0.7852	6
LIPN-run3	supervised	0.71618	0.80857	0.77134	10
Meerkat_Mafia-run1	unsupervised	0.6682	0.78517	0.73803	13
Meerkat_Mafia-run2	unsupervised	0.74305	0.84542	0.80417	2
Meerkat_Mafia-run3	supervised	0.73815	0.82247	0.78849	5
TeamZ-run1	supervised	0.6102	0.71654	0.67369	15
TeamZ-run2	supervised	0.60425	0.70974	0.66723	16
UMCC_DLSI-run1	supervised	0.74084	0.82539	0.79132	4
UMCC_DLSI-run2	supervised	0.78021	0.82539	0.80718	1
UNAL-NLP-run1	weakly supervised	0.78036	0.81541	0.80129	3
UNAL-NLP-run2	supervised	0.75659	0.78293	0.77232	9
UNAL-NLP-run3	supervised	0.68936	0.79648	0.75331	12
UoW-run1	supervised	0.74826	0.80008	0.7792	7
UoW-run2	supervised	0.74826	0.80008	0.7792	8

Notes:
1. The main evaluation column is "mean". The rank column gives the rank of the submission as ordered by the "mean" result.
2. * denotes a system that used Wikipedia to build its model for the Wikipedia test dataset.

SemEval-2014 Task 10

Results

English STS results

Spanish STS results

Contact Info

Other Info

Announcements