WAW Corpus

WAW Corpus is a bilingual translation and interpretation corpus in Arabic and English. WAW corpus comprises recordings from three international conferences namely WISE 2013 (World Innovation Summit for Education), ARC’14 (Qatar Foundation’s Annual Research Conference), and WISH 2014 (World Innovation Summit for Health). These recordings contain both original speaker and the interpreter; their transcripts and their translation.
Related publications

Authors: Type:
  • I. Temnikova, A. Abdelali, S. Hedaya, S. Vogel, and A. Al Daher, “Interpreting strategies annotation in the waw corpus,” in Proceedings of the first workshop on human-informed translation and interpreting technology (hit-it), Varna, Bulgaria, 2017, p. 36–-43.
    title={Interpreting Strategies Annotation in the WAW Corpus},
    author={Temnikova, Irina and Abdelali, Ahmed and Hedaya, Samy and Vogel, Stephan and Al Daher, Aishah},
    booktitle={Proceedings of the First Workshop on Human-Informed Translation and Interpreting Technology (HiT-IT)},
    address={Varna, Bulgaria},

WAW Corpus (Copyright 2015, QCRI a member of Qatar Foundation. All Rights Reserved) is licensed under the Apache License, Version 2.0 (the “License”); you may not use it except in compliance with the License. You may obtain a copy of the License here.
Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an “AS IS” BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.