Constructing a Turkish-English parallel treebank
Künye
Yıldız, O. T., Solak, E., Görgün, O. & Ehsani, R. (2014). Constructing a Turkish-English parallel treebank. Paper present at the 52nd Annual Meeting of the Association for Computational Linguistics, ACL 2014 - Proceedings of the Conference, 2, 112-117.Özet
In this paper, we report our preliminary efforts in building an English-Turkish parallel treebank corpus for statistical machine translation. In the corpus, we manually generated parallel trees for about 5,000 sentences from Penn Treebank. English sentences in our set have a maximum of 15 tokens, including punctuation. We constrained the translated trees to the reordering of the children and the replacement of the leaf nodes with appropriate glosses. We also report the tools that we built and used in our tree translation task.