English-Turkish parallel treebank with morphological annotations and its use in tree-based SMT
dc.authorid | 0000-0001-7754-2033 | |
dc.authorid | 0000-0001-5838-4615 | |
dc.authorid | 0000-0002-8448-9987 | |
dc.contributor.author | Görgün, Onur | en_US |
dc.contributor.author | Yıldız, Olcay Taner | en_US |
dc.contributor.author | Solak, Ercan | en_US |
dc.contributor.author | Ehsani, Razieh | en_US |
dc.date.accessioned | 2016-08-11T17:35:14Z | |
dc.date.available | 2016-08-11T17:35:14Z | |
dc.date.issued | 2016 | |
dc.department | Işık Üniversitesi, Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümü | en_US |
dc.department | Işık University, Faculty of Engineering, Department of Computer Engineering | en_US |
dc.description.abstract | In this paper, we report our tree based statistical translation study from English to Turkish. We describe our data generation process and report the initial results of tree-based translation under a simple model. For corpus construction, we used the Penn Treebank in the English side. We manually translated about 5K trees from English to Turkish under grammar constraints with adaptations to accommodate the agglutinative nature of Turkish morphology. We used a permutation model for subtrees together with a word to word mapping. We report BLEU scores under simple choices of inference algorithms. | en_US |
dc.description.version | Publisher's Version | en_US |
dc.identifier.citation | Görgün, O., Yıldız, O. T., Solak, E. & Ehsani, R. (2016). English-turkish parallel treebank with morphological annotations and its use in tree-based SMT. Paper presented at the Proceedings of the 5th International Conference on Pattern Recognition Applications and Methods (ICPRAM 2016), 510-516. doi:10.5220/0005653905100516 | en_US |
dc.identifier.doi | 10.5220/0005653905100516 | |
dc.identifier.endpage | 516 | |
dc.identifier.isbn | 9789897581731 | |
dc.identifier.scopus | 2-s2.0-84969940458 | |
dc.identifier.scopusquality | N/A | |
dc.identifier.startpage | 510 | |
dc.identifier.uri | https://hdl.handle.net/11729/1112 | |
dc.identifier.uri | http://dx.doi.org/10.5220/0005653905100516 | |
dc.indekslendigikaynak | Scopus | en_US |
dc.institutionauthor | Yıldız, Olcay Taner | en_US |
dc.institutionauthor | Solak, Ercan | en_US |
dc.institutionauthor | Ehsani, Razieh | en_US |
dc.institutionauthorid | 0000-0001-5838-4615 | |
dc.institutionauthorid | 0000-0002-8448-9987 | |
dc.language.iso | en | en_US |
dc.peerreviewed | Yes | en_US |
dc.publicationstatus | Published | en_US |
dc.publisher | SciTePress | en_US |
dc.relation.ispartof | Proceedings of the 5th International Conference on Pattern Recognition Applications and Methods (ICPRAM 2016) | en_US |
dc.relation.publicationcategory | Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı | en_US |
dc.rights | info:eu-repo/semantics/openAccess | en_US |
dc.subject | Machine translation | en_US |
dc.subject | Tree-based translation | en_US |
dc.subject | Inference engines | en_US |
dc.subject | Pattern recognition | en_US |
dc.subject | Corpus construction | en_US |
dc.subject | Data generation | en_US |
dc.subject | Inference algorithm | en_US |
dc.subject | Machine translations | en_US |
dc.subject | Simple modeling | en_US |
dc.subject | Statistical translation | en_US |
dc.subject | Tree-based | en_US |
dc.subject | Tree-based smt | en_US |
dc.subject | Trees (mathematics) | en_US |
dc.title | English-Turkish parallel treebank with morphological annotations and its use in tree-based SMT | en_US |
dc.type | Conference Object | en_US |
dspace.entity.type | Publication |