A multilayer annotated corpus for Turkish
Künye
Yıldız, O. T., Ak, K., Ercan, G., Topsakal, O. & Asmazoğlu, C. (2018). A multilayer annotated corpus for turkish. Paper presented at the 2018 2nd International Conference on Natural Language and Speech Processing (ICNLSP), 21-26. doi:10.1109/ICNLSP.2018.8374369Özet
In this paper, we present the first multilayer annotated corpus for Turkish, which is a low-resourced agglutinative language. Our dataset consists of 9,600 sentences translated from the Penn Treebank Corpus. Annotated layers contain syntactic and semantic information including morphological disambiguation of words, named entity annotation, shallow parse, sense annotation, and semantic role label annotation.
İlgili Öğeler
Başlık, yazar, küratör ve konuya göre gösterilen ilgili öğeler.
-
An all-words sense annotated Turkish corpus
Akçakaya, Sinan; Yıldız, Olcay Taner (IEEE, 2018-06-06)This paper reports our efforts in constructing of a sense labeled Turkish corpus with respect to Turkish Language Institution's dictionary, using the traditional method of manual tagging. We tagged a pre-built parallel ... -
Parallel proposition bank construction for Turkish
Ak, Koray (Işık Üniversitesi, 2019-04-02)PropBank is the bank of propositions which contains hand-annotated corpus for predicate-argument information and semantic roles or arguments. It aims to provide an extensive dataset for enhancing NLP applications such as ... -
Construction of a Turkish proposition bank
Ak, Koray; Toprak, Cansu; Esgel, Volkan; Yıldız, Olcay Taner (Tubitak Scientific & Technical Research Council Turkey, 2018)This paper describes our approach to developing the Turkish PropBank by adopting the semantic role-labeling guidelines of the original PropBank and using the translation of the English Penn-TreeBank as a resource. We discuss ...