Basit öğe kaydını göster

dc.contributor.authorYıldız, Olcay Taneren_US
dc.contributor.authorAvar, Begümen_US
dc.contributor.authorErcan, Gökhanen_US
dc.date.accessioned2020-04-14T06:59:53Z
dc.date.available2020-04-14T06:59:53Z
dc.date.issued2019-09
dc.identifier.citationYıldız, O. T., Avar, B. & Ercan, G. (2019). An open, extendible, and fast Turkish morphological analyzer. Paper presented at the International Conference Recent Advances in Natural Language Processing, RANLP, 1364-1372. doi:10.26615/978-954-452-056-4_156en_US
dc.identifier.isbn9789544520557
dc.identifier.issn1313-8502
dc.identifier.urihttps://hdl.handle.net/11729/2300
dc.identifier.urihttp://dx.doi.org/10.26615/978-954-452-056-4_156
dc.description.abstractIn this paper, we present a two-level morphological analyzer for Turkish which consists of five main components: finite state transducer, rule engine for suffixation, lexicon, trie data structure, and LRU cache. We use Java language to implement finite state machine logic and rule engine, Xml language to describe the finite state transducer rules of the Turkish language, which makes the morphological analyzer both easily extendible and easily applicable to other languages. Empowered with a comprehensive lexicon of 54,000 bare-forms including 19,000 proper nouns, our morphological analyzer is amongst the most reliable analyzers produced so far. The analyzer is compared with Turkish morphological analyzers in the literature. By using LRU cache and a trie data structure, the system can analyze 100,000 words per second, which enables users to analyze huge corpora in a few hours.en_US
dc.language.isoengen_US
dc.publisherIncoma Ltden_US
dc.relation.isversionof10.26615/978-954-452-056-4_156
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.subjectComputational linguisticsen_US
dc.subjectData structuresen_US
dc.subjectDeep learningen_US
dc.subjectEnginesen_US
dc.subjectFinite state transducersen_US
dc.subjectJava languageen_US
dc.subjectMorphological analyzeren_US
dc.subjectNatural language processing systemsen_US
dc.subjectProper nounsen_US
dc.subjectRule engineen_US
dc.subjectSemanticsen_US
dc.subjectSpeech recognitionen_US
dc.subjectText processingen_US
dc.subjectTransducersen_US
dc.subjectTrie data structuresen_US
dc.subjectTurkish languageen_US
dc.subjectXML languagesen_US
dc.titleAn open, extendible, and fast Turkish morphological analyzeren_US
dc.typeconferenceObjecten_US
dc.description.versionPublisher's Versionen_US
dc.relation.journalInternational Conference Recent Advances in Natural Language Processing, RANLPen_US
dc.contributor.departmentIşık Üniversitesi, Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümüen_US
dc.contributor.departmentIşık University, Faculty of Engineering, Department of Computer Engineeringen_US
dc.contributor.authorID0000-0001-5838-4615
dc.contributor.authorID0000-0002-2782-8217
dc.identifier.volume2019
dc.identifier.startpage1364
dc.identifier.endpage1372
dc.peerreviewedYesen_US
dc.publicationstatusPublisheden_US
dc.relation.publicationcategoryKonferans Öğesi - Uluslararası - Kurum Öğretim Elemanıen_US
dc.contributor.institutionauthorYıldız, Olcay Taneren_US
dc.contributor.institutionauthorErcan, Gökhanen_US
dc.relation.indexScopusen_US


Bu öğenin dosyaları:

Thumbnail

Bu öğe aşağıdaki koleksiyon(lar)da görünmektedir.

Basit öğe kaydını göster