Noun Phrase Chunking for Turkish Using a Dependency Parser

Kutlu M., ÇİÇEKLİ İ.

30th International Symposium on Computer and Information Sciences (ISCIS), London, Kanada, 21 - 24 Eylül 2015, cilt.363, ss.381-391, (Tam Metin Bildiri)

Yayın Türü: Bildiri / Tam Metin Bildiri
Cilt numarası: 363
Doi Numarası: 10.1007/978-3-319-22635-4_35
Basıldığı Şehir: London
Basıldığı Ülke: Kanada
Sayfa Sayıları: ss.381-391
Hacettepe Üniversitesi Adresli: Evet

Özet

Noun phrase chunking is a sub-category of shallow parsing that can be used for many natural language processing tasks. In this paper, we propose a noun phrase chunker system for Turkish texts. We use a weighted constraint dependency parser to represent the relationship between sentence components and to determine noun phrases. The dependency parser uses a set of hand-crafted rules which can combine morphological and semantic information for constraints. The rules are suitable for handling complex noun phrase structures because of their flexibility. The developed dependency parser can be easily used for shallow parsing of all phrase types by changing the employed rule set. The lack of reliable human tagged datasets is a significant problem for natural language studies about Turkish. Therefore, we constructed a noun phrase dataset for Turkish. According to our evaluation results, our noun phrase chunker gives promising results on this dataset.