Modernization of Old Turkish Texts


Ozkan E., ERCAN G.

26th IEEE Signal Processing and Communications Applications Conference (SIU), İzmir, Türkiye, 2 - 05 Mayıs 2018 identifier identifier

  • Yayın Türü: Bildiri / Tam Metin Bildiri
  • Cilt numarası:
  • Doi Numarası: 10.1109/siu.2018.8404308
  • Basıldığı Şehir: İzmir
  • Basıldığı Ülke: Türkiye
  • Hacettepe Üniversitesi Adresli: Evet

Özet

Changes in a language over time causes the texts that is written in old times contain lot of words that is not used at the present time. This makes it difficult for readers to understand old texts. The goal of text simplification task is to increase the readability and understandability of the text by preserving the meaning. In this study, it is aimed to reduce the complexity of the texts written in republican period Turkish with text simplification methods. First, a parallel dataset is build using Nutuk, then a statistical machine translation model is trained. The results are measured using BLEU metric that is used in evaluation of machine translation systems. With this work, the complexity of old texts is reduced and the target audience of these texts is increased.