Image Captioning in Turkish with Subword Units

Kuyu M., Erdem A., Erdem E.

26th IEEE Signal Processing and Communications Applications Conference (SIU), İzmir, Türkiye, 2 - 05 Mayıs 2018, (Tam Metin Bildiri)

Yayın Türü: Bildiri / Tam Metin Bildiri
Cilt numarası:
Doi Numarası: 10.1109/siu.2018.8404431
Basıldığı Şehir: İzmir
Basıldığı Ülke: Türkiye
Hacettepe Üniversitesi Adresli: Evet

Özet

Automatically describing images with natural sentences, also known as image captioning, is a challenging research problem at the intersection of computer vision and natural language processing which has recently become very popular in the literature. With the advances in deep learning, recently proposed image captioning approaches are all based on deep artificial neural networks. However, most of these methods focus on the English language, which greatly restricts their use for Turkish. Turkish is an agglutinative language and suffixes might change the meaning of a word entirely, hence an image captioning approach specifically designed for Turkish should consider the characteristics of the language. In this study, we propose such an image captioning model, which utilizes subword units. Our experimental results show that this model provides results which are much better than the word-based model.