SentiWordNet for New Language: Automatic Translation Approach


UÇAN A. , Naderalvojoud B., Sezer E. A. , Sever H.

12th International Conference on Signal-Image Technology and Internet-Based Systems (SITIS), Naples, Italy, 28 November - 01 December 2016, pp.308-315 identifier identifier

  • Publication Type: Conference Paper / Full Text
  • Doi Number: 10.1109/sitis.2016.57
  • City: Naples
  • Country: Italy
  • Page Numbers: pp.308-315

Abstract

This paper proposes an automatic translation approach to create a sentiment lexicon for a new language from available English resources. In this approach, an automatic mapping is generated from a sense-level resource to a word-level by applying a triple unification process. This process produces a single polarity score for each term by incorporating all sense polarities. The major idea is to deal with the sense ambiguity during the lexicon transfer and provide a general sentiment lexicon for languages like Turkish which do not have a freely available machine-readable dictionary. On the other hand, the translation quality is critical in the lexicon transfer due to the ambiguity problem. Thus, this paper also proposes a multiple bilingual translation approach to find the most appropriate equivalents for the source language terms. In this approach, three parallel, series and hybrid algorithms are used to integrate the translation results. Finally, three lexicons are achieved for the target language with different sizes. The performance of three lexicons is evaluated in the lexicon-based sentiment classification task and compared with the results achieved by the supervised approach. According to experimental results, the proposed approach can produce reliable sentiment lexicons for the target language.