ANALYSIS OF DATA PRE-PROCESSING METHODS FOR SENTIMENT ANALYSIS OF REVIEWS


Creative Commons License

PARLAR T., ÖZEL S. A. , Song F.

COMPUTER SCIENCE-AGH, cilt.20, ss.123-141, 2019 (ESCI İndekslerine Giren Dergi) identifier identifier

  • Cilt numarası: 20 Konu: 1
  • Basım Tarihi: 2019
  • Doi Numarası: 10.7494/csci.2019.20.1.3097
  • Dergi Adı: COMPUTER SCIENCE-AGH
  • Sayfa Sayısı: ss.123-141

Özet

The goals of this study are to analyze the effects of data pre-processing methods for sentiment analysis and determine which of these pre-processing methods (and their combinations) are effective for English as well as for an agglutinative language like Turkish. We also try to answer the research question of whether there are any differences between agglutinative and non-agglutinative languages in terms of pre-processing methods for sentiment analysis. We find that the performance results for the English reviews are generally higher than those for the Turkish reviews due to the differences between the two languages in terms of vocabularies, writing styles, and agglutinative property of the Turkish language.