ANALYSIS OF DATA PRE-PROCESSING METHODS FOR SENTIMENT ANALYSIS OF REVIEWS


Creative Commons License

PARLAR T., ÖZEL S. A. , Song F.

COMPUTER SCIENCE-AGH, vol.20, no.1, pp.123-141, 2019 (Journal Indexed in ESCI) identifier identifier

  • Publication Type: Article / Article
  • Volume: 20 Issue: 1
  • Publication Date: 2019
  • Doi Number: 10.7494/csci.2019.20.1.3097
  • Title of Journal : COMPUTER SCIENCE-AGH
  • Page Numbers: pp.123-141

Abstract

The goals of this study are to analyze the effects of data pre-processing methods for sentiment analysis and determine which of these pre-processing methods (and their combinations) are effective for English as well as for an agglutinative language like Turkish. We also try to answer the research question of whether there are any differences between agglutinative and non-agglutinative languages in terms of pre-processing methods for sentiment analysis. We find that the performance results for the English reviews are generally higher than those for the Turkish reviews due to the differences between the two languages in terms of vocabularies, writing styles, and agglutinative property of the Turkish language.