Deep Learning-based Sentiment Analysis of Facebook Data: The Case of Turkish Users


Coban O., ÖZEL S. A., Inan A.

COMPUTER JOURNAL, cilt.64, sa.3, ss.473-499, 2021 (SCI-Expanded) identifier identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 64 Sayı: 3
  • Basım Tarihi: 2021
  • Doi Numarası: 10.1093/comjnl/bxaa172
  • Dergi Adı: COMPUTER JOURNAL
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Academic Search Premier, ABI/INFORM, Aerospace Database, Applied Science & Technology Source, Communication Abstracts, Compendex, Computer & Applied Sciences, INSPEC, Metadex, MLA - Modern Language Association Database, zbMATH, Civil Engineering Abstracts
  • Sayfa Sayıları: ss.473-499
  • Anahtar Kelimeler: online social networks, Facebook, sentiment analysis, machine learning, deep learning, AGREEMENT, RELIABILITY
  • Çukurova Üniversitesi Adresli: Evet

Özet

Sentiment analysis (SA) is an essential task for many domains where it is crucial to know users' public opinion about events, products, brands, politicians and so on. Existing works on SA have concentrated on English texts including Twitter feeds and user reviews on hotels, movies and products. On the other hand, Facebook, as an online social network (OSN), has attracted quite limited attention from the research community. Among these, SA work on Turkish text obtained from OSNs are extremely scarce. In this paper, our aim is to perform SA on public Facebook data collected from Turkish user accounts. Our study differs from existing studies in terms of the data set scale, the natural language of the texts in the data set and the extent of experimental analyses that include both machine learning and deep learning techniques. We extensively report not only the results of different learning models involving SA but also statistical distribution of metadata of user activities across various user attributes (e.g. gender and age). Our experimental results indicate that recurrent neural networks achieve the best accuracy (i.e. 0.916) with word embeddings. To the best of our knowledge, this is the best result for SA on Facebook data in the context of the Turkish language.