ADAPTING TEXT CATEGORIZATION FOR MANIFEST BASED ANDROID MALWARE DETECTION


Creative Commons License

Coban O., ÖZEL S. A.

COMPUTER SCIENCE-AGH, cilt.20, sa.3, ss.383-405, 2019 (ESCI) identifier identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 20 Sayı: 3
  • Basım Tarihi: 2019
  • Doi Numarası: 10.7494/csci.2019.20.3.3285
  • Dergi Adı: COMPUTER SCIENCE-AGH
  • Derginin Tarandığı İndeksler: Emerging Sources Citation Index (ESCI), Scopus
  • Sayfa Sayıları: ss.383-405
  • Çukurova Üniversitesi Adresli: Evet

Özet

Malware is a shorthand of malicious software that are created with the intent of damaging hardware systems, stealing data, and causing a mess to make money, protest something, or even make war between governments. Malware is often spread by downloading some applications for your hardware from some download platforms. It is highly probable to face with a malware while you try to load some applications for your smart phones nowadays. Therefore it is very important that some tools are needed to detect malware before loading them to the hardware systems. There are mainly three different approaches to detect malware: i) static. ii) dynamic, and iii) hybrid. Static approach analyzes the suspicious program without executing it. Dynamic approach, on the other hand. executes the program in a controlled environment and obtains information from operating system during runtime. Hybrid approach, as its name implies. is the combination of these two approaches. Although static approach may seem to have some disadvantages, it is highly preferred because of its lower cost. In this paper, our aim is to develop a static malware detection system by using text categorization techniques. To reach our goal, we apply text mining techniques like feature extraction by using bag-of-words, n-grams, etc. from manifest content of suspicious programs, then apply text classification methods to detect malware. Our experimental results revealed that our approach is capable of detecting malicious applications with an accuracy between 94.0% and 99.3%.