Creative Commons License

Coban O., ÖZEL S. A.

COMPUTER SCIENCE-AGH, vol.20, no.3, pp.383-405, 2019 (Journal Indexed in ESCI) identifier identifier

  • Publication Type: Article / Article
  • Volume: 20 Issue: 3
  • Publication Date: 2019
  • Doi Number: 10.7494/csci.2019.20.3.3285
  • Title of Journal : COMPUTER SCIENCE-AGH
  • Page Numbers: pp.383-405


Malware is a shorthand of malicious software that are created with the intent of damaging hardware systems, stealing data, and causing a mess to make money, protest something, or even make war between governments. Malware is often spread by downloading some applications for your hardware from some download platforms. It is highly probable to face with a malware while you try to load some applications for your smart phones nowadays. Therefore it is very important that some tools are needed to detect malware before loading them to the hardware systems. There are mainly three different approaches to detect malware: i) static. ii) dynamic, and iii) hybrid. Static approach analyzes the suspicious program without executing it. Dynamic approach, on the other hand. executes the program in a controlled environment and obtains information from operating system during runtime. Hybrid approach, as its name implies. is the combination of these two approaches. Although static approach may seem to have some disadvantages, it is highly preferred because of its lower cost. In this paper, our aim is to develop a static malware detection system by using text categorization techniques. To reach our goal, we apply text mining techniques like feature extraction by using bag-of-words, n-grams, etc. from manifest content of suspicious programs, then apply text classification methods to detect malware. Our experimental results revealed that our approach is capable of detecting malicious applications with an accuracy between 94.0% and 99.3%.