ADAPTING TEXT CATEGORIZATION FOR MANIFEST BASED ANDROID MALWARE DETECTION

Coban, Onder; ÖZEL, SELMA

doi:10.7494/csci.2019.20.3.3285

ADAPTING TEXT CATEGORIZATION FOR MANIFEST BASED ANDROID MALWARE DETECTION

Atıf İçin Kopyala

Coban O., ÖZEL S. A.

COMPUTER SCIENCE-AGH, cilt.20, sa.3, ss.383-405, 2019 (ESCI)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 20 Sayı: 3
Basım Tarihi: 2019
Doi Numarası: 10.7494/csci.2019.20.3.3285
Dergi Adı: COMPUTER SCIENCE-AGH
Derginin Tarandığı İndeksler: Emerging Sources Citation Index (ESCI), Scopus
Sayfa Sayıları: ss.383-405
Çukurova Üniversitesi Adresli: Evet

Özet

Malware is a shorthand of malicious software that are created with the intent of damaging hardware systems, stealing data, and causing a mess to make money, protest something, or even make war between governments. Malware is often spread by downloading some applications for your hardware from some download platforms. It is highly probable to face with a malware while you try to load some applications for your smart phones nowadays. Therefore it is very important that some tools are needed to detect malware before loading them to the hardware systems. There are mainly three different approaches to detect malware: i) static. ii) dynamic, and iii) hybrid. Static approach analyzes the suspicious program without executing it. Dynamic approach, on the other hand. executes the program in a controlled environment and obtains information from operating system during runtime. Hybrid approach, as its name implies. is the combination of these two approaches. Although static approach may seem to have some disadvantages, it is highly preferred because of its lower cost. In this paper, our aim is to develop a static malware detection system by using text categorization techniques. To reach our goal, we apply text mining techniques like feature extraction by using bag-of-words, n-grams, etc. from manifest content of suspicious programs, then apply text classification methods to detect malware. Our experimental results revealed that our approach is capable of detecting malicious applications with an accuracy between 94.0% and 99.3%.