EMPLOYING FUZZY C-MEANS FOR DNA TRANSCRIPTION FACTOR BINDING SITE IDENTIFICATION

İBRİKÇİ, TURGAY; Karabulut, Mustafa

doi:10.1142/s0218126610005925

EMPLOYING FUZZY C-MEANS FOR DNA TRANSCRIPTION FACTOR BINDING SITE IDENTIFICATION

Atıf İçin Kopyala

İBRİKÇİ T., Karabulut M.

JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, cilt.19, sa.1, ss.15-30, 2010 (SCI-Expanded)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 19 Sayı: 1
Basım Tarihi: 2010
Doi Numarası: 10.1142/s0218126610005925
Dergi Adı: JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
Sayfa Sayıları: ss.15-30
Çukurova Üniversitesi Adresli: Evet

Özet

DNA motif discovery is an important task since it helps to better understand the regulation of the transcription in the protein synthesis process. This paper introduces a novel method for the task of DNA motif finding where the proposed method adopts machine-learning approach by the use of a well-known clustering algorithm, Fuzzy C-Means. The method is explained in detail and tested against DNA sequences extracted from the genome of Saccharomyces cerevisiae and Escherichia coli organisms. Experimental results suggest that the algorithm is efficient in finding statistically interesting features existing in the DNA sequences. The comparison of the algorithm with the well-known motif finding tools, MEME and MDScan, which are built on statistical and word-enumerative models, shows the advantages of the proposed method over the existing tools and the promising direction of the machine-learning approach.