RECOGNITION OF NON-SPEECH SOUNDS USING MEL-FREQUENCY CEPSTRUM COEFFICIENTS AND DYNAMIC TIME WARPING METHOD

Disken G., İBRİKÇİ T.

23nd Signal Processing and Communications Applications Conference (SIU), Malatya, Türkiye, 16 - 19 Mayıs 2015, ss.144-147, (Tam Metin Bildiri)

Yayın Türü: Bildiri / Tam Metin Bildiri
Cilt numarası:
Doi Numarası: 10.1109/siu.2015.7130277
Basıldığı Şehir: Malatya
Basıldığı Ülke: Türkiye
Sayfa Sayıları: ss.144-147
Çukurova Üniversitesi Adresli: Evet

Özet

With the developing technology, speech recognition systems are getting more space in our daily lives. Sounds in our environment are not only pure speech. Because of this, it is important for cochlear implants, unmanned vehicles and security systems to be able to recognize other sounds. In this work, Mel-frequency cepstrum coefficients, one of the most widely used methods for feature extraction in speech recognition, applied to various nature and animal sounds. Because each sound does not have the same duration, dynamic time warping, one of the methods used in speech recognition, is preferred to classify the feature vectors. The difference in durations of sounds affects the lengths of the feature vectors. With dynamic time warping method, one can overcome these differences. One reference record and 10 test records obtained from 10 different sound sources. True classification rate is found as 88%.