Arama Sonuçları

Listeleniyor 1 - 5 / 5
  • Yayın
    A new speech modeling method: SYMPES
    (IEEE, 2006) Güz, Ümit; Gürkan, Hakan; Yarman, Bekir Sıddık Binboğa
    In this paper, the new method of speech modeling which is called SYMPES is introduced and it is compared with the commercially available methods. It is shown that for the same compression ratio or better, SYMPES yields considerably better hearing quality over the coders such as G.726 at 16 Kbps and voice excited LPC-10E of 2.4Kbps.
  • Yayın
    A new algorithm for high speed speech and audio coding
    (IEEE, 2007) Güz, Ümit; Gürkan, Hakan; Yarman, Bekir Sıddık Binboğa
    In this work, a new mathematical modeling approach is proposed for the representation of the speech and audio signals. This approach is based on the generation of the so called Predefined Signature Sequence (PSS) and Predefined Envelope Sequence (PES) Sets. After the generation process of the PSS and PES sets, they are clustered by effective k-means clustering algorithm and the PSS and PES are redefined by using the centroids of the clusters. By using this approach, the drawbacks such as the size of the sets, speed of the reconstruction process (computational complexity) which arise in our proposed methods previously are highly eliminated. In spite of these improvements, the initial results proved that, the quality of the reconstructed signals remains within the limitations of the acceptable hearing quality.
  • Yayın
    A new coding method for speech and audio signals
    (IEEE, 2005) Güz, Ümit; Gürkan, Hakan; Yarman, Bekir Sıddık Binboğa
    In this paper a new representation or modeling method of speech signals is introduced. The proposed method is based on the generation of the so-called Predefined Signature S={S R } and Envelope vector E={E K } Sets (PSEVS). These vector sets are speaker and language independent. In this method, once the speech signals are divided into frames with selected lengths, then each frame signal piece X i is reconstructed by means of the mathematical form of X i =C i E K S R . In this representation, C i is called the frame coefficient, S R and E K are the vectors properly assigned from the PSEVS respectively. It is shown that the proposed method provides fast reconstruction and substantial compression ratio with acceptable hearing quality.
  • Yayın
    A novel noise robust and low bit rate speech coding algorithm
    (IEEE, 2009) Güz, Ümit; Gürkan, Hakan; Yarman, Bekir Sıddık Binboğa
    In this work, a new noise robust and variable length frame based speech modeling method is introduced. This method consists of three major steps which includes noise removal algorithm, coding and encoding algorithms, respectively. Coding and encoding parts are developed based on SYMPES (SYsteMatic Procedure for Predefined Envelope and Signature sequence sets). These sets have been developed in two types which represent voiced and unvoiced parts of the speech signals separately in order to obtain more efficient coding strategy and higher compression ratio while preserving the perceptual quality of the speech signals. As an extension of our previous works our new framework is not only consider the coding of the clean speech signals but also noisy speech signals. The new noise robust module suppresses the noise and delivers the clean speech signal to the newly designed modeling part. The modeling part promises higher compression ratios by switching to the more appropriate type of predefined sets take into account the voiced and unvoiced frames.
  • Yayın
    A novel fast algorithm for speech and audio coding
    (IEEE, 2007) Güz, Ümit; Gürkan, Hakan; Yarman, Bekir Sıddık Binboğa
    In this work a new mathematical modeling approach is proposed for the representation of the speech and audio signals. This approach is based on the generation of the so called Predefined Signature Sequence (PSS) and Predefined Envelope Sequence (PES) Sets. After the generation process of the PSS and PES sets, they are clustered by effective k-means clustering algorithm and the PSS and PES are redefined by using these centroids. By using this approach, the drawbacks by means of the size of the sets, speed of the reconstruction process (computational complexity) which arise in the proposed methods previously are highly eliminated. In spite of these improvements, the initial results proved that, the quality of the reconstructed signals remains within the limitations of the acceptable hearing quality.