Arama Sonuçları

Listeleniyor 1 - 10 / 15
  • Yayın
    Decoder-side super-resolution and frame interpolation for improved H.264 video coding
    (IEEE, 2013) Ateş, Hasan Fehmi
    In literature decoder-side motion estimation is shown to improve video coding efficiency of both H.264 and HEVC standards. In this paper we introduce enhanced skip and direct modes for H.264 coding using decoder-side super-resolution (SR) and frame interpolation. P-and B-frames are downsampled and H.264 encoded at lower resolution (LR). Then reconstructed LR frames are super-resolved using decoder-side motion estimation. Alternatively for B-frames, bidirectional true motion estimation is performed to synthesize a B-frame from its reference frames. For P-frames, bicubic interpolation of the LR frame is used as an alternative to SR reconstruction. A rate-distortion optimal mode selection algorithm determines for each MB which of the two reconstructions to use as skip/direct mode prediction. Simulations indicate an average of 1.04 dB PSNR improvement or 23.0% bitrate reduction at low bitrates when compared to H.264 standard. Average PSNR gains reach as high as 3.95 dB depending on the video content and frame rate.
  • Yayın
    Tel file geometrilerinin sıradüzensel küme bölüntüleme ile spektral kodlaması
    (IEEE, 2007-09-04) Konur, Umut; Bayazıt, Uluğ; Ateş, Hasan Fehmi; Gürgen, Sadık Fikret
    Çalışmamızda bir dönüşümle elde edilen spektral katsayılar kullanılarak betimlenen tel file geometri bilgisi, katsayılara bütün bit düzlemlerinde en doğru öncelikler atanarak sıradüzensel bir küme bölüntüleme algoritmasıyla aşamalı biçimde kodlanmaktadır. Kullanılan spektral dönüşüm [8]’de önerilmekte ve geometri bilgisinin topolojiden belirlenen birimdik bir doğuray üzerine düşümlenerek katsayıların elde edilmesi ilkesine dayanmaktadır. Kodlamada kullanılan küme bölüntüleme yöntemi, üç ayrı uzamsal koordinata ait farklı katsayıların bitlerine her bit düzleminde doğru önceliği tanımakta ve katsayıların bit düzlemlerindeki sıfırları birleşik kodladığı için dolaylı bit atamasını başararak tamamen gömülü bir yapıyı sağlayabilmektedir. Yaygın düzensiz tel filelerle yapılan deneylerde önerilen yöntemin hız-bozunum başarımı, [8]’deki kodlama yönteminin hız-bozunum başarımına göre açık bir üstünlük sağlamaktadır.
  • Yayın
    Low complexity inter-mode selection for H.264
    (IEEE, 2006) Ba, Seydou Nourou; Altunbaşak, Yücel; Ateş, Hasan Fehmi
    The coding efficiency of the H.264/AVC standard enables the transmission of high quality video over bandwidth limited networks. Due to the use of multiple Macroblock (MB) partitions, the Motion estimation module has extremely high complexity that makes it unpractical for most real-time applications on resource-limited platforms such as hand held devices. In this paper we propose a novel algorithm that significantly reduces the encoding complexity while maintaining high rate distortion performance. The proposed method reduces the Motion estimation (ME) computational complexity by accurately predicting the optimal MB partitions and restricting the number of candidate modes based on a-priori probabilities computed from spatio-temporal information. The experimental results show that the speed up of UmHexagonS [1] (one of the most efficient ME algorithms) can be doubled while maintaining the coding efficiency of Full Search.
  • Yayın
    Wavelet image coding using the spherical representation
    (IEEE, 2005) Ateş, Hasan Fehmi; Orchard, Michael T.
    In this paper, we introduce the "spherical representation", which provides a new adaptive framework for modeling and coding the image information in wavelet subbands. Based on this representation, a practical coding algorithm is developed. This coder uses local energy as a direct measure to differentiate between parts of the wavelet subband and to decide how to allocate the available bitrate. As local energy becomes available at finer resolutions, i.e. in smaller size windows, the coder automatically updates its decisions about how to spend the bitrate. We use a hierarchical set of variables to specify and code the local energy up to the highest resolution, i.e. the energy of individual wavelet coefficients. The overall scheme is nonredundant, meaning that the subband information is conveyed using this equivalent set of variables without the need for any side parameters. Despite its simplicity, the algorithm produces PSNR results that are competitive with the state-of-art coders in literature.
  • Yayın
    Decoder side true motion estimation for very low bitrate b-frame coding
    (IEEE, 2011) Ateş, Hasan Fehmi; Çizmeci, Burak
    In H.264 standard, coding of motion vectors constitutes a significant portion of total bitrate especially at low bitrate regimes. This is because differential coding of motion vectors is inefficient when the bit budget is very low. In this paper, we propose a novel estimation and coding algorithm for motion vectors of B-frames at very low bitrates. In this method, the encoder selects the optimal motion vector from a limited set of candidate vectors that are determined at the decoder side using true motion estimation. Since these candidate vector sets are fixed by the decoder for each macroblock, there is no need for explicit coding of motion information, which reduces the bitrate required for coding. Also, true motion vector estimates are used for improved direct mode coding in B-frames. The algorithm provides an average of 0.68 dB PSNR gain for B-frames when compared to the reference H.264 results at the same bitrates. Simulation results also indicate significant improvement in visual quality of the compressed B-frames.
  • Yayın
    A new speech modeling method: SYMPES
    (IEEE, 2006) Güz, Ümit; Gürkan, Hakan; Yarman, Bekir Sıddık Binboğa
    In this paper, the new method of speech modeling which is called SYMPES is introduced and it is compared with the commercially available methods. It is shown that for the same compression ratio or better, SYMPES yields considerably better hearing quality over the coders such as G.726 at 16 Kbps and voice excited LPC-10E of 2.4Kbps.
  • Yayın
    A new algorithm for high speed speech and audio coding
    (IEEE, 2007) Güz, Ümit; Gürkan, Hakan; Yarman, Bekir Sıddık Binboğa
    In this work, a new mathematical modeling approach is proposed for the representation of the speech and audio signals. This approach is based on the generation of the so called Predefined Signature Sequence (PSS) and Predefined Envelope Sequence (PES) Sets. After the generation process of the PSS and PES sets, they are clustered by effective k-means clustering algorithm and the PSS and PES are redefined by using the centroids of the clusters. By using this approach, the drawbacks such as the size of the sets, speed of the reconstruction process (computational complexity) which arise in our proposed methods previously are highly eliminated. In spite of these improvements, the initial results proved that, the quality of the reconstructed signals remains within the limitations of the acceptable hearing quality.
  • Yayın
    A new coding method for speech and audio signals
    (IEEE, 2005) Güz, Ümit; Gürkan, Hakan; Yarman, Bekir Sıddık Binboğa
    In this paper a new representation or modeling method of speech signals is introduced. The proposed method is based on the generation of the so-called Predefined Signature S={S R } and Envelope vector E={E K } Sets (PSEVS). These vector sets are speaker and language independent. In this method, once the speech signals are divided into frames with selected lengths, then each frame signal piece X i is reconstructed by means of the mathematical form of X i =C i E K S R . In this representation, C i is called the frame coefficient, S R and E K are the vectors properly assigned from the PSEVS respectively. It is shown that the proposed method provides fast reconstruction and substantial compression ratio with acceptable hearing quality.
  • Yayın
    A novel noise robust and low bit rate speech coding algorithm
    (IEEE, 2009) Güz, Ümit; Gürkan, Hakan; Yarman, Bekir Sıddık Binboğa
    In this work, a new noise robust and variable length frame based speech modeling method is introduced. This method consists of three major steps which includes noise removal algorithm, coding and encoding algorithms, respectively. Coding and encoding parts are developed based on SYMPES (SYsteMatic Procedure for Predefined Envelope and Signature sequence sets). These sets have been developed in two types which represent voiced and unvoiced parts of the speech signals separately in order to obtain more efficient coding strategy and higher compression ratio while preserving the perceptual quality of the speech signals. As an extension of our previous works our new framework is not only consider the coding of the clean speech signals but also noisy speech signals. The new noise robust module suppresses the noise and delivers the clean speech signal to the newly designed modeling part. The modeling part promises higher compression ratios by switching to the more appropriate type of predefined sets take into account the voiced and unvoiced frames.
  • Yayın
    H.264 video kodlamada B-çerçeveler için kodçözücü tarafında aday devinim vektör seçimi
    (IEEE, 2012-04-18) Ateş, Hasan Fehmi; Gaurav, Rahul
    H.264 standardında devinim vektör farklarının kodlanması sebebiyle özellikle düşük bit hızlarında nesne sınırlarında devinimdeki ani değişiklikler harcanan bit miktarlarını artırmaktadır. Bu bildiride B-çerçevelerde kod çözücü desteği ile verimli devinim vektör kodlama için özgün bir yöntem sunulmuştur. Bu yöntemde kod çözücü gerçek devinim kestirimi kullanarak az sayıda aday vektör içeren bir vektör kümesi belirler. Devinim kestirim doğrulu günün iyileştirilmesi amacıyla bu aday vektörler etrafında kısıtlı bir arama yapılır. Bu aramaya en iyi olma ihtimali düşük vektörler dahil edilmeleyerek aday vektor alt-kümesinin küçük tutulması sağlanır. Sonuç¸ta her makroblok için aday vektör kümeleri kod çözücü tarafından belirlendiği için, belirtik bir şekilde devinim bilgisinin kodlanmasına gerek kalmamakta ve bu da kodlama için gerekli bit hızını düşürmektedir. Algoritmanın aynı bit hızlarında referans H.264 sonuçlarına göre 0.39 dB PSNR kazancı sağladığı gösterilmiştir. Ayrıca sıkıştırılmış B-çerçevelerin görsel kalitesinde kayda değer bir iyileşme gözlenmistir.