Eran8.pdf (327.59 kB)
Shape adaptive integer transform for coding arbitrarily shaped objects in H.264/AVC
conference contribution
posted on 2010-07-16, 10:11 authored by Xiongwen Li, Eran Edirisinghe, Helmut BezThe use of shape-adaptive transforms is a popular approach for coding arbitrarily shaped objects in image/video coding
due to their adaptability at object edges and low complexity. In this respect shape adaptive DCT (SA-DCT) and shape
adaptive DWT (SA-DWT) have been proposed in previous literature. The Integer Transform (IT), a derivative of the
4x4 DCT, has been adopted in the latest H.264/AVC standard for coding image blocks in residual data (texture). The
associated integer arithmetic guarantees fast and accurate coding/decoding. In this paper, we propose a novel Shape
Adaptive Integer Transform (SA-IT) which can be effectively used in future for enabling arbitrary shaped object coding
in H.264. Though Integer Transforms are a derivative of 4x4 DCTs, in H.264, to maintain integer arithmetic capability,
the post-and pre-scaling factors of transform process are integrated into the forward and inverse quantiser stages
respectively for reducing the total number of multiplications and avoiding the loss of accuracy. Thus SA-IT considerably
differs from SA-DCT and calls for novel design and implementation considerations based on combining those merits of
both SA-DCT and IT. We provide theoretical proofs and support them with experimental justifications.
History
School
- Science
Department
- Computer Science
Citation
LI, X., EDIRISINGHE, E.A.and BEZ, H.E., 2006. Shape adaptive integer transform for coding arbitrarily shaped objects in H.264/AVC. IN: Apostolopoulos, J.G. and Said, A. (eds.) Visual Communications and Image Processing 2006, Proc. of SPIE-IS&T Electronic Imaging, 6077, 60770C, 11pp.Publisher
© 2006 SPIEVersion
- VoR (Version of Record)
Publication date
2006Notes
Copyright 2006 Society of Photo-Optical Instrumentation Engineers. One print or electronic copy may be made for personal use only. Systematic electronic or print reproduction and distribution, duplication of any material in this paper for a fee or for commercial purposes, or modification of the content of the paper are prohibited. This paper can also be found at: http://dx.doi.org/10.1117/12.642320Language
- en