High-Performance Hardware Implementation of The H
High-Performance Hardware Implementation of The H
264
SIMPLIFIED 8X8 TRANSFORMATION AND QUANTIZATION
II - 1138
3. HARDWARE PROTOTYPE The architecture is composed of two main stages. The
first one contains two blocks; the Transform block, which
A block diagram of the proposed architecture is shown in is composed of the three stages of the fast butterfly
Figure 1. The architecture uses 8x8 parallel blocks, QP, a operations mentioned in Section 2, and the QP-Processing
synchronizing clock, and an enabling signal (Input Valid) block, which is responsible for calculating the
as inputs. It outputs the quantized transform coefficients intermediate variables needed for quantization, such as f,
and the signal Output Valid. A block diagram of the qbits, and (P0 – P5), which are the values of the
architecture is shown in Figure 1. multiplication factors at the six different groups of
positions in the matrix as shown in Table 1. Finally, the
Quantization process takes place in the second main stage
Input Block of the design. This is done by performing the addition and
Quant.
(X00-X77) 8x8 Trans. multiplication operations in the Arithmetic block, and
Forward Coefficients finally the shifting operations in the Shifter block.
Trans. &
Quant.
QP
4. SIMULATIONS AND RESULTS
Stage 2
Stage 3
(X00- (W00-
X77) W77)
A 14.598 ns critical path is estimated by the synthesis
tool. Since at steady state, the architecture outputs a whole
8x8 encoded block with each clock pulse, therefore the
Arithmetic
II - 1139
Television (HDTV) frame of a 720 u 1280 pixels Transactions on Circuits and Systems For Video Technology, Vol. 13,
No. 7, pp. 560-576, July 2003.
resolution, and a 60 frames/sec frame rate is 0.21 ms, [7] “Emerging H.264 Standard: Overview and TMS320DM642-Based
which is about 79 times less than the 16.6 ms time Solutions for Real-Time Video Applications,” A white paper. [Online].
required for continuous motion. Hence, the introduced Available:
architecture satisfies the real-time constraints for SD, HD, http://www.ubvideo.com, December 2002.
[8] K. Denolf, C. Blanch, G. Lafruit, and A. Bormans, “Initial memory
and even higher resolution video formats. complexity analysis of the AVC codec,” IEEE Workshop on Signal
Processing Systems, 2002 (SIPS’02), pp. 222-227, October 2002.
5. CONCLUSION [9] T. Stockhammer, M. M. Hannuksela, T. Wiegand, “H.264/AVC in
wireless environments,” IEEE Transactions on Circuits and Systems For
Video Technology, Vol. 13, No. 7, pp. 657-673, July 2003.
A high-performance architecture for prototyping the [10] M. Horowitz, A. Joch, F. Kossentini, A. Hallapuro, “H.264/AVC
simplified 8x8 transformation and quantization, which Baseline Profile Decoder Complexity Analysis,” IEEE Transactions on
was recently adopted by the H.264 standard is developed. Circuits and Systems For Video Technology, Vol. 13, No. 7, pp. 704-
The architecture is designed to perform pipelined 716, July 2003.
[11] H. S. Malvar, A. Hallapuro, M. Karczewicz, and L. Kerofsky,
operations, which drastically reduces the required memory “Low-Complexity Transform and Quantization in H.264/AVC,” IEEE
resources and accesses, avoids any stall states, and Transactions on Circuits and Systems For Video Technology, Vol. 13,
dramatically improves the throughput of the architecture. No. 7, pp. 598-603, July 2003.
It satisfies the real-time constraints required by different [12] M. Wien, “Clean-up and improved design consistency for ABT,”
Joint Video Team (JVT) of ISO/IEC MPEG and ITU-T VCEG, doc. JVT
high-resolution digital video applications. The system is –E025.
simulated using ModelSim 5.4®, and synthesized using [13] S. Gordon, D. Marpe, and T. Wiegand, “Simplified Use of 8x8
Synplify Pro 7.1®, targeting the FPGA device XC2V4000 Transform – Proposal,” Joint Video Team (JVT) of ISO/IEC MPEG and
(BF957 package) from the Virtex-II family of Xilinx©. ITU-T VCEG, doc. JVT –J029.
[14] S. Gordon, D. Marpe, and T. Wiegand, “Simplified Use of 8x8
The ideal way to utilize the proposed design is to embed it Transform – Updated Proposal & Results,” Joint Video Team (JVT) of
to an overall system, where the reference software runs on ISO/IEC MPEG and ITU-T VCEG, doc. JVT –K028, Munich, Germany,
a Digital Signal Processor (DSP), and the computationally March 2004.
extensive operations performed in the hardware block. [15] S. Gordon, “Simplified Use of 8x8 Transform,” Joint Video Team
(JVT) of ISO/IEC MPEG and ITU-T VCEG, doc. JVT –I022, San
Diego, USA, September 2003.
6. ACKNOWLEDGEMENT [16] I. Amer, W. Badawy, and G. Jullien, “Towards MPEG-4 Part 10
System On Chip: A VLSI Prototype For Context-Based Adaptive
The authors would like to thank Advanced Technology Variable Length Coding (CAVLC),” accepted in IEEE Workshop on
Signal Processing Systems, Austin, Texas, USA, October 2004.
Information Processing Systems (ATIPS), Alberta [17] I. Amer, W. Badawy, and G. Jullien, “A VLSI Prototype for
Ingenuity Fund (AIF), Natural Science and Engineering Hadamard Transform with Application to MPEG-4 Part 10,” accepted in
Research Council of Canada (NSERC), Canadian IEEE International Conference on Multimedia and Expo, Taipei,
Microelectronics Corporation (CMC), Micronet R&D, Taiwan, June 2004.
[18] I. Amer, W. Badawy, and G. Jullien, “Hardware Prototyping for The
Alberta Informatics Circle of Research Excellence H.264 4x4 Transformation,” proceedings of IEEE International
(iCORE), Canada Foundation for Innovation (CFI), and Conference on Acoustics, Speech, and Signal Processing, Montreal,
the Department of Electrical and Computer Engineering at Quebec, Canada, Vol. 5, pp. 77-80, May 2004.
the University of Calgary for supporting this research. [19] I. Amer, W. Badawy, and G. Jullien, “A SystemC Model for the
MPEG-4 Part 10 4x4 DCT-like Transformation and Quantization,”
ISO/IEC JTC1/SC29/WG11 M10830, Redmond, USA, July 2004.
7. REFERENCES [20] I. Amer, W. Badawy, and G. Jullien, “A Hardware Block for the
MPEG-4 Part 10 4x4 Transformation and Quantization,” ISO/IEC
[1] A. M. Tekalp, Digital Video Processing, Prentice-Hall, Inc., New JTC1/SC29/WG11 M10829, Redmond, USA, July 2004.
Jersey, USA, 1995. [21] I. Amer, W. Badawy, and G. Jullien, “A SystemC model for 4x4
[2] “ITU-T Rec. H.264 | ISO/IEC 14496-10 AVC,” Draft Text of Final Hadamard Transform and Quantization with application to MPEG-4 Part
Draft International Standard for Advanced Video Coding, [Online]. 10,” ISO/IEC JTC1/SC29/WG11 M10828, Redmond, USA, July 2004.
Available: [22] I. Amer, W. Badawy, and G. Jullien, “A Hardware Block for 4x4
http://www.chiariglione.org/mpeg/working_documents.htm, March Hadamard Transform and Quantization in MPEG-4 Part 10,” ISO/IEC
2003. JTC1/SC29/WG11 M10827, Redmond, USA, July 2004.
[3] I. E. G. Richardson, “H.264/MPEG-4 Part 10: Transform & [23] I. Amer, W. Badawy, and G. Jullien, “A SystemC model for 2x2
Quantization,” A white paper. [Online]. Available: Hadamard Transform and Quantization with Application to MPEG–4
http://www.vcodex.com, March 2003. Part 10,” ISO/IEC JTC1/SC29/WG11 M10826, Redmond, USA, July 04.
[4] I. E. G. Richardson, H.264 and MPEG-4 Video Compression: Video [24] I. Amer, W. Badawy, and G. Jullien, “A Hardware Block for 2x2
Coding for Next-generation Multimedia, John Wiley & Sons Ltd., Hadamard Transform and Quantization with Application to MPEG–4
Sussex, England, December 2003. Part 10,” ISO/IEC JTC1/SC29/WG11 M10825, Redmond, USA, July 04.
[5] G. Sullivan, P. Topiwala, and A. Luthra, “The H.264 Advanced [25] I. Amer, W. Badawy, and G. Jullien, “An IP Block for MPEG-4 Part
Video Coding Standard: Overview and Introduction to the Fidelity 10 Context-Based Adaptive Variable Length Coding (CAVLC),”
Range Extensions,” SPIE Conference on Application of Digital Image ISO/IEC JTC1/SC29/WG11 M10824, Redmond, USA, July 2004.
Processing XXVII, Colorado, USA, August 2004. [26] I. Amer, W. Badawy, and G. Jullien, “A Proposed Hardware
[6] T. Wiegand, G. J. Sullivan, G. Bjontegaard, and A. Luthra, Reference Model for Spatial Transformation and Quantization in H.264,”
“Overview of the H.264/AVC Video Coding Standard,” IEEE accepted in Journal of Visual Communication and Image Representation
Special Issue on Emerging H.264/AVC Video Coding Standard.
II - 1140