| 研究生: |
林恆瑤 Lin, Heng-Yao |
|---|---|
| 論文名稱: |
H.264/AVC轉換域框內編解碼器之快速演算法及硬體架構設計 Fast Algorithms and VLSI Implementations of H.264/AVC Transform-Domain Intra Codec |
| 指導教授: |
劉濱達
Liu, Bin-Da 楊家輝 Yang, Jar-Ferr |
| 學位類別: |
博士 Doctor |
| 系所名稱: |
電機資訊學院 - 電機工程學系 Department of Electrical Engineering |
| 論文出版年: | 2009 |
| 畢業學年度: | 97 |
| 語文別: | 英文 |
| 論文頁數: | 95 |
| 中文關鍵詞: | H.264/AVC 、框內編碼 、轉換域 、快速演算法 、條件可調性非等長編碼 |
| 外文關鍵詞: | context-based adaptive variable length coding (C, H.264/AVC, intra prediction, Transform domain, fast algorithm |
| 相關次數: | 點閱:212 下載:4 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
H.264/AVC為最新的數位視訊壓縮標準,藉由引進不同的編碼特點以獲得更高的壓縮效率。本論文完成於H.264/AVC視訊框內編解碼器之巨大區塊管線化硬體架構,來增進執行效能。轉換域之框內預測與內涵可適非等長編碼(CAVLC)之快速硬體架構為主要研究重點。
針對演算法及硬體架構做最佳化設計,我們首先設計框內決策機制演算法。基於離散餘弦轉換的特性,將畫面內既有的資訊做轉換,進而從中偵測出目標方塊紋路的方向性來過濾不必要的預測方向。利用絕對和之整數離散餘弦轉換誤差法(SAITD)的特性,在整個框內編碼的效能可獲保持下,計算速度得到相當大的改進。與現今的相關研究比較,本演算法可減少約60%的編碼時間,而影像品質均有較優的表現。在硬體設計上,此演算法可完全融入框內編碼器中,並非只是單純的前置處理單元。至於最佳預測殘值,則可以直接由預測過程中得到,而不必重新計算。
接著,我們分析內涵可適非等長編碼(CAVLC)之關鍵路徑,透過提前執行條件偵測,來縮短關鍵路徑之延遲,並將內涵可適非等長編碼電路分割成兩部份,進行管線化架構設計,以提升整體執行效能。我們使用表分解(table partitioning)及字首提前解碼(prefix predecoding)二種低功率的設計方法,來減少解碼表中不必要的功率消耗。並使用結合解碼表及單一輸出緩衝器之方法,在不影響解碼效能下更進一步減少硬體面積及功率消耗。跟傳統H.264解碼效能相比之結果,本論文所提出的架構能減少約40%功率消耗,並提升三倍解碼速度。
最後,本H.264/AVC視訊框內編解碼器經TSMC 0.18 μm製程技術合成後,最高操作頻率達100 MHz,可即時處理每秒30張16 SIF (1408×960)解析度的影像。
H.264/AVC is the latest digital video coding standard that achieves very high data compression by using several new coding features. In this dissertation, a highly efficient VLSI architecture with macroblock pipeline structure is proposed to increase the coding speed for intra frame codec in H.264/AVC. The major researches include fast implementations of transform-based intra prediction and context adaptive variable length coder (CAVLC).
For optimized algorithm and architecture design in intra prediction, a fast mode decision algorithm is represented based on inherent features of discrete cosine transform (DCT). The primary texture direction of the target block can be determined by using the transformed coefficients of the block. Only a few candidate modes are chosen for the cost calculation, which is based on the error model in the sum of absolute integer-transformed differences (SAITD). Experimental results show that the proposed intra prediction algorithm has lower Peak Signal to Noise Ratio (PSNR) degradation and bit-rate increment with the reduction of the coding time by 60% compared to other recent designs. For hardware implementation, the proposed intra prediction algorithm is integrated into intra prediction procedure rather than being a preprocessing units. The residual blocks of the best prediction mode can be obtained directly after the cost generator without re-computing in the intra prediction procedure.
In the entropy codec module, the critical path in the CAVLC decoder is first analyzed and then reduced by forwarding the adaptive detection for succeeding symbols. With a shortened critical path, the CAVLC architecture is further divided into two segments, which can be easily implemented by a pipeline structure. Consequently, the overall performance is effectively improved. In the hardware implementation, we suggested two efficient approaches, table partitioning and prefix predecoding, to reduce the unnecessary power consumption in decoding the look up table (LUT) of VLC codes. Moreover, a combined LUT and single output buffer have been adopted to further reduce the area as well as power consumption without affecting the decoding performance. Experimental results show that the proposed architecture surpassing other recent designs can approximately reduce power consumption by 40% and achieve three times decoding speed in comparison to the original decoding procedure suggested in the H.264 standard.
Finally, the H.264/AVC transform-based intra codec with TSMC 0.18 μm CMOS technology is realized, where the maximum operation frequency can achieve at 100 MHz, which can easily support the real-time requirements for video resolutions up to the 16 SIF (1408×960) at 30 frames/sec video format.
[1] “Draft ITU-T Recommendation and Final Draft International Standard of Joint Video Specification (ITU-T Rec.H.264 jISO/IEC 14496-10 AVC),” in Joint Video Team, Mar. 2003, Joint Video Team (JVT) of ISO/IEC MPEG and ITU-T VCEG, JVT-G050.
[2] I. E. G. Richardson, H.264 and MPEG-4 Video Compression – Video Coding for Next-Generation Multimedia. Chichester, UK: John Wiley & Sons, 2003.
[3] B. Meng, O. C. Au, C. W. Wong, and H. K. Lam, “Efficient intra-prediction mode selection for 4x4 blocks in H.264,” in Proc. IEEE Int. Conf. on Multimedia and Expo, Jul. 2003, pp. 521-524.
[4] Y. W. Huang, B. Y. Hsieh, T. C. Chen, and L. G. Chen, “Analysis, fast algorithm, and VLSI architecture design for H.264/AVC intra-frame coder,” IEEE Trans. Circuits Syst. Video Technol., vol. 15, no. 3, pp. 378–401, Mar. 2005.
[5] C. C. Cheng and T. S. Chang, “Fast three step intra prediction algorithm for 4x4 blocks in H.264,” in Proc. IEEE Int. Symp. Circuits Syst., Kobe, Japan, May 2005, pp. 1509-1512.
[6] F. Pan, X. Lin, S. Rahardja, K. P. Lim, Z. G. Li, D. Wu, and S. Wu, “Fast mode decision algorithm for intraprediction in H.264/AVC video coding,” IEEE Trans. Circuits Syst. Video Technol., vol. 15, no. 7, pp. 813–822, Jul. 2005.
[7] M. Jafari and S. Kasaei, “Fast intra-prediction mode decision in H.264 advanced video coding,” in Proc. IEEE ICCS, Oct. 2006, pp. 1-6.
[8] C. L. Hsu, M. H. Ho, and J. J. Hong, “An efficient algorithm for intra-prediction in H.264,” in Proc. IEEE ICCE, Jan. 2006, pp. 35-36.
[9] J. W. Chen, C. H. Chang, C. C. Lin, Y. H. Ouyang, J. I. Guo, and J. S. Wang, “A condition-based intra prediction algorithm for H.264/AVC,” in Proc. IEEE Int. Conf. on Multimedia and Expo, Jul. 2006, pp. 1077 – 1080.
[10] J. C. Wang, J. F. Wang, J. F. Yang, and J. T. Chen, “A fast mode decision algorithm and its VLSI design for H.264/AVC intra-prediction,” IEEE Trans. Circuits Syst. Video Technol., vol. 17, no. 10, pp. 1414–1422, Oct. 2007.
[11] A. C. Tsai, A. Paul, J. C. Wang, and J. F. Wang, “Intensity gradient technique for efficient intra-prediction in H.264/AVC,” IEEE Trans. Circuits Syst. Video Technol., vol. 18, pp. 694-698, May 2008.
[12] K. Bharanitharan, B. D. Liu, J. F. Yang and W. C. Tsai, “A low complexity detection of discrete cross differences for fast H.264/AVC intra prediction,” IEEE Transactions on Multimedia, vol. 10, no. 7, pp. 1250–1260, Nov. 2008.
[13] Y. C. Wei and J. F. Yang, “Transformed-domain intra mode decision in H.264/AVC encoder,” in Proc. TENCON, Nov. 2006, pp. 1-4.
[14] T. Tsukuba, I. Nagayoshi, T. Hanamura, and H. Tominaga, “H.264 fast intra-prediction mode decision based on frequency characteristic,” in Proc. European Signal Processing Conf., Sep. 2005, pp. 1549-1552.
[15] T. Hattori and K. Ichige, “Intra-prediction mode decision in H.264/AVC using DCT coefficients,” in Proc. Int. Symp. Intelligent Signal Processing & Comm. Syst., Dec. 2006, pp. 135-138.
[16] G. Hwang, J. Park, B. Jung, K. Choi, Y. Joo, Y. Oh, and B. Jeon, “Efficient fast intra mode decision using transform coefficients,” in Proc. Int. Conf. Advanced Comm. Technol., Feb. 2007, pp. 399-402.
[17] Z. Wang, J. Yang, Q. Peng, Z. Ma, and C. Zhu, “A fast transform domain based algorithm for H.264/AVC intra prediction,” in Proc. IEEE Int. Conf. Multimedia Expo, Jul. 2007, pp. 1563-1566.
[18] H. Li, K. N. Ngan, and Z. Wei, “Fast and efficient method for block edge classification and its application in H.264/AVC video coding,” IEEE Trans. Circuits Syst. Video Technol., vol. 18, no. 6, pp. 756–768, Jun. 2008.
[19] Y. N. Sairam, N. Ma, and N. Sinha, “A novel partial prediction algorithm for fast 44 intra prediction mode decision in H.264/AVC,” in Proc. IEEE DCC, Mar. 2008, pp. 232-241.
[20] C. W. Tien, H. Y. Lin, B. D. Liu, and J. F. Yang, “Transform-domain partial prediction algorithm for intra prediction in H.264/AVC,” in Proc. IEEE Int. Symp. Circuits Syst., Taipei, Taiwan, May 2009.
[21] K. Suh, S. Park, and H. Cho, “An efficient hardware architecture of intra prediction and TQ/IQIT module for H.264 encoder,” ETRI J., vol. 27, no. 5, pp. 511-524, Oct. 2005.
[22] C. W. Ku, C. C. Cheng, G. S. Yu, M. C. Tsai, and T. S. Chang, “A high-definition H.264/AVC intra-frame codec IP for digital video and still camera applications,” IEEE Trans. Circuits Syst. Video Technol., vol. 16, no. 8, pp. 917–928, Aug. 2006.
[23] D. W. Li, C. W. Ku, C. C. Cheng, Y. K. Lin, and T. S. Chan, “A 61MHz 72K gates 1280×720 30FPS H.264 intra encoder,” in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Apr. 2007, vol. 2, pp. 801-804.
[24] Y. K. Lin, Ch. W. Ku, D. W. Li, and T. S. Chang, “A 140-MHz 94 K gates HD1080p 30-frames/s intra-only profile H.264 encoder,” IEEE Trans. Circuits Syst. Video Technol., vol. 19, no. 3, pp. 432–436, Mar. 2009.
[25] B. Jeon, J. Park, and J. Jeong, “Huffman coding of DCT coefficients using dynamic codeword assignment and adaptive codebook selection,” Signal Process.-Image Commun., vol. 12, pp. 253-262, June 1998.
[26] G. Lakhani, “Optimal Huffman coding of DCT blocks,” IEEE Trans. Circuits Syst. Video Technol., vol. 14, pp. 522-527, Apr. 2004.
[27] S. M. Lei and M. T. Sun, “An entropy coding system for digital HDTV applications,” IEEE Trans. Circuits Syst. Video Technol., vol. 1, pp. 147-155, Mar. 1991.
[28] D. S. Ma, J. F. Yang, and J. Y. Lee, “Programmable and parallel variable-length decoder for video systems,” IEEE Trans. Consum. Electron., vol. 39, pp. 448-454, Aug. 1993.
[29] B. J. Shieh, Y. S. Lee, and C. Y. Lee, “A new approach of group-based VLC Codec system with full table programmability,” IEEE Trans. Circuits Syst. Video Technol., vol. 11, pp. 210-221, Feb. 2001.
[30] R. Hashemian, “Design and hardware implementation of a memory efficient Huffman decoding,” IEEE Trans. Consum. Electron., vol. 40, pp. 345-352, Aug. 1994.
[31] J. Nikara, S. Vassiliadis, J. Takala, P. Liuha, “Multiple-symbol parallel decoding for variable length codes,” IEEE Trans. Very Large Scale Integr. Syst., vol. 12, pp. 676-685, July 2004.
[32] H. Y. Kang, K. A. Jeong, J. Y. Bae, Y. S. Lee, and S. H. Lee, “MPEG4 AVC/H.264 decoder with scalable bus architecture and dual memory controller,” in Proc. IEEE Int. Symp. Circuits Syst., May 2004, pp. 23-26.
[33] S. H. Wang, W. H. Peng, Y. He, G. Y. Lin, C. Y. Lin, S. C. Chang, C. N. Wang, and T. Chiang, “A platform-based MPEG-4 advanced video coding (AVC) decoder with block level pipelining,” in Proc. IEEE Int. Conf. Inform. Commun. Security, Dec. 2003, pp. 51-55.
[34] D. Wu, W. Gao, M. Hu, and Z. Ji, “A VLSI architecture design of CAVLC decoder,” in Proc. IEEE Int. Conf. ASIC, Oct. 2003, vol. 2, pp. 692-695.
[35] H. C. Chang, C. C. Lin, and J. I. Guo, “A novel low-cost high-performance VLSI architecture for MPEG-4 AVC/H.264 CAVLC decoding,” in Proc. IEEE Int. Symp. Circuits Syst., May 2005, pp. 6110-6113.
[36] Y. M. Lin and P. Y. Chen, “An efficient implementation of CAVLC for H.264/AVC,” in Proc. Int. Conf, Innovative Comput. Inform. Contr., Aug. 2006, pp. 601-604.
[37] M. Alle, J. Biswas, and S. K. Nandy, “High performance VLSI architecture design for H.264 CAVLC decoder,” in Proc. Int. Conf. Appl. Syst. Architect. Process., Sept. 2006, pp. 317-322.
[38] Y. H. Moon, G. Y. Kim, and J. H. Kim, “An efficient decoding of CAVLC in H.264/AVC video coding standard,” IEEE Trans. Consum. Electron., vol. 51, pp. 933-938, Aug. 2005.
[39] Y.-H. Kim, Y.-J. Yoo, J. Shin, B. Choi, and J. Paik, “Memory-efficient H.264/AVC CAVLC for fast decoding,” IEEE Trans. Consum. Electron., vol. 52, pp. 943-952, Aug. 2006.
[40] S.-Y. Tseng and T.-W. Hsieh, “A pattern-search method for H.264/AVC CAVLC decoding,” in Proc. IEEE Int. Conf. Multimedia Expo, July 2006, pp. 1073-1076.
[41] G.-S. Yu and T.-S. Chang, “A zero-skipping multi-symbol CAVLC decoder for MPEG-4 AVC/H.264,” in Proc. Int. Symp. Circuits Syst., May 2006, pp. 5583-5586.
[42] Y.-N. Wen, G.-L. Wu, S.-J. Chen, and Y.-H. Hu, “Multiple-symbol parallel CAVLC decoder for H.264/AVC,” in Proc. IEEE Asia Pacific Conf. Circuits Syst., Dec. 2006, pp.1240-1243.
[43] Information Technology-Generic Coding of Audio-Visual Objects Part 2: Visual, ISO/IEC 14496-2 (MPEG-4 Video), ISO/IEC, 1999.
[44] Video Coding for Low Bitrate Communication, ITU-T Recommendation H.263, 1995, Version 1, Version 2, Sep. 1997.
[45] K. R. Rao and P. Yip, Discrete Cosine Transform: Algorithms, Advantages, Applications. London, UK: Academic Press, 1990.
[46] C. H. Tseng, H. M. Wang, and J. F. Yang, “Improved and fast algorithms for intra 4x4 mode decision in H.264/AVC,” in Proc. IEEE Int. Symp. Circuits Syst., Kobe, Japan, May 2005, pp. 2128-2131.
[47] C. H. Tseng, H. M. Wang, and J. F. Yang, “Enhanced intra-4×4 mode decision for H.264/AVC coders,” IEEE Trans. Circuits Syst. Video Technol., vol. 16, no. 8, pp. 1027–1032, Aug. 2006.
[48] G. Bjontegaard, “Calculation of average PSNR differences between RD curves,” presented at the 13th VCEG-M33 Meeting, Austin, TX, Apr. 2001.
[49] Z. Y. Cheng, C. H. Chen, B. D. Liu, and J. F. Yang, “High throughput 2-D transform architectures for H.264 advanced video coders,” in Proc. IEEE APCCAS, Tainan, Taiwan, Dec. 2004, pp. 1141-1144.
[50] T. C. Wang, Y. W. Huang, H. C. Fang, and L. G. Chen, “Parallel 4x4 2D transform and inverse transform architecture for MPEG-4 AVC/H.264,” in Proc. IEEE Int. Symp. Circuits Syst., May. 2003, pp. 800-803.
[51] E. Sahin and I. Hamzaoglu, “An efficient hardware architecture for H.264 intra prediction algorithm,” in Proc. Design, Automat. and Test in Europe Conf. and Exhib., DATE 2007, Apr. 2007, pp. 183-188.
[52] G. Jin, J. S. Jung, and H. J. Lee, “An efficient pipelined architecture for H.264/AVC intra frame processing,” in Proc. IEEE ISCAS, May 2007, pp. 1605-1608.
[53] S. B. Wang, X. L. Zhanq, Y. Yao, and Z. Wang, “H.264 intra prediction architecture optimization,” in Proc. IEEE ICME, July 2007, pp. 1571-1574.
[54] S. Molloy and R. Jain, “Low power VLSI architectures for variable-length encoding and decoding,” in Proc. 40th Midwest Symp. Circuits and Syst., Aug. 1997, vol. 2, pp. 997-1000.
[55] S. H. Cho, T. Xanthopoulos, and A. P. Chandrakasan, “A low power variable length decoder for MPEG-2 based on nonuniform fine-grain table partitioning,” IEEE Trans. VLSI Syst., vol. 7, pp. 249-257, June 1999.