簡易檢索 / 詳目顯示

研究生: 林恆瑤
Lin, Heng-Yao
論文名稱: H.264/AVC轉換域框內編解碼器之快速演算法及硬體架構設計
Fast Algorithms and VLSI Implementations of H.264/AVC Transform-Domain Intra Codec
指導教授: 劉濱達
Liu, Bin-Da
楊家輝
Yang, Jar-Ferr
學位類別: 博士
Doctor
系所名稱: 電機資訊學院 - 電機工程學系
Department of Electrical Engineering
論文出版年: 2009
畢業學年度: 97
語文別: 英文
論文頁數: 95
中文關鍵詞: H.264/AVC框內編碼轉換域快速演算法條件可調性非等長編碼
外文關鍵詞: context-based adaptive variable length coding (C, H.264/AVC, intra prediction, Transform domain, fast algorithm
相關次數: 點閱:212下載:4
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • H.264/AVC為最新的數位視訊壓縮標準,藉由引進不同的編碼特點以獲得更高的壓縮效率。本論文完成於H.264/AVC視訊框內編解碼器之巨大區塊管線化硬體架構,來增進執行效能。轉換域之框內預測與內涵可適非等長編碼(CAVLC)之快速硬體架構為主要研究重點。
    針對演算法及硬體架構做最佳化設計,我們首先設計框內決策機制演算法。基於離散餘弦轉換的特性,將畫面內既有的資訊做轉換,進而從中偵測出目標方塊紋路的方向性來過濾不必要的預測方向。利用絕對和之整數離散餘弦轉換誤差法(SAITD)的特性,在整個框內編碼的效能可獲保持下,計算速度得到相當大的改進。與現今的相關研究比較,本演算法可減少約60%的編碼時間,而影像品質均有較優的表現。在硬體設計上,此演算法可完全融入框內編碼器中,並非只是單純的前置處理單元。至於最佳預測殘值,則可以直接由預測過程中得到,而不必重新計算。
    接著,我們分析內涵可適非等長編碼(CAVLC)之關鍵路徑,透過提前執行條件偵測,來縮短關鍵路徑之延遲,並將內涵可適非等長編碼電路分割成兩部份,進行管線化架構設計,以提升整體執行效能。我們使用表分解(table partitioning)及字首提前解碼(prefix predecoding)二種低功率的設計方法,來減少解碼表中不必要的功率消耗。並使用結合解碼表及單一輸出緩衝器之方法,在不影響解碼效能下更進一步減少硬體面積及功率消耗。跟傳統H.264解碼效能相比之結果,本論文所提出的架構能減少約40%功率消耗,並提升三倍解碼速度。
    最後,本H.264/AVC視訊框內編解碼器經TSMC 0.18 μm製程技術合成後,最高操作頻率達100 MHz,可即時處理每秒30張16 SIF (1408×960)解析度的影像。

    H.264/AVC is the latest digital video coding standard that achieves very high data compression by using several new coding features. In this dissertation, a highly efficient VLSI architecture with macroblock pipeline structure is proposed to increase the coding speed for intra frame codec in H.264/AVC. The major researches include fast implementations of transform-based intra prediction and context adaptive variable length coder (CAVLC).
    For optimized algorithm and architecture design in intra prediction, a fast mode decision algorithm is represented based on inherent features of discrete cosine transform (DCT). The primary texture direction of the target block can be determined by using the transformed coefficients of the block. Only a few candidate modes are chosen for the cost calculation, which is based on the error model in the sum of absolute integer-transformed differences (SAITD). Experimental results show that the proposed intra prediction algorithm has lower Peak Signal to Noise Ratio (PSNR) degradation and bit-rate increment with the reduction of the coding time by 60% compared to other recent designs. For hardware implementation, the proposed intra prediction algorithm is integrated into intra prediction procedure rather than being a preprocessing units. The residual blocks of the best prediction mode can be obtained directly after the cost generator without re-computing in the intra prediction procedure.
    In the entropy codec module, the critical path in the CAVLC decoder is first analyzed and then reduced by forwarding the adaptive detection for succeeding symbols. With a shortened critical path, the CAVLC architecture is further divided into two segments, which can be easily implemented by a pipeline structure. Consequently, the overall performance is effectively improved. In the hardware implementation, we suggested two efficient approaches, table partitioning and prefix predecoding, to reduce the unnecessary power consumption in decoding the look up table (LUT) of VLC codes. Moreover, a combined LUT and single output buffer have been adopted to further reduce the area as well as power consumption without affecting the decoding performance. Experimental results show that the proposed architecture surpassing other recent designs can approximately reduce power consumption by 40% and achieve three times decoding speed in comparison to the original decoding procedure suggested in the H.264 standard.
    Finally, the H.264/AVC transform-based intra codec with TSMC 0.18 μm CMOS technology is realized, where the maximum operation frequency can achieve at 100 MHz, which can easily support the real-time requirements for video resolutions up to the 16 SIF (1408×960) at 30 frames/sec video format.

    Abstract i List of Tables ix List of Figures x CHAPTER 1 Introduction 1 1.1 Background and Motivation 1 1.2 Organization of the Dissertation 5 CHAPTER 2 Basic Concepts of the H.264/AVC Intra Codec 6 2.1 Overview of H.264/AVC Standard 6 2.2 Intra Prediction Modes 8 2.3 Mode Decision 9 2.4 Transform and Quantization 10 2.5 Entropy Coding 13 CHAPTER 3 Transform-based Intra Mode Decision Algorithm 16 3.1 Characteristic of Transform Function 16 3.2 Luma 4×4 Intra Prediction 18 3.2.1 Detection of DC mode 20 3.2.2 Detection of AC modes 21 3.2.3 Most probable mode 23 3.2.4 Summary 24 3.3 Luma 16×16 Intra Prediction 25 3.4 Chroma 8×8 Intra Prediction 27 3.5 SAITD Technique 29 3.6 Simulation Results 30 3.6.1 Threshold value determination 31 3.6.2 RDO enabled 33 3.6.3 RDO disabled 39 CHAPTER 4 Hardware-Oriented CAVLC Decoder Algorithm Optimization 42 4.1 Real-Time Requirement 43 4.2 Original Level Decoding Process 45 4.3 Critical Path Improvement in Decoding Level 48 4.3.1 Normal mode with suffixLength > 0 49 4.3.2 Normal mode with suffixLength = 0 51 4.3.3 Escape_code mode 52 4.3.4 TrailingOnes mode 52 4.3.5 Summary 53 4.4 Throughput Improvement in Decoding Sign of T1s 54 4.5 Simulation Results 55 CHAPTER 5 VLSI Architecture Design of Transform-based Intra Codec 57 5.1 Transform-based Intra Prediction Architecture 58 5.1.1 Integer transform 59 5.1.2 Prediction generator 63 5.1.3 Cost generation and mode decision 67 5.1.4 Reconstruction loop 67 5.1.5 Register array 69 5.1.6 System controller 69 5.1.7 Timing schedule 73 5.2 CAVLC Encoder 74 5.3 CAVLC Decoder 75 5.3.1 Low power LUTs decoders 75 5.3.2 Combined LUTs decoder 78 5.3.3 Output buffer 80 5.3.4 Power consumption analysis 82 5.4 Synthesis Results and Comparisons 83 CHAPTER 6 Conclusions and Future Work 85 6.1 Conclusions 85 6.2 Future Work 87 References 89 Publication List 94

    [1] “Draft ITU-T Recommendation and Final Draft International Standard of Joint Video Specification (ITU-T Rec.H.264 jISO/IEC 14496-10 AVC),” in Joint Video Team, Mar. 2003, Joint Video Team (JVT) of ISO/IEC MPEG and ITU-T VCEG, JVT-G050.
    [2] I. E. G. Richardson, H.264 and MPEG-4 Video Compression – Video Coding for Next-Generation Multimedia. Chichester, UK: John Wiley & Sons, 2003.
    [3] B. Meng, O. C. Au, C. W. Wong, and H. K. Lam, “Efficient intra-prediction mode selection for 4x4 blocks in H.264,” in Proc. IEEE Int. Conf. on Multimedia and Expo, Jul. 2003, pp. 521-524.
    [4] Y. W. Huang, B. Y. Hsieh, T. C. Chen, and L. G. Chen, “Analysis, fast algorithm, and VLSI architecture design for H.264/AVC intra-frame coder,” IEEE Trans. Circuits Syst. Video Technol., vol. 15, no. 3, pp. 378–401, Mar. 2005.
    [5] C. C. Cheng and T. S. Chang, “Fast three step intra prediction algorithm for 4x4 blocks in H.264,” in Proc. IEEE Int. Symp. Circuits Syst., Kobe, Japan, May 2005, pp. 1509-1512.
    [6] F. Pan, X. Lin, S. Rahardja, K. P. Lim, Z. G. Li, D. Wu, and S. Wu, “Fast mode decision algorithm for intraprediction in H.264/AVC video coding,” IEEE Trans. Circuits Syst. Video Technol., vol. 15, no. 7, pp. 813–822, Jul. 2005.
    [7] M. Jafari and S. Kasaei, “Fast intra-prediction mode decision in H.264 advanced video coding,” in Proc. IEEE ICCS, Oct. 2006, pp. 1-6.
    [8] C. L. Hsu, M. H. Ho, and J. J. Hong, “An efficient algorithm for intra-prediction in H.264,” in Proc. IEEE ICCE, Jan. 2006, pp. 35-36.
    [9] J. W. Chen, C. H. Chang, C. C. Lin, Y. H. Ouyang, J. I. Guo, and J. S. Wang, “A condition-based intra prediction algorithm for H.264/AVC,” in Proc. IEEE Int. Conf. on Multimedia and Expo, Jul. 2006, pp. 1077 – 1080.
    [10] J. C. Wang, J. F. Wang, J. F. Yang, and J. T. Chen, “A fast mode decision algorithm and its VLSI design for H.264/AVC intra-prediction,” IEEE Trans. Circuits Syst. Video Technol., vol. 17, no. 10, pp. 1414–1422, Oct. 2007.
    [11] A. C. Tsai, A. Paul, J. C. Wang, and J. F. Wang, “Intensity gradient technique for efficient intra-prediction in H.264/AVC,” IEEE Trans. Circuits Syst. Video Technol., vol. 18, pp. 694-698, May 2008.
    [12] K. Bharanitharan, B. D. Liu, J. F. Yang and W. C. Tsai, “A low complexity detection of discrete cross differences for fast H.264/AVC intra prediction,” IEEE Transactions on Multimedia, vol. 10, no. 7, pp. 1250–1260, Nov. 2008.
    [13] Y. C. Wei and J. F. Yang, “Transformed-domain intra mode decision in H.264/AVC encoder,” in Proc. TENCON, Nov. 2006, pp. 1-4.
    [14] T. Tsukuba, I. Nagayoshi, T. Hanamura, and H. Tominaga, “H.264 fast intra-prediction mode decision based on frequency characteristic,” in Proc. European Signal Processing Conf., Sep. 2005, pp. 1549-1552.
    [15] T. Hattori and K. Ichige, “Intra-prediction mode decision in H.264/AVC using DCT coefficients,” in Proc. Int. Symp. Intelligent Signal Processing & Comm. Syst., Dec. 2006, pp. 135-138.
    [16] G. Hwang, J. Park, B. Jung, K. Choi, Y. Joo, Y. Oh, and B. Jeon, “Efficient fast intra mode decision using transform coefficients,” in Proc. Int. Conf. Advanced Comm. Technol., Feb. 2007, pp. 399-402.
    [17] Z. Wang, J. Yang, Q. Peng, Z. Ma, and C. Zhu, “A fast transform domain based algorithm for H.264/AVC intra prediction,” in Proc. IEEE Int. Conf. Multimedia Expo, Jul. 2007, pp. 1563-1566.
    [18] H. Li, K. N. Ngan, and Z. Wei, “Fast and efficient method for block edge classification and its application in H.264/AVC video coding,” IEEE Trans. Circuits Syst. Video Technol., vol. 18, no. 6, pp. 756–768, Jun. 2008.
    [19] Y. N. Sairam, N. Ma, and N. Sinha, “A novel partial prediction algorithm for fast 44 intra prediction mode decision in H.264/AVC,” in Proc. IEEE DCC, Mar. 2008, pp. 232-241.
    [20] C. W. Tien, H. Y. Lin, B. D. Liu, and J. F. Yang, “Transform-domain partial prediction algorithm for intra prediction in H.264/AVC,” in Proc. IEEE Int. Symp. Circuits Syst., Taipei, Taiwan, May 2009.
    [21] K. Suh, S. Park, and H. Cho, “An efficient hardware architecture of intra prediction and TQ/IQIT module for H.264 encoder,” ETRI J., vol. 27, no. 5, pp. 511-524, Oct. 2005.
    [22] C. W. Ku, C. C. Cheng, G. S. Yu, M. C. Tsai, and T. S. Chang, “A high-definition H.264/AVC intra-frame codec IP for digital video and still camera applications,” IEEE Trans. Circuits Syst. Video Technol., vol. 16, no. 8, pp. 917–928, Aug. 2006.
    [23] D. W. Li, C. W. Ku, C. C. Cheng, Y. K. Lin, and T. S. Chan, “A 61MHz 72K gates 1280×720 30FPS H.264 intra encoder,” in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Apr. 2007, vol. 2, pp. 801-804.
    [24] Y. K. Lin, Ch. W. Ku, D. W. Li, and T. S. Chang, “A 140-MHz 94 K gates HD1080p 30-frames/s intra-only profile H.264 encoder,” IEEE Trans. Circuits Syst. Video Technol., vol. 19, no. 3, pp. 432–436, Mar. 2009.
    [25] B. Jeon, J. Park, and J. Jeong, “Huffman coding of DCT coefficients using dynamic codeword assignment and adaptive codebook selection,” Signal Process.-Image Commun., vol. 12, pp. 253-262, June 1998.
    [26] G. Lakhani, “Optimal Huffman coding of DCT blocks,” IEEE Trans. Circuits Syst. Video Technol., vol. 14, pp. 522-527, Apr. 2004.
    [27] S. M. Lei and M. T. Sun, “An entropy coding system for digital HDTV applications,” IEEE Trans. Circuits Syst. Video Technol., vol. 1, pp. 147-155, Mar. 1991.
    [28] D. S. Ma, J. F. Yang, and J. Y. Lee, “Programmable and parallel variable-length decoder for video systems,” IEEE Trans. Consum. Electron., vol. 39, pp. 448-454, Aug. 1993.
    [29] B. J. Shieh, Y. S. Lee, and C. Y. Lee, “A new approach of group-based VLC Codec system with full table programmability,” IEEE Trans. Circuits Syst. Video Technol., vol. 11, pp. 210-221, Feb. 2001.
    [30] R. Hashemian, “Design and hardware implementation of a memory efficient Huffman decoding,” IEEE Trans. Consum. Electron., vol. 40, pp. 345-352, Aug. 1994.
    [31] J. Nikara, S. Vassiliadis, J. Takala, P. Liuha, “Multiple-symbol parallel decoding for variable length codes,” IEEE Trans. Very Large Scale Integr. Syst., vol. 12, pp. 676-685, July 2004.
    [32] H. Y. Kang, K. A. Jeong, J. Y. Bae, Y. S. Lee, and S. H. Lee, “MPEG4 AVC/H.264 decoder with scalable bus architecture and dual memory controller,” in Proc. IEEE Int. Symp. Circuits Syst., May 2004, pp. 23-26.
    [33] S. H. Wang, W. H. Peng, Y. He, G. Y. Lin, C. Y. Lin, S. C. Chang, C. N. Wang, and T. Chiang, “A platform-based MPEG-4 advanced video coding (AVC) decoder with block level pipelining,” in Proc. IEEE Int. Conf. Inform. Commun. Security, Dec. 2003, pp. 51-55.
    [34] D. Wu, W. Gao, M. Hu, and Z. Ji, “A VLSI architecture design of CAVLC decoder,” in Proc. IEEE Int. Conf. ASIC, Oct. 2003, vol. 2, pp. 692-695.
    [35] H. C. Chang, C. C. Lin, and J. I. Guo, “A novel low-cost high-performance VLSI architecture for MPEG-4 AVC/H.264 CAVLC decoding,” in Proc. IEEE Int. Symp. Circuits Syst., May 2005, pp. 6110-6113.
    [36] Y. M. Lin and P. Y. Chen, “An efficient implementation of CAVLC for H.264/AVC,” in Proc. Int. Conf, Innovative Comput. Inform. Contr., Aug. 2006, pp. 601-604.
    [37] M. Alle, J. Biswas, and S. K. Nandy, “High performance VLSI architecture design for H.264 CAVLC decoder,” in Proc. Int. Conf. Appl. Syst. Architect. Process., Sept. 2006, pp. 317-322.
    [38] Y. H. Moon, G. Y. Kim, and J. H. Kim, “An efficient decoding of CAVLC in H.264/AVC video coding standard,” IEEE Trans. Consum. Electron., vol. 51, pp. 933-938, Aug. 2005.
    [39] Y.-H. Kim, Y.-J. Yoo, J. Shin, B. Choi, and J. Paik, “Memory-efficient H.264/AVC CAVLC for fast decoding,” IEEE Trans. Consum. Electron., vol. 52, pp. 943-952, Aug. 2006.
    [40] S.-Y. Tseng and T.-W. Hsieh, “A pattern-search method for H.264/AVC CAVLC decoding,” in Proc. IEEE Int. Conf. Multimedia Expo, July 2006, pp. 1073-1076.
    [41] G.-S. Yu and T.-S. Chang, “A zero-skipping multi-symbol CAVLC decoder for MPEG-4 AVC/H.264,” in Proc. Int. Symp. Circuits Syst., May 2006, pp. 5583-5586.
    [42] Y.-N. Wen, G.-L. Wu, S.-J. Chen, and Y.-H. Hu, “Multiple-symbol parallel CAVLC decoder for H.264/AVC,” in Proc. IEEE Asia Pacific Conf. Circuits Syst., Dec. 2006, pp.1240-1243.
    [43] Information Technology-Generic Coding of Audio-Visual Objects Part 2: Visual, ISO/IEC 14496-2 (MPEG-4 Video), ISO/IEC, 1999.
    [44] Video Coding for Low Bitrate Communication, ITU-T Recommendation H.263, 1995, Version 1, Version 2, Sep. 1997.
    [45] K. R. Rao and P. Yip, Discrete Cosine Transform: Algorithms, Advantages, Applications. London, UK: Academic Press, 1990.
    [46] C. H. Tseng, H. M. Wang, and J. F. Yang, “Improved and fast algorithms for intra 4x4 mode decision in H.264/AVC,” in Proc. IEEE Int. Symp. Circuits Syst., Kobe, Japan, May 2005, pp. 2128-2131.
    [47] C. H. Tseng, H. M. Wang, and J. F. Yang, “Enhanced intra-4×4 mode decision for H.264/AVC coders,” IEEE Trans. Circuits Syst. Video Technol., vol. 16, no. 8, pp. 1027–1032, Aug. 2006.
    [48] G. Bjontegaard, “Calculation of average PSNR differences between RD curves,” presented at the 13th VCEG-M33 Meeting, Austin, TX, Apr. 2001.
    [49] Z. Y. Cheng, C. H. Chen, B. D. Liu, and J. F. Yang, “High throughput 2-D transform architectures for H.264 advanced video coders,” in Proc. IEEE APCCAS, Tainan, Taiwan, Dec. 2004, pp. 1141-1144.
    [50] T. C. Wang, Y. W. Huang, H. C. Fang, and L. G. Chen, “Parallel 4x4 2D transform and inverse transform architecture for MPEG-4 AVC/H.264,” in Proc. IEEE Int. Symp. Circuits Syst., May. 2003, pp. 800-803.
    [51] E. Sahin and I. Hamzaoglu, “An efficient hardware architecture for H.264 intra prediction algorithm,” in Proc. Design, Automat. and Test in Europe Conf. and Exhib., DATE 2007, Apr. 2007, pp. 183-188.
    [52] G. Jin, J. S. Jung, and H. J. Lee, “An efficient pipelined architecture for H.264/AVC intra frame processing,” in Proc. IEEE ISCAS, May 2007, pp. 1605-1608.
    [53] S. B. Wang, X. L. Zhanq, Y. Yao, and Z. Wang, “H.264 intra prediction architecture optimization,” in Proc. IEEE ICME, July 2007, pp. 1571-1574.
    [54] S. Molloy and R. Jain, “Low power VLSI architectures for variable-length encoding and decoding,” in Proc. 40th Midwest Symp. Circuits and Syst., Aug. 1997, vol. 2, pp. 997-1000.
    [55] S. H. Cho, T. Xanthopoulos, and A. P. Chandrakasan, “A low power variable length decoder for MPEG-2 based on nonuniform fine-grain table partitioning,” IEEE Trans. VLSI Syst., vol. 7, pp. 249-257, June 1999.

    下載圖示 校內:2012-07-23公開
    校外:2012-07-23公開
    QR CODE