| 研究生: |
趙億智 Chao, Yi-Chih |
|---|---|
| 論文名稱: |
多標準反轉換與H.264/AVC視訊解碼核心架構設計 Architecture Designs for H.264/AVC Decoder Kernel with Multi-Standard Inverse Transforms |
| 指導教授: |
劉濱達
Liu, Bin-Da |
| 共同指導教授: |
楊家輝
Yang, Jar-Ferr |
| 學位類別: |
博士 Doctor |
| 系所名稱: |
電機資訊學院 - 電機工程學系 Department of Electrical Engineering |
| 論文出版年: | 2011 |
| 畢業學年度: | 99 |
| 語文別: | 英文 |
| 論文頁數: | 109 |
| 中文關鍵詞: | H.264/AVC 、VC-1 、AVS 、VLSI 、CAVLC 、量化 、離散餘弦轉換 (DCT) |
| 外文關鍵詞: | H.264/AVC, VC-1, AVS, VLSI, CAVLC, Quantization, Discrete Cosine Transform (DCT) |
| 相關次數: | 點閱:104 下載:1 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
ITU-T VCEG和ISO/IEC MPEG所提出的H.264/AVC、微軟的VC-1和中國的AVS為目前最熱門視訊壓縮標準。為了解碼H.264/AVC 標準中的殘值,整個架構需要CAVLC解碼器、反量化器、反轉換器。在硬體設計上,因為這些元件的解碼速度不同,在元件之間的介面必須很小心的設計來緩衝它們之間的資料傳輸。為了有效的實現 CAVLC 解碼器和反量化器之間的介面,本論文分析了在H.264/AVC 標準中的解碼流程。在適當的安排CAVLC 解碼器和反量化器的解碼流程後,所提出的架構只需要一個反量化器,而且更進一步將它整合到CAVLC 解碼器中來減少緩衝器的面積。另一方面,此研究也針對H.264/AVC視訊標準提出了低成本的反轉換架構。
目前有越來越多的消費性產品能支援多種視訊壓縮格式,如何去設計出能支援多種標準的硬體是很重要的。為了實現低成本的硬體設計,本論文也提出了一個應用於多個標準的整合性反轉換架構,此架構能有效地被使用於H.264/AVC,VC-1以及AVS解碼器中。利用轉換矩陣本身的對稱性質,就可以有效率地簡化原本繁複的矩陣乘法運算,藉由此結果,在硬體設計上並不需要使用到乘法器即可完成所需的反轉換運算。在此架構中,只需要使用移位器以及加/減法器,即可快速地完成H.264/AVC,VC-1以及AVS所定義的一維反轉換運算。
在模擬之後顯示,所提出的H.264/AVC解碼核心之架構所需的面積是14.1 k的邏輯閘,而最大的操作頻率可以到達130 MHz。這樣的處理速度可以達到即時處理4:2:0 的格式、每秒30 張、畫面解析度為4VGA 大小的影像。另一方面,所提出的多標準反轉換架構所需的邏輯閘數目為8,983,此邏輯閘數量遠少於沒有使用硬體共享的架構。當與個別標準的硬體設計相比時,此架構能減少約一半的邏輯閘數量。
The video coding systems such as ITU-T VCEG and MPEG H.264/AVC, Microsoft VC-1, and Chinese AVS are currently popular standards for multimedia applications. In H.264/AVC, the required components for decoding the residual data consist of CAVLC decoder, inverse quantizer, and inverse transform. Since the decoding speed of these components is varied, the interface should be designed carefully to buffer the data among them. To efficiently realize the interface of the CAVLC decoder and inverse quantizer, the residual decoding procedure in the H.264/AVC is analyzed. After proper arrangement of the CAVLC decoding and inverse quantization procedures, the proposed architecture requires only one inverse quantizer which is further combined into the CAVLC decoder to reduce the buffer size. Moreover, low-cost architectures for computing the multiple H.264/AVC inverse transforms in all profiles are also proposed.
Hardware designs that can support multiple standards are required for versatile media players. This dissertation also proposes a unified inverse transform architecture that can be efficiently used in MPEG and ITU-T H.264/AVC, Microsoft VC-1, and Chinese AVS decoders. For the H.264/AVC 8-point and 4-point inverse transforms, the computational complexity in the proposed architecture is similar to that defined in the H.264/AVC standard. By using the symmetry of the transform matrices, the matrix product operations of the inverse transforms in VC-1 and AVS are efficiently decomposed to only use shifters, adders, and subtractors. All the computations are verified and designed using a hardware unit to achieve a low-cost hardware kernel. The proposed multiple-transform architecture contains fast 1-D transforms and rounding operations for the computation of H.264/AVC, VC-1, and AVS 8-point and 4-point inverse transforms.
Simulation results show that the total implemented gate counts is 14.1 k and the maximum operation frequency is 130 MHz in the proposed H.264/AVC decoding kernel design. It can support the real-time requirement for the 4VGA @30 fps video resolution in 4:2:0 formats. The total number of gates for the proposed multi-standard inverse transform architecture is 8,983, which is much less than that required for architectures without hardware sharing. Compared to individual designs, the proposed architecture reduces the number of logic gates by a factor of two with a penalty of 20% in data throughput.
[1] Coding of Moving Pictures and Associated Audio for Digital Storage Media at up to about 1.5 Mbit/s – Part2: Video, ISO/IEC 11172, 1993.
[2] Information Technology – Generic Coding of Moving Pictures and Associated Audio Information: Video, ISO/IEC 13818-2 and ITU-T Rec. H.262, 1996.
[3] Information Technology – Coding of Audio-Visual Objects – Part2: Visual, ISO/IEC 14496-2, 1999.
[4] Video Codec for Audiovisual Services at px64 kbits/s, ITU-T Rec. H.261 v1, 1990.
[5] Video Coding for Low Bit Rate Communication, ITU-T Rec. H.263, 1998.
[6] ITU-T Recommendation H.264 & ISO/IEC 14496-10 AVC, Draft ITU-T Recommendation and Final Draft International Standard of Joint Video Specification, May 2003.
[7] T. Wiegand, G. J. Sullivan, G. Bjontegaard, and A. K. Luthra, “Overview of the H.264/AVC video coding standard,” IEEE Trans. Circuits Syst. Video Technol., vol. 13, pp. 560-576, July 2003.
[8] ITU-T Recommendation H.264 & ISO/IEC 14496-10 AVC, Advanced Video Coding for Generic Audiovisual Services, version 3, 2005.
[9] I. E. G. Richardson, H.264 and MPEG-4 Video Compression: Video Coding for Next-generation Multimedia, Chichester, UK: John Wiley & Sons, 2003.
[10] Standard for Television: VC-1 Compressed Video Bitstream Format and Decoding Process, SMPTE 421M-2006.
[11] H. Kalva and J. B. Lee, “The VC-1 video coding standard,” IEEE Trans. Multimedia, vol. 14, pp. 88-91, Oct.-Dec. 2007.
[12] W. Gao, C. Reader, F.Wu, Y. He, L. Yu, H. Lu, S. Yang, T. Huang, and X. Pan, “AVS—the Chinese next-generation video coding standard,” in Proc. Nat. Assoc. of Broadcasters (NAB) Conf., Apr. 2004.
[13] B. Jeon, J. Park, and J. Jeong, “Huffman coding of DCT coefficients using dynamic codeword assignment and adaptive codebook selection,” Signal Process. Image Commun., vol. 12, pp. 253-262, June 1998.
[14] G. Lakhani, “Optimal Huffman coding of DCT blocks, ” IEEE Trans. Circuits Syst. Video Technol., vol. 14, pp. 522-527, Apr. 2004.
[15] D. Wu, W. Gao, M. Z. Hu, and Z. Z. Ji, “A VLSI architecture design of CAVLC decoder,” in Proc. IEEE Int. Conf. ASIC, Oct. 2003, pp. 692-695.
[16] H. C. Chang, C. C. Lin, and J. I. Guo, “A novel low-cost high-performance VLSI architecture for MPEG-4 AVC/H.264 CAVLC decoding,” in Proc. IEEE ISCAS, May 2005, pp. 6110-6112.
[17] Y. H. Moon, G. Y. Kim, and J. H. Kim, “An efficient decoding of CAVLC in H.264/AVC video coding standard,” IEEE Trans. Consum. Electron., vol. 51, pp. 933-938, Aug. 2005.
[18] Y. H. Kim, Y. J. Yoo, J. Shin, B. Choi, and J. Paik, “Memory-efficient H.264/AVC CAVLC for fast decoding,” IEEE Trans. Consum. Electron., vol. 52, pp. 943-952, Aug. 2006.
[19] S. Y. Tseng and T. W. Hsieh, “A pattern-search method for H.264/AVC CAVLC decoding,” in Proc. IEEE ICME, July 2006, pp. 1073-1076.
[20] Y. M. Lin and P. Y. Chen, “An efficient implementation of CAVLC for H.264/AVC,” in Proc. IEEE ICICIC, Aug. 2006, pp. 601-604.
[21] M. Alle, J. Biswas, and S. K. Nandy, “High performance VLSI architecture design for H.264 CAVLC decoder,” in Proc. IEEE ASAP, Sept. 2006, pp. 317-322.
[22] B. S. Choi and J. Y. Lee, “Implementation of area efficient H.264/AVC CAVLC decoder,” in Proc. IEEE ICICDT, May 2009, pp. 135-138.
[23] J. Nikara, S. Vassiliadis, J. Takala, and P. Liuha, “Multiple-symbol parallel decoding for variable length codes,” IEEE Trans. VLSI Syst., vol. 12, pp. 676-685, July 2004.
[24] G. S. Yu and T. S. Chang, “A zero-skipping multi-symbol CAVLC decoder for MPEG-4 AVC/H.264,” in Proc. IEEE ISCAS, May 2006, pp. 1240-1243.
[25] Y. N. Wen, G. L. Wu, S. J. Chen, and Y. H. Hu, “Multiple-symbol parallel CAVLC decoder for H.264/AVC,” in Proc. IEEE APCCAS, Dec. 2006, pp. 1240-1243.
[26] G. G. Lee, C. C. Lo, Y. C. Chen, H. Y. Lin, and M. J. Wang, “High-throughput low-cost VLSI architecture for AVC/H.264 CAVLC decoding,” IET Image Process., vol. 4, pp. 81-91, Apr. 2010.
[27] T. H. Tsa, D. L. Fang, and Y. N. Pan, “A hybrid CAVLD architecture design with low complexity and low power considerations,” in Proc. IEEE ICME, July 2007, pp. 1910-1913.
[28] H. Y. Lin, Y. H. Lu, B. D. Liu, and J. F. Yang, “Low power design of H.264 CAVLC decoder,” in Proc. IEEE ISCAS, May 2006, pp. 2689-2692.
[29] H. Y. Lin, Y. H. Lu, B. D. Liu, and J. F. Yang, “A highly efficient VLSI architecture for H.264/AVC CAVLC decoder,” IEEE Trans. Multimedia, vol. 10, pp. 31-42, Jan. 2008.
[30] T. H. Tsai, T. L. Fang, and Y. N. Pan, “A novel design of CAVLC decoder with low power and high throughput considerations,” IEEE Trans. Circuits Syst. Video Technol., vol. 21, pp. 311-319, Mar. 2011.
[31] B. Y. Lee and K. K. Ryoo, “A design of high-performance pipelined architecture for H.264/AVC CAVLC decoder and low-power implementation,” IEEE Trans. Consum. Electron., vol. 56, pp. 2781-2789, Nov. 2010
[32] Y. H. Moon, I. K. Eom, and S. W. Ha, “An improved coeff_token variable length decoding method for low power design of H.264/AVC CAVLC decoder,” in Proc. IEEE ICIP, Oct. 2008 , pp. 2840-2843.
[33] T. H. Tsai and D. L. Fang, “A novel design of CAVLC decoder with low power consideration,” in Proc. IEEE ASSCC, Nov. 2007, pp. 196-199
[34] C. C. Lo, S. T. Tsai, and M. D. Shieh, “Reconfigurable architecture for entropy decoding and inverse transform in H.264,” IEEE Trans. Consum. Electron., vol. 56, pp. 1670-1676, Aug. 2010.
[35] X. L. Zhang, Z. S. Luo, and R. H. Zhang, “An improved group-based optimizing of CAVLC decoder,” in Proc. IEEE IASP, Apr. 2009, pp. 200-203.
[36] H. S. Malvar, A. Hallapuro, M. Karczewicz, and L. Kerofsky, “Low complexity transform and quantization in H.264/AVC,” IEEE Trans. Circuits Syst. Video Technol., vol. 13, no. 7, pp. 598-603, July 2003.
[37] T. C. Wang, Y. W. Huang, H. C. Fang, and L. G. Chen, “Parallel 4 × 4 2D transform and inverse transform architecture for MPEG-4 AVC/H.264,” in Proc. IEEE ISCAS, May 2003, pp. 800-803.
[38] L. Z. Liu, L. Qiu, M. T. Rong, and L. Jiang, “A 2-D forward/inverse integer transform processor of H.264 based on highly-parallel architecture,” in Proc. IEEE IWSOC, July 2004, pp. 158-161.
[39] R. Kordasiewicz and S. Shirani, “Hardware implementation of the optimized transform and quantization blocks of H.264,” in Proc. IEEE CCECE, May 2004, pp. 943-946.
[40] Z. Y. Cheng, C. H. Chen, B. D. Liu, and J. F. Yang, “High throughput 2-D transform architectures for H.264 advanced video coders,” in Proc. IEEE APCCAS, Dec. 2004, pp. 1141-1144.
[41] C. P. Fan, “Fast 2-dimensional 4 × 4 forward integer transform implementation for H.264/AVC,” IEEE Trans. Circuits Syst. Ⅱ, vol. 53, pp. 174-177, Mar. 2006.
[42] L. Agostini, R. Porto, J. Guntzel, and I. S. Silva, “High throughput multitransform and multiparallelism IP for H.264/AVC video compression standard,” in Proc. IEEE ISCAS, May 2006, pp. 5419-5422.
[43] K. H. Chen, J. I. Guo, and J. S. Wang, “A high-performance direct 2-D transform coding IP design for MPEG-4AVC/H.264,” IEEE Trans. Circuits Syst. Video Technol., vol. 16, pp. 472-483, Aug. 2006.
[44] M. Nadeem, S. Wong, and G. Kuzmanov, “Inverse integer transform in H.264/AVC intra-frame encoder,” in Proc. IEEE DELTA, Jan. 2011, pp. 228-233.
[45] T. T. T. Do and T. M. Le, “High throughput area-efficient SoC-based forward/inverse integer transforms for H.264/AVC,” in Proc. IEEE ISCAS, May 2010, pp. 4113-4116.
[46] H. Y. Lin, Y. C. Chao, B. D. Liu, and J. F. Yang, “Combined 2-D transform and quantization architecture for H.264 video coders,” in Proc. IEEE ISCAS, May 2005, pp. 1802-1805.
[47] I. Amer, W. Badawy, and G. Jullien, “A high-performance hardware implementation of the H.264 simplified 8 × 8 transformation and quantization,” in Proc. IEEE ICASSP, Mar. 2005, pp.1137-1140.
[48] M. Elhaji, A. Zitouni, S. Meftali, J. Dekeyser, and R. Tourki, “A low power and highly parallel implementation of the H.264 8 × 8 transform and quantization,” in Proc. IEEE ISSPIT, Dec. 2010, pp. 528 - 531.
[49] L. Wang, Z. Zhang, G. Teng, L. Shen, and X. Shi, “Hardware implementation of transform and quantization for AVS encoder,” in Proc. IEEE ICALIP, July 2008, pp. 843-847.
[50] S. Lee and K. Cho, “Design of transform and quantization circuit for multi-standard integrated video decoder,” in Proc. IEEE SiPS, Oct. 2007, pp. 181-186.
[51] S. Lee and K. Cho, “Architecture of transform circuit for video decoder supporting multiple standards,” Electron. Lett., vol. 44, pp. 274-275, Feb. 2008.
[52] C. P. Fan and G. A. Su, “Efficient low-cost sharing design of fast 1-D inverse integer transform algorithms for H.264/AVC and VC-1,” IEEE Signal Processing Lett., vol. 15, pp. 926-929, 2008.
[53] G. A. Su and C. P. Fan, “Low-cost hardware-sharing architecture of fast 1-D inverse transforms for H.264/AVC and AVS applications,” IEEE Trans. Circuits Syst. Ⅱ, vol. 55, pp. 1249-1253, Dec. 2008.
[54] P. H. Chen, H. M. Chen, M. C. Shie, J. C. Chen, and J. M. Chang, “An unified architecture of all transforms for H.264/AVC codec,” in Proc. IEEE 3CA, May 2010, pp. 476-479.
[55] W. Hwangbo and C. M. Kyung, “A multitransform architecture for H.264/AVC high-profile coders,” IEEE Trans. on Multimedia, pp. 157-167, vol. 12, Apr. 2010.
[56] H. Qi, Q. Huang, and W. Gao, “A Low-Cost Very Large Scale Integration Architecture for Multistandard Inverse Transform, ” IEEE Trans. Circuits Syst. Ⅱ, vol. 57, pp. 551-555, July 2010.
[57] V. Bhaskaran and K. Konstantinides, Image and Video Compression Standards: Algorithms and Architectures. Boston, MA: Kluwer Academic, 1997.