| 研究生: |
楊宗翰 Yang, Tsung-Han |
|---|---|
| 論文名稱: |
適用於H.264編碼器之多重參考畫面快速演算法及其架構設計 Adaptive Multi-Frame Algorithms and VLSI Architecture for Fast H.264/AVC Encoding |
| 指導教授: |
劉濱達
Liu, Bin-Da 楊家輝 Yang, Jar-Ferr |
| 學位類別: |
碩士 Master |
| 系所名稱: |
電機資訊學院 - 電機工程學系 Department of Electrical Engineering |
| 論文出版年: | 2008 |
| 畢業學年度: | 96 |
| 語文別: | 英文 |
| 論文頁數: | 79 |
| 中文關鍵詞: | 多重參考畫面 、計算複雜度 、提前終止 、適應性 |
| 外文關鍵詞: | multi-frame algorithm, early termination, H.264/AVC, adaptive, computational complexity |
| 相關次數: | 點閱:81 下載:1 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
多重參考畫面提供可以提供較佳的影像品質及較低的編碼位元率,但高計算複雜度卻一直是及時處理所需面對的嚴峻考驗。本篇論文提出了一個符合H.264/AVC的視訊標準的多重參考畫面的快速演算法及硬體架構,可以由前一張參考畫面的移動向量,模式決策等資訊決定是否提前終止參考畫面的搜尋。相較於一般必須使用固定參考畫面的演算法,本篇論文根據影像的特性,可以在維持影像品質的前提下適應性的調整搜尋畫面的張數。模擬結果顯示,提出的演算法可以節省30%~78%的計算複雜度,但僅有0.02 dB的PSNR損失及位元率的小幅增加。在硬體設計上,我們採用適應性的技巧來決定是否載入需要的搜尋視窗,因此快取記憶體的大小僅為一般多重參考畫面的五分之一,並且可以降低和主記憶體間的頻寬需求。
依據上述演算法設計出其所對應的硬體架構,包含動作評估單元,模式決策單元,以及控制單元。模擬結果顯示整體架構需要128.8 k個邏輯閘,且最快的時脈速度可達到 185 MHz,故得知其在最差的情況下,可支援在 CIF (356 * 288) @30fps 解析度下即時的預測處理。
This thesis proposes an adaptive multi-frame algorithm and its VLSI architecture for H.264/AVC. Multiple reference frames can supply better video quality and less coding bit rates. The real-time process of multi-frame motion estimation is hard to be achieved for high computational complexity. The information of SAD trend, motion vector, and mode decision from the previous reference frames can be applied to decide for the early termination of the frame level search. The proposed algorithm adaptively modulates the number of reference frames according to the texture of the video sequences.
Simulation results represent that the proposed algorithm saves 30%~78% computational complexity only with a little PSNR degradation and bit rate increment. In the hardware architecture, the cache is loaded while necessary. This approach can save both system bandwidth and 80% of cache memory. The corresponding hardware architecture for the algorithm is proposed, including motion estimation, mode decision, and control units. Simulation results represent that the total number of gates for this architecture is 128.8 k and the maximum operation frequency is 185 MHz. This design conforms the real-time requirement for the CIF (356 * 288) @30fps video resolution.
[1] Coding of moving pictures and associated audio for digital storage media at up to about 1.5 Mbit/s–Part2: Video, ISO/IEC 11172, 1993.
[2] Information Technology–Generic Coding of Moving Pictures and Associated Audio Information: Video, ISO/IEC 13818-2 and ITU-T Rec. H.262, 1996.
[3] Information Technology–Coding of Audio-Visual Objects–Part2: Visual, ISO/IEC14496-2, 1999.
[4] Video Codec for Audiovisual Services at px64 kbits/s, ITU-T Rec. H.261 v1, 1990.
[5] Video Coding for Low Bit Rate Communication, ITU-T Rec. H.263, 1998.
[6] Joint Video Team, Draft ITU-T Recommendation and Final Draft International Standard of Joint Video Specification, ITU-T Rec. H.264 and ISO/IEC 14496-10 AVC, May 2003.
[7] C. Y. Chen, S. Y. Chien, Y. W. Huang, T. C. Chen, T. C. Wang, and L. G. Chen, “Analysis and architecture design of variable block-size motion estimation for H.264/AVC,” IEEE Trans. Circuits Syst. I, vol. 53, pp. 578-593, Feb. 2006.
[8] L. Deng, W. Gao, M. Z. Hu, and Z. Z. Ji, “An efficient hardware implementation for motion estimation of AVC standard,” IEEE Trans. Consumer Electron., vol. 51, pp. 1360-1366, Nov. 2005.
[9] Y. W. Huang, T. C. Wang, B. Y. Hsieh, and L. G. Chen, “Hardware architecture design for variable block size motion estimation in MPEG-4 AVC/JVT/ITU-T H.264,” in Proc. IEEE ISCAS, May 2003, pp. 796-799.
[10] M. Kim, I. Huang, and S. I. Chae, “A fast VLSI architecture for VBSME in MPEG-4 AVC/H.264,” in Proc. IEEE ASP-DAC, May 2005, pp. 631-634.
[11] C. Wei, Z. Yan, M. Z. Gang, and L. Z. Qiang, “VLSI architecture design for variable-size block motion estimation in MPEG-4 AVC/H.264,” in Proc. IEEE APCCAS, Dec. 2004, pp. 617-620.
[12] C. Wei and M. Z. Gang, “A novel VLSI architecture for VBSME in MPEG-4 AVC/H.264,” in Proc. IEEE ISCAS, May 2005, pp. 1794-1797.
[13] S. Y. Yap and J. V. Mccanny, “A VLSI architecture for variable block size video motion estimation,” IEEE Trans. Circuits Syst. II, vol. 51, pp. 384-389, July 2004.
[14] S. Yang, Z. Liu, S. Goto, and T. Ikenaga “ Scalable VLSI architecture for variable block size integer motion estimation in H.264/AVC” IEICE Trans. on Fundamentals, vol. E89-A, pp. 979-988, Apr. 2006.
[15] J. Kim and T. Park, “A novel VLSI architecture for full-search variable block-size motion estimation,” in Proc. IEEE TENCON, Nov. 2007, pp. 1-4.
[16] K. L. Chung and L. C. Chang, “A new predictive search area approach for fast block motion estimation,” IEEE Trans. Image Processing, vol. 12, pp. 648-652, June 2003.
[17] M. C. Chang and J. S. Chien, “An adaptive search algorithm based on block classification for fast block motion estimation,” in Proc. IEEE ISCAS, May 2006, pp. 3982-3985.
[18] T. H. Tsai and Y. N. Pan, “A novel 3-D Predict Hexagon Search Algorithm for fast block motion estimation on H.264 video coding,” IEEE Trans. Circuits Syst. Video Technol., vol. 16, pp. 1542-1549, Dec. 2006.
[19] Y. H. Chen, T, C. Chen, and L. G. Chen, “Hardware oriented content-adaptive fast algorithm for variable block-size integer motion estimation in H.264,” in Proc. IEEE ISPACS, Dec. 2005, pp. 341-344.
[20] M. J. Chen, G. L. Li, Y. Y. Chiang, and C. T. Hsu, “Fast mlutiframe motion estimation algorithms by motion vector composition for the MPEG-4/AVC/H.264 standard,” IEEE Trans. Multimedia, vol. 8, pp. 478-487, June 2006.
[21] C. K. Chiang and S. H. Lai, “Fast multi-Reference frame motion estimation via downhill simplex search,” in Proc. IEEE ICME, July 2006, pp. 121-124.
[22] X. Li, E. Q. Li, and Y. K. Chen, “Fast multi-frame motion estimation algorithm with adaptive search strategies in H.264,” in Proc. IEEE ICASSP, May 2004, pp. 369-372.
[23] L. Shen, Z. Liu, Z. Zhang, and X. Shi, “An adaptive and fast H.264 multi-frame selection algorithm based on information from previous searches,” in Proc. IEEE ICME, July 2007, pp. 1591-1594.
[24] Q. Sun, X. H. Chen, X. Wu, and L. Yu, “A content-adaptive fast multiple reference frames motion estimation in H.264,” in Proc. IEEE ISCAS, May 2007, pp. 3651-3654.
[25] L. Shen, Z. Liu, Z. Zhang, and G. Wang, “An adaptive and fast multiframe selection algorithm for H.264 video coding,” IEEE Signal Processing Lett., vo14., pp. 836-839, Nov. 2006.
[26] Y. W. Huang, B. Y. Hsieh, S. Y. Chien, S. Y. Ma, and L. G. Chen, “Analysis and complexity reduction of multiple reference frames motion estimation in H.264/AVC,” IEEE Trans. Circuits Syst. Video Technol., vol. 16, pp. 507-522, Apr. 2006.
[27] X. Su, S. Singh, and Y. Bai, “Local reference with early termination in H.264 motion estimation,” in Proc. IEEE ICME, July 2007, pp. 384-387.
[28] Q. Tang, P. Fu, H. Chen, and F. Guo, “A fast motion estimation algorithm based on the adaptive reference frame and the spatial and temporal correlations for H.264,” in Proc. IEEE IECON, Nov. 2007, pp. 2399-2402.
[29] Y. Su and M. T. Sun, “Fast multiple reference frame motion estimation for H.264/AVC,” IEEE Trans. Circuits Syst. Video Technol., vol. 16, pp. 447-452, Mar. 2006.
[30] H. J. Li, C. T. Hsu, and M. J. Chen, “Fast multiple reference frame selection method for motion estimation in JVT/H.264,” in Proc. IEEE APCCAS, Dec. 2004, pp. 605-608.
[31] S. E. Kim, J. K. Han, and J. G. Kim, “An efficient scheme for motion estimation using multireference frames in H.264/AVC,” IEEE Trans. Multimedia, vol. 8, pp. 457-466, June 2006.
[32] T. C. Chen, C. Y. Tsai, Y. W. Huang, and L. G. Chen, “Single reference frame multiple current macroblocks scheme for multiple reference frame motion estimation in H.264/AVC,” IEEE Trans. Circuits Syst. Video Technol., vol. 17, pp. 242-247, Feb. 2007.
[33] S. Warrington, S. Sudharsanan, and W. Y. Chan, “Architecture for multiple reference frame variable block size motion estimation,” in Proc. IEEE ISCAS, May 2007, pp. 2894-2897.
[34] G. J. Sullivan and T. Wiegand, “Rate-distortion optimization for video compression,” IEEE Signal Process. Mag., vol. 15, no. 6, pp. 74–90, Nov. 1998.
[35] T. Wiegand and B. Girod, “Lagrangian multiplier selection in hybrid video coder control,” in Proc. IEEE Int. Conf. Image Process, Oct. 2001, pp. 542–545.
[36] P. R. Hill, T. K. Chiew, D. R. Bull, and C. N. Canagarajah, “Interpolation free sub-pixel accuracy motion estimation,” IEEE Trans. Circuits Syst. Video Technol., vol. 16, pp. 1519-1526, Dec. 2006.
[37] Z. Y. Cheng, C. H. Chen, B. D. Liu, and J. F. Yang, “High throughput 2-D transform architecture for H.264 advanced video coders,” in Proc. IEEE APCCAS, Dec. 2004, pp. 1141-1144.