成功大學博碩士論文系統

簡易檢索 / 詳目顯示

回結果列表

研究生：	鄭弘煒 Cheng, Hung-Wei
論文名稱：	基於位元率失真比最佳化之H.264移動估測模式之決策 Mode Decision of H.264/AVC Motion Estimation with RDO
指導教授：	蘇文鈺 Su, Wen-Yu 郭耀煌 Kuo, Yau-Hwang
學位類別：	碩士 Master
系所名稱：	電機資訊學院 - 資訊工程學系 Department of Computer Science and Information Engineering
論文出版年：	2005
畢業學年度：	93
語文別：	英文
論文頁數：	100
中文關鍵詞：	加速、模式決策、H.264/AVC 、移動估測、壓縮量失真比最佳化
外文關鍵詞：	Mode decision, H.264/AVC, Motion estimation, Rate distortion optimization, Speed up
相關次數：	點閱：105 下載：5
分享至:	分享至facebook 分享至twitter

查詢本校圖書館目錄查詢臺灣博碩士論文知識加值系統勘誤回報

　　在本篇論文中，我們提出二步模式決策(Two-stage mode decision)演算法以及降低小數像素移動估測計算(Sub-pixel motion estimation Reduction)兩種方法來加速視訊壓縮標準H.264/AVC的執行時間，在有使用位元率失真比最佳化(Rate distortion optimization)的情況下。由於在使用壓縮量失真比最佳化時，更多的計算量必需被投入來取得在特定的壓縮比條件下最小的失真使得該方法難以應用在活動式或是嵌入式的系統上。二步模式決策被提出來減少計算的時間但同時仍保持高原有的高品質，它包含兩個步驟：第一步運用先前壓縮完成的畫面資訊來預測模式。第二步則使用貝式機率法則和倒傳遞類神經網路做進一步的選擇。此外小數像素移動估測在H.264/AVC編碼流程中也是一個相當花費時間以及記憶體的動作。忽略小數像素移動估測被提出來決定小數像素移動估測是否需要執行，這是根據整數像素移動估測中取得的最佳位置及其週遭位置來決定。根據實驗結果，二步模式決策在損失少部分的影像品質以及壓縮比的條件下減少超過一半的計算時間。忽略小數像素移動估測同樣減少了計算的時間也僅伴隨著少量的損耗。此二演算法相較於原始的H.264/AVC標準提供在相似品質之下更快速的壓縮。

　　In this paper we proposed a two-stage mode decision (TSMD) algorithm and Sub-pixel motion estimation reduction (SMER) method to speed up the execution of the H.264/AVC video encoding with rate distortion optimization (RDO). Because much more computation power is required to achieve the minimal distortion for a given bit-rate budget by RDO algorithm, it is difficult to apply H.264 in mobile devices or embedded systems. The proposed TSMD algorithm is proposed to reduce the computation time with high quality video encoding. It is including two stages: The first stage is to predict the possible encoding modes according to the previous frame’s information. The second stage is to apply the refinement mode utilizing Baye's probability rule and Back-Propagation neural network (BPN). Moreover, the sub-pixel motion estimation is also a time and memory consuming operation during the H.264/AVC encoding procedure. The SMER method is proposed to decide whether the sub-pixel motion estimation can be skipped by using the relation among the pixels around the best position after the integer-pixel motion estimation is finished. According to the experiment results, over 50% of the computation time is reduced with a slightly loss in peak signal-to-noise ratio (PSNR) and a slightly increment in bit rate when TSMD is applied. The computation time is further reduced by the SMER method with little reduction of the PSNR. These two algorithms provide faster encoding with close compression performance compared with the H.264/AVC standard reference software (JM 9.2).

Chapter 1 Introduction - 1 -
Chapter 2 The H.264/AVC Video Coding Standard - 3 -
　2.1 The H.264/AVC Encoder Standard - 3 -
　　2.1.1 Forward Coding Path of I Frame - 4 -
　　2.1.2 Forward Coding Path of P Frame - 6 -
　　2.1.3 Intra Prediction - 8 -
　　　2.1.3.1 4x4 Luma Prediction - 9 -
　　　2.1.3.2 16x16 Luma Prediction - 12 -
　　　2.1.3.3 8x8 Chroma Prediction - 12 -
　　2.1.4 Motion Estimation - 13 -
　　　2.1.4.1 Full Search - 16 -
　　　2.1.4.2 Diamond Search - 17 -
　　　2.1.4.3 Comparison of Motion Estimation Algorithms - 19 -
　2.2 Rate Distortion Optimization - 21 -
　　2.2.1 Applying RDO Efficiently - 21 -
　　2.2.2 Computation Time Analysis - 24 -
　2.3 Problem Formulation - 26 -
Chapter 3 Proposed Algorithm - 28 -
　3.1 Two Stage Mode Decision (TSMD) - 28 -
　　3.1.1 The First Stage Prediction - 29 -
　　3.1.2 The Second Stage Prediction - 32 -
　　　3.1.2.1 Probability Transition Model of the Second Stage - 32 -
　　　3.1.2.2 Probability Formulation - 33 -
　　　3.1.2.3 Analysis of the Probability Transition Model - 35 -
　　　3.1.2.4 Back Propagation Network (BPN) - 37 -
　　　　3.1.2.4.1 Training Phase - 39 -
　　　　3.1.2.4.2 Testing Phase - 41 -
　　　　3.1.2.4.3 Verification Phase - 41 -
　　3.1.3 Summarize of the Two-Stage Mode Decision - 42 -
　3.2 Sub-pixel Motion Estimation Reduction (SMER) - 44 -
　　3.2.1 Analysis of Full Search SAD Plot - 45 -
　　3.2.2 Algorithm of Sub-pixel Motion Estimation Reduction (SMER) - 46 -
Chapter 4 Experiment Result - 48 -
　4.1 Experiment Environment - 48 -
　4.2 Experiment Result - 50 -
　　4.2.1 Inside Testing Sequence - 50 -
　　　4.2.1.1 Sequence Akiyo - 51 -
　　　4.2.1.2 Sequence Carphone - 54 -
　　　4.2.1.3 Sequence Coastguard - 56 -
　　　4.2.1.4 Sequence Foreman - 59 -
　　　4.2.1.5 Sequence Mobile - 61 -
　　4.2.2 Outside Testing Sequence - 64 -
　　　4.2.2.1 Sequence Container - 64 -
　　　4.2.2.2 Sequence Hall Objects - 67 -
　　　4.2.2.3 Sequence Mother and Daughter - 69 -
　　　4.2.2.4 Sequence News - 72 -
　　　4.2.2.5 Sequence Salesman - 74 -
　　　4.2.2.6 Sequence Silent - 76 -
　　　4.2.2.7 Sequence Stefan - 79 -
Chapter 5 Conclusion and Future Work - 82 -
Reference - 84 - 
                                    

[1] B. Meng, O. C. Au, C. W. Wong, and H. K. Lam, “Efficient Intra-Prediction Algorithm in H.264,” in Proc. IEEE ICIP 2003, Barcelona, Spain, September 2003.

[2] B. Meng, O. C. Au, C. W. Wong, and H. K. Lam, “Efficient Intra-Prediction Mode Selection for 4x4 Blocks in H.264,” in Proc. IEEE ICME 2003, Baltimore, Maryland July 2003.

[3] B. Meng, and O. C. Au, “Fast Intra-Prediction Mode Selection for 4x4 Blocks in H.264,” in Proc. ICASSP 2003, Hong-Kong, April 2003.

[4] C. S. Kim, H. H. Shih and C. C. Kuo, ”Feature-Based Intra-Prediction Mode Decision for H.264,” IEEE Proceedings of International Conference Image Processing 2004, Singapore, October 2004.

[5] C. S. Kim, Q. Li, and C. C. J. Kuo, “Fast Intra-Prediction Model Selection for H.264 Codec,” SPIE International Symposium ITCOM 2003, Orlando, Florida, July, 2003.

[6] C. Y. Chang, C. H. Pan, and H. Chen, “Fast Mode Decision for P-Frames in H.264,” Picture Coding Symposium, San Francisco, December 2004.

[7] C. S. Kim, Q. Li, and C. C. J. Kuo, “Fast Intra/Inter Mode Decision for H.264 Encoding Using A Risk-Minimization Criterion,” Special Session on Video Coding, SPIE 49th Annual Meeting 2004, Denver, Colorado, Aug. 2-6, 2004.

[8] D. Wu, S. Wu, K. P. Lim, F. Pan, Z. G. Li, and, X. Lin, “Block Inter Mode Decision for Fast Encoding of H.264,” in Proc. IEEE ICASSP 2004, Montreal, Quebec, Canada.

[9] F. Pan, X. Lin, R. Susanto, K. P. Lim, Z. G. Li, G. N. Feng, D. J. Wu, and S. Wu, “Fast Mode Decision for Intra prediction,” JVT document: JVT-G013.

[10] H. S. Malvar, A. Hallapuro, and L. Kerofsky, “Low-Complexity Transform and Quantization in H.264/AVC,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 13, No. 7, July 2003.

[11] I. E. G. Richardson, “H.264/MPEG-4 Part 10 White Paper: Overview,” Available: http://www.vcodex.com

[12] I. E. G. Richardson, “H.264/MPEG-4 Part 10 White Paper: Intra Prediction,” Available: http://www.vcodex.com

[13] I. E. G. Richardson, “H.264/MPEG-4 Part 10 White Paper: Inter Prediction,” Available: http://www.vcodex.com

[14] I. E. G. Richardson, “H.264/MPEG-4 Part 10 White Paper: Transform & Quantization,” Available: http://www.vcodex.com

[15] I. E. G. Richardson, “H.264/MPEG-4 Part 10 White Paper: Variable Length Coding,” Available: http://www.vcodex.com

[16] J. Y. Tham, S. Ranganath, M. Ranganath, and A. A. Kassim, “A novel Unrestricted Center-Biased Diamond Search Algorithm for Block Motion Estimation,” IEEE Transactions on Circuits and System for Video Coding Technology, vol. 8, No. 4, August 1998.

[17] K. P. Lim, S. Wu, D. J. Wu, S. Rahardja, X. Lin, F. Pan, Z. G. Li, “Fast Inter Mode Selection,” JVT document: JVT-I020.

[18] L. M. Po, and W. C. Ma, “A Novel Four-Step Search Algorithm for Fast Block Motion Estimation,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 6, No. 3, June 1996.

[19] M. Wien, “Variable Block-Size Transforms for H.264/AVC,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 13, No. 7, July 2003.

[20] P. List, A. Joch, J. Lainema, G. Bjontegaard, and M. Karczewicz, “Adaptive Deblocking Filter,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 13, No. 7, July 2003.

[21] R. Li, B. Zeng, and M. L. Liou, “A New Three-Step Search Algorithm for Block Motion Estimation,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 4, No. 4, August 1994.

[22] S. Haykin, “Neural Networks – A Comprehensive Foundation,” 2nd Ed. Prentice Hall, ISBN: 0-13-273350-1

[23] T. Koga, K. Iinuma, A. Hirano, Y. Iijima, and T. Ishiguro, “Motion-Compensated Inter Frame Coding for Video Conferencing,” in Proc. NTC 81, pp. C9.6.1-9.6.5, New Orleans, LA, November/December 1981.

[24] T. Wiegand, G. J. Sullivan, G. Bjontegaard, and A. Luthra, “Overview of the H.264/AVC Video Coding standard,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 13, No. 7, July 2003.

[25] T. Wiegand, M. Lightstone, D. Mukherjee, T. G. Campbell, and S. K. Mitra, “Rate-Distortion Optimized Mode Selection for Very Low Bit Rate Video Coding and Emerging H.263 Standard,” IEEE Transactions on Circuits and System for Video Technology, vol. 6, April. 1996.

[26] “Draft ITU-T Recommendation and Final Draft International Standard of Joint Video Specification (ITU-T Rec. H.264 | ISO/IEC 14496-10 AVC)”, ISO/IEC JTC1 /SC29/WG11, 2003, May.

校內：2007-08-04公開
校外：2007-08-04公開

簡易檢索 / 詳目顯示

相關論文