成功大學博碩士論文系統

簡易檢索 / 詳目顯示

回結果列表

研究生：	李禮真 Li, Li-Jen
論文名稱：	運用視差資訊之多視角快速模式決策演算法 Fast Mode Decision Algorithm for Multi-view Video Coding Based on Disparity Information
指導教授：	賴源泰 Lai, Yen-Tai
學位類別：	碩士 Master
系所名稱：	電機資訊學院 - 電機工程學系 Department of Electrical Engineering
論文出版年：	2013
畢業學年度：	101
語文別：	英文
論文頁數：	57
中文關鍵詞：	多視角視訊編碼、視差資訊、快速模式決策
外文關鍵詞：	multi-view video coding, disparity information, fast mode decision
相關次數：	點閱：90 下載：0
分享至:	分享至facebook 分享至twitter

查詢本校圖書館目錄查詢臺灣博碩士論文知識加值系統勘誤回報

隨著影像技術不斷進步，在3D影像上的應用，將提供觀賞者一種新的視覺享受，而目前已列JVT (Joint Video Team) H.264延伸計畫中之3D影像壓縮技術是以多視角視訊編碼(Multi-view Video Coding, MVC)為主，MVC不僅保有原本H.264 編碼上的特性並增加視角間的參考方向來達到更好的壓縮效率，但是這樣一來卻大幅地提高計算上的複雜度，因此如何有效地降低其計算複雜度為當前研究之重要方向。
為了減少多視角視訊編碼之大量的計算量，有效地決定可能的編碼模式，本篇論文提出一個快速模式決策演算法，主要是利用視差資訊來表示其物體深度，根據不同的視差大小可將畫面分成前、中、背景三個區域，接著再針對這三個區域可能的編碼模式各別去做分析，另外，我們利用提早決定跳躍模式的方法和減少模式選擇的機制來達到更進一步的加速，本篇論文所提出的演算法是實現於JMVC4.0當中，由實驗結果得知，本篇論文提出的演算法可以加速約80%的編碼時間並且只造成少部分的PSNR值下降和位元率提高。

With the rapid development of video techniques, various applications of 3D video provide viewers a new viewing sensation. Joint Video Team (JVT) started working on the multi-view video coding (MVC) extension for H.264 video coding. The MVC extension achieves higher coding efficiency by not only inheriting from H.264 video standard but also including the inter-view correlation between neighboring views. However, the computational complexity obviously increases as well. Therefore, how to reduce the computational complexity effectively has become an important issue.
In this thesis, a fast mode decision algorithm is proposed to reduce the huge computation and decide the most appropriate mode in efficient way. We utilize the disparity information to represent the depth of objects. According to the different sizes of disparities, a frame can be classified into three regions: Near region, Middle region, and Far region. Then, we assign the appropriate modes to the three regions by analyzing the characteristics of each region, respectively. Additionally, the two methods which are the early skip mode decision and reducing the candidate modes based on RDcost are applied to remove some unnecessary inter modes to achieve better compression efficiency. Finally, the proposed algorithm is implemented in MVC reference software JMVC4.0. From experimental results, our proposed algorithm can reduce around 80% of the encoding time with negligible PSNR drop and bit-rate increasing.

Table of Contents

Abstract 
Acknowledgment 
Table of Contents 
List of Tables 
List of Figures

Chapter 1 Introduction 1
1.1 Motivation 3
1.2 Thesis Organization	4
Chapter 2 Related Works	5
2.1 Multi-view Video Coding (MVC) Structure 6
2.2 Basic Concepts for Prediction Methods in MVC 9
2.2.1 Intra Prediction Mode 9
2.2.2 Inter Prediction Mode 11
2.2.3 Inter-view Prediction Mode 18
2.3 Basic Concept of Mode Decision 19
2.4 Fast Algorithms in MVC 20
2.5 Depth Information Based Algorithm 22
Chapter 3 Proposed Methods 25
3.1 MVC Prediction Structure 25
3.2 The Relation between Disparity and Depth 27
3.3 A Fast Mode Decision in Different Regions 30
3.3.1 Segment Three Regions Based on Disparity Information	30
3.3.2 Generate Dispariy Map 31
3.3.3 The Mode Decision of Three Regions 35
3.4 Reduce Mode Selection Based on RDcost 38
3.4.1 Early Skip Mode Decision 38
3.4.2 Reduce Mode Candidates by Employing RDcost 42
3.5 The Thresholds of Disparity Activity 43
Chapter 4 Experimental Results 46
4.1 Experimental Platform 46
4.2 Experimental Results 48
4.3 Comparing with other algorithms 52
Chapter 5 Conclusions 54
References 55

                                    

[1] Y.-S. Ho, and K.-J. Oh, “Overview of Multi-view Video Coding,” IEEE International Conference on Signals and Image Processing, Multimedia Communications and Services, pp. 5-12, Jun. 2007.
[2] M. Tanimoto, “FTV (free viewpoint television) for 3D Scene Reproduction and Creation,” IEEE Conference on Computer Vision and Pattern Recognition Workshop, pp. 172-172, Jun. 2006.
[3] ISO/IEC/JTC1/SC29/WG11, “Multiview Coding Using AVC,” Bangkok, Thailand, Jan. 2006.
[4] W. Zhu, X. Tian, F. Zhou, and Y. Chen, “Fast Inter Mode Decision Based on Textural Segmentation and Correlations for Multiview Video Coding,” IEEE Transactions on Consumer Electronics, vol. 56, no. 3, pp. 1696-1704, Aug. 2010.
[5] A. Vetro, T. Wiegand, and G. J. Sullivan, “Overview of the Stereo and Multiview Video Coding Extensions of the H.264/MPEG-4 AVC Standard,” Proceedings of the IEEE, vol. 99, issue 4, pp. 626-642, Apr. 2011.
[6] A. Vetro, P. Pandit, H. Kimata, A. Smolic and Y. K. Wang, ISO/IEC JTC1/SC29/WG11 and ITU-T Q6/SG16. “Joint Multiview Video Model (JMVM) 8.0,” Doc. JVT-AA207, Geneva, CH, Apr. 2008.
[7] H. Schwarz, D. Marpe, and T. Wiegand, “Analysis of Hierarchical B Pictures and MCTF,” IEEE International Conference on Multimedia and Expo, pp. 1929-1932, Jul. 2006.
[8] H. Schwarz, D. Marpe, and T. Wiegand, “Hierarchical B pictures,” Doc. JVT-P014, Poznan PL, Jul. 2005.
[9] X. Li, P. Amon, A. Hutter, and A. Kaup, “Adaptive Quantization Parameter Cascading for Hierarchical Video Coding,” IEEE International Symposium on Circuits and Systems (ISCAS), pp. 4197-4200, Jun. 2010.
[10]H.-Q. Zeng, K.-K. Ma, and C.-H. Cai, “Fast Mode Decision for Multiview Video Coding Using Mode Correlation,” IEEE Transactions on Circuit and Systems for Video Technology, vol. 21, no. 11, pp. 1659-1666, Nov. 2011.
[11]G. J. Sullivan, T. Wiegand, “Rate-Distortion Optimization for Video Compression,” IEEE Signal Processing Magazine, vol. 15, pp. 74-90, Nov. 1998.
[12]T.-Y. Kuo, Y.-Y. Lai, and Y.-C. Lo, “Fast Mode Decision for Non-Anchor Picture in Multiview Video Coding,” IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB), pp. 1-5, 24-26 Mar. 2010.
[13]W. Zhu, W. Jiang, and Y. Chen, “A Fast Inter Mode Decision for Multiview Video Coding,” International Conference on Information Engineering and Computer Science (ICIECS), pp. 1-4, 19-20 Dec. 2009.
[14]L. Shen, Z. Liu, T. Yan, Z. Zhang, and P. An, “View-Adaptive Motion Estimation and Disparity Estimation for Low Complexity Multiview Video Coding,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 20, no. 6, pp.925-930, Jun. 2010.
[15]Y.-H. Lin and J.-L. Wu, “A Depth Information Based Fast Mode Decision Algorithm for Color Plus Depth-Map 3D Videos,” IEEE Transactions on Broadcasting, vol. 57, no.2, pp.542-550, Jun. 2011.
[16]L. Shen, Z. Liu, S. Liu, Z. Zhang, and P. An, “Selective Disparity Estimation and Variable Size Motion Estimation Based on Motion Homogeneity for Multi-View Coding,” IEEE Transactions on Broadcasting, vol. 55, no. 4, Dec. 2009.
[17]X. Li, D. Zhao, X. Ji, Q. Wang, and W. Gao, “A Fast Inter Frame Prediction Algorithm for Multi-view Video Coding,” IEEE International Conference on Image Processing (ICIP), vol. 3, pp. Ⅲ-417-Ⅲ-420, 16-19 Sept. 2007.
[18]L.-F. Ding, P.-K. Tsung, W.-Y. Chen, S.-Y. Chien, and L.-G. Chen, “Fast Motion Estimation with Inter-view Motion Vector Prediction for Stereo and Multi-view Video Coding,” IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 1373-1376, Apr. 2008.
[19]Z. Peng, G. Jiang, M. Yu, and Q. Dai, “Fast Macroblock Mode Selection Algorithm for Multiview Video Coding,” EURASIP Journal on Image and Video Processing. vol. 1, pp. 1-14, Oct. 2008.

校內：2023-01-01公開
校外：不公開電子論文尚未授權公開，紙本請查館藏目錄

簡易檢索 / 詳目顯示

相關論文