簡易檢索 / 詳目顯示

研究生: 傅惠稜
Fu, Hui-Leng
論文名稱: 基於H.264/AVC解碼之自由視角影像產生
Free Viewpoint Generation from H.264/AVC Decoding Procedure
指導教授: 楊家輝
Yang, Jar-Ferr
學位類別: 碩士
Master
系所名稱: 電機資訊學院 - 電腦與通信工程研究所
Institute of Computer & Communication Engineering
論文出版年: 2008
畢業學年度: 96
語文別: 英文
論文頁數: 48
中文關鍵詞: 自由視角影像修補虛擬視角內插視差
外文關鍵詞: Disparity, Virtual view interpolation, Free viewpoint generation, Inpainting
相關次數: 點閱:75下載:2
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 隨著電腦視覺與圖學的發展,產生自由視角影像序列上的研究有顯著的進步。自由視角影像讓使用者任意選擇觀賞角度。但無論以模型重建或影像為基礎的演算法,都需要大量的計算。
    本論文提出以動態影像為基礎,利用H.264/AVC解碼器之訊息以後處理的自由視角影像序列產生方法。藉由解碼器後所得的運動向量可以有效的節省計算視差所需的時間,使得自由視角影像產生速度較快。本系統包含視差影像取得、虛擬視角影像內插以及修補影像內插後產生的空洞。根據實驗結果證明,本論文所提出的系統,可針對兩個存在平行視差的影像序列,產生介於此兩真實視角之間的虛擬視角影像序列。

    Researches in both computer vision and graphics have made important steps toward generating free view video sequences. Free viewpoint video offers the viewers an opportunity to interactively change its viewpoint in the scene. Existing methods for generating new views can be broken down into two basic groups: Model-based rendering (MBR) and Image-based rendering (IBR). These two kinds of method both need a lot of computation.
    In this thesis, we develop an image-based free viewpoint sequence generation method. It is based on the post processing of H.264/AVC decoding. Through the information of motion vector after decoding, the computation complexity can be reduced. This scheme contains three parts: 1) disparity map estimation, 2) virtual view interpolation, and 3) hole filling. Experimental results show that this approach can generate virtual view between two real views that have horizontal disparity with different weighting factor through the whole sequence.

    1 Introduction 1 1.1 Background 1 1.2 Introduction to Free Viewpoint Video Generation 2 1.3 Overview of H.264/AVC Video Coding Standard 3 1.4 Motivation 6 1.5 Organization of Thesis 6 2 Overview of H.264/AVC Codec 7 2.1 H.264 Encoder 8 2.2. H.264 Decoder 9 3 Free Viewpoint Image Generation 11 3.1 Disparity Map 11 3.1.1 Matching Cost Computation 12 3.1.2 Aggregation of Cost 13 3.1.3 Disparity Computation and Optimization 14 3.1.4 Refinement of Disparities 18 3.2 Image Inpainting 19 3.2.1 Image Inpainting Based on Partial Differential Equation (PDE) 19 3.2.2 Fast Digital Image Inpainting 20 3.2.3 Texture Synthesis 21 3.2.4 Priority Texture Synthesis 21 4 Free Viewpoint Generation Scheme based on H.264/AVC Decoding 23 4.1 System structure 23 4.2 Disparity Map Generation 24 4.2.1 Disparity Generation for Intra Frame Pairs 24 4.2.1 Disparity Generation for Inter Frame Pairs 29 4.3 Free View Image Interpolation 32 4.4 Hole Filling 34 5 Experimental results 40 6 Conclusions and Future Works 44 6.1 Conclusions 44 6.2 Future works 44 References 46

    [1] Draft ITU-T Recommendation and Final Draft International Standard of Joint Video Specification (ITU-T Rec. H.264 | ISO/IEC 14496-10 AVC), Joint Video Team, March 2003. Joint Video Team (JVT) of ISO/IEC MPEG and ITU-T VCEG, JVT-G050.
    [2] T. Wiegand, G. J. Sullivan, G. Bjontegaard and A. Luthra, “Overview of the H.264/AVC video coding standard,” IEEE Transaction on Circuits and Systems for Video Technology, vol. 13, no.7, pp. 560-576, July 2003.
    [3] D. Scharstein and R. Szeliski, “A taxonomy and evaluation of dense two-frame stereo correspondence algorithms,” International Journal of Computer Vision, vol.47 no.1-3, pp.7-42, April 2002.
    [4] S. Birchfield and C. Tomasi, “Depth discontinuities by pixel-to-pixel stereo,” in Proceeding IEEE International Conference on Computer Vision, 1998, pp. 1073-1080.
    [5] C. L. Zitnick and T. Kanade. “A cooperative algorithm for stereo matching and occlusion detection,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 22, no. 7, pp. 675-684, July 2000.
    [6] M. Bertalmio, G. Sapiro, V. Caselles and C. Ballester, “Image inpainting,” in Proceeding ACM SIGGRAPH Conference on Computer Graphics, 2000, pp. 417-424.
    [7] M. M. Oliveira, B. Bowen, R. McKenna and Y. S. Chang, “Fast digital image inpainting,” in Proceeding International Conference on Visualization, Imaging and Image Processing, 2001, pp. 261-266.
    [8] A. Efros and T. K. Leung, “Texture synthesis by nonparametric sampling,” in Proceeding IEEE International Conference on Computer Vision, vol. 2, 1999, pp. 1033-1038.
    [9] A. Criminisi, P. Pérez and K. Toyama, “Region filling and object removal by exemplar-based image inpainting,” IEEE Transactions on Image Processing, vol. 13, no. 9, pp. 1200-1212, September 2004.
    [10] H. Saito, S. Baba, T. Kanade, “Appearance-based virtual view generation from multicamera videos captured in the 3-D room,” IEEE Transactions on Multimedia, vol. 5, no. 3, pp. 303-316, September 2003
    [11] Y. S. Lu and Y. C. H, “Color image inpainting using texture synthesis and morphological operations”, M. Sc. Thesis, NCTU, 2006.
    [12] R. C. Gonzalez and R. E. Woods, “Digital image processing,” Addision Wesley, Reading, Massachusetts, 1992.
    [13] R. Hartley and A. Zisserman, “Multiple view geometry in computer vision,” 2nd Edition, Cambridge University Press, March 2004.
    [14] D. Marr and T. Poggio, “Cooperative computation of stereo disparity,” Science, vol. 194, no. 4262, pp. 209-236, October 1976.
    [15] D. Marr and T. Poggio, “A computational theory of human stereo vision,” in Proceedings of the Royal Society of London Series B, Biological Sciences, vol. 204, 1979, pp. 301-328.
    [16] S. Moezzi, L. C. Tai, and P. Gerard, “Virtual view generation for 3-D digital video,” IEEE MultiMedia, vol. 4, no. 1, pp. 18-26, January-March 1997.
    [17] J. Weng, N. Ahuja, and T. S. Huang “Matching two perspective views,” IEEE Transaction on Pattern Analysis and Machine Intelligence, vol. 14, no.8, pp. 806-825, August 1992.
    [18] M. Gong and Y. H. Yang, “Fast unambiguous stereo matching using reliability-based dynamic programming,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 27, no. 6, pp. 998-1003, June 2005.
    [19] Z. Y. and L. Guiling, “A research on disparity estimation algorithm of stereo video coding,” Journal of electronic measurement and instrument, vol. 16, no. 1, March 2002.
    [20] L. Zhang, D. Wang, and A. Vincent, “Adaptive reconstruction of intermediate views from stereoscopic images,” IEEE Transactions on Circuits and System for Video Technology, vol. 16, no. 1, pp. 102-113, January 2006.

    下載圖示 校內:2011-07-23公開
    校外:2013-07-23公開
    QR CODE