| 研究生: |
郭品岑 Kuo, Pin-Chen |
|---|---|
| 論文名稱: |
快速H.264/MVC編碼之投射式視角預測搜尋演算法 A Projection Disparity Search Algorithm for Fast H.264/MVC Encoding |
| 指導教授: |
劉濱達
Liu, Bin-Da |
| 共同指導教授: |
楊家輝
Yang, Jar-Ferr |
| 學位類別: |
碩士 Master |
| 系所名稱: |
電機資訊學院 - 電機工程學系 Department of Electrical Engineering |
| 論文出版年: | 2011 |
| 畢業學年度: | 99 |
| 語文別: | 英文 |
| 論文頁數: | 87 |
| 中文關鍵詞: | H.264/MVC 、投射式視角預測 、視角向量預測子 、移動向量預測子 |
| 外文關鍵詞: | disparity vector predictor, H.264/MVC, inter-view prediction, motion vector predictor, projection disparity search |
| 相關次數: | 點閱:112 下載:1 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
本論文提出一種投射式視角預測搜尋演算法,是利用相機之內部參數和外部參數產生極線以及更精確的視角向量預測子以縮小搜尋視窗,並在不影響影像品質的前提下,節省H.264/MVC中視角間預測之運算複雜度,進而增進H.264/MVC的執行效能。另外,本論文亦參考影像之紋理分布提出新的移動向量預測子,此預測子不僅能在框間預測時增強物件的凝聚性,更能藉由物件內像素的群聚性減少移動估測的搜尋範圍,節省移動估測所需時間,並且重建出品質優良的影像。
本論文將所提出之投射式視角預測搜尋演算法實現於JMVC2.5中,根據模擬結果,在不同的量化參數以及不同的測試樣本中,本演算法皆可節省約70%至80%的運算量。在影像品質方面,PSNR僅下降0.01dB,位元率僅增加0.3%,此幅度在人眼所能接受的範圍之內;換言之,本論文所提出之投射式視角預測搜尋演算法能在影像效果幾乎沒有變差的前提之下,有效提升MVC的編碼效率。
H.264/MVC is the newest standard for digital video compression. Similar with H.264/AVC, H.264/MVC carries out the compression for multi-view image system to get the more compression efficiency. This thesis focuses on the computational complexity reduction in inter prediction and inter-view prediction. The main research point is to narrow the search window, and derived a more accurate disparity vector predictor and motion vector predictor for inter prediction and inter-view prediction.
Multi-view video system can apply different image for user’s eyes and allow users to have a more realistic visual experience. However, with the increasing of view, the amount of data to be processed can be also doubled. Multi-view images have not only temporal redundancies in the single view image, but also have a huge correlation between views. In order to reduce computational complexity, this thesis proposed a fast algorithm to find the redundancy between views.
According to the camera geometry, this thesis produces a more accuracy disparity vector predictor and narrows the search region of disparity estimation. For motion estimation, a new motion vector predictor can be found. Finally, this thesis implements the algorithm in JMVC. Experimental results show that the proposed algorithm can reduce 70~80% of the computational time, while maintaining high RD performance.
[1] Coding of Moving Pictures and Associated Audio for Digital Storage Media at Up to 1.5 Mbit/s–Part2: Video, ISO/IEC 11172, 1993.
[2] Information Technology–Generic Coding of Moving Pictures and Associated Audio Information: Video, ISO/IEC 13818-2 and ITU-T Rec. H.262, 1996.
[3] Information Technology–Coding of Audio-Visual Objects–Part2: Visual, ISO/IEC 14496-2, 1999.
[4] Joint Video Team, Draft ITU-T Recommendation and Final Draft International Standard of Joint Video Specification, ITU-T Rec. H.264 and ISO/IEC 14496-10 AVC, May 2003.
[5] M. Tanimoto and T. Fujii, “FTV-free viewpoint television,” ISO/IEC JTC1/SC29/WG11, M8595, July 2002.
[6] T. Fujii and M. Tanimoto, “Free-viewpoint TV system based on ray-space representation,” in Proc. SPIE ITCom, Mar. 2002, pp.175-189.
[7] A. Smolic, K. Mueller, P. Merkle, T. Rein, M. Kautmer, P. Eisert, and T. Wiegand, “Free viewpoint video extraction, representation, coding, and rendering,” in Proc. IEEE ICIP, Oct. 2004, pp. 3287-3290.
[8] M. Tanimoto, “FTV (free viewpoint television) for 3D scene reproduction and creation,” in Proc. CVPRW, June 2006, pp. 172-172.
[9] M. Tanimoto, “FTV (free viewpoint television) creating ray-based image engineering,” ECTI Trans. Electrical Eng., Elect. Commun., vol 6, pp. 3-14, Feb. 2008.
[10] M. Tanimoto, “Overview of free viewpoint television,” Signal Process.-Image Commun., vol. 21, pp. 454-461, July 2006.
[11] F. Isgrò, E. Trucco, P. Kauff, and O. Schreer, “Three-dimensional image processing in the future of immersive media,” IEEE Trans. Circuits Syst. Video Technol., vol. 14, pp. 388-303, Mar. 2003.
[12] A. Smolic and P. Kauff, “Interactive 3-D video representation and coding technologies,” Proc. IEEE, vol. 93, pp. 98-110, Jan. 2005.
[13] S. Adedoyin, W.A.C. Fernando, A. Aggoun, and K.M. Kondoz, “Motion and disparity estimation with self adapted evolutionary strategy in 3D video coding,” IEEE Trans. Consum. Electron., vol. 53, pp. 1768-1775, Nov. 2007.
[14] X. Guo and Q. Huang, “Multiview video coding based on global motion model,” Lecture Notes Comput. Sci., vol. 3333, pp. 665-672, Dec. 2004.
[15] G. Li and Y. He, “A novel multiview video coding scheme based on H.264,” in Proc. ICICS-PCM, Dec. 2003, pp. 493-497.
[16] Video Codec for Audiovisual Services at px64 kbits/s, ITU-T Rec. H.261 v1, 1990.
[17] Video Coding for Low Bit-rate Communication, ITU-T Rec. H.263, 1998.
[18] A. Vetro, P. Pandit, H. Kimata, A. Smolic and Y. K. Wang, ISO/IEC JTC1/SC29/WG11 and ITU-T Q6/SG16. Joint Multiview Video Model (JMVM) 8.0, Doc. JVT-AA207, Geneva, Switzerland, Apr. 2008.
[19] Description of Core Experiments in MVC, ISO/IEC JTC1/SC29/WG11, MPEG2006/W7798, 2006.
[20] A. Kaup and U. Fecker, “Analysis of multi-reference block matching for multi-view video coding,” in Proc. WSDB, Sept. 2006, pp. 33-39.
[21] G. J. Sullivan and T. Wiegand, “Rate-distortion optimization for video compression,” IEEE Signal Process. Mag., vol. 15, pp. 74-90, Nov. 1998.
[22] F. Pan, X. Lin, S. Rahardja, K. P. Lim, Z. G. Li, D. Wu, and S. Wu, Fast mode decision for intra prediction in H.264/AVC,” IEEE Trans. Circuits Syst. Video Technol., vol. 15, pp. 813-822, July 2005.
[23] D. Wu, F. Pan, K. P. Lim, S. Wu, Z. G. Li, X. Lin, S. Rahardja, and C. C. Ko, “Fast inter mode decision in H.264/AVC video coding,” IEEE Trans. Circuits Syst. Video Technol., vol. 15, pp. 953-958, July 2005.
[24] L. F. Ding, S. Y. Chien, and L. G. Chen, “Joint prediction algorithm and architecture for stereo video hybrid coding systems,” IEEE Trans. Circuits Syst. Video Technol., vol. 16, pp.1324-1337, Nov. 2006.
[25] X. M. Li, D. B. Zhao, X. Y. Ji, Q. Wang, and W. Gao, “A fast inter frame prediction algorithm for multi-view video coding,” in Proc. IEEE ICIP, Sep. 2007, pp. 417-420.
[26] L. S. Young, S. K. Mu, and C. K. Dong, “An object-based mode decision algorithm for multi-view video coding,” in Proc. IEEE ISM, Dec. 2008, pp. 74-81.
[27] W. Zhu, X. Tian, F. Zhou, and Y. Chen, “Fast inter mode decision based on textural segmentation and correlations for multiview video coding,” IEEE Trans. Consum. Electron., vol. 56, pp. 1696-1704,Aug. 2010.
[28] T. Wiegand, G. J. Sullivan, G. Bjøntegaard, and A. Luthra, “Overview of the H.264/AVC video coding standard,” IEEE Trans. Circuits Syst. Video Technol., vol. 13, pp. 560-576, July 2003.
[29] R. Hartley and A. Zisserman, Multiple View Geometry in Computer Vision. Cambridge, U.K.: Cambridge Univ. Press, 2000.
[30] I. E. G. Richardson, H.264 and MPEG-4 Video Compression. Chichester, UK: John Wiley & Sons, 2003.
[31] Y. Kim, J. Kim, and K. Sohn, “Fast disparity and motion estimation for multi-view video coding,” IEEE Trans. Consum. Electron., vol. 53, pp. 712-719, May 2007.
[32] L.F. Ding, P. K. Tsung, W. Y. Chen, S. Y. Chien, and L. G. Chen “Fast motion estimation with inter-view motion vector prediction for stereo and multiview video coding” in Proc. IEEE ICASSP, Mar. 2008, pp. 1373-1376.
[33] X. Li, D. Zhao, S. Ma, and W. Gao, “Fast disparity and motion estimation based on correlations for multiview video coding,” IEEE Trans. Consum. Electron., vol. 54, pp. 2037-2044, Nov. 2008.
[34] T. Zahariadis, D. Kalivas, "A spiral search algorithm for fast estimation of block motion vectors," in Proc. EUSIPCO, Sept. 1996, pp. 1079-1082.
[35] X. Xu and Y. He, “Fast disparity motion estimation in MVC based on range prediction,” in Proc. IEEE ICIP, Oct. 2008, pp. 2000-2003.
[36] F. Pan, X. Lin, S. Rahardja, K. P. Lim, Z. G. Li, D. Wu, and S. Wu, “Fast mode decision algorithm for intra prediction in H.264/AVC video coding,” IEEE Trans. Circuits Syst. Video Technol., vol. 15, pp. 813–822, July 2005.