成功大學博碩士論文系統

簡易檢索 / 詳目顯示

回結果列表

研究生：	陳毓宏 Chen, Yu-Hung
論文名稱：	在H.264上依量化參數決定的模式決策演算法 A Quantisation Parameter Adaptive Mode Decision Algorithm for H.264
指導教授：	戴顯權 Tai, Shen-Chuan
學位類別：	碩士 Master
系所名稱：	電機資訊學院 - 電腦與通信工程研究所 Institute of Computer & Communication Engineering
論文出版年：	2005
畢業學年度：	93
語文別：	英文
論文頁數：	80
中文關鍵詞：	視訊、動作估計、模式決策
外文關鍵詞：	Video, H.264, AVC, Motion Estimation, Mode Decision, SDBS, QPAMD
相關次數：	點閱：150 下載：1
分享至:	分享至facebook 分享至twitter

查詢本校圖書館目錄查詢臺灣博碩士論文知識加值系統勘誤回報

　　H.264在編碼效能上遠勝過MPEG-4 Visual，然而和舊有的標準相比，它有過高運算複雜度的問題。在這份論文中，我們提出兩個有效率的方案來降低H.264編碼端的運算量。第一個方案被稱之為以小菱形為基礎的快速搜尋演算法(SDBS) - 一個用來做整數像素動作向量估計的快速方塊匹配演算法。SDBS演算法內部隱含多種不同的搜尋機制，而且它在任何程度的空間及動作量的影片上都能保有穩定的效能。另一個方案則是被稱為依量化參數決定的模式決策演算法(QPAMD)。它是被用來判斷目前畫面間模式(Inter Mode)的預測結果是否夠好，使得接下來的畫面間模式不必再被調查而可提早結束(提早終止階段)，或者是預測出哪些畫面間預測模式是無效益的並且省略掉對它們的調查(模式選擇階段)。實驗結果顯示，在結合SDBS與QPAMD的環境下可以得到大約是244倍 (針對CIF格式影片)或147倍(針對CCIR-601格式影片)的加速，同時在編碼效能上只會有輕微的PSNR降低或少許位元率的增加。

H.264 can out-perform MPEG-4 Visual in compression efficiency but it has too higher computational complexity when compared to the older standards. In this Thesis, two efficient schemes are proposed to reduce the computation load of the H.264 encoder. The first one is called small diamond based search (SDBS), a fast block-matching algorithm, which will be applied in integer-pixel motion estimation. The SDBS algorithm consists of multiple different search strategies and could achieve stable performance over any different class of spatial detail and temporal motion videos. The other one is called quantisation parameter adaptive mode decision (QPAMD) algorithm, which is used to determine whether the prediction results of the current Inter mode is good enough to omit the investigations of the remaining Inter modes or not (in Early Termination stage), or to find the ineffective Inter predition modes and omit the investigations of them (in Mode Selection stage). The experimental results show the combination of SDBS and QPAMD could achieve a speed-up factor about 244 (for CIF format sequences) or 147 (for CCIR-601 sequences) while a slightly PSNR degradation or bitrate increase with regard to coding efficiency.

LIST OF TABLES	iii
LIST OF FIGURES	v
CHAPTER 1 INTRODUCTION	1
1	Video Compression	1
2	Problems in Implementation of H.264 Encoder	2
3	Organization of This Thesis	3
CHAPTER 2 INTRODUCTION TO H.264/AVC ENCODER	4
1	Overview of H.264/AVC Coding Standard	4
1.1	Profiles and Levels	5
1.2	Variable Block Size Motion Compensation	6
1.3	Multiple Reference Frames Motion Compensation	7
2	Overview of the Motion Estimation Algorithms for H.264	7
2.1	Initial Search Point Prediction for Fast Search	8
2.2	Fast Integer Pixel Motion Estimation – UMHexagonS	10
2.2.1	Unsymmertical-Cross Search	10
2.2.2	Uneven Multi-Hexagonal-Grid Search	12
2.2.3	Extended Hexagon-based Search (EHS)	13
2.2.4	Early Termination for UMHexagonS	14
2.3	Center Biased Fractional Search (CBFPS) Algorithm	14
2.4.1	Introduction to Fractional Pixel Motion Estimation	14
2.4.2	Hierarchical Fractional Pixel Search (HFPS) Algorithm	15
2.4.3	Description of the CBFS Algorithm	16
3	Fast Mode Decision Algorithm in JVT Test Model	18
3.1	SKIP Mode Decision at Early Stage	19
3.2	Selective Intra Mode Decision	19
CHAPTER 3 A NEW FAST SEARCH ALGORITHM FOR MOTION ESTIMATION	22
1	The Concept of Small Diamond Domain	22
2	Small Diamond-based Search (SDBS) Algorithm	24
3.1	Initial Search Pattern	24
3.2	Hirerarchical Motion Search Strategy	24
3.3	Extended Small Diamond-based Search	27
3.4	Handshaking Search Strategy	27
3.5	Motion Adaptive Prediction	29
3	Experimental Results	30
CHAPTER 4 THE PROPOSED MODE DECISION ALGORITHM	35
1	Overview of the Proposed Mode Decision Algorithm	35
2	Observations and Analysis on the Residual Data	37
2.1	4 × 4 Residual Transform in H.264	37
2.2	Pixel-wise QP-adaptive Threshold	39
2.3	Block-wise QP-adaptive Threshold	41
3	Description of QP-Adaptive Mode Decision Algorithm	42
3.1	Early Termination	43
3.2	Mode Selection	47
CHAPTER 5 EXPERIMENTAL RESULTS	51
1	Assessment	51
2	Experiments on CIF Test Sequences	53
3	Experiments on CCIR-601 Test Sequences	63
4	Experiments under Fast High Complexity in JVT	66
CHAPTER 6 CONCLUSIONS AND FUTURE WORKS	76
REFERENCE	77
BIOGRAPHY	80

                                    

[1] ITU-T and ISO/IEC JTC 1, "Generic coding of moving pictures and associated audio information - Part 2: Video," ITU-T Recommendation H.262 - ISO/IEC 13818-2 (MPEG-2), Nov. 1994.
[2] ISO/IEC JTC1, "Coding of audio-visual objects - Part 2: Visual," ISO/IEC 14496-2 (MPEG-4 visual version 1), April 1999; Amendment 1 (version 2), February, 2000; Amendment 4 (streaming profile), Jan. 2001.
[3] Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG, "Draft ITU-T Recommendation and Final Draft International Standard of Joint Video Specification (ITU-T Rec. H.264 | ISO/IEC 14496-10 AVC)," ITU-T, Doc. #JVT-G050r1, Mar. 2003.
[4] Joint Video Team Reference Software, Version 9.5 (JM9.5), http://iphome.hhi.de/suehring/tml/download/
[5] Z. Chen, P. Zhou and Y. He, "Fast Motion Estimation for JVT," ITU-T, Doc. #JVT-G016, 2003.
[6] A. K. Jain, "Fundamentals of Digital Image Processing". Englewood Cliffs, NJ: Prentice-Hall, 1989.
[7] Z. Chen, P. Zhou and Y. He, "Fast Integer Pixel and Fractional Pixel Motion Estimation for JVT," ITU-T, Doc. #JVT-F017, 2002.
[8] J. Lee and B. Jeon, "Pruned Mode Decision based on Variable Block Sizes Motion Compensation for H.264," Multimedia Interactive Protocols and Systems (MIPS), Nov. 2003.
[9] I. Choi, J. Lee and B. Jeon, "Efficient coding mode decision in MPEG-4 part-10 AVC/H.264 main profile," 2004 International Conference on Image Processing (ICIP '04), vol. 2, pp. 1141 - 1144, Oct. 24-27, 2004.
[10] J. R. Jain and A. K. Jain, "Displacement measurement and its application in interframe image coding," IEEE Trans. Commun., vol. COM-29, pp. 1799-1806, Dec. 1981.
[11] T. Koga, K. Iinuma, A. Hirano, Y. Iijima, and T. Ishiguro, "Motion compensated interframe coding for video conferencing," Proc. of the National Telecommunications Conference (NTC), pp. G5.3.1--5, New Orleans, LA, Nov. 29-Dec. 3 1981.
[12] R. Li, B. Zeng, and M. L. Liou, "A new three-step search algorithm for block motion estimation," IEEE Trans. Circuits Syst. Video Technol., vol. 4, pp. 438-442, Aug. 1994.
[13] L. M. Po and W. C. Ma, "A novel four-step search algorithm for fast block motion estimation," IEEE Trans. Circuits Syst. Video Technol., vol. 6, pp. 313-317, June 1996.
[14] J. Lu and M. L. Liou, "A simple and efficient search algorithm for blockmatching motion estimation," IEEE Trans. Circuits Syst. Video Technol, vol. 7, pp. 429-433, Apr. 1997.
[15] J. Y. Tham, S. Ranganath, M. Ranganath, and A. A. Kassim, "A novel unrestricted center biased diamond search algorithm for block motion estimation," IEEE Trans. Circuits. Syst. Video Technol., vol. 8, pp. 369-377, Aug. 1998.
[16] S. Zhu and K. K. Ma, "A new diamond search algorithm for fast block-matching motion estimation," IEEE Trans. Image Processing, vol. 9, pp. 287-290, Feb. 2000.
[17] C. Zhu, X. Lin, and L. P. Chau, "Hexagon-based search pattern for fast block motion estimation," IEEE Trans. Circuits Syst. Video Technol, vol. 12, pp. 349-355, May 2002.
[18] X. Jing and L. P. Chau, "An Efficient Three-Step Search Algorithm for Block Motion Estimation," IEEE Trans. Multimedia, vol. 6, pp. 435-438, Jun. 2004.
[19] S. Kappagantula, K. R. Rao, "Motion Compensated Predictive Coding", in SPIE 27th Proc., vol. 432, pp.64-70, Aug. 1983.
[20] M. Bierling, R. Thoma, "Motion compensating field interpolation using a hierarchically structured displacement estimator," Signal Processing, pp.387-404, 1986.
[21] J. Wang, D. Wang, and W. Zhang, "Temporal Compensated Motion Estimation With Simple Block-Based Prediction," IEEE Trans. on Broadcasting, vol. 49, no. 3, pp. 241 - 248, Sep. 2003.
[22] G. Bjontegaard, "Calculation of Average PSNR Differences between RD-curves," Doc. #VCEG-M33, the 13th meeting: Austin, Texas, USA, Apr. 2001.
[23] A. Hallapuro, M. Karczewicz and H. Malvar, "Low Complexity Transform and Quantization - Part I: Basic Implementation," ITU-T, Doc. #JVT-B038, Geneva, Feb. 2002.
[24] Zhou Xuan, Yu Zenghua, and Yu Songyu, "Method for detecting all-zero DCT coefficients ahead of discrete cosine transformation and quantisation," IEE Electronics Letters, vol. 34, pp. 1839-1840, 1998.
[25] J. Makhoul, "A fast cosine transform in one and two dimensions," IEEE. Trans. on Acoust., Speech, Signal Processing, vol. ASSP-28, pp. 27-34, 1980.
[26] G. Sullivan, "Recommended Simulation Common Conditions for H.26L Coding Efficiency Experiments on Low Resolution Progressive Scan Source Material," Doc. #VCEG-N81, the 14th meeting: Santa Barbara, CA, USA. Sep. 2001.

2006-07-08公開

簡易檢索 / 詳目顯示

相關論文