簡易檢索 / 詳目顯示

研究生: 黃琮淵
Huang, Tsung-Yuan
論文名稱: 應用於多種視訊用途之可重組反轉換架構之設計與實現
Design and Implementation of Reconfigurable Inverse Transform Architecture for Multiple Purpose Video Coding
指導教授: 李國君
Lee, Gwo-Giun
學位類別: 碩士
Master
系所名稱: 電機資訊學院 - 電機工程學系
Department of Electrical Engineering
論文出版年: 2010
畢業學年度: 98
語文別: 英文
論文頁數: 67
中文關鍵詞: 多規格可重組複雜度分析資料流
外文關鍵詞: multiple standards, reconfigurable, complexity analysis, dataflow
相關次數: 點閱:116下載:0
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 在這篇論文中,我們提出一個可應用於多規格用途的可重組逆轉換架構以及一個用於 AVC/H.264 的可重組 Hadamard 轉換。藉由伴隨複雜度分析以及資料流建模之由上而下的設計方式以及萃取演算法之間的共通性。透過探索之共通性,提出之可重組設計可以節省 44%的加法器使用量並且有能力去支援包含於MPEG-2 標準中之逆離散餘弦轉換以及 AVC/H.264 標準中所用到的整數逆轉換。此架構透過 TSMC 0.18 微米製程技術合成,工作頻率為 108MHz,其中工作頻率的選擇為透過資料流排程得到的結果。提出之設計的合成面積為 32K 閘,可以看出設計之架構於 VLSI 實現中相較於相關之設計中能有比較好的面積效益。除此之外,提出之逆轉換架構的精度需求能夠符合IEEE1180-1990和ISO/IEC 23002-1標準中的精度規範。因此,基於我們的由上而下的設計方式以及萃取共同性,提出之設計能有較低的面積成本以及足夠的彈性來使用於目標規格為 1920×1088之影像解析度,每秒64張楨圖,以及色彩格式為4:2:0的多規格用途上。

    In this thesis, an area efficient reconfigurable inverse transformation architecture for multiple standards and a directed 2D reconfigurable Hadamard transform architecture for AVC/H.264 are proposed. We present top down design methodology with complexity analysis, commonalities extraction, and dataflow modeling to design reconfigurable architecture. By sharing the commonalities,the adder usage of the proposed reconfigurable inverse transform datapath can be reduced 44% compared with total amount of adders in performing transforms. Then, the reconfigurable architecture is synthesized using TSMC 0.18 technology cell library. The working frequency is 108Mhz, which is derived from the dataflow. The area synthesis result is 32k gates, which indicates that the proposed design has more efficient area than other documented design in VLSI implementation. In addition, the proposed architecture also satisfies the accuracy requirement of IEEE1180-1990 and ISO/IEC 23002-1 specification. Therefore, based on our top-down design approach and commonality extraction, the proposed designs have lower hardware cost and enough flexibility for multi-standard purposes with 1920×1088 resolution and 64 frames per second and the color format is 4:2:0 for real time processing.

    Abstract ii Table of Contents iv List of Tables vii List of Figures viii Chapter 1 Introduction 1 1.1 Background 1 1.2 Motivation 2 1.3 Organization of this Thesis 2 Chapter 2 Overview of Transforms Used in MPEG-2 and AVC/H.264 3 2.1 Introduction to Transform Adopted in MPEG-2 3 2.1.1 DCT/IDCT Properties 4 2.1.2 IDCT Implementation Issues 5 2.1.3 Measure Accuracy Procedure 6 2.2 Introduction to transform in AVC/H.264 7 2.2.1 AVC/H.264 Decoding Flow 7 2.2.2 Introduction to Transforms Adopted in AVC/H.264 8 2.2.2.1 Integer 4×4 transform 8 2.2.2.2 4×4 Hadamard transform 10 2.2.2.3 2×2 Hadamard transform 11 2.2.2.4 Integer 8×8 transform 12 Chapter 3 Introduction to fast transform algorithms and architectures 13 3.1 Fast transform algorithms 13 3.1.1 Chen’s Factorization 14 3.1.2 Lee’s Factorization 14 3.1.3 Fast Inverse Integer Transforms in AVC/H.264 16 3.1.3.1 Inverse 4×4 transform 16 3.1.3.2 Inverse 8×8 transform 17 3.1.3.3 Inverse Hadamard Transform 18 3.2 Transform architectures 20 3.2.1 Distributed Arithmetic 20 3.2.2 CSHM Architectures 20 3.2.3 CORDIC Architectures 21 Chapter 4 Reconfigurable Inverse Transformation Architecture Design 23 4.1 Reconfigurable Video Coding (RVC) 23 4.2 Specification 24 4.3 Proposed Fast IDCT for MPEG-2 25 4.3.1 Scaling Factor Derivation 29 4.3.2 Accuracy measurement 29 4.4 Directed 2D Hadamard Transform 31 4.5 Commonality extraction 33 4.6 Complexity Analysis 34 4.6.1 Number of Operation 34 4.6.2 Degree of Parallelism 36 4.6.3 Pipeline depth 38 4.7 Design Space Exploration 39 4.7.1 High Level Design Space Exploration 40 4.7.1.1 16×16 Granularity 40 4.7.1.2 16×8 Granularity 40 4.7.1.3 8×8 Granularity 41 4.7.1.4 8×4 Granularity 43 4.7.1.5 Conclusion of High Level Design Space Exploration 44 4.7.2 Low Level Design Space Exploration 44 4.7.3 Design Space Exploration with Complexity Analysis 49 4.8 Architecture Design and Implementation 50 4.8.1 Block Diagram of Reconfigurable IQIT module 51 4.8.2 Transpose Memory 52 4.8.3 Reconfigurable Inverse Transform Data Path 53 4.8.4 Reconfigurable Hadamard Transform(RHT) Data Path 56 Chapter 5 Verification and Experimental Results 58 5.1 Verification Process 58 5.1.1 Dataflow model verification 59 5.1.2 Register Transfer Level Verification 59 5.1.3 Gate Level Netlist Verification 60 5.2 Experimental Result 60 5.3 Comparison with related works 62 Chapter 6 Conclusion and future work 63 6.1 Conclusion 63 6.2 Future Work 64 Reference 65

    [1]ISO/IEC 13818-2, Information Technology – Coding of moving pictures and associated audio, 1996.
    [2]ITU-T Recommendation H.264, “Advanced video coding for generic audiovisual services,” Draft, March 2005.
    [3]IEEE CAS Standards Committee, “IEEE Standard Specifications for the Implementations of 8x8 Inverse Discrete Cosine Transform,” IEEE standard 1180-1990, Dec. 1990.
    [4]ISO/IEC JTC/SC29/WG11 N7815 [23002-1 FDIS] Information technology – MPEG video technologies – Part 1: Accuracy requirements for implementation of integer-output 8x8 inverse discrete cosine transform.
    [5]N. Ahmel, T. Natarajan and K.R. Rao, “Discrete cosine transform,” IEEE Trans. Compute., vol. C-23, pp. 88-93, January 1974.
    [6]Rao, K.R., and Yip. P., “Discrete Cosine Transform Algorithms, Advantages, Applications,” Academic Press, Inc. London, 1990.
    [7]H. S. Malvar, A. Hallapuro, M. Karczewicz, and L. Kerofsky, “Low-complexity transform and quantization in H.264/AVC,” IEEE Transaction on Circuits and Systems for Video technology, vol. 13, no.7, pp.598-603, July 2003.
    [8]I. E. G. Richardson, “H.264 and MPEG-4 Video Compression—Video Coding for Next-generation Multimedia,” John Wiley & Sons, 2003
    [9]D. Marpe, T. Wiegand, and S. Gordon, "H.264/MPEG4-AVC fidelity range extensions: Tools, profiles, performance, and application areas," IEEE International Conference on Image Processing, pp.593–596, September 2005.
    [10]W. Chen, C. H. Smith, and S. C. Fralick, “A fast computational algorithm for the discrete cosine transform,” IEEE Transactions on Communications, vol. COMM-25, pp. 1004–1009, September 1977.
    [11]B. G. Lee, “A new algorithm to compute the discrete cosine transform,” IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-32, pp.1243–1245, December 1984.
    [12]S. Gordon, D. Marple, and T. Wiegand, “Simplified Use of 8×8 Transforms-Updated Proposal and Results,” JVT-K028, 11th Meeting ,Munich, Germany, March 2004
    [13]Stanley A. White, “Applications of distributed arithmetic to digital signal processing: a tutorial review,” IEEE ASSP Magazin,vol.6,issue3, pp.4-19, July 1989.
    [14]M. T. Sun, T. C. Chen, and A. M. Gottlieb, “VLSI implementation of a 16 x 16 discrete cosine transform,” IEEE Transactions on Circuits and Systems, vol. 36, no. 4, pp. 610–617, Apr. 1989.
    [15]S. Yu and E. E. Swartzlander Jr., “DCT implementation with distributed arithmetic,” IEEE Transactions on Computations, vol. 50, no. 9, pp. 985–991, Sep. 2001.
    [16]J. Park, S. Kwon, and K. Roy, “Low power reconfigurable DCT design based on sharing multiplication,” in Acoustics, Speech, and Signal Processing, 2002 IEEE International Conference on, 2002, vol. 3, pp.3116-3119.
    [17]C.-C. Sun, P. Donner and J. G¨otze, “Low-complexity multi-purpose IP core for quantized discrete cosine and integer transform,” IEEE ISCAS, pp. 3014-3017, May 2009.
    [18]The MPEG homepage, http://www.chiariglione.org/mpeg
    [19]G. G. Lee, Y.-K. Chen, M. Mattavelli, E. S. Jang, “Algorithm/Architecture Co-Exploration of Visual Computing on Emergent Platforms: Overview and Future Prospects,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 19, no. 11, November 2009.
    [20]K. K. Parhi, VLSI Digital Signal Processing Systems Design and Implementation, John Wiley & Sons, New York, USA, 1999.
    [21]C. Loeffler, A. Lightenberg, and G. S. Moschytz, “Practical fast 1-D DCT algorithms with 11-multiplications,” in Proc. ICASSP, vol. 2, May 1989, pp. 988–991.
    [22]C. -P. FAN and Y. –L. Lin, “Implementations of low-cost hardware sharing architectures for fast 8×8 and 4×4 integer transform in H.264/AVC,” IEICE TRANS FUNDAMENTALS, Vol. E90-A, no. 2, February 2007
    [23]G. G. Lee, “Advanced Reconfigurable Video Coding in Digital Home (2/3),” National Science Council Project Progress Report, 2009.8.1 - 2010.7.31, NSC97-2221-E006-252-MY3.
    [24]Y. C. Chao, H. H. Tsai, Y. H. Lin, J. F. Yang, and B. D. Liu, “A novel design for computation of all transforms in H.264/AVC decoders,” in Proc. IEEE Int. Conf. Multimedia and Expo, pp. 1914–1917, Jul. 2007.
    [25]L. Fanucci and S. Saponara, “Data Driven VLSI Computation for Low Power DCT- Based Video Coding,” in Proc. 9th Int. Conf. Electronics, Circuits, System, pp. 541-544, Sept. 2002.
    [26]S. Lee, and K. Cho, “Architecture of transform circuit for video decoder supporting multiple standards,” Electronics Letters, vol. 44, no. 4, 14th February 2008.
    [27]C. -Y. Huang, L.-F. Chen, and Y.-K. Lai, “A high-speed 2-D transform architecture with unique kernel for multi-standard video applications,” IEEE ISCAS, pp. 21-24, May 2008.

    無法下載圖示
    校外:不公開
    電子論文及紙本論文均尚未授權公開
    QR CODE