簡易檢索 / 詳目顯示

研究生: 吳瑞哲
Wu, Jui-Che
論文名稱: 應用於視覺運算系統之動態估計系統架構設計
System Architecture and Design of Motion Estimation for Visual Computing System
指導教授: 李國君
Lee, Gwo Giun
學位類別: 碩士
Master
系所名稱: 電機資訊學院 - 電機工程學系
Department of Electrical Engineering
論文出版年: 2011
畢業學年度: 99
語文別: 英文
論文頁數: 94
中文關鍵詞: 動態估計演算法暨架構共同探索資料流本質演算法複雜度動態補償禎率提升轉換
外文關鍵詞: motion estimation, algorithm/architecture co-exploration, dataflow, intrinsic algorithm complexity, motion compensated frame rate up-conversion
相關次數: 點閱:145下載:4
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 此論文提出的動態估計系統架構為基於演算法暨架構共同探索設計方法所設計,使用由粗顆粒動態向量淬鍊至細顆粒的方法保有極佳演算法效能,於設計演算法時便同時考量架構資訊,而非只專注於改善其精確度的演算法效能之動態估計演算法,卻忽略了伴隨著極高頻寬與繁複運算造成不實際的架構實現,於初期設計運用不同的資料流萃取相對應的本質演算法複雜度以取得系統架構的特性,並評估是否返回修改高抽象層級設計來減少設計成本,於目標規格下,所提出之系統架構已成功實現並驗證於現場可程式化閘陣列平台,支援1920×1080解析度可由60禎率提升至120禎率,時脈操作於148.5赫茲。

    The present thesis proposes the system architecture of motion estimation (ME) based on an algorithm/architecture co-exploration (AAC) design methodology, which is proven to have excellent algorithm performance, with the refinement of the coarse-grained motion vector (MV) into a fine-grained MV. Architecture information is considered in designing the algorithm. However, the use of complex algorithms to enhance the accuracy of ME in terms of algorithm performance is not considered because of the extremely high bandwidth and unduly complex computation required, consequently making their implementation impractical. Varying dataflow enables extraction across a range of intrinsic algorithm complexities as a means to characterize the significance of information related to system architecture in the early phases of design. The intrinsic complexity of the algorithm must be appraised to determine whether back-annotation to a higher level of abstraction should be applied to reduce design costs. In accordance with the target specifications, the proposed system architecture was successfully implemented and verified on a field-programmable array gate (FPGA) board for motion-compensated frame rate up-conversion (MC-FRUC), supporting frame rates from 60 up to 120 frames per second (fps) with a 1920×1080 resolution and at a clock rate of 148.5 MHz.

    Chapter 1 Introduction 1 1.1 Background 1 1.2 Motivation 3 1.3 Organization of Thesis 3 Chapter 2 Used Algorithm of Motion Compensated Frame Rate Up-Conversion 4 2.1 Motion Compensated Frame Rate Up-Conversion (MC-FRUC) 4 2.1.1 Color Conversion 5 2.1.2 Bi-directional Spatial-Temporal Motion Estimation (BiSTME) 5 2.1.2.1 Coarse Motion Estimation (CME) 6 2.1.2.2 Trajectory Tracking Motion Estimation (TTME) 11 2.1.2.3 Refined Motion Estimation (RME) 13 2.1.3 Bi-directional Spatial-Temporal Motion Compensated Interpolation (BiSTMCI) 14 2.1.3.1 Trajectory Tracking Motion Compensation (TTMC) 15 2.1.3.2 Motion Compensated Interpolation (MCI) 16 Chapter 3 System Design of MC-FRUC based on Algorithm/Architecture Co-exploration 17 3.1 Algorithm/Architecture Co-exploration (AAC) 17 3.1.1 Dataflow 20 3.1.2 Intrinsic Complexity Metric 22 3.2 Specification 24 3.3 System Design of MC-FRUC based on AAC 24 3.3.1 High-level Dataflow of BiSTME 25 3.3.2 Complexity Analysis of BiSTME 31 3.4 Design Space Exploration 39 Chapter 4 System Architecture Design for BiSTME 45 4.1 System Architecture of BiSTME 45 4.2 Low-level Dataflow of BiSTME 46 4.2.1 Low-level Dataflow of CME 47 4.2.2 Low-level Dataflow of TTME 54 4.2.3 Low-level Dataflow of RME 57 4.3 Memory Configuration for BiSTME 61 4.3.1 Memory Configuration for Reference Picture Data 62 4.3.2 Memory Configuration for Current Picture Data 65 4.3.3 Memory Configuration for Motion Vector 68 4.3.4 Summary of Memory Configuration 71 4.4 Microarchitecture of BiSTME 72 4.4.1 Microarchitecture of CME 75 4.4.2 Microarchitecture of TTME 78 4.4.3 Microarchitecture of RME 78 Chapter 5 Verification and Experimental Results 80 5.1 Verification Plan 80 5.1.1 Verification of Dataflow 80 5.1.2 Verification of RTL Model 81 5.1.3 Verification of Gate-level Model 81 5.2 Experimental Result 82 5.3 Comparison with Related Works 88 Chapter 6 Conclusion and Future Work 90 6.1 Conclusion 90 6.2 Future Work 91 Reference 92

    [1]Gwo Giun (Chris) Lee, Yen-Kuang Chen, Marco Mattavelli and Euee S. Jang, “Algorithm/Architecture Co-Exploration of Visual Computing on Emergent Platforms: Overview and Future Prospects,” IEEE Trans. on Circuits and Systems for Video Technology, Vol.19, Iss. 11, pp. 1576-1587, Nov. 2009.
    [2]He-Yuan Lin, “Algorithm/Architecture Co-Exploration for Visual Computing,” Ph.D. dissertation, National Cheng Kung University, Tainan, Taiwan, Republic of China, July. 2010.
    [3]T. Wiegand, G. J. Sullivan, G. Bjøntegaard and A. Luthra, “Overview of the H.264/AVC video coding standard,” IEEE Trans. on Circuits and Systems for Video Technology, Vol. 13, pp. 560-576, July. 2003.
    [4]A. Aydin Alatan and Levent Onural, “Estimation of depth fields suitable for video compression based on 3-D structure and motion of objects,” IEEE Trans. on Image Processing, Vol. 7, No.6, pp. 904-908, June. 1998.
    [5]G. de Haan and E.B. Bellers, “Deinterlacing - An overview,” Proceedings of the IEEE, Vol.86, No.9, pp. 1839-1857, Sep. 1998.
    [6]S. C. Park, M. K. Park and M. G. Kang, “Super-resolution image reconstruction: A technical overview,” IEEE Signal Processing Magazine, Vol. 20, Iss. 3, pp. 21-36, May. 2003.
    [7]Gwo Giun Lee, Ming-Jiun Wang, He-Yuan Lin, Drew Wei-Chi Su and Bo-Yun Lin, “Algorithm/Architecture Co-Design of 3D Spatio-Temporal Motion Estimation for Video Coding,” IEEE Trans. on Multimedia, Vol.9, Iss. 3, pp. 455-465, Apr. 2007.
    [8]C. Erbas. System-Level Modeling and Design Space Exploration for Multiprocessor Embedded System-on-Chip Architectures. PhD dissertation, 2006.
    [9]B. Kienhuis, E. Deprettere, K. Vissers, and P. van der Wolf, “An approach for quantitative analysis of application-specfic dataflow architectures,” In Proc. of the Int. Conf. Application-specific Systems, Architectures and Processors, pp. 338–349, 1997.
    [10]F. Balarin, E. Sentovich, M. Chiodo, P. Giusto, H. Hsieh, B. Tabbara, A. Jurecska, L. Lavagno, C. Passerone, K. Suzuki, and A. Sangiovanni-Vincentelli, “Hardware-Software Co-design of Embedded Systems – The POLIS approach,” Kluwer Academic Publishers, 1997.
    [11]Theo A. C. M. Claasen, “An industry perspective on current and future state of the art in system-on-chip (SoC) technology,” Proceedings of the IEEE, Vol. 94, No. 6, pp. 1121-1137, Jun. 2006.
    [12]G. G. Lee, E. S. Jang, M. Mattavelli, and C. J. Tsai, “Text of ISO/IEC FCD 23001-4 Codec Configuration Representation,” ISO/IEC JTC1SC29/WG11, MPEG w9772, Archamps, France, April, 2008.
    [13]Y. S. Tung, G. G. Lee, E.S. Jang, S. Lee, K. Asai, Y. Yamada, and M. Mattavelli, “Text of ISO/IEC FCD 23002-4 Video Tool Library,” ISO/IEC JTC1SC29/ WG11, MPEG w9774, Archamps, France, April, 2008.
    [14]N. Zhang, “Algorithm/architecture co-design for wireless communications systems,” Ph.D. dissertation, EECS, University of California, Berkeley, USA, July, 2001.
    [15]Michael Keating, Pierre Bricaud, “Reuse Methodology Manual,” Kluwer Academic Publishers, 2002.
    [16]Gwo Giun Lee and He-Yuan Lin, “Quantifying Intrinsic Parallelism Via Eigen-Decomposition of Dataflow Graphs for Algorithm/Architecture Co-Exploration,” IEEE 2010 Workshop on Signal Processing Systems (SIPS), pp. 317-322, Oct. 2010.
    [17]J. W. Janneck, D. Miller and D. B. Parlour, “Profiling dataflow programs,” Proceeding of IEEE ICME 2008, pp. 1065-1068, June 2008.
    [18]J. Gorin, M. Raulet, Y-L. Cheng, H-Y. Lin, N. Siret, K. Sugimoto and G. G. Lee, “An RVC Dataflow Description of the AVC Constrained Baseline Profile Decoder,” IEEE International Conference on Image Processing (ICIP2009).
    [19]Gilles KAHN, “The Semantics of a Simple Language for Parallel Programming,” Information Processing 74, North-Holland Publishing, 1974.
    [20]E. A. de Kock, W. J. M. Smits, P. van der Wolf, J.-Y. Brunel, W. M. Kruijtzer, P. Lieverse, K. A. Vissers, and G. Essink, “YAPI: Application modeling for signal processing systems,” in Proc. Design AutomationConf., Los Angeles, CA, 2000, pp. 402–405.
    [21]E.A. Lee and T.M. Parks, “Dataflow process networks,” Proceedings of the IEEE, Vol. 83, pp. 773–801, 1995.
    [22]M. Keaton, P. Bricaud, “Reuse Methodology Manual For System–On–A–Chip Designs” Kluwer Academic Publishers, 1998.
    [23]L. Cai and D. Gajski, “Transaction Level Modeling: An Overview,” Proc. Int'l Conf. HW/SW Codesign and System Synthesis, pp. 19–24, Oct. 2003.
    [24]http://www.microtronix.com/_literature_43354/ViClaro_III_Video_Host_Board.
    [25]Suk-Ju Kang, Dong-Gon Yoo, Sung-Kyu Lee and Young Hwan Kim, “Hardware implementation of motion estimation using a sub-sampled block for frame rate up-conversion,” International SoC Design Conference (ISOCC), pp. II-101 - II-104, Nov. 2008.
    [26]Truong Quang Vinh, Seok-Hwi Park and Young-Chul Kim, “Efficient Architecture for Hierarchical Bidirectional Motion Estimation,” IEEE international conference on Computational Intelligence and Computing Research (ICCIC), pp. 28-29, Dec. 2010.
    [27]O. Tasdizen, H. Kukner, A. Akin and I. Hamzaoglu, “A High Performance Reconfigurable Motion Estimation Hardware Architecture,” Design, Automation & Test in Europe Conference & Exhibition (DATE), pp. 882-885, April. 2009.
    [28]Ozgur Tasdizen and Ilker Hamzaoglu, “A Reconfigurable Frame Interpolation Hardware Architecture for High Definition Video,” Euromicro Coference on Digital System Design, Architectures, Methods and Tools, pp. 714-719, Aug. 2009.

    下載圖示 校內:2021-12-31公開
    校外:2021-12-31公開
    QR CODE