簡易檢索 / 詳目顯示

研究生: 林伯齊
Lin, Bo-chi
論文名稱: 考慮混合時域下與有限資料量應用於點對點的資料傳輸方式之低成本緩衝區結構最佳化方法
Low Cost Buffer Architecture Optimization Methods for Point-to-Point Data Transmission Considering Mixed-Clock Domains and Finite Number of Data Patterns
指導教授: 邱瀝毅
Chiou, Lih-Yih
學位類別: 碩士
Master
系所名稱: 電機資訊學院 - 電機工程學系
Department of Electrical Engineering
論文出版年: 2010
畢業學年度: 98
語文別: 英文
論文頁數: 71
中文關鍵詞: 緩衝區最佳化混合時域
外文關鍵詞: buffer, optimization, mixed-clock
相關次數: 點閱:89下載:1
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • GALS系統經常將一個資料流分開成許多較小的功能單元來減少資料在交互傳輸上產生的問題,它使用於大部分的傳送和信號處理的系,因此需要資料緩衝區(buffer) 來溝通兩個不同操作頻率與資料吞吐量的功能單元,再者由於為了迎合各戶的要求在晶片中整合多個標準設計越來越重要,不同的資料樣本經常存在於各種的標準,然而當改變資料樣本的個數和需求將會使的緩衝區的手動撰寫變得複雜。
    在本篇論文我們提出在多種的有限資料樣本和混合時域下緩衝區最佳化的方法,被稱為次緩衝區的叢集,此方法採用圖形著色與次緩衝區切割策略,而設計出的緩衝區在RTL程式的硬體架構不僅包含雙埠SRAM也包含單一埠SRAM,我們的方法有效率的分享資料儲存的緩衝區來找出一個最小的緩衝區。根據實際實驗結果顯示,比較於沒有做最佳化的緩衝區,我們所提出的方法,可以節省59%~80%左右緩衝區的面積

    The GALS approach usually divides a data-flow in the most communication and signal processing systems into numerous smaller enough function units to reduce interconnection problems. Buffers are commonly used in the GALS approach to handle the two function units running with different operating frequencies and throughput requirements. Furthermore, integrating various standards in a chip is increasingly important for meeting versatile consumers’ demands. Multiple data patterns usually exist for various standards. However, the design of buffers capability of handing varying number of data patterns and requirements becomes increasingly complex.
    This work presents a buffer optimization approach capability of dealing with multiple finite data patterns in mixed clock domains, referred to as sub-buffer clustering (SB-Clustering). SB-Clustering adopts a graph coloring first followed by a line partitioning approaches. The buffer thus designed is consisted of not only dual-port SRAM but also single port SRAM at register-transfer level (RTL). Our approach shares the buffers of data patterns effectively and leads to a buffer with smaller memory. Experimental results using realistic test cases show that the proposed method can save the buffer area of the buffer architecture from 59% to 80% when compared with the design without optimization.

    Chapter 1 Introduction 1 1.1 Motivation 1 1.1.1 Mixed-clock domains 1 1.1.2 Buffer minimization 2 1.1.3 Multiple modes on buffer 3 1.1.4 Benefits of high level synthesis 4 1.2 Graph Coloring 6 1.2.1 Vertex coloring 6 1.2.2 Maximum degree first strategy 7 1.3 Thesis Contributions 8 1.4 Thesis Organization 8 Chapter 2 Related Work 9 2.1 Buffer Optimization Methods 9 2.1.1 Data memory minimization by sharing large size buffers 9 2.1.2 Table-based buffer size minimization for multi-mode data transfer under mixed-clock domain 11 I. Single-line-buffer (SLB) 11 II. Parallel-line-buffer (PLB) 12 2.1.2.1 SLBM algorithm 12 2.1.2.2 PLBM algorithm 14 2.1.3 Summary 16 2.2 Mixed-Clock Buffer Design 16 2.2.1 Hardware synthesis for asynchronous communications mechanisms 17 2.2.2 A scalable dual-clock FIFO for data transfers between arbitrary and haltable clock domains 18 2.2.3 Summary 20 Chapter 3 Proposed Low Cost Buffer Architecture Optimization Methods 21 3.1 Problem Definition 21 3.2 Buffer Structure 24 3.3 Proposed Algorithm Flow 27 3.3.1 Initialization and conflict graph generation phase 29 3.3.2 PLB architecture exploration phase 35 3.3.3 RTL codes of buffer controller generation phase 43 3.4 Summary 44 Chapter 4 Experimental Results 45 4.1 Hardware Architecture 45 4.1.1 Architecture of SLB 46 4.1.2 Architecture of PLB 47 4.2 Test Cases 50 4.3 Experimental 1 52 4.3.1 Comparison 1 54 4.3.2 Comparison 2 55 4.4 Experimental 2 58 4.4.1 Comparison with table-based SLBM 59 4.4.2 Comparison with table-based PLBM 60 4.5 Experimental 3 63 Chapter 5 Conclusions and Future Work 67 5.1 Conclusions 67 5.2 Future Work 68 Reference 69

    [1] S. Narayana, “On-Chip Communication Hardware Resources for Globally Asynchronous and Locally Synchronous Systems,” in Proc. of the IEEE Computer Society on the 8th International Symposium on Parallel Architectures, Algorithms and Networks, pp. 208-213, 2005.
    [2] S. Han, X. Guerin, S. Chae, and A.A. Jerraya, “Buffer memory optimization for video codec application modeled in Simulink,” in Proc. of the 43rd annual conference on Design automation, pp. 689-694, 2006.
    [3] Hojun Shim, Naehyuck Chang, and Massoud Pedram, “A Compressed Frame Buffer to Reduce Display Power Consumption in Mobile Systems,” in Proc. of the 2004 Conference on Asia and South Pacific Design Automation, pp. 819- 824, 2004.
    [4] Arvind, R.S. Nikhil, D.L. Rosenband, and N. Dave, “High-level synthesis: an essential ingredient for designing complex ASICs,” in Proc. of the International conference on Computer-aided design, pp. 775-782, 2004.
    [5] P. Coussy, D.D. Gajski, M. Meredith, and A. Takach, "An Introduction to High-Level Synthesis," IEEE Design & Test of Computers, vol.26, no.4, pp.8-17, 2009.
    [6] P. Lieverse. P.V.D. Wolf, E. Deprettere, and K. Vissers, "A methodology for architecture exploration of heterogeneous signal processing systems," in Proc. of the Conference on J.VLSI Signal Processing Systems, pp.181-190, 1999.
    [7] T. Zeitlhofer and B. Wess, “A comparison of graph coloring heuristics for register allocation based on coalescing in interval graphs,” in Proc. of the 2004 International Symposium on Circuits and Systems, pp. IV - 529-32, May 2004.

    [8] J. Vanne, E. Aho, T. Hamalainen, and K. Kuusilinna, “A Parallel Memory System for Variable Block-Size Motion Estimation Algorithms,” IEEE Transactions on Circuits and Systems for Video Technology, vol.18, pp. 538-543, 2008.
    [9] E. Lee and D. Messerschmitt, “Synchronous data flow,” in Proc. of the IEEE, vol.75, pp. 1235-1245, 1987.
    [10] S. Stuijk, M. Geilen, and T. Basten, “SDF^3: SDF For Free,” in Proc. of 6th International Conference on Application of Concurrency to System Design, pp. 276–278, 2006.
    [11] J. Hahn and P.H. Chou, “Buffer optimization and dispatching scheme for embedded systems with behavioral transparency,” in Proc. of the 7th ACM & IEEE International Conference on Embedded Software, pp. 94-103, 2007.
    [12] H. Oh and S. Ha“ Data memory minimization by sharing large size buffers , ” in Proc. of Design Automation Conference on Asia and South Pacific DAC, pp. 491 - 496, Jan. 2000.
    [13] Yu-Shan Chou, “Table-Based Buffer Size Minimization for Multi-mode Data Transfer under Mixed-Clock Domains,” in Department of Electrical Engineering Thesis for Master of Science National Cheng-Kung University, Tainan, Taiwan, July. 2009.
    [14] K. Gorgonio, J. Cortadella, “Hardware Synthesis for Asynchronous Communications Mechanisms,” in Proc. of the International Conference on Chilean Computer Science Society, pp.135-143, Nov. 2008.
    [15] R.W. Apperson, Z. Yu, M.J. Meeuwsen, T. Mohsenin, and B.M. Baas, “A scalable dual-clock FIFO for data transfers between arbitrary and haltable clock domains,” IEEE Transaction on VLSI Systems, vol.15, pp. 1125-1134, 2007.

    下載圖示 校內:2015-09-09公開
    校外:2015-09-09公開
    QR CODE