簡易檢索 / 詳目顯示

研究生: 陳亭宇
Chen, Ting-Yu
論文名稱: 具高表現與低功率之適應性類神經模糊網路數位硬體設計
Digital Hardware Design of an Adaptive Neuro-Fuzzy Network with High Performance and Low Power Consumption
指導教授: 王振興
Wang, Jeen-Shing
學位類別: 碩士
Master
系所名稱: 電機資訊學院 - 電機工程學系
Department of Electrical Engineering
論文出版年: 2007
畢業學年度: 95
語文別: 英文
論文頁數: 89
中文關鍵詞: 硬體設計高表現
外文關鍵詞: hardware design, high performance
相關次數: 點閱:51下載:1
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 本論文主旨在將具有線上學習的功能之類神經模糊網路設計成一個低功率和高表現的硬體電路。在硬體電路之整體效能評量方面,由於平均功率消耗(P)、延遲(D)和晶片面積消耗(A)彼此之間之互相影響,無法同時達到最小值;因此,為使整體設計達到最佳表現,研究目標為利用有效之設計方法,達到平均功率消耗、計算執行時間和晶片面積消耗的乘積(PDA)之最小值。首先,我們利用了高階合成設計概念對於整體類神經模糊網路做排程和配置,並比較排程和配置後的結果。由此結果推演出新的排程演算法,進而使整體設計晶片面積達到最小值。接著,再藉由實現各種不同管線式(pipeline)架構之類神經模糊網路之電路的波形和合成資訊來比較其優缺點,並應用低功率電路設計理論來降低功率消耗和方法且結合動態控制步驟(dynamic control steps)加快執行速度。從模擬結果我們發現,結合全域性非同步與區域性同步(globally-asynchronous locally-synchronous)之前饋電路與同步(synchronous)管線式之倒傳遞演算法電路可達成本研究目標。最後,將電路實際測試於智慧型駕車系統軟硬體模擬平台上,驗證其效能。

    This study focuses on the digital hardware design of an adaptive neuro-fuzzy (NF) network with high performance and low power consumption. In the performance evaluation of a hardware design, since there is a tradeoff among power, delay, and area consumption, it is difficult if not impossible to reach their minima simultaneously. Hence, our design objective is to minimize the cost function, formulated as the product of power, delay (execution time) and area consumption (PDA), of the hardware design of the neuro-fuzzy network. First, based on the concepts of the high level synthesis, we scheduled the data flow graph (DFG) of the NF network and allocated the required functional units of the network computation. According to the scheduling and allocation results, we developed a novel datapath scheduling algorithm which can minimize the area of the NF chip. Next, we accessed the advantages and drawbacks of different pipeline structures by comparing the waveforms and synthesis results of different pipeline structures implemented on NF network. We then applied low-power designs and dynamic control steps to the structures for reducing the power consumption and increasing the execution time, respectively. From the simulations and synthesis results, we found that the structure with minimum PDA is to implement the feedforward and backpropagation circuit of the NF network in a globally-asynchronous locally synchronous (GALS) structure and a synchronous pipeline structure, respectively. Finally, the effectiveness of our design has been validated in the hardware/software co-simulation platform of an intelligent car-driving system.

    CHINESE ABSTRACT i ABSTRACT ii ACKNOWLEDGEMENT iii LIST OF TABLES vi LIST OF FIGURES vii Chapter 1 Introduction 1-1 1.1 Motivation 1-1 1.2 Literature Survey 1-2 1.3 Purpose of Study 1-4 1.4 Organization of the Thesis 1-5 Chapter 2 Design Methodologies of High Level Synthesis for Neuro-Fuzzy Networks 2-1 2.1 Adaptive Neuro-Fuzzy Network 2-2 2.1.1 Feedforward Path 2-4 2.1.2 Backpropagation Learning Algorithm 2-5 2.2 High Level Synthesis 2-6 2.2.1 Integer Linear Programming Method 2-8 2.2.2 Integer Linear Programming Method with Chaining 2-14 2.3 Modified Integer Linear Programming Method 2-18 2.3.1 Constraint Multiplexers and De-multiplexers 2-21 2.3.2 A Novel Scheduling Algorithm 2-24 Chapter 3 Hardware Implementations of Adaptive Neuro-Fuzzy Networks 3-1 3.1 Asynchronous Pipeline Structure 3-1 3.2 Synchronous Pipeline Structure 3-3 3.3 Globally Asynchronous Locally Synchronous Pipeline Structure 3-5 3.4 Final Structure of Neuro-Fuzzy Networks 3-9 Chapter 4 The Theorems of Low Power VLSI Circuit Design in Our NF Network 4-1 4.1 Pre-Computation 4-2 4.2 Signal Gating 4-4 4.3 Gray Code 4-6 4.4 Globally Asynchronous Locally Synchronous Structure 4-9 4.5 Multicycle Operations 4-10 Chapter 5 Specification, Verification, Comparison and Simulation Results 5-1 5.1 Design Specification 5-2 5.2 I/O Definition 5-3 5.3 Timing Verification and Comparison 5-3 5.3.1 Synthesis Results 5-4 5.3.2 Waveform Checking 5-5 5.4 Area Comparison 5-6 5.4.1 Synthesis Results 5-7 5.5 Power Comparison 5-8 5.5.1 Power Estimation by PrimePower 5-8 5.6 Performance Comparison 5-9 5.7 Prototyping and System Verification 5-10 5.7.1 Prototyping 5-10 5.7.2 Application on Intelligent Car-Driving System 5-12 5.7.3 Simulation Results 5-15 Chapter 6 Conclusions and Future Work 6-1 6.1 Conclusions 6-1 6.2 Future Work 6-3

    [1] J. S. Wang and C. S. G. Lee, “Self-adaptive neuro-fuzzy inference systems for classification applications,” IEEE Trans. Fuzzy Systems, vol. 10, no. 6, pp. 790-802, Dec. 2002.
    [2] J. S. Wang and C. S. G. Lee, “Self-adaptive recurrent neuro-fuzzy control of an autonomous underwater vehicle,” IEEE Trans. Robotics and Automation, vol. 19, no. 2, pp. 283-295, Apr. 2003.
    [3] A. Rubaai, R. Kotaru, and M. D. Kankam, “A continually online-trained neural network controller for brushless DC motor drives,” IEEE Trans. Industry Applicatations, vol. 36, no. 2, pp. 475-483, Mar.-Apr. 2000.
    [4] Y. H. Kuo, C. I. Kao, and J. J. Chen, “A fuzzy neural network model and its hardware implementation,” IEEE Trans. Fuzzy Systems, vol. 1, no. 3, pp. 171-183, Aug. 1993.
    [5] M. A. Cantin, Y. Blaquiere, Y. Savaria, P. Lavoie, and E. Granger, “Analysis of quantization effects in a digital hardware implementation of a fuzzy ART neural network algorithm,” IEEE Int’l. Symp. Circuits and Systems, vol. 3, pp. 141-144, May 2000.
    [6] T. Hollstein, S. K. Halgamuge, and M. Glesner, “Computer-aided design of fuzzy systems based on generic VHDL specifications,” IEEE Trans. Fuzzy Systems, vol. 4, no. 4, pp. 403-417, Nov. 1996.
    [7] R. Gonzalez, A. Torralba, and L.G. Franquelo, “AFAN: Tool for optimizing fuzzy controllers,” IEEE Micro, vol. 17, no. 5, pp. 50-54, Sept.-Oct. 1997.
    [8] J. F. Hurdle, “The synthesis of compact fuzzy neural circuits,” IEEE Trans. Fuzzy Systems, vol. 5, no. 1, pp. 44-55, Feb. 1997.
    [9] C. F. Juang and J. S. Chen, “Water bath temperature control by a recurrent fuzzy controller and its FPGA implementation,” IEEE Trans. Industrial Electronics, vol. 53, no. 3, pp. 941-949, June 2006.
    [10] C. F. Juang andC. H. Hsu, “Temperature control by chip-implemented adaptive recurrent fuzzy controller designed by evolutionary algorithm,” IEEE Trans. Circuits and Systems, vol. 52, no. 11, pp. 2376-2384, Nov. 2005.
    [11] J. M. Jou, P. Y. Chen, and S. F. Yang, “An adaptive fuzzy logic controller: Its VLSI architecture and applications,” IEEE Trans. Very Large Scale Integration (VLSI) Systems, vol. 8, no. 1, pp. 52-60, Feb. 2000.
    [12] T. H. Li, C. S. Hsu, and Y. J. Su, “Design of a dynamic fuzzy controller IC with application garage parking control,” in Proc. 9th IEEE Int’l. Conf. Fuzzy Systems, vol. 1, pp. 239-244, May. 2000.
    [13] S. H. Lee and Z. Bien, “Design of expandable fuzzy inference processor,” IEEE Trans. Consumer Electronics, vol. 40, no. 2, pp. 171-175, May 1994.
    [14] H. Surmann and A. P. Ungering, “Fuzzy rule-based systems on general-purpose processors,” IEEE Micro, vol. 15, no. 4, pp. 40-48, Aug. 1995.
    [15] A. Costa, A. D. Gloria, and M. Olivieri, “Hardware design of asynchronous fuzzy controllers,” IEEE Trans. Fuzzy Systems, vol. 4, no. 3, pp. 328-338, Aug. 1996.
    [16] G. Ascia, V. Catania, and M. Russo, “VLSI hardware architecture for complex fuzzy systems,” IEEE Trans. Fuzzy Systems, vol. 7, no. 5, pp. 553-570, Oct. 1999.
    [17] J. J. Blake, L. P. Maguire, T. M. McGinnity, and L. J. McDaid, “Using Xilinx FPGAs to implement neural networks and fuzzy systems,” IEE Colloquium on Neural and Fuzzy Systems, pp. 1/1-1/4, May 1997.
    [18] M. Miwa, T. Hashiyama, T. Furuhashi, and S. Okuma, “Cerebellar model arithmetic computer with bacterial evolutionary algorithm and its hardware acceleration using FPGA,” in Proc. IEEE Int’l. Conf. Systems, Man, and Cybernetics, vol. 5, pp. 591-595, Oct. 1999.
    [19] T. Yamakawa, “A fuzzy inference engine in nonlinear analog mode and its application to a fuzzy logic control,” IEEE Trans. Neural Networks, vol. 4, no. 3, pp. 496-522, May 1993.
    [20] C. T. P. Song, S. F. Quigley, and S. Pammu, “A novel CMOS analogue fuzzy inference processor,” in Proc. IEEE Int’l. Symp. Circuits and Systems, vol. 3, pp. 247-250, June 1998.
    [21] J. Diamond, W. Pedrycz, and D. McLeod, “Fuzzy JK flip-flops as computational structures: Design and implementation,” IEEE Trans. Circuits and Systems, vol. 41, no. 3, pp. 2376-2384, Mar. 1994.
    [22] J. Tombs, A. J. Torralba, and L. G. Franquelo, “A PWM fuzzy logic controller,” IEEE Micro, vol. 16, no. 5, pp. 68-71, Oct. 1996.
    [23] I. S. Han, “Hardware implementation of neuro-Fuzzy System with the analogue-digital hybrid neural chip,” IEE Colloquium on Neural and Fuzzy Systems, pp. 4/1-4/4, May 1997.
    [24] A. Heittmann and U. Riickert, “Mixed mode VLSI implementation of a neural associative memory,” in Proc. 7th Int’l. Conf. Microelectronics for Neural, Fuzzy and Bio-Inspired Systems, pp. 299-306, April 1999.
    [25] T. S. Gotarredona and B. L. Barranco, “A low-power current mode fuzzy-ART cell,” IEEE Trans. Neural Networks, vol. 17, no. 6, pp. 1666-1673, Nov. 2006.
    [26] M. Chiaberge, E. Miranda, and L. M. Reyneri, “An HW/SW co-design approach for neuro-fuzzy hardware design,” in Proc. 7th Int’l. Conf. Microelectronics for Neural, Fuzzy and Bio-Inspired Systems, pp. 332-337, April 1999.
    [27] M. Russo, “FuGeNeSys: SW and HW implementation,” in Proc. 1st Int’l. Conf. Knowledge-Based Intelligent Electronic Systems, vol. 1, pp. 209-289, May 1997.
    [28] L. M. Reyneri, “Implementation issues of neuro-fuzzy hardware: Going toward HW/SW codesign,” IEEE Trans. Neural Networks, vol. 14, no. 1, pp. 176-193, Jan. 2003.
    [29] J. H. Lin, “A neuro-fuzzy network with online learning capability: Asynchronous pipeline hardware design,” M.S. thesis, Dept. Electron. Eng., National Cheng Kung Univ., Tainan, Taiwan, 2005.
    [30] C. C. Yu, “Hardware design of an adaptive neuro-fuzzy network with on-chip learning capability and fine grain pipeline structure,” M.S. thesis, Dept. Electron. Eng., National Cheng Kung Univ., Tainan, Taiwan, 2006.
    [31] C. T. Hwang, J. H. Lee, and Y. C. Hsu, “A formal approach to the scheduling problem in high level synthesis,” IEEE Trans. Computer-Aided Design, vol. 10, no. 4, pp. 464-475, Apr. 1991.
    [32] D. D. Gajski, N. N. Dutt, C. H. Wu, and Y. L. Lin, High-Level Synthesis: Introduction to Chip and System Design. Kluwer Academic, 1992.
    [33] P. G. Paulin and J. P. Knight, “Force-directed scheduling for the behavioral synthesis of ASIC’s,” IEEE Trans. Computer-Aided Design, vol. 8, no. 6, pp. 661-679, June 1989.
    [34] I. C. Park and C. M. Kyung, “Fast and near optimal scheduling in automatic data synthesis,” in Proc. 28th Design Automation Conference, pp. 680-685, June 1991.
    [35] R. Jain, A. Mujumdar, A. Sharma, and H. Wang, “Empirical evaluation of some high-level synthesis scheduling heuristics,” in Proc. 28th Design Automation Conference, pp. 686-689, June 1991.
    [36] J. H. Lee, Y. C. Hsu, and Y. L. Lin, “A new integer linear programming formulation for the scheduling problem in data path synthesis,” in Proc. IEEE Int’l. Conf. Computer-Aided Design, pp. 20-23, Nov. 1989.
    [37] G. D. Micheli, Synthesis and Optimization of Digital Circuits. McGRAW-HILL, 1994.
    [38] M. R. Corazao, M. A. Khalaf, L. M. Guerra, M. Potkonjak, and J. M. Rabaey, “Performance optimization using template mapping for datapath-intensive high level synthesis,” IEEE Trans. Computer-Aided Design of Integrated Circuits and Systems, vol. 15, no. 8, pp. 877-888, Aug. 1996.
    [39] T. Sadakata and Y. Matsunaga, “A simultaneous module selection, scheduling, and allocation method considering operation chaining with multi-functional units,” IEICE Trans. Fundamental, vol. E90-A, no. 4, pp. 792-799, April 2007.
    [40] Y. Yi, I. Nousias, M. Milward, S. Khawam, T. Arslan, and I. Lindsay, “System-level scheduling on instruction cell based reconfigurable systems,” in Proc. Design, Automation and Test in Europe, vol. 1, pp. 1-6, March 2006.
    [41] M. Rim, R. Jain, and R. D. Leone, “Optimal allocation and binding in high-level synthesis,” in Proc. 29th Design Automation Conference, pp. 120-123, June 1992.
    [42] B. Haroun and B. Sajjadi, “ILP synthesis of signal processing architectures with minimum structural complexity,” in Proc. IEEE Custom Integrated Circuits Conference, pp. 237-240, May 1994.
    [43] B. Haroun and B. Sajjadi, “Synthesis of signal processing structured datapaths for FPGAs supporting RAMs and busses,” in Proc. 3th Int’l. ACM Symp. Field-Programmable Gate Arrays, pp. 75-81, 1995.
    [44] J. Teifel and R. Manohar, “An asynchronous dataflow FPGA architecture,” IEEE Trans. Computers, vol. 53, no. 11, pp. 1376-1392, Nov. 2004.
    [45] M. Keating and P. Bricaud, Reuse Methodology Manual, Kluwer Academic Publishers, 2002.
    [46] A. lyer and D. Marculescu, “Power and performance evaluation of globally asynchronous locally synchronous processors,” in Proc. 29th Annual Int’l. Symp. Computer Architecture, pp. 158-168, May 2002.
    [47] C. L. Su, C. Y. Tsui, and A. M. Despain, “Low power architecture design and compilation techniques for high-performance processors,” Digest of Papers on Compcon Spring, pp. 489-498, March 1994.
    [48] M. Alidina, J. Monteiro, S. Devadas, A. Ghosh, and M. Papaefthymiou, “Precomputation-based sequential logic optimization for low power,” IEEE Trans. Very Large Scale Integration (VLSI) Systems, vol. 2, no. 4, pp. 426-436, Dec. 1994.
    [49] H. Kapadia, L. Benini, and G. D. Micheli, “Reducing switching activity on datapath buses with control-signal gating,” IEEE J. Solid-State Circuits, vol. 34, no. 3, pp. 405-414, March 1999.
    [50] H. Mehta, R. M. Owens, and M. J. Irwin, “Some issue in gray code addressing,” in Proc. 6th Great Lakes Symp. VLSI, pp. 178-181, March 1996.
    [51] A. Iranli and M. Pedram, “Cycle-based decomposition of Markov chains with applications to low-power synthesis and sequence compaction for finite state machines,” IEEE Trans. Computer-Aided Design of Integrated Circuits and Systems, vol. 25, no. 12, pp. 2712-2725, Dec. 2006.
    [52] A. Bogliolo, L. Benini, B. Ricco, and G. D. Micheli, “Efficient switching activity computation during high-level synthesis of control-dominated designs,” in Proc. Int’l. Symp. Low Power Electronics and Design, pp. 127-132, 1999.
    [53] M. Chiodo, P. Giusto, A. Jurecska, H. C. Hsieh, A. S. Vincentelli, and L. Lavagno, “Hardware-software codesign of embedded systems,” IEEE Micro, vol. 14, no. 4, pp. 26-36, Aug. 1994.
    [54] E. Shaw and K. Khordoc, “System design based on interface specifications,” in Canadian Conf. Electrical and Computer Engineering, vol. 1, pp. 189-193, March 2000.
    [55] F. Vahid, S. Narayan, and D. D. Gajski, “SpecCharts: a VHDL front-end for embedded systems,” IEEE Trans. Computer-Aided Design of Integrated Circuits and Systems, vol. 14, no. 6, pp. 694-706, June 1995.
    [56] S. Park and K. Choi, “Latency minimisation by system clock optimization,” Electronics Letters, vol. 34, no. 9, pp. 862-864, April 1998.
    [57] S. P. Mohanty and N. Ranganathan, “Simultaneous peak and average power minimization during datapath scheduling,” IEEE Trans. Circuits and Systems, vol. 52, no. 6, pp. 1157-1165, June 2005.
    [58] S. C. Hsu, “Implementation of motion planning for car-like robots on ARM-based development board,” M.S. thesis, Dept. Electron. Eng., National Cheng Kung Univ., Tainan, Taiwan, 2006.

    下載圖示 校內:2009-08-07公開
    校外:2009-08-07公開
    QR CODE