簡易檢索 / 詳目顯示

研究生: 莫明華
Mo, Ming-Hua
論文名稱: 基於低成本SPCE061A之無線擬雙人語音互動嵌入式系統設計
An Embedded System Design for Ubiquitous and Anthropomorphic Speech Interaction based on a Cost Effective SPCE061A Micro Controller
指導教授: 王駿發
Wang, Jhing-Fa
學位類別: 碩士
Master
系所名稱: 電機資訊學院 - 電機工程學系
Department of Electrical Engineering
論文出版年: 2006
畢業學年度: 94
語文別: 英文
論文頁數: 66
中文關鍵詞: 嵌入式系統設計動態時間校準RF模組
外文關鍵詞: Embedded System Design, RF Module, DTW
相關次數: 點閱:74下載:2
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 本論文提出可更換文本架構之互動式無線聲控及對話學習系統,並將系統實現於微控制器(Micro Controller Unit, MCU) SPCE061A嵌入式晶片硬體系統。另外並比較市面上的語音晶片,說明為何選擇SPCE061A MCU作為系統平台。
    在硬體系統部份,使用二組SPCE061A (Master Device and Slave Device) 配合RF模組傳送語音控制信號。Master Device將語音辨識結果透過RF模組傳送到Slave
    Device,當Slave Device接收到控制信號立即產生對應動作。
    在軟體系統部分,主要分為錄音程序、放音程序與語音辨識程序。本論文以此三個程序為核心提出一個互動式系統,此系統包含六個功能;聲控、字典查詢、固定劇本對話、可程式對話、錄音、放音與詞入文出功能,並詳細探討實現的步驟。
    語音辨識程序的功能為語音事件的觸發與追蹤、語音特徵擷取及語音特徵比對。另外,語音辨識程序、錄音程序與放音程序可組合對話學習核心,其功能為模擬雙人的對話,除可提供內定文本的劇情式情境對話,也可自行錄製對話語句,更換對話文本內容,以實現多樣化的學習環境。
    本系統將語音屬性擷取所用到的LPC參數擷取及語音屬性比對所需的動態時間校準(DTW)演算法實現在運算時脈及記憶體資源有限的微控制器(MCU, Micro Controller Unit) SPCE061A,在有限的運算能力及記憶體資源的影響下,本論文提出利用音訊取樣時的閒置時間來處理LPC參數擷取,以及利用有限的記憶體資源來執行動態時間校準(DTW)的方法,而達到與音框同步處理的即時效能。
    最後,希望藉由硬體系統的實現,可以提昇系統的應用層面,提供國內數位內容產業使用。

    In this thesis, we propose an embedded system based on SPCE061A for ubiquitous voice control and interactive dialogue learning system. There are many different kinds of MCU in the market, and we have chosen many kinds of MCU to compare. And explain why we choose SPCE061A MCU as our purpose-built system.
    For hardware design phase, we used two SPCE061A chips (one is for master device and the other is for slave device) and a RF module to communicate the master/slave device. Master device transmit the speech recognition result through RF Module. When Slave Device receives the control signal from the master device, it will generate corresponding response to user.
    For software design phase, system is divided into speech recording procedure, speech playing procedure and speech recognition procedure. We use the speech recording procedure, speech playing procedure and speech recognition procedure as kernel to implement interactive system application. For interactive system applications, we proposed six functions: voice control, dictionary query, fixed dialogue, programmable dialogue, speech record/play and vocabulary-in sentence-out. And probe into the implementation step in detail.
    For the design of embedded system, the LPCC features from speech attribute extraction and dynamic time warping algorithm (DTW) for speech attribute match are implement with the low cost and resource-limited micro controller unit (MCU) - SPCE061A. While sampling input speech data, the idle time of MCU is used to perform LPCC extraction and DTW for frame-synchronous operations to have real-time capability.
    We implement the proposed system as an embedded system, and the system can be used for the application of interactive speech toys for improving the recreation of digital contents industry.

    中文摘要 iii ABSTRACT v ACKNOWLEDGEMENTS vii CONTENTS .viii LIST OF FIGURES xi LIST OF TABLES xiv CHAPTER 1 INTRODUCTION - 1 - 1.1 Background - 1 - 1.2 Previous works - 2 - 1.2.1 Speech Recognition – Deoandent For 8051 (ITRI) - 2 - 1.2.2 Voice Me (HOTECK) - 3 - 1.2.3 Voicedex (HOTECK) - 4 - 1.3 Motivation - 5 - 1.4 Organization of Thesis - 6 - CHAPTER 2 THE ARCHITECTURE OF THE PROPOSED SYSTEM - 7 - 2.1 System description - 7 - 2.1.1 Master device - 8 - 2.1.2 Slave device - 9 - 2.2 Speech recording procedure - 10 - 2.3 Speech playing procedure - 11 - 2.4 Speech recognition procedure - 13 - 2.4.1 Endpoint detection - 13 - 2.4.2 Feature extraction - 17 - 2.4.3 DTW (Dynamic Time Warping) - 19 - CHAPTER 3 EMBEDDED SYSTEM DESIGN FOR THE PROPOSED SYSTEM BASED ON SPCE061A - 21 - 3.1 Overview of SPCE061A - 22 - 3.2 Hardware architecture of the proposed system - 27 - 3.2.1 Regulation Circuit - 28 - 3.2.2 RF Module - 29 - 3.2.3 Motor Driving Circuit - 30 - 3.2.4 External Flash ROM - 31 - 3.2.5 Microphone - 32 - 3.2.6 Amplifier and Speaker - 33 - 3.3 Software architecture of the proposed system - 33 - 3.3.1 FIR Filter (Finite Impulse Response) - 34 - 3.3.2 Frame-Synchronous design for LPCC - 35 - 3.3.3 Fixed-point design for LPCCs features - 37 - CHAPTER 4 SYSTEM APPLICATION AND IMPLEMENTATION - 39 - 4.1 Master device implementation - 39 - 4.2 Slave device implementation - 41 - 4.3 System board implementation - 42 - 4.4 System applications - 43 - 4.4.1 System flow of master device and slave device - 43 - 4.4.2 Memory allocation - 46 - 4.4.3 Architecture of system applications - 47 - 4.4.4 Six examples of system applications - 48 - 4.5 System verification - 57 - CHAPTER 5 CONCLUSIONS AND FUTURE WORKS - 58 - REFERENCES - 60 - APPENDIX - 62 - 作者簡歷 - 66 -

    [1] Pro-Chuan Lin, Jhing-Fa Wang, Shun-Chieh Lin, and Ming-Hua Mo, “An Embedded System Design for Ubiquitous Speech Interactive Applications Based on a Cost Effective SPCE061A Micro Controller”, Accepted to the 3rd IFIP International Conference on Ubiquitous Intelligence and Computing (UIC-06) and published to LECTURE NOTES IN COMPUTER SCIENCE (LNCS)
    [2] Yung-Shing Kuo, Jhing-Fa Wang, “Embedded System Design based on SPCE061A for Interactive Spoken Dialogue Learning System with a Programmable Dialogue”
    [3] K. Sukun, N. Sergiu and P. Rabin K., “Hardware Speech Recognition in Low Cost, Low Power Devices”, computer science division (university of California, Berkeley) cs252 class project, Spring 2003.
    [4] B.L. Zeigler and B. Mazor, “DIALOG DESIGN FOR A SPEECH-INTERACTIVE AUTOMATION SYSTEM”, GTE Laboratories Incorporated, 1994
    [5] Bellman R., Dynamic Programming, Princeton University Press, 1957. EMBEDDED SPEECH RECOGNITION SYSTEM
    [6] L. Rabiner and B. H. Juang, “FUNDAMENTALS OF SPEECH RECOGNITION.” Prentice-Hall, Inc., 1993.
    [7] Li-Chang Wen, “Partial Matching Algorithm and SOPC Design for Multi-Language Spoken Sentence Retrieval”, Master’s thesis, Department of Electrical Engineering, National Cheng Jung University, 2004.
    [8] Patterson Hennessy, “Computer Organization and Design - The Hardware/Software Interface”, Morgan Kaufmann, 2000
    [9] SUNPLUS, “SPCE061A DataSheet”, SUNPLUS TECHONOLEGY CO., Ltd., DEC. 2004.
    [10] SUNPLUS, “IDE User guide”, SUNPLUS TECHONOLEGY CO., Ltd., JULY. 2003.
    [11] SUNPLUS, “SPCE040A/060A/061A Programming Guide v1.2”, SUNPLUS TECHONOLEGY CO., Ltd., MAY. 2004.
    [12] SUNPLUS, “SUNPLUS SPRS512C Datasheet”, SUNPLUS TECHONOLEGY CO., Ltd., March. 2003.
    [13] SUNPLUS, “SUNPLUS SPRS512C Datasheet”, SUNPLUS TECHONOLEGY CO., Ltd., March. 2003
    [14] SUNPLUS, “SUNPLUS µ'nSP® Instruction Set”, SUNPLUS TECHONOLEGY CO., Ltd., JULY. 2002
    [15] SUNPLUS, “C Programming in SUNPLUS SPCE061A”, SUNPLUS TECHONOLEGY CO., Ltd., March. 2003
    [16] EZ talk, http://www.eztalk.to/
    [17] Talk to Me, http://www.auralog.com/
    [18] MyET, http://www.myet.com/en/Index.htm
    [19] LiveABC, http://www.liveabc.com/english/

    下載圖示 校內:2008-08-24公開
    校外:2008-08-24公開
    QR CODE