研究生: |
郭昀昇 Kuo, Yung-Shing |
---|---|
論文名稱: |
基於SPCE061A嵌入式單晶片實現可更換文本架構之互動式語音對話學習硬體系統 Embedded System Design based on SPCE061A for Interactive Spoken Dialogue Learning System with a Programmable Dialogue |
指導教授: |
王駿發
Wang, Jhing-Fa |
學位類別: |
碩士 Master |
系所名稱: |
電機資訊學院 - 電機工程學系 Department of Electrical Engineering |
論文出版年: | 2005 |
畢業學年度: | 93 |
語文別: | 英文 |
論文頁數: | 48 |
中文關鍵詞: | 語音學習 、對話系統 、動態時間校準 、微控制器 、嵌入式 |
外文關鍵詞: | Dialogue, LPCC, DTW, SPCE061A, MCU, Embedded System |
相關次數: | 點閱:77 下載:2 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
本論文提出一可更換文本架構之互動式語音對話學習系統,並將系統實現於微控制器(Micro Controller Unit, MCU) SPCE061A的嵌入式單晶片硬體系統,使其單晶片具備有語音辨識及具有互動性的語音對話功能。在互動式語音對話學習系統上,除可提供內定文本的劇情式情境對話練習外,也可自行錄製對話語句,更換對話文本內容,以實現多樣化的對話內容。而在系統使用過程中,與使用者進行語音對話時,系統將處理使用者對話事件觸發與追蹤、使用者語音對話屬性擷取與系統所要回應使用者對話之事件偵測。對於嵌入式單晶片硬體系統實現方面,本系統將語音對話屬性擷取所用到的LPC參數擷取及事件偵測所需的動態時間校準(DTW)演算法實現在運算時脈及記憶體資源拘限的微控制器(MCU,Micro Controller Unit)SPCE061A硬體模組上,在有限的運算能力及記憶體資源的影響下,本論文內提出利用音訊取樣時的閒置時間,來處理LPC參數擷取,以及利用有限的記憶體資源來執行動態時間校準(DTW)的方法,而達到在音框同步處理上的即時性效能。最後,希望藉由該硬體系統的實現,可以提昇互動式語音對話系統的應用層面,如互動式語音對話IC玩具,使能建構成語音學習及資訊的互通平台,提供國內數位內容產業使用。
In this thesis, an embedded system based on SPCE061A for interactive spoken dialogue learning system (ISDLS) with a programmable dialogue is designed, and it provides capabilities for speech recognition and interactive spoken dialogue. For the proposed ISDLS, besides default dialogue, users can record programmable dialogue content for multiform practice of conversation. When users begin to have a dialogue, ISDLS provides functional capabilities for dialogue trigger and tracking, dialogue attribute extraction and dialogue event feedback to make ISDLS interactive with users. For the design of embedded system, the LPCC features from dialogue attribute extraction and dynamic time warping algorithm for dialogue event feedback are implemented with the low cost and resource-limited micro controller unit (MCU) - SPCE061A. While sampling input speech, the idle time of MCU is used to perform LPCC extraction and DTW for frame-synchronous operations to have real-time capability for ISDLS. We implement the ISDLS embedded system, and the system can be used for the application of interactive speech toys for improving the recreation of digital contents industry.
[1] Christophe L´evy, Georges Linar`es1, Pascal Nocera1, Jean-Francois Bonastre1, “REDUCING COMPUTATIONAL AND MEMORY COST FOR CELLULAR PHONE”, IEEE ICASP 2004
[2] Dong Wang, Liang Zhang, Jia Liu and Runsheng Liu, “EMBEDDED SPEECH RECOGNITION SYSTEM ON 8-BIT MCU CORE”, IEEE ICASP 2004
[3] B.L. Zeigler and B. Mazor, “DIALOG DESIGN FOR A SPEECH-INTERACTIVE AUTOMATION SYSTEM”, GTE Laboratories Incorporated, 1994
[4] Bellman R., Dynamic Programming, Princeton University Press, 1957. EMBEDDED SPEECH RECOGNITION SYSTEM
[5] L. Rabiner and B. H. Juang, “FUNDAMENTALS OF SPEECH RECOGNITION.” Prentice-Hall, Inc., 1993.
[6] HUANG, ACERO and HON, “Spoken Language Processing - A Guide to Theory, Algorithm and System Development”, PH/PTR, 2001
[7] Patterson Hennessy, “Computer Organization and Design - The Hardware/Software Interface”, Morgan Kaufmann, 2000
[8] SUNPLUS, “SPCE061A DataSheet”, SUNPLUS TECHONOLEGY CO., Ltd., DEC. 2004.
[9] SUNPLUS, “IDE User guide”, SUNPLUS TECHONOLEGY CO., Ltd., JULY. 2003.
[10] SUNPLUS, “SPCE040A/060A/061A Programming Guide v1.2”, SUNPLUS TECHONOLEGY CO., Ltd., MAY. 2004.
[11] SUNPLUS, “SUNPLUS SPRS512C Datasheet”, SUNPLUS TECHONOLEGY CO., Ltd., March. 2003.
[12] SUNPLUS, “SUNPLUS SPRS512C Datasheet”, SUNPLUS TECHONOLEGY CO., Ltd., March. 2003
[13] SUNPLUS, “SUNPLUS µ'nSP Instruction Set”, SUNPLUS TECHONOLEGY CO., Ltd., JULY. 2002
[14] SUNPLUS, “C Programming in SUNPLUS SPCE061A”, SUNPLUS TECHONOLEGY CO., Ltd., March. 2003