成功大學博碩士論文系統

簡易檢索 / 詳目顯示

回結果列表

研究生：	黃宗德 Huang, Zong-de
論文名稱：	實現於系統晶片之智慧型卡拉OK系統 Realization of An Intelligent Karaoke System on Soc Platform
指導教授：	楊家輝 Yang, Jar-ferr
學位類別：	碩士 Master
系所名稱：	電機資訊學院 - 電腦與通信工程研究所 Institute of Computer & Communication Engineering
論文出版年：	2008
畢業學年度：	96
語文別：	中文
論文頁數：	67
中文關鍵詞：	語音變速變調
外文關鍵詞：	Speech time and pitch scaling
相關次數：	點閱：52 下載：3
分享至:	分享至facebook 分享至twitter

查詢本校圖書館目錄查詢臺灣博碩士論文知識加值系統勘誤回報

　　時下所流行的KTV是由日本所發展過來，結合電視影片與卡拉OK伴奏系統，讓人看著電視螢幕，跟著背景主旋律及歌詞字幕盡情高歌一曲。KTV可以紓解生活上的壓力，亦成為現代一種重要的休閒活動。
　　本論文主要的研究是將語音調變之技術與我們所發展的卡拉ＯK系統互相結合，而我們所提出的語音調變演算法是從語音壓縮技術之口腔模型所延伸出來，除了能針對歌聲訊號加以處理，並能與MIDI所提供的主旋律資訊配合。最後，我們亦於系統晶片(System-on-Chip)平台上實現一個能自動更正走音的智慧型卡拉OK系統。

　　Karaoke Television (KTV) is popular nowadays, which was originated from Japanese, by combining a television video tape recorder and a karaoke accompaniment system such that the users could watch TV monitor, heartily singing along with the background melody and lyrics caption. It could not only relax the pressure of life, but also become an important leisure activity in the modern world.
　　In this thesis, we combine the speech modulation technology with our developed karaoke system, where the purposed speech modulation algorithm is based on the vocoder model suggested in speech coding techniques. The proposed algorithm could process the singing voices and cooperate with MIDI melody information in real-time. Finally, we will realize an intelligent karaoke system with both speech modulation and MIDI on SoC platform, which can automatically help to correct the pitch of the singer.

目  錄	i
圖目錄	iv
表目錄	vi
第一章 簡介	1
1 背景與動機	1
2 論文大綱	3
第二章 語音與歌聲之特性及其調變技術	4
1 聲音的基本特性	4
1.1 聲音三要素	4
1.2 樂音的性質	5
2 歌聲的特色	7
2.1 歌唱中運用的技巧	7
2.2 語音與歌聲的差異	9
3 語音的調變技術	11
3.1 PSOLA ( Pitch Synchronous Overlap and Add )	11
3.2 WSOLA ( Waveform Similarity Overlap and Add )	12
3.3 Phase Vocoder	14
第三章 頻譜參數之調變演算法	19
1 語音的時頻分析	19
2 語音訊號的參數化	23
2.1 語音頻譜包跡的估計	23
2.2 精確基週的求取	26
2.3 有聲/無聲(V/UV)的判斷	27
3 速度與基週參數的調整	29
4 參數式合成與後置處理	33
4.1 參數式合成	33
4.2 訊號之後置處理	34
4.3 有聲/無聲過渡音框之平滑化處理	37
第四章 智慧型卡拉OK系統之實現	40
1 硬體平台之架構介紹	40
2 硬體平台之音訊介面驅動流程	47
3 MIDI標準檔案與主旋律資訊	52
3.1 MIDI 標準格式簡介	52
3.2 主旋律資訊的取得	54
4 更正走音的流程	56
第五章 結論	63
參考文獻	65
                                    

[1] Wikipedia, the free encyclopedia:
from http://en.wikipedia.org/wiki/Karaoke
[2] Yipeng Li, DeLiang Wang, “Detecting pitch of singing voice in polyphonic audio”, IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol.3, 2005.
[3] Yipeng Li, DeLiang Wang, “Separation of Singing Voice From Music Accompaniment for Monaural Recordings”, IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol.15, pp. 1475 - 1487, 2007.
[4] H. Valbret and E. Moulines and J.P. Tubach, “Voice transformation using PSOLA technique”, Acoustics, Speech, and Signal Processing, 1992. ICASSP-92, 1992. IEEE International Conference, Vol. 1, 1992.
[5] Werner Verhelst, “Speech Communication”, Elsevier Science Publishers B. V. Volume 30, Number 4, pp. 207-221(15), April 2000.
[6] Florian Hammer, “Time-Scale Modification using the Phase Vocoder”, Institute for Electronic Music and Acoustics (IEM), Graz, Austria, September 2001
[7] S.S.Abeysekera, K. P.Padhi, J.Absar, S.George,“An audio signal scaling technique harmonic grouping and shifting”, Circuits and Systems, 2004. ISCAS '04. Proceedings of the 2004 International Symposium on, Vol. 3, 23-26 May 2004.
[8] 王文生, 頻譜參數之語音變速變調演算法及其應用, 碩士論文－國立成功大學電機工程研究所, July 2007

[9] Ben Gold, Nelson Morgan,“Speech and Audio Signal Processing”, Wiley, Auguest 1999.
[10] Texas Instruments:TMS320DM6446 Digital Media System-on-Chip,(Rev.F),31 Mar 2008.
from http://focus.ti.com/docs/prod/folders/print/tms320dm6446.html
[11] Open Sound System, OSS 4.0 Programmer's Guide:
from http://manuals.opensound.com/developer/
[12] YongweiZhu, ShengGao,“Extracting Vocal Melody from Karaoke Music Audio”, Multimedia and Expo, 2005. ICME 2005. IEEE International Conference on, July 2005.
[13] 陳家敏, MIDI 解碼合成器於系統晶片平台之實現, 碩士論文－國立成功大學電機工程研究所, July 2007

校內：2013-08-29公開
校外：2013-08-29公開

簡易檢索 / 詳目顯示

相關論文