| 研究生: |
黃宗德 Huang, Zong-de |
|---|---|
| 論文名稱: |
實現於系統晶片之智慧型卡拉OK系統 Realization of An Intelligent Karaoke System on Soc Platform |
| 指導教授: |
楊家輝
Yang, Jar-ferr |
| 學位類別: |
碩士 Master |
| 系所名稱: |
電機資訊學院 - 電腦與通信工程研究所 Institute of Computer & Communication Engineering |
| 論文出版年: | 2008 |
| 畢業學年度: | 96 |
| 語文別: | 中文 |
| 論文頁數: | 67 |
| 中文關鍵詞: | 語音變速變調 |
| 外文關鍵詞: | Speech time and pitch scaling |
| 相關次數: | 點閱:52 下載:3 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
時下所流行的KTV是由日本所發展過來,結合電視影片與卡拉OK伴奏系統,讓人看著電視螢幕,跟著背景主旋律及歌詞字幕盡情高歌一曲。KTV可以紓解生活上的壓力,亦成為現代一種重要的休閒活動。
本論文主要的研究是將語音調變之技術與我們所發展的卡拉OK系統互相結合,而我們所提出的語音調變演算法是從語音壓縮技術之口腔模型所延伸出來,除了能針對歌聲訊號加以處理,並能與MIDI所提供的主旋律資訊配合。最後,我們亦於系統晶片(System-on-Chip)平台上實現一個能自動更正走音的智慧型卡拉OK系統。
Karaoke Television (KTV) is popular nowadays, which was originated from Japanese, by combining a television video tape recorder and a karaoke accompaniment system such that the users could watch TV monitor, heartily singing along with the background melody and lyrics caption. It could not only relax the pressure of life, but also become an important leisure activity in the modern world.
In this thesis, we combine the speech modulation technology with our developed karaoke system, where the purposed speech modulation algorithm is based on the vocoder model suggested in speech coding techniques. The proposed algorithm could process the singing voices and cooperate with MIDI melody information in real-time. Finally, we will realize an intelligent karaoke system with both speech modulation and MIDI on SoC platform, which can automatically help to correct the pitch of the singer.
[1] Wikipedia, the free encyclopedia:
from http://en.wikipedia.org/wiki/Karaoke
[2] Yipeng Li, DeLiang Wang, “Detecting pitch of singing voice in polyphonic audio”, IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol.3, 2005.
[3] Yipeng Li, DeLiang Wang, “Separation of Singing Voice From Music Accompaniment for Monaural Recordings”, IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol.15, pp. 1475 - 1487, 2007.
[4] H. Valbret and E. Moulines and J.P. Tubach, “Voice transformation using PSOLA technique”, Acoustics, Speech, and Signal Processing, 1992. ICASSP-92, 1992. IEEE International Conference, Vol. 1, 1992.
[5] Werner Verhelst, “Speech Communication”, Elsevier Science Publishers B. V. Volume 30, Number 4, pp. 207-221(15), April 2000.
[6] Florian Hammer, “Time-Scale Modification using the Phase Vocoder”, Institute for Electronic Music and Acoustics (IEM), Graz, Austria, September 2001
[7] S.S.Abeysekera, K. P.Padhi, J.Absar, S.George,“An audio signal scaling technique harmonic grouping and shifting”, Circuits and Systems, 2004. ISCAS '04. Proceedings of the 2004 International Symposium on, Vol. 3, 23-26 May 2004.
[8] 王文生, 頻譜參數之語音變速變調演算法及其應用, 碩士論文-國立成功大學電機工程研究所, July 2007
[9] Ben Gold, Nelson Morgan,“Speech and Audio Signal Processing”, Wiley, Auguest 1999.
[10] Texas Instruments:TMS320DM6446 Digital Media System-on-Chip,(Rev.F),31 Mar 2008.
from http://focus.ti.com/docs/prod/folders/print/tms320dm6446.html
[11] Open Sound System, OSS 4.0 Programmer's Guide:
from http://manuals.opensound.com/developer/
[12] YongweiZhu, ShengGao,“Extracting Vocal Melody from Karaoke Music Audio”, Multimedia and Expo, 2005. ICME 2005. IEEE International Conference on, July 2005.
[13] 陳家敏, MIDI 解碼合成器於系統晶片平台之實現, 碩士論文-國立成功大學電機工程研究所, July 2007