| 研究生: |
黃一展 Huang, Yi-Chan |
|---|---|
| 論文名稱: |
諧波偵測及估計於HVXC編碼器之快速實現 Harmonic Detection and Estimation for HVXC Speech Coders |
| 指導教授: |
楊家輝
Yang, Jar-Fe |
| 學位類別: |
碩士 Master |
| 系所名稱: |
電機資訊學院 - 電機工程學系碩士在職專班 Department of Electrical Engineering (on the job class) |
| 論文出版年: | 2003 |
| 畢業學年度: | 91 |
| 語文別: | 中文 |
| 論文頁數: | 93 |
| 中文關鍵詞: | 有聲/無聲決定的方法 、低率語音編碼器 、諧波振幅估測 、諧波向量激發編碼器 |
| 外文關鍵詞: | voiced/unvoiced decision algorithm, HVXC, speech coder, MPEG-4, estimation of the harmonic magnitudes |
| 相關次數: | 點閱:68 下載:1 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
本論文主要研究內容是針對MPEG-4 中低位元率語音編碼標準−諧波向量激發編碼器(Harmonic Vector eXcitation Coder,簡稱HVXC ),做演算法之快速實現。首先,在編碼端提出一種多階段有聲/無聲決定的方法,使得整個編碼端能隨著語音信號特性的不同,在不降低原來合成語音品質的條件下,適應性地改變整個演算架構的流程,以節省不必要的計算量。接下來,因求取諧波振幅過程中,必先求得精確的基週,我們提出一種樹狀搜尋法,而且能隨著語音信號本身特性不同,適應性地改變搜尋的範圍,有效率地節省一些不必要的計算量。
In this thesis, we develop fast algorithms for the MPEG-4 low bit-rateHVXC speech coder to adaptively reduce the computation. First, we proposea novel multi-stage voiced/unvoiced decision algorithm. According to thecharacteristics of the encoding speech signal, each detected stage adaptively
adjusts the flow of the encoder to reduce the unnecessary computation and achieves the same speech quality. In the HVXC speech coder, the pitch is an indispensable parameter for successful estimation of the harmonic magnitudes.
Hence, we secondly propose a tree searching method to refine the estimated pitch. The propose method can adaptively amend the search range of candidate pitches such that we can further reduce the needless computation in accordance with the characteristics of the processing speech signal.
[1] L. R. Rabiner and R. W. Schafer, “Digital Processing of Speech Signals”, 1978
[2] A. M. Kondoz, “Digital Speech-Coding for Low Rate Communications Systems”,April 1999
[3] D. W. Griffin and J. S. Lim, “Multiband Excitation Vocoder”, IEEE Trans. on ASSP,664-678, August 1988
[4] P. C. Meuse, “A 2400 bps Muti-Band Excitation Vocoder”, Proc. ICASSP, Page(s):9 –12, 1990
[5] Nishiguchi, M.; Matsumoto, J, “Vector quantized MBE with simplified V/UV division at 3.0 kbit/s”, ICASSP, Page(s): 151 -154 vol.2, 1993
[6] R. J. McAulay and T. F. Quatieri, “Speech Analysis/Synthesis Based on a Sinusoidal Representation”, IEEE Trans. ASSP, Vol.34, No 4, pp.744-754, Aug 1986
[7] Y. Shoham, “High-Quality Speech Coding at 2.4 to 4.0 Kbps based on Time-Frequency Interpolation”, Proc. ICASSP, pp.II-151-154, Apr. 1993
[8] M. Nishiguchi; A. Inoue; Y. Maeda and J. Matsumoto, “Parametric Speech Coding-HVXC at 2.0-4.0kbps”, Speech Coding Proceedings, IEEE Workshop, Page(s):84–86, 1999
[9] M. Nishiguchi and J. Matsumoto, “Harmonic and noise coding of LPC residuals with classified vector quantization”, ICASSP, Volume: 1, Page(s): 484 -487 vol.1, 1995
[10] ISO/IEC JTC 1/SC 29/WG 11 N2503-2H, 1998-11-15, “Information technology–Coding of audio-visual objects, Part 3: Audio, Subpart 2: Speech Coding - HVXC.
[11] 連桂宏, MPEG-4 低率語音編碼器-HVXC 編碼器之實現, 碩士論文--國立成功大學電機工程研究所, 民90