簡易檢索 / 詳目顯示

研究生: 陳桂雪
Chen, Guei-Shiue
論文名稱: MPEG-4音響編碼與遞迴MDCT演算法之研究與分析
Studies and Analyses of MPEG-4 Audio Coders and Recursive MDCT Algorithms
指導教授: 楊家輝
Yang, Jar-Ferr
學位類別: 碩士
Master
系所名稱: 電機資訊學院 - 電機工程學系
Department of Electrical Engineering
論文出版年: 2003
畢業學年度: 91
語文別: 中文
論文頁數: 76
中文關鍵詞: 修正式離散餘弦轉換頻域加權交錯之向量量化音響編碼MPEG-4進階音樂編碼
外文關鍵詞: TwinVQ, MDCT, MPEG-4 AAC, Audio Coding
相關次數: 點閱:112下載:2
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 本論文主要針對MPEG-4進階音響編碼和頻域加權交錯向量量化編碼之實現上做研究。根據數位訊號處理晶片的硬體技術,提出修正離散餘弦轉換法(MDCT)和反修正離散餘弦轉換法(IMDCT)的遞迴架構,以加速在硬體實現的執行速率。並且本論文更進一步分析頻域加權交錯向量量化編碼架構,討論其優點和模擬此編碼器應用於網路傳輸上的抗雜訊情形。

    The research of this thesis is focused on the implementation of MPEG-4 advanced audio coding (AAC) and transform domain weighted interleaved vector quantization (TwinVQ) standards. According to digital signal processing chip, we propose recursive structures for realizing modified discrete cosine transform (MDCT) and inverse MDCT (IMDCT) to speed up computation. Finally, in this thesis, we also analyze the structure of TwinVQ audio coders and discuss its advantages for low bit rate audio. The simulations show that the TwinVQ audio coder is more robust over noisy communication channels than the AAC coders.

    摘要 i Abstract ii 誌謝 iii 目錄 iv 圖目錄 vii 表目錄 ix 第1章 簡介 1 1.1 研究背景與動機 1 1.2 感官式音訊編碼 3 1.3 心理音響原理(Psychoacoustic principle) 4 1.3.1 聽覺的絕對臨界值 4 1.3.2 臨界頻帶 6 1.3.3 遮罩效應(Masking effect) 8 1.3.3.1 頻域遮罩(Spectral Masking) 9 1.3.3.2 時域遮罩(Temporal Masking) 10 1.4 章節安排 11 第2章 MPEG-4進階音樂編碼 12 2.1 心理音響模型 13 2.2 增益控制(Gain control) 18 2.2.1 多相正交濾波器 19 2.2.2 增益偵測器 20 2.2.3 增益修正器 20 2.3 時頻轉換(Time-frequency transform) 21 2.3.1 視窗形狀調整 22 2.3.2 轉換頻率區塊的切換 23 2.4 時域雜訊變形(Temporal Noise Shaping, TNS) 24 2.4.1 時頻對稱性關係 25 2.4.2 預測編碼做雜訊變形 25 2.4.3 編碼流程 25 2.5 長期預測(Long Term Prediction, LTP) 27 2.6 感知式雜訊替代 28 2.7 量化 29 2.8 無失真編碼 32 2.8.1 分段 32 2.8.2 分類與交錯 33 2.8.3 哈夫曼編碼 33 第3章 修正式離散餘弦轉換之遞迴架構 35 3.1 修正式離散餘弦轉換(MDCT)之遞迴架構 36 3.2 反修正式離散餘弦轉換(IMDCT)之遞迴架構 42 3.3 結果 50 第4章 頻域加權交錯之向量量化 52 4.1 頻譜平坦化 53 4.1.1 線性預測編碼之平坦化 53 4.1.2 巴喀封包之平坦化 54 4.1.3 振幅正規化 56 4.2 加權交錯之向量量化 61 4.2.1 音框分割和交錯(interleave)方式 61 4.2.2 向量量化 63 第5章 結論 72 參考文獻 73

    [1] ISO-IEC JTC1/SC29/WG11 N2503GA, "Information Technology - Coding of Audiovisual Objects (part3: Audio)," May, 1998.
    [2] M. Bosi, K. Brandenburg et al., "ISO/IEC MPEG-2 Advanced Audio Coding," J. Audio Eng. Soc., Vol.45, No.10, pp.789-812, October 1997.
    [3] H. Purnhagen, "An Overview Of Mpeg-4 Audio Version 2 (1999)," 17th AES Convention.
    [4] T. PAINTER and A. SPANIAS, "Perceptual Coding of Digital Audio," Proc. of The IEEE, VOL. 88, NO. 4, APRIL 2000.
    [5] Z. Fastl, "Psycaocoustics: Facts and Models,".
    [6] J. Herre and J. D. Johnston, "Continuously signal-adaptive filterbank for high-quality perceptual audio coding," IEEE ASSP Workshop on, 19-22, Oct 1997.
    [7] DONALD SCHULZ, "Improving Audio Codecs by Noise Substitution," J. Audio Eng. Soc., 1996.
    [8] N. I. Cho and S. U. Lee, "A fast 4x4 DCT algorithm for the recursive 2-D DCT," IEEE Trans. Signal Process., vol. 40, pp. 2166-2173, Sept. 1992.
    [9] E. Feig and S. Winograd, "Fast algorithms for the discrete cosine transform," IEEE Trans. Signal Process., vol. 40, pp. 2174-2193, Sept. 1992.
    [10] G. Goertzel, "An algorithm for the calculation of finite trigonometric series," Amer. Math. Monthly, vol. 65, pp. 34-35, 1958.
    [11] L. P. Chau and W. C. Siu, "Recursive algorithm for the discrete cosine transform with general length," Electron. Lett., vol 30, no. 3, Feb. 1994, pp. 197-198.
    [12] Z. Wang, G. A. Jullien, and W. C. Miller, "Recursive algorithms for the forward and inverse discrete cosine transform with arbitrary length," IEEE Signal Processing Lett., vol. 1, no. 7, pp. 101-102, July 1994.
    [13] Y. H. Chan, L. P. Chau, and W. C. Siu, "Efficient implementation of discrete cosine transform using recursive filter structure," IEEE Trans. Circuits Syst. Video Technol., vol. 4, no. 6, pp. 550-552, Dec. 1994.
    [14] M. F. Aburdene, J. Zheng, and R. J. Kozick, "Computation of discrete cosine transform using Clenshaw's recurrence formula," IEEE Signal Processing Lett., vol. 2, no. 8, pp. 155-156, Aug. 1995.
    [15] A. Y. Wu and K. J. R. Liu, "A low-power and low-complexity DCT/IDCT VLSI architecture based on backward Chebyshev recursion," in Proc. 1994, IEEE Int. Symp. Circuits and Syst., London, U.K., May 1994.
    [16] A. Y. Wu and K. J. R. Liu, "Algorithm-based low-power transform coding architectures: the multirate approach," IEEE Trans. VLSI Syst., vol. 6, no. 4, pp. 707-718, Dec. 1998.
    [17] K. J. R. Liu, C. T. Chiu, R. K. Kolagotla, and J. F. Jala, "Optimal unified architectures for the real-time computation of time-recursive discrete sinusoidal transforms," IEEE Trans. Circuits Syst. Video Technol., vol. 4, no. 2, pp. 168-180, Apr. 1994.
    [18] D. Y. Chan, J. F. Yang, and S. Y. Chen, "Regular implementation algorithms of time domain aliasing cancellation," IEE Proc. Vis. Image Signal Process., vol. 143, no. 6, pp. 387-392, Dec. 1996.
    [19] J. L. Wang, C. B. Wu, B. D. Liu, and J. F. Yang, "Implementation of the discrete cosine transform and its inverse by recursive structures," in Proc. SiPS'99, vol. 1, Oct. 1999, pp. 120-130.
    [20] J. Wang and Z. Dong , "A fast algorithm for modified discrete cosine transform," in Proc. Int. Conf. Commun. Technol., vol. 1, May 1996, pp.445-448.
    [21] A. V. Oppenheim and R. W. Schafer, "Discrete-time Signal Processing", 1999.
    [22] Y. T. Hwang, N. J. Liu and M. C. Tsai, "An MPEG-4 TwinVQ Based on High Quality Audio Codec Design", IEEE Signal Processing Systems, 2001.
    [23] N. Iwakami, T. Moriya et al., "Fast encoding algorithms for MPEG-4 TwinVQ audio tool," ICASSP of IEEE, 2001.
    [24] N. Iwakami, T. Moriya and S. Miki, "High-Quality Audio Coding at less than 64kbit/s by Using TwinVQ," Proc. ICASS'95, pp.937-940, 1995.
    [25] T. Moriya and M. Honda, "Two-channel Conjugate Vector Quantizer for Noisy Channel Speech Coding," IEEE JSAC, vol.6 pp.425-431, 1988.
    [26] K. Ikeda, T. Moriya and N. Iwakami, "Error Protected TwinVQ Audio Coding at less than 64kbit/s," Proc. IEEE Speech Coding Workshop, pp. 33-34, 1995.
    [27] K. Ikeda, T. Moriya, N. Iwakami and S. Miki, "A design of TwinVQ audio-codec for personal communication systems," Fourth IEEE International Conference on Universal Personal communications, 1995, pp. 803-807.
    [28] T. Moriya, N. Iwakami, K. Ikeda and S. Miki, "Extension and complexity reduction of TwinVQ audio coder," Proc. of ICASSP, 1996, vol.2, pp. 1029-1032.
    [29] V. Nikolajevic, G. Fettweis, "Computation of forward and inverse MDCT using clenshaw's recurrence formula", IEEE Trans. On Signal Processing, vol.51, no.5, May 2003

    下載圖示 校內:2004-06-23公開
    校外:2004-06-23公開
    QR CODE