研究生: |
陳麒竹 Chen, Chi-chu |
---|---|
論文名稱: |
基於心理聲學的低頻頻寬擴展結合小波轉換 Low Frequency Bandwidth Extension with Wavelet Transform based Psychoacoustic |
指導教授: |
雷曉方
Lei, Sheau-Fang |
學位類別: |
碩士 Master |
系所名稱: |
電機資訊學院 - 電機工程學系 Department of Electrical Engineering |
論文出版年: | 2008 |
畢業學年度: | 96 |
語文別: | 中文 |
論文頁數: | 78 |
中文關鍵詞: | 小波轉換 、低頻頻寬擴展 、虛擬音高 、遺失基頻 |
外文關鍵詞: | Wavelet transform, Low frequency bandwidth extension, Virtual pitch, Missing fundamental |
相關次數: | 點閱:93 下載:3 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
對於輕便型音訊裝置在數位音訊的錄放裝置中,如果沒有特殊的低音擴音器或較為昂貴的播放器或耳機,僅利用一般小型多媒體的播放器或耳機播放較低頻的低音是較為困難的。因而眾多文獻指出了低頻頻寬擴展的方法或者可稱作虛擬音高(虛擬低音),可以在不提高成本的情況下播放出較低頻的低音部份。虛擬音高是利用”遺失基頻”的心理聲學概念去建立的,不過由於當在產生虛擬音高的同時,也會造成非線性失真,導致音訊失真,進而使人類感知聽覺受到影響。因而我們的研究針對虛擬低音中的非線性失真的部份,提出利用小波轉換信號分析的方法,改善其非線性失真所帶來的音訊失真。最後達到優化虛擬低音的目標。
In digital audio playback systems for portable audio device, there is a strong demand to produce deep bass using small multimedia speakers and earphones, without the need for additional subwoofer or expensive speakers/earphones. Therefore, many documents have pointed out the low frequency bandwidth extension method or can be called the virtual pitch (virtual bass), and it can comparatively broadcast out the bass part of low frequency in case of rising cost. The virtual pitch is utilizing and setting up because of ‘missing fundamental’ of the psychoacoustic concept. But because will cause non-linear distortion while producing virtual pitch, causing the audio signal to be distorted, and making the perceptual hearing of human influences. Therefore our research focuses on the non-linear distortion part of virtual bass, utilizing the wavelet transform and improve its non-linear distortion caused the audio signal of distortion. Finally, reach the goal of optimizing virtual bass.
[1] R. M. Aarts, 25 jaar AES in Nederland (Gebotekst, Zoetermeer, The Netherlands, 1999), chap. 17, pp. 158–159.
[2] R. M. Aarts, S. P. Straetemans, “Circuit, Audio System and Method for Processing Signals, and a Harmonics Generator,” US patent 6,111,960 (2000 Aug. 29).
[3] J.L. Goldstein, “Auditory nonlinearity.” J. Acoust. Soc. Am., 41(3):676-689, 1967
[4] N. Guttman and S. Pruzansky., “Lower limits of pitch and musical pitch,” J. Speech Hear. Res., 5(3):207-214, 1962
[5] W. S. Gan, S. M. Kuo, and C. W. Toh.,“Virtual bass for home entertainment, multimedia PC, game stationand portable audio systems”, IEEE Trans. Cons. Electron, 47(4), 787-793, 2001.
[6] Cartwright JHE, Gonza’ lez DL, Piro O (2001), “Pitch perception: a dynamical-systems perspective.” Proc Natl Acad Sci USA 98: 4855–4859.
[7] Ronald M. Aarts, Erik Larsen, “Reproducing Low-Pitched Signals through Small Loudspeakers”, J. Audio Eng. Soc., Vol. 50, No. 3, 2002 March.
[8] Erik Larsen, Ronald M. Aarts, “Audio Bandwidth Extension”, 2004.
[9] R.M. Aarts, E. Larsen, and D. Schobben, “Improving perceived bass and reconstruction of high frequencies for band limited signals,” Proc. IEEE Benelux Workshop on Model based Processing and Coding of Audio (MPCA-2002), pp.59–71, Nov. 2002.
[10] Ronald M. Aarts, “Applications of DSP for sound reproduction Improvement,” AES International Conference, Copenhagen, Denmark, 2003 May
[11] A. Mertens, Signal Analysis. Wavelets, Filter Banks, Time-Frequency Transforms and Applications. Chichester, UK: John Wiley & Sons, 1999
[12] J. Morlet, “Sampling theory and wave propagation,” In NATO ASI series, Issues in acoustic signal/image processing and recognition, volume 1, pages 233-261. Springer, New York, 1983.
[13] J. Morlet, G. Arens, I. Fourgeau, and D. Giard, “Wave propagation and sampling theory,” Geophysics, 47:203-236, 1982
[14] S. Mallat, S. Zhong, “Character of signal from multiscale edges,” IEEE Trans. Pattern Anal. Mach. Intel. 14 (7) (July 1992) 710-732.
[15] S. Mallat, “A theory for multiresolution signal decomposition: the wavelet representation,” IEEE Trans. Pattern Anal. Mach. Intell. 11 (7) (July 1989) 674-691.
[16] J. Mariano Merino, “Complexity of pitch and timbre concepts.” Physics Education, 33(2), 105-109. 1998.
[17] Hamza Ozer, Ismail Avcibas, Bulent Sankur, Nasir Memon, “Steganalysis of Audio Based on Audio Quality Metrics,” Proceedings of SPIE-IS&T Electronic Imaging, SPIE vol. 5050, pp.55-66 2003.
[18] Alan V. Oppenheim, Ronald W. Schafer, John R. Buck, “Discrete-Time Signal Processing,” Prentice Hall Signal Processing Series.
[19] D. Ben Tzur, M. Colloms, “The Effect of MaxxBass Psychoacoustic Bass Enhancement System on Loudspeaker Design,” presented at the 106th Convention of the Audio Engineering Society, J. Audio Eng. Soc. (Abstracts), vol. 47, pp. 517 (1999 June), preprint 4892.
[20] G. Tzanetakis, G. Essl, and P. Cook. Audio Analysis using the Discrete Wavelet Transform. In Proc. Conf. in Acoustics and Music Theory Applications. WSES, Sept. 2001.
[21] W. J. Warren, W. R. Hewlett, “An analysis of the Intermodulation Method of Distortion Measurement,” Proceeding of the IRE, Vol. 36, pp: 457- 466, (1948 April).
[22] ISO 226-1987(E), “Acoustics––Normal Equal-Loudness Level Contours,” International Standards Organization, Geneva, Switzerland (1987).
[23] Udo Zolzer, “DAFX - Digital Audio Effects,” 2002.