| 研究生: |
張世樺 Chang, Shi-Hua |
|---|---|
| 論文名稱: |
使用音高調變結合遮蔽效應實現心理聲學之低頻頻寬擴展 Low Frequency Bandwidth Extension of Psychoacoustic with Pitch Scaling and Masking Effect |
| 指導教授: |
雷曉方
Lei, Sheau-Fang |
| 學位類別: |
碩士 Master |
| 系所名稱: |
電機資訊學院 - 電機工程學系 Department of Electrical Engineering |
| 論文出版年: | 2010 |
| 畢業學年度: | 98 |
| 語文別: | 中文 |
| 論文頁數: | 95 |
| 中文關鍵詞: | 低頻頻寬擴展 、音高調變 、遮蔽效應 、虛擬音高 、遺失基頻 |
| 外文關鍵詞: | Low frequency bandwidth extension, Pitch scaling, Masking effect, Virtual pitch, Missing fundamental |
| 相關次數: | 點閱:133 下載:8 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
一般常見的數位音訊錄放設備,如果沒有品質較佳的播放器或耳機,或者是特別針對低音的播放裝置,僅利用一般的多媒體播放器或耳機要播放出低頻聲音是一件困難的事情。為了讓一般播放設備也能播出較低頻的聲音,相關文獻中提出以低頻頻寬擴展(又稱作虛擬音高或虛擬低音)的方法來改善此缺失,這個方法是利用心理聲學原理中的”遺失基頻”概念來模擬呈現低頻聲音。不過在產生虛擬音高的同時,通常會造成原本聲音有一定程度的失真,尤其當諧波產生器為非線性裝置時,人耳感受到不和諧的聲音特別明顯。
本研究針對此問題提出改進方法,研究成果顯示透由音高調變結合遮蔽效應產生虛擬音高的方法,能改善使用非線性裝置產生虛擬音高所導致聲音不和諧的感受;另一方面使用遮蔽效應能有效針對低頻能量較強的聲音做虛擬低音處理,能保有相同低音效果且降低因為產生虛擬音高所造成的失真。
It can hardly generate deep bass sound with small multimedia speakers or earphones in general digital audio playback devices. According to scientific and technical literatures, a low frequency bandwidth extension (also known as virtual pitch or virtual bass) method, which uses ‘missing fundamental ‘of the psychoacoustic concept to simulate the low frequency sound, is adopted for a general audio playback device to generate deep bass sound without an additional subwoofer or high equality speakers/ earphones. But when virtual pitches are generated, the original audio signal is accordingly distorted, and the unpleasant inharmonious sound is especially obvious for human ears while harmonic generator of non-linear type is used.
This research proposes improving the method to this problem, and the research result shows the method applying pitch scaling and masking effect to virtual pitch generation, which is capable of improving unpleasant feelings of inharmonious sound for human ears caused by using non-linear device to generate virtual pitch. Applying the masking effect can also effectively help moderate low frequency sounds with strong energy by virtual low frequency processing, and thus help remain the same base effect while reducing the distortion caused by virtual pitch generation.
[1] R. M. Aarts and E. Larsen, "Reproducing Low-Pitched Signals through Small Loudspeakers," Audio Eng. Soc., vol. 50, pp. 147-164, March 2002
[2] J. H. E. Cartwright, D. L. Gonza' lez, and O. Piro, "Pitch perception: A dynamical-systems perspective," Proceedings of the National Academy of Sciences of the United States of America, vol. 98, pp. 4855-4859, 2001.
[3] D. Ben-Tzur and M. Colloms, "The Effect of MaxxBass Psychoacoustic Bass Enhancement System on Loudspeaker Design," presented at the 106th Convention of the Audio Engineering Society, J. Audio Eng. Soc. (Abstracts), vol. 47, p. 517, May 1999
[4] E. Larsen and R. M. Aarts, "Audio Bandwidth Extension," 2004.
[5] U. Zolzer, "DAFX - Digital Audio Effects," 2002.
[6] W. S. Gan, S. M. Kuo, and C. W. Toh, "Virtual bass for home entertainment, multimedia PC, game station and portable audio systems," Consumer Electronics, IEEE Transactions on, vol. 47, pp. 787-796, 2001.
[7] 蔡國隆, 王光賢, and 余聰賢, 聲學原理與噪音量測控制: 全華出版社, 2005.
[8] ISO-226:1987, "Acoustics––Normal Equal-Loudness Level Contours," International Standards Organization, Geneva, Switzerland, 1987.
[9] 陳麒竹, 基於心理聲學的低頻頻寬擴展結合小坡轉換: 碩士論文-國立成功大學電機工程研究所, 2008.
[10] R. M. Aarts and S. P. Straetemans, "Circuit, Audio System and Method for Processing Signals, and a Harmonics Generator," US patent 6,111,960, Aug. 29 2000.
[11] J. L. Goldstein, "Auditory nonlinearity," J. Acoust. Soc. Am, vol. 41(3), 1967.
[12] McGraw-Hill, DIGITAL SIGNAL PROCESSING: A Computer Based Approach, 3rdEdition, 2005.
[13] L. B. Jackson, Digital Filters and Signal Processing with MATLAB Exercises, 1995.
[14] B. Carlson, J. Einarsson, J.-H. Kim, S. Nystrom, H. Renefeldt, and A. Wadenberg, "Real Time Pitch Scaling," Royal Institute of technology(KTH), 2001.
[15] E. Moulines and J. Laroche, "Non-parametric techniques for pitch-scale and time-scale modifications of speech," Speech Commun., vol. 16, pp. 175-205, 1995.
[16] S. M. Sprenger, "Pitchscaling using the Fourier Transform," http://www.dspdimension.com/admin/pitch-shifting-using-the-ft/, 2001.
[17] A. Hammarwall, "Automatic music mixing using beat tracking and pitch scaling," Royal Institute of Technology(KTH), 2003.
[18] K. P. Padhi, S. S. Abeysekera, J. Absar, and S. George, "Scaling of audio signals using frequency domain techniques," in Signals, Systems and Computers, 2000. Conference Record of the Thirty-Fourth Asilomar Conference on, 2000, pp. 1484-1488 vol.2.
[19] H. Roder, "Amplitude, Phase, and Frequency Modulation," Proceedings of the IRE, vol. 19, pp. 2145-2176, 1931.
[20] J. L. Flanagan and R. M. Golden, "Phase Vocoder," Bell System Technical Journal, vol. 45, pp. 1493-. 1509, July 18 1966.
[21] E. Zwicker and H. Fastl, Psychoacoustics: Facts and Models. Springer-Verlag,second ed., 1999.
[22] T. Painter and A. Spanias, "Perceptual coding of digital audio," Proceedings of the IEEE, vol. 88, pp. 451-515, 2000.
[23] J. D. Johnston, "Transform coding of audio signals using perceptual noise criteria," Selected Areas in Communications, IEEE Journal on, vol. 6, pp. 314-323, 1988.
[24] J. D. Johnston, "Estimation of perceptual entropy using noise masking criteria," in Acoustics, Speech, and Signal Processing, 1988. ICASSP-88., 1988 International Conference on, 1988, pp. 2524-2527 vol.5.
[25] I. I.-I. Standard, "Information Technology - Coding of moving pictures and associated audio for digital storage media at up to about 1.5 Mbit/s," 1993.
[26] Andersin, T. H., and J. K, "Phase modeling of instrument sounds based on psycho acoustic experiments," In Proceedings of the MOSART Workshop on Current Research Directions in Computer Music, pp. 170-173, 2001.
[27] Hamza Ozer, Ismail Avcibas, Bulent Sankur, and Nasir Memon, "Steganalysis of Audio Based on Audio Quality Metrics,," Proceedings of SPIE-IS&T Electronic Imaging, SPIE, vol. 5050, 2003.
[28] R. Huber and B. Kollmeier, "PEMO-Q----A New Method for Objective Audio Quality Assessment Using a Model of Auditory Perception," Audio, Speech, and Language Processing, IEEE Transactions on, vol. 14, pp. 1902-1911, 2006.