簡易檢索 / 詳目顯示

研究生: 張世樺
Chang, Shi-Hua
論文名稱: 使用音高調變結合遮蔽效應實現心理聲學之低頻頻寬擴展
Low Frequency Bandwidth Extension of Psychoacoustic with Pitch Scaling and Masking Effect
指導教授: 雷曉方
Lei, Sheau-Fang
學位類別: 碩士
Master
系所名稱: 電機資訊學院 - 電機工程學系
Department of Electrical Engineering
論文出版年: 2010
畢業學年度: 98
語文別: 中文
論文頁數: 95
中文關鍵詞: 低頻頻寬擴展音高調變遮蔽效應虛擬音高遺失基頻
外文關鍵詞: Low frequency bandwidth extension, Pitch scaling, Masking effect, Virtual pitch, Missing fundamental
相關次數: 點閱:133下載:8
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 一般常見的數位音訊錄放設備,如果沒有品質較佳的播放器或耳機,或者是特別針對低音的播放裝置,僅利用一般的多媒體播放器或耳機要播放出低頻聲音是一件困難的事情。為了讓一般播放設備也能播出較低頻的聲音,相關文獻中提出以低頻頻寬擴展(又稱作虛擬音高或虛擬低音)的方法來改善此缺失,這個方法是利用心理聲學原理中的”遺失基頻”概念來模擬呈現低頻聲音。不過在產生虛擬音高的同時,通常會造成原本聲音有一定程度的失真,尤其當諧波產生器為非線性裝置時,人耳感受到不和諧的聲音特別明顯。
    本研究針對此問題提出改進方法,研究成果顯示透由音高調變結合遮蔽效應產生虛擬音高的方法,能改善使用非線性裝置產生虛擬音高所導致聲音不和諧的感受;另一方面使用遮蔽效應能有效針對低頻能量較強的聲音做虛擬低音處理,能保有相同低音效果且降低因為產生虛擬音高所造成的失真。

    It can hardly generate deep bass sound with small multimedia speakers or earphones in general digital audio playback devices. According to scientific and technical literatures, a low frequency bandwidth extension (also known as virtual pitch or virtual bass) method, which uses ‘missing fundamental ‘of the psychoacoustic concept to simulate the low frequency sound, is adopted for a general audio playback device to generate deep bass sound without an additional subwoofer or high equality speakers/ earphones. But when virtual pitches are generated, the original audio signal is accordingly distorted, and the unpleasant inharmonious sound is especially obvious for human ears while harmonic generator of non-linear type is used.
    This research proposes improving the method to this problem, and the research result shows the method applying pitch scaling and masking effect to virtual pitch generation, which is capable of improving unpleasant feelings of inharmonious sound for human ears caused by using non-linear device to generate virtual pitch. Applying the masking effect can also effectively help moderate low frequency sounds with strong energy by virtual low frequency processing, and thus help remain the same base effect while reducing the distortion caused by virtual pitch generation.

    摘要 i ABSTRACT ii 誌謝 iii 目錄 iv 表目錄 vii 圖目錄 viii 第一章 - 導論 1 1.1. 研究目的 1 1.2. 虛擬音高(Virtual Pitch) 1 1.3. 等響度曲線定理(Equal Loudness Contour) 2 1.3.1. 響度(Loudness) 2 1.3.2. 等響度曲線圖(Equal Loudness Contour) 3 1.4. 如何產生虛擬低音 4 1.4.1. 非線性系統(NLD)[4] 4 1.4.2. 頻域音高變調 4 1.5. 諧波產生方法的選擇 5 1.6. 章節組織 5 第二章 - 非線性虛擬低音產生 6 2.1. 非線性虛擬低音演算法架構 6 2.2. 高通濾波器(HFIL) 6 2.3. 第一帶通濾波器(FIL1) 7 2.4. 第二帶通濾波器(FIL2) 7 2.5. 非線性系統(NLD) 7 2.5.1. 全波整流器[4] 8 2.5.2. 改良型乘法器[4] 9 2.6. 諧波信號的增益(Gain) 11 2.7. 非線性系統結合小波轉換[9] 12 2.8. 小波轉換對於非線性產生器之效果[9] 13 第三章 - 頻域諧波產生處理 14 3.1. 頻域虛擬低音演算法架構 14 3.2. 離散傅立葉轉換(DFT) 15 3.3. 音高調變(Pitch Scaling) [14-18] 16 3.3.1. 相角聲碼器理論(Theory of the phase vocoder) 16 3.3.2. 音窗化(Framing) 16 3.3.3. 短時間傅立葉轉換(STFT) 17 3.3.4. 相位處理(Phase Processing) 18 3.3.5. 調變(Scaling) 19 3.3.6. 相位反處理(Phase Deprocessing) 21 3.3.7. 短時間傅立葉逆轉換(iSTFT) 22 3.4. 其他頻域上移頻方法 22 3.4.1. 計算基頻對應的諧波頻率強度調整法 22 3.4.2. 信號調變(Signal Modulation)[19] 23 3.4.3. 時變相角聲碼器(Time Scale Phase Vocoder)[20] 24 第四章 - 心理聲學模型(Psychoacoustic Model) 26 4. 前言 26 4.1. 心理聲學特性 26 4.1.1. 聽覺絕對臨界值(Absolution Threshold of Hearing) 26 4.1.2. 臨界頻帶(Critical Band) 27 4.1.3. 遮蔽效應(Masking Effect) 29 4.1.3.1. 頻域遮蔽(Frequency Masking) 29 4.1.3.2. 時域遮蔽(Temporal Masking) 30 4.2. 心理聲學模型架構流程 30 4.2.1. 臨界頻帶解析(Critical Band Analysis) 31 4.2.2. 展開函數(Spreading Function) 33 4.2.3. 音調係數與遮蔽臨界值估算 34 4.2.4. 臨界值估算的正規化處理 35 4.2.5. 與絕對聽覺臨界值比較算出聽覺遮蔽臨界值 36 第五章 -音高調變結合遮蔽效應之虛擬低音架構 37 5.1. 架構流程圖 37 5.1.1. 帶通濾波器與高通濾波器 37 5.1.2. 短時間傅立葉轉換 38 5.1.3. 欲產生諧波的低頻信號先做遮蔽效應運算處理 38 5.1.4. 音高調變諧波產生器運算處理 40 第六章 -諧波產生分析與比較 50 6.1. 頻率方式產生諧波比較 50 6.1.1. 輸入為單一頻率弦波 50 6.1.2. 輸入為多個不同頻率之弦波 56 6.1.3. 輸入音檔信號 59 6.1.3.1. 音檔中低頻能量最高的三個音框諧波處理頻譜圖 60 6.1.3.2. 經過不同諧波產生處理之低頻總能量譜 65 6.1.3.3. 不同頻域諧波產生方法比較結果 66 6.2. 音高調變結合遮蔽效應與小波轉換結合非線性裝置比較 67 6.2.1. 諧波失真比較 67 6.2.2. 諧波頻譜差異度比較 68 6.2.3. ODG分析比較 88 6.3. 主觀測試結果 89 6.4. 數據分析結論 91 第七章 - 結論與未來工作 92 7.1. 結論 92 7.2. 未來工作 92 參考文獻 93

    [1] R. M. Aarts and E. Larsen, "Reproducing Low-Pitched Signals through Small Loudspeakers," Audio Eng. Soc., vol. 50, pp. 147-164, March 2002
    [2] J. H. E. Cartwright, D. L. Gonza' lez, and O. Piro, "Pitch perception: A dynamical-systems perspective," Proceedings of the National Academy of Sciences of the United States of America, vol. 98, pp. 4855-4859, 2001.
    [3] D. Ben-Tzur and M. Colloms, "The Effect of MaxxBass Psychoacoustic Bass Enhancement System on Loudspeaker Design," presented at the 106th Convention of the Audio Engineering Society, J. Audio Eng. Soc. (Abstracts), vol. 47, p. 517, May 1999
    [4] E. Larsen and R. M. Aarts, "Audio Bandwidth Extension," 2004.
    [5] U. Zolzer, "DAFX - Digital Audio Effects," 2002.
    [6] W. S. Gan, S. M. Kuo, and C. W. Toh, "Virtual bass for home entertainment, multimedia PC, game station and portable audio systems," Consumer Electronics, IEEE Transactions on, vol. 47, pp. 787-796, 2001.
    [7] 蔡國隆, 王光賢, and 余聰賢, 聲學原理與噪音量測控制: 全華出版社, 2005.
    [8] ISO-226:1987, "Acoustics––Normal Equal-Loudness Level Contours," International Standards Organization, Geneva, Switzerland, 1987.
    [9] 陳麒竹, 基於心理聲學的低頻頻寬擴展結合小坡轉換: 碩士論文-國立成功大學電機工程研究所, 2008.
    [10] R. M. Aarts and S. P. Straetemans, "Circuit, Audio System and Method for Processing Signals, and a Harmonics Generator," US patent 6,111,960, Aug. 29 2000.
    [11] J. L. Goldstein, "Auditory nonlinearity," J. Acoust. Soc. Am, vol. 41(3), 1967.
    [12] McGraw-Hill, DIGITAL SIGNAL PROCESSING: A Computer Based Approach, 3rdEdition, 2005.
    [13] L. B. Jackson, Digital Filters and Signal Processing with MATLAB Exercises, 1995.
    [14] B. Carlson, J. Einarsson, J.-H. Kim, S. Nystrom, H. Renefeldt, and A. Wadenberg, "Real Time Pitch Scaling," Royal Institute of technology(KTH), 2001.
    [15] E. Moulines and J. Laroche, "Non-parametric techniques for pitch-scale and time-scale modifications of speech," Speech Commun., vol. 16, pp. 175-205, 1995.
    [16] S. M. Sprenger, "Pitchscaling using the Fourier Transform," http://www.dspdimension.com/admin/pitch-shifting-using-the-ft/, 2001.
    [17] A. Hammarwall, "Automatic music mixing using beat tracking and pitch scaling," Royal Institute of Technology(KTH), 2003.
    [18] K. P. Padhi, S. S. Abeysekera, J. Absar, and S. George, "Scaling of audio signals using frequency domain techniques," in Signals, Systems and Computers, 2000. Conference Record of the Thirty-Fourth Asilomar Conference on, 2000, pp. 1484-1488 vol.2.
    [19] H. Roder, "Amplitude, Phase, and Frequency Modulation," Proceedings of the IRE, vol. 19, pp. 2145-2176, 1931.
    [20] J. L. Flanagan and R. M. Golden, "Phase Vocoder," Bell System Technical Journal, vol. 45, pp. 1493-. 1509, July 18 1966.
    [21] E. Zwicker and H. Fastl, Psychoacoustics: Facts and Models. Springer-Verlag,second ed., 1999.
    [22] T. Painter and A. Spanias, "Perceptual coding of digital audio," Proceedings of the IEEE, vol. 88, pp. 451-515, 2000.
    [23] J. D. Johnston, "Transform coding of audio signals using perceptual noise criteria," Selected Areas in Communications, IEEE Journal on, vol. 6, pp. 314-323, 1988.
    [24] J. D. Johnston, "Estimation of perceptual entropy using noise masking criteria," in Acoustics, Speech, and Signal Processing, 1988. ICASSP-88., 1988 International Conference on, 1988, pp. 2524-2527 vol.5.
    [25] I. I.-I. Standard, "Information Technology - Coding of moving pictures and associated audio for digital storage media at up to about 1.5 Mbit/s," 1993.
    [26] Andersin, T. H., and J. K, "Phase modeling of instrument sounds based on psycho acoustic experiments," In Proceedings of the MOSART Workshop on Current Research Directions in Computer Music, pp. 170-173, 2001.
    [27] Hamza Ozer, Ismail Avcibas, Bulent Sankur, and Nasir Memon, "Steganalysis of Audio Based on Audio Quality Metrics,," Proceedings of SPIE-IS&T Electronic Imaging, SPIE, vol. 5050, 2003.
    [28] R. Huber and B. Kollmeier, "PEMO-Q----A New Method for Objective Audio Quality Assessment Using a Model of Auditory Perception," Audio, Speech, and Language Processing, IEEE Transactions on, vol. 14, pp. 1902-1911, 2006.

    下載圖示 校內:2015-02-11公開
    校外:2015-02-11公開
    QR CODE