| 研究生: |
吳子懿 Wu, Tzu-Yi |
|---|---|
| 論文名稱: |
基於線性預估之高品質音訊高頻重建 High-Quality Audio Bandwidth Extension Method Based on LPC |
| 指導教授: |
雷曉方
Lei, Sheau-Fang |
| 學位類別: |
碩士 Master |
| 系所名稱: |
電機資訊學院 - 電機工程學系 Department of Electrical Engineering |
| 論文出版年: | 2006 |
| 畢業學年度: | 94 |
| 語文別: | 英文 |
| 論文頁數: | 71 |
| 中文關鍵詞: | 頻帶複製 、高頻重建 、音訊編碼 、線性預測 |
| 外文關鍵詞: | linear prediction, band replication, audio coding, bandwidth extension |
| 相關次數: | 點閱:84 下載:1 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
數位音訊在過去幾年的發展中,嚴然已經成為大眾使用音樂的重要工具,如MP3更是相當的普及。然而,在最近的各種數位音訊應用中,其壓縮率變得要求更高,甚至需要從192kbps降到64kbps,當中所造成的問題就是高頻頻譜的失真,這個現象會使得音訊在聆聽上會顯得單調且低沉。高頻重建是近年來一種新技術的發展,利用人耳對於低頻跟高頻音訊的敏感度不同,研究設計出一套不同傳統感知性音訊編碼器的處理法,它利用少量的資訊來重建失去的高頻頻譜,使得低位元率的音訊可以保存住全頻寬,提升傳統感知性音訊編碼器的品質。我們的研究將針對高頻重建技術,提出編碼端利用線性預測來記錄高頻的特徵,然後解碼端可以還原回近似原音訊之高頻頻譜的方法,並結合傳統感知性音訊編碼器,彌補其高頻頻帶處理的缺陷,最後達成數位音訊的新目標-低位元率和高品質。
The development of digital audio in the past years, it becomes an important tool for people to share music and audio. For example, MP3 is the most popular one. However, the request of compression ratio is getting higher for every application in recent, even it needs lower bit rate from 192kbps to 64kbps. The low bit rate causes distortions in high band component of audio signals and makes listening flatness and muffled. Bandwidth extension is a new technology in the recent years. It designs a processing method different from traditional perceptual audio coding. According to the different sensitivity to different frequency in human auditory system, it uses a little information to reconstruct the lost high band component and makes low bit rate audio streams without losing audio quality. Our researches also focus on bandwidth extension technology. We proposed a method using linear prediction to record original high band component in the encoder and reconstruct a rough high band component similar to the original component. It can be combined with traditional perceptual audio coder to overcome the distortion in high band and reaches the goal of low bit rate with high quality of new digital audio coding.
[1]. E. Zwicker and H. Fastl, “Psychoacoustic-Facts and Models,” Springer Press, 1999.
[2]. ISO/IEC JTC1/SC29/WG11 MPEG, “Information technology-Coding of moving pictures and associated audio for digital storage media at up to about 1.5 Mbits/s, Part 3:Audio,” IS11172-3, 1993.
[3]. D. Pan, “A Tutorial on MPEG-1 Audio Compression,” IEEE transactions on Multimedia, Vol. 2, NO. 2, pp. 60-74, 1995.
[4]. P. Noll, “MPEG Digital Audio Coding,” IEEE Signal Processing Magazine, September 1997.
[5]. C.A. Lanciani, “Audio Perception and the MPEG Audio Standard-A Qualifying Examination Report,” Georgia Institute of Technology School of Electrical and Computer Engineering, August 11, 1995.
[6]. Louis D. Fielder , Robert L. Andersen, Brett G. Crockett, Grant A. Davidson1, Mark F. Davis, Stephen C. Turner, Mark S. Vinton, and Phillip A. Williams, ” Introduction to Dolby Digital Plus, an Enhancement to the Dolby Digital Coding System,” AES 117th Convention, San Francisco, CA, USA, October 28–31, 2004
[7]. Anibal J. S. Ferreira and Deepen Sinha, “Accurate Spectral Replacement,” AES 118th Convention, Barcelona, Spain, May 28-31, 2005.
[8]. Ye Wang AND Miikka Vilermo, ” The Modified Discrete Cosine Transform: Its Implications for Audio Coding and Error Concealment,” AES 22nd International Conference on Virtual, Synthetic and Entertainment Audio, Espoo, Finland, June 15-17, 2002.
[9]. M. Dietz, L. Liljeryd, K. Kjörling and O. Kunz, “Spectral Band Replication, a novel approach in audio coding,” in 112th AES Convention, Munich, May 2002.
[10]. M. Dietz and S. Meltzer, “CT-aacPlus – a state-of-the-art Audio Coding Scheme,” EBU Technical Review, July 2002,
[11]. T. Painter and A. Spanias, “Perceptual Coding of Digital Audio,” Proc. IEEE, vol. 88, no. 4, pp. 451–513, Apr. 2000.
[12]. P. Ekstrand, “Bandwidth Extension of Audio Signals by Spectral Band Replication,” in IEEE Benelux Workshop on Model based Processing and Coding of Audio (MPCA-2002), Leuven, Belgium, Nov. 15, 2002.
[13]. D. Frerichs, “New MPEG-4 High-efficiency AAC Audio,” Apr. 2003, available: http://www.mpegif.org/public/documents/vault/m4-out-30034.zip.
[14]. Chi-Min Liu, Wen-Chieh Lee, and Han-Wen Hsu, “High Frequency Reconstruction for Band-limited Audio Signals,” Proc. of the 6th Int. Conference on Digital Audio Effects (DAFX-03), London, UK, September 8-11, 2003.
[15]. Shlien S., “Guide to MPEG-1 Audio Standard," Broadcasting, IEEE Transactions, vol.40, issue 4, pp206-218, Dec. 1994.
[16]. Karlheinz Brandenburg, “MP3 and AAC Explained," AES 17th International Conference on High Quality Audio Coding.
[17]. E. Kurniawati, C. T. Lau, B. Premkumar, J. Absar and S. George, “New Implementation Techniques of an Efficient MPEG Advanced Audio Coder," IEEE transactions on Consumer Electronics, vol.50, no.2, pp.655-665, May, 2004.
[18]. Erik Larsen, Ronald M. Aarts, “Audio Bandwidth Extension,” Wiley.
[19]. Sabine W. C., Collected Papers on Acoustics, Peninsula Publishing, Los Altos, CA, USA, 1993.
[20]. ISO/IEC JTC1/SC29/WG11/N5203 MPEG, “Bandwidth extensions,” Audio, Oct. 2002.
[21]. ITU Radio communication Study Group 6,”Draft Revision to Recommendation ITU-R BS.1387-Method for objective measurements of perceived audio quality”.
[22]. Charles D. Creusere, “Understanding Perceptual Distortion in MPEG Scalable Audio Coding,” IEEE transactions on speech and audio processing, vol. 13, no.3, pp. 422-432, May, 2005.
[23]. Florian Keiler, Daniel Arfib, and Udo Z¨olzer, “Efficient Linear Prediction for Digital Audio Effects,” Proceedings of the COST G-6 Conference on Digital Audio Effects (DAFX-00), Verona, Italy, December 7-9, 2000
[24]. Simon D, Boland, Mohamed Deriche, ”Hybrid LPC and Discrete Wavelet Transform Audio Coding with a Novel Bit Allocation Algorithm,” IEEE Proc. ICASSP, pp.3657-3660, May, 1998.
[25]. Subjective performance assessment of telephone-band and wide-bandwidth digital codecs, Recommend. ITU-R P830, 1996.
[26]. Arttu Laaksonen, “Bandwidth Extension in High-Quality Audio Coding”, HELSINKI UNIVERSITY OF TECHNOLOGY Department of Electrical and Communications Engineering Laboratory of Acoustics and Audio Signal Processing, May, 2005
[27]. ”Plus V specification,” http://www.vlsi.fi/plusv/plusv.shtml.
[28]. Bernd Iser, Gerhard Schmidt, ”Neural Network versus Codebooks in an Application for Bandwidth Extension of Speech Signals,” 8’th European Conference on speech communication and technology in Eurospeech, pp. 565-568, September, 2003.
[29]. S.Chennoukh, A Gerrits, G. Miet and R. Sluijter, “Speech Enhancement via Frequency Bandwidth Extension Using Line Spectral Frequencys,” Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01) IEEE,pp.665-668, May, 2001.
[30]. Hamza Ozer, Ismail Avcibas, Bulent Sankur, Nasir Memon, ”Steganalysis of Audio Based on Audio Quality Metrics,” Proceedings of SPIE-IS&T Electronic Imaging, SPIE vol. 5020,pp.55-66, 2003.
[31]. Alan V. Oppenheim, Ronald W. Schafer, John R. Buck, “Discrete-Time Signal Processing,” Prentice Hall Signal Processing Series.