簡易檢索 / 詳目顯示

研究生: 林希哲
Lin, Sei-Jay
論文名稱: 基於小波轉換之語音增強系統的硬體實現
VLSI Design for Wavelet Based Speech Enhancement System
指導教授: 雷曉方
Lei, Sheau-Fang
學位類別: 碩士
Master
系所名稱: 電機資訊學院 - 電機工程學系
Department of Electrical Engineering
論文出版年: 2006
畢業學年度: 94
語文別: 英文
論文頁數: 73
中文關鍵詞: 語音增強小波轉換
外文關鍵詞: speech enhancement, wavelet transform
相關次數: 點閱:83下載:2
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 本篇論文的主要目的是設計一個基於小波轉換的語音增強系統演算法的硬體,並將其實現與驗證。論文內容主要分成小波包轉換與語音增強系統演算法實現兩部分:
    在小波包轉換實現的部分,我們提出一個摺疊式小波包的分解架構,此架構不需額外的儲存單元便可以實現我們所需要的小波包轉換,在合成的部分,我們提出的架構需要一半音框長度的記憶體來儲存小波合成的係數,最後我們將小波包分解以及結合的架構整合在一起並與其他的架構做比較,證明我們的架構在面積上比其他的架構更為優秀。
    在語音增強系統演算法的硬體實現的部分,我們使用兩種不同的處理方式來實現,一種是由小波包轉換完成後直接輸入語音增強系統來做運算,另一種是由記憶體輸入小波包係數來做運算,此兩種不同的處理方式分別稱為即時處理以及非即時處理,在效能的表現上,即時處理的架構擁有較好的執行時間而非即時處理的架構則擁有較小的面積,我們的實現方式是以 Verilog HDL 描述所提出的電路架構,並使用TSMC 0.18 um的技術合成,最後在 FPGA 上進行合成及佈局繞線後驗證。

    In this thesis, we focus on the hardware implementation of a wavelet-based speech enhancement system and verify it. It consists of two parts: In the first part we proposed the architecture of analysis and synthesis wavelet packet transform. The folded architecture is used to implement the analysis lifting wavelet packet and not need extra memory unit to save the temporary coefficients. The architecture of synthesis wavelet packet transform needs extra memory that has half of frame length. Finally we combined the architecture of analysis and synthesis and compare with other architectures. It proves our architecture has smaller area.
    In the second part we proposed two different processing types of speech enhancement system, named real time and non-real time processing respectively. The real time processing has the input from analysis wavelet transform and non-real time processing has the input from the memory. The real time processing has lower latency and the non-real time has smaller area. The proposed architectures are synthesized with TSMC 0.18 um technology. Finally we use ARM development platform to verify our architectures.

    摘要 i Abstract ii Acknowledgment iii List of Tables vi List of Figures vii Chapter 1 -Introduction 1 1.1 Background 1 1.2 Motivation 3 1.3 Organization of Thesis 4 Chapter 2 -Overview of System 5 2.1 Introduction 5 2.2 Traditional Discrete wavelet transform 7 2.3 Lifting Discrete Wavelet Transform 10 2.4 The perceptual wavelet packet transform 15 2.5 Adaptive noise estimation and threshold 16 Chapter 3 The architecture of lifting discrete wavelet packet transform 20 3.1 Introduction 20 3.2 The Folded Architecture of Lifting DWT 20 3.3 The architecture of synthesis Lifting DWT 34 3.4 Combine the analysis and synthesis Lifting DWT 38 Chapter 4 -The architecture of speech enhancement system 42 4.1 Introduction 42 4.2 The architecture of proposed enhancement algorithm 44 4.2.1 The architecture of real time processing 44 4.2.2 The architecture of non-real time processing 45 4.3 The architecture of each function block 47 4.3.1 The architecture of average power unit 47 4.3.2 The architecture of noise estimation unit 49 4.3.3 The architecture of SNR unit 52 4.3.4 The architecture of Adaptive threshold and soft threshold unit 54 4.4 The precision and hardware source of proposed architecture 55 Chapter 5 -Experimental results and verification 59 5.1 Design flow 59 5.2 Experimental results 60 5.2.1 compare with other architecture 60 5.2.2 Area analysis 61 5.3 Verification 62 5.3.1 Verification environment 62 5.3.2 Ram based interface connection 63 5.3.3 Simulation result 64 Chapter 6 -Conclusion and future work 68 References 70

    [1] S.F. Boll, ”Suppression of acoustic noise in speech using spectral subtraction,"IEEE Trans. Acoustics, Speech, and Signal Processing, vol. ASSP-29, no.2, pp. 113-120 ,1979.
    [2] Y. Ephraim and D. Malah, “Speech enhancement using a minimum mean square error short time spectral amplitude estimator,” IEEE Trans. Acoust., Speech, SignalProcessing, vol. ASSP-32, pp. 1109–1121, 1984.
    [3] M. Berouti, R. Schwartz and J. Makhoul ,”Enhancement of speech corrupted by acoustic noise,” Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '79, vol. 4, pp.208 - 211, 1979.
    [4] N. Virag,” Speech enhancement based on masking properties of the auditory system,” Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on, vol. 1, pp.796 - 799, 1995.
    [5] N. Virag, ”Single channel speech enhancement based on masking properties of the human auditory system,” IEEE Trans. Speech and Audio Processing, vol. 7 , Issue: 2, pp.126 - 137, 1999.
    [6] D. L. Donoho, “De-Noising by Soft-Thresholding, “IEEE Trans. Inform. Theory, vol.41, pp. 613-627, 1995.
    [7] D.L. Donoho and I.M. Johnstone, “Ideal Spatial Adaptation by Wavelet
    Shrinkage,“ Biometrika, vol. 81, pp. 425-455, 1994.
    [8] Y. Ephraim and H. L. Van-Trees, “A signal subspace approach for speech
    enhancement,” IEEE Trans. Speech Audio Processing, vol. 3, pp. 251–266, July 1995.
    [9] A.-J. D. Veen, E. F. Deprettere and A. L. Swindlhurst, “Subspace-based signal analysis using singular value decomposition,” Proc. IEEE, vol. 91, pp. 1277-1308, 1993.
    [10] Sheau-Fang Lei and Ying-Kai Tung "Speech Enhancement for Nonstationary Noises by Wavelet Packet Transform and Adaptive Noise Estimation," to be published in 2005 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS2005)
    [11] E. Zwicker and E. Terhardt,” Analytical Expressions for Critical-Band Rate and Critical Bandwidth as a Function of Frequency,” JASA, vol. 68, pp. 1523-1525, 1980.
    [12] I. Daubechies, “orthonormal bases of compactly supported wavelets,” Comm. In Pure And Applied Math., Vol41, No7 ,pp.909-996, 1988.
    [13] S. Mallat, “A theory for multiresolution signal decomposition: The wavelet representation,” IEEE Trans. Pattern Anal. Machine Intell., vol. 11, pp. 674–693, July 1989.
    [14] S. Mallat, “Multifrequency Channel Decompositions of Images and Wavelet Models,” IEEE Trans. Acoustics, Speech, and Signal Processing, vol. 37, pp. 2091-2110, 1989. 71
    [15] A.S.Lewis and G. Knowles, “VLSI architecture for 2-D Daubnchies wavelet
    transform without multipliers”, Electronic Letters, Vol.27, No2, pp171-173,1991
    [16] Jer Min Jou, Pei-Yin chen, Ming Shiang Liang, ”A scalable pipelined architecture for separable 2-D discrete wavelet transform”, Design Automation Conference, Vol1, pp205-208, 1999
    [17] Ferretti, M.; Rizzo, D.;”Handling borders in systolic architecture for the 1-D discrete wavelet transform for perfect reconstruction”, Signal Processing, IEEE Transactions on [see also Acoustics, Speech, and Signal Processing, IEEE Transactions on] Volume 48, Issue 5, May 2000 Page(s):1365 – 1378
    [18] Vishwanath, M.; Owens, R.M.; Irwin, M.J.;”VLSI architectures for the discrete wavelet transform”, Circuits and Systems II: Analog and Digital Signal Processing, IEEE Transactions on [see also Circuits and Systems II: Express Briefs, IEEE Transactions on] Volume 42, Issue 5, May 1995 Page(s):305 - 316 1995
    [19] Grzeszczak, A.; Mandal, M.K.; Panchanathan, S.; “VLSI implementation of discrete wavelet transform” Very Large Scale Integration (VLSI) Systems, IEEE Transactions on Volume 4, Issue 4, Dec. 1996 Page(s):421 - 433
    [20] Chu Yu; Chien-An Hsieh; Sao-Jie Chen;” Design and implementation of a highly efficient VLSI architecture for discrete wavelet, Custom Integrated Circuits Conference, 1997., Proceedings of the IEEE 1997 5-8 May 1997 Page:237 – 240
    [21] W. Sweldens, “The Lifting Scheme: A Custom-Design Construction of Biorthogonal Wavelet,” Applied and Computational Harmonic Analysis, vol. 3, pp. 186-200, 1996.
    [22] I. Pinter, “Perceptual Wavelet-Representation of Speech Signals and its Application to Speech Enhancement,” Computer Speech and Language, vol. 10, no. 1, pp. 1-22, 1996.
    [23] B. Carnero and A. Drygajlo ,” Perceptual speech coding and enhancement using frame-synchronized fast wavelet packet transform algorithms,” Signal Processing, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 47 , Issue: 6, pp.1622-1635, 1999.
    [24] S. H. Chen and J. F. Wang, “Speech Enhancement Using Perceptual Wavelet Packet Decomposition and Teager Energy Operator,” Journal of VLSI Signal Processing, vol. 36, Issue 2-3 , pp. 125 – 139, 2004.
    [25] I. Daubechies and W. Sweldens, “Factoring wavelet transforms into lifting steps,” J. Fourier Anal. Appl., vol. 4, pp. 247–269, 1998.
    [26] Andra, K.; Chakrabarti, C.; Acharya, T.;’”A VLSI architecture for lifting-based forward and inverse wavelet transform”, Signal Processing, IEEE Transactions on [see also Acoustics, Speech, and Signal Processing, IEEE Transactions on] Volume 50, Issue 4, April 2002 Page(s):966 – 977
    [27] Hongyu Liao; Mandal, M.Kr.; Cockburn, B.F.;” Efficient architectures for 1-D and 2-D lifting-based wavelet transforms”, Signal Processing, IEEE Transactions on [see also Acoustics, Speech, and Signal Processing, IEEE Transactions on] Volume 52, Issue 5, May 2004 Page(s):1315 - 1326
    [28] Pei-Yin Chen;,” VLSI implementation for one-dimensional multilevel lifting-based wavelet transform”, Computers, IEEE Transactions on Volume 53, Issue 4, April 2004 Page(s):386 - 398
    [29] Chao-Tsung Huang; Po-Chih Tseng; Liang-Gee Chen,” Efficient VLSI architectures of lifting-based discrete wavelet transform by systematic design method”, Circuits andSystems, 2002. ISCAS 2002. IEEE International Symposium on Volume 5, 26-29 May 2002 Page(s):V-565 - V-568 vol.5
    [30] “Efficient VLSI Architecture for Lifting-Based Discrete Wavelet Transform”,南台科技大學/電子工程系/90/碩士/090STUT0428002 ,研究生:鄭嘉賢 指導教授:陳培殷
    [31] Bing-Fei Wu; Chung-Fu Lin; “A rescheduling and fast pipeline VLSI architecture for lifting-based discrete wavelet transform” ,Circuits and Systems, 2003. ISCAS '03. Proceedings of the 2003 International Symposium on ,Volume 2, 25-28 May 2003 Page(s):II-732 - II-735 vol.2
    [32] Jer Min Jou; Yeu-Horng Shiau; Chin-Chi Liu; “Efficient VLSI architectures for the biorthogonal wavelet transform by filter bank and lifting scheme”, Circuits and Systems, 2001. ISCAS 2001. The 2001 IEEE International Symposium on, Volume 2, 6-9 May 2001 Page(s):529 - 532 vol. 2
    [33] Synopsys corporation,”Synopsys On-Line Documentation,” National Chip
    Implementation Center, Taiwan, 2004.
    [34] I. M. Johnstone and B. W. Silverman, "wavelet threshold estimators for data with correlated noise,"J. Roy. Statist. Soc. B, vol. 59, pp.319-351, 1997.

    下載圖示 校內:2008-09-08公開
    校外:2008-09-08公開
    QR CODE