簡易檢索 / 詳目顯示

研究生: 張凱行
Chang, Kai-Hsing
論文名稱: 一個基於子空間追蹤演算法之語音強健系統及其硬體設計
Algorithm/Hardware Design of a Subspace Tracking Based Speech Enhancement System
指導教授: 王駿發
Wang, Jhing-Fa
學位類別: 碩士
Master
系所名稱: 電機資訊學院 - 電機工程學系
Department of Electrical Engineering
論文出版年: 2004
畢業學年度: 92
語文別: 英文
論文頁數: 72
中文關鍵詞: 子空間追蹤雜訊消除硬體設計
外文關鍵詞: hardware design, subspace tracking, noise reduction
相關次數: 點閱:93下載:4
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  •   在此篇論文中,我們基於子空間追蹤演算法設計一個語音強鍵系統。所提出的演算法中,整合了聽覺分頻濾波推架構來做為前處理。藉由實驗模擬結果發現,以TAICAR資料庫音檔來做測試,所提出的架構比傳統子空間方法語音強健效果較好。由其在汽車雜訊環境中,低頻雜訊 (低於1KHz) 經過分頻濾波推後能被有效的去除。對於即時系統應用,我們設計一個子空間追蹤演算法的管線化超大型積體電路架構。不用延持技術,應用前瞻技術方法推導硬體,子空間追蹤演算法中的資料相依的危障能被順利解決。我們所推導的管線子空間演算法架構之收斂速度比延持的PASTd架構還要更快。在硬體設計中,為了減少晶片面積,我們共用乘法器來分擔多個乘法,這使得乘法器數目減少相對節省面積大小,也使得且濾波器階數和乘法器數目無關。我們所推導出來的子空間追蹤演算法實現於ARM-Based Aletra EpxA10發展板上。其工作頻率為9.7MHz。

      In this thesis, we describe a design of signal subspace speech enhancement based on subspace tracking algorithm. The proposed algorithm incorporates a perceptual filterbank which is derived from psycho-acoustic model with signal subspace processing. The experimental results which were obtained by testing TAICAR database show that our approach is better than conventional subspace methods. The low frequency noises (below 1KHz) in car noisy environments are suppressed efficiently after applying the perceptual filterbank. For real time applications, we derive a pipelined VLSI architecture of the subspace tracking algorithm. The data hazard of subspace tracking algorithm is solved by using Look-Ahead method without delayed updating. The convergence rate of our architecture is faster than those of delayed PASTd architectures. To save the chip area, a shared technique for the arithmetic of multiplication units is adopted. It makes the number of multipliers be independent with the filter length. This architecture has been realized in ARM-based ALTERA EPXA10 Development Board with frequency at 9.7MHz. Simulation results are presented to validate our algorithm and hardware architectures.

    中文摘要 iv Abstract v Acknowledgment vi Lists of Figures x Lists of Tables xii Chapter 1 INTRODUCTION 1 1.1 Speech Enhancement Algorithms and Applications 2 1.2 Signal Subspace Technique and Applications 3 1.3 Related Studies 4 1.4 Motivation 7 1.5 Organization of Thesis 9 Chapter 2 SINGAL SUBSPACE BAESD SPEECH ENHANCEMTN 10 2.1 Problem Description 10 2.2 The Decomposition of Signal and Noise Subspace 10 2.2.1 Concept of Signal and Noise Subspace 11 2.2.2 Subspace Decomposition Using Karhunen-Lo`eve Transformation 12 2.3 Subspace Estimation from Noisy Data 14 2.4 Rank Estimation 14 2.5 Linear Signal Estimators 15 2.5.1 Linear Least Square Estimator 16 2.5.2 Minimum Variance Estimator 16 2.5.3 Time Domain Constraint 17 Chapter 3 THE PROPOSED SPEECH ENHANCEMENT ALGORITHM 19 3.1 Overall Speech Enhancement System 19 3.2 Perceptual Filterbank 20 3.3 Subspace Tracking Algorithm 25 3.3.1 Comparison of subspace tracking algorithms 26 3.3.2 Projection Approximation Subspace Tracking (PAST) Algorithm 27 3.3.3 Projection Approximation Subspace Tracking Deflation (PASTd) Algorithm 28 Chapter 4 HARDWARE DESIGN OF SUBSPACE TRACKING ALGORITHM 30 4.1 Design Issues of The PASTd Algorithm 30 4.2 The Proposed Pipelined PASTd Architecture 31 4.2.1 Look-Ahead Substitution for PASTd Architecture 31 4.2.2 Sharing Multiplication Arithmetic Units Description 33 4.2.3 The Proposed Architecture 35 4.2.4 Converge Characteristic of the Proposed PASTd Architecture 39 4.2.5 Apply Pipelined PASTd Architecture to Proposed Speech Enhancement System 41 4.3 The Hardware Design Evaluation 41 4.3.1 Real-Time Issues 42 4.3.2 Precision Analysis 42 Chapter 5 EXPERIMENTAL RESULTS AND HARDWARE VERILFICATION 45 5.1 The Evaluation of Proposed Speech Enhancement Algorithm 45 5.1.1 TAI-CAR Data Base Description 46 5.1.2 Performance of the Method for Additive Car Noise 47 5.1.3 Performance of the Method for Real Environment In-Car Noise 52 5.2 SOPC Implementation of the Proposed Pipelined Architecture Algorithm 54 5.2.1 Introduction to SOPC 56 5.2.2 Design Flow and Strategy 56 5.3 Hardware Functional Simulation Results 59 5.4 Chip Features 61 Chapter 6 CONCLUSIONS AND FUTRURE WORKS 65 REFERENCES 67

    [1]. Y. Ephraim and H. L. Van-Trees, "A signal subspace approach for speech enhancement," IEEE Trans. Speech Audio Processing, vol. 3, pp. 251-266, July 1995.
    [2]. B. Yang, "Projection approximation subspace tracking," IEEE Trans. Signal Processing, vol. 43, pp. 95-107, Jan. 1995.
    [3]. Fan Xu and Willson, A.N., Jr., "Novel systolic architectures for signal subspace tracking,"in Proc. IEEE Circuits and Systems,Vol.2 , pp.8-11 Aug. 2000.
    [4]. F. Xu and A.N. Jr., "A high-performance architecture for an RLS-like eigenvector algorithm," in Proc. of ICSP, pp. 559-562, 2000.
    [5]. M. Kuropatwinski, D. Leckschat, K. Kroschel, and A. Czyzewski, "Integration of speech enhancement and coding techniques," in Proc. IEEE Workshop on Speech Coding, (Porvoo, Finland), pp. 168-170, May 1999.
    [6]. D. Kim, Y. Park, I. Kim, and S. Park, "The effect of the speech enhancement algorithm for the sensorineural hearing impairment listener," in Proc. Twentieth Annual Int. Conf. IEEE Eng. in Med. and Bio. Soc., vol. 6, (Piscataway, New Jersey), pp. 3150-3153, Oct. 1998.
    [7]. D. O'Shaughnessy, P. Kabal, D. Bernardi, L. Barbeau, C.-C. Chu, and J.-L. Moncet,"Applying speech enhancement to audio surveillance," in Proc. IEEE Int. Carnahan Conf. on Crime Countermeasures, (Lexington, KY), pp. 69-71, Oct. 1988.
    [8]. B. L. Sim, Y. C. Tong, J. S. Chang and C. T. Tan, "A parametric formulation of the generalized spectral subtraction method, " IEEE Trans. Speech and Audio Processing, vol. 6, pp. 328-337, Jul. 1998.
    [9]. J. Rissanen, "MDL denoising," IEEE Trans. Info. Theory, vol. 46, pp. 2537-2543, Nov. 2000.
    [10]. X. Shen and L. Deng, "A dynamic system approach to speech enhancement using the H-infinity filtering algorithm," IEEE Trans. Speech and Audio Processing, vol. 7, pp.391-399, Jul. 1999.
    [11]. W. G. Knecht, M. E. Schenkel and G. S. Moschytz, "Neural network filters for speech enhancement," IEEE Trans. Speech and Audio Processing, vol. 3, pp. 433-438, Nov. 1995.
    [12]. R. DeGroat, "Noniterative subspace tracking," IEEE Trans. Signal Processing, vol. SP-40, pp. 571-577, March 1992.
    [13]. P. Strobach, "Bi-iteration SVD subspace tracking algorithms," IEEE Trans. Acoust., Speech, Signal Processing, vol. 45, pp. 1222-1240, May 1997.
    [14]. I. Karasalo, "Estimating the covariance matrix by signal subspace averaging," IEEE Trans. Acoust., Speech, Signal Processing, vol. 34, pp. 8-12, Feb. 1986.
    [15]. Carlos E. Davila, "Efficient, High performance, Subspace Tracking for Time-Domain Data," IEEE Trans. Signal Processing, vol. 48, pp. 3307 - 3315, Dec. 2000.
    [16]. M. Klein, P. Kabal, "Signal subspace speech enhancement with perceptual post-filtering," in Proc. IEEE ICASSP, May. 2002, pp.537-540.
    [17]. Jong Uk Kim, S.G. Kim and C.D. Yoo, "The incorporation of masking threshold to subspace speech enhancement," in Proc. IEEE ICASSP, Apr. 2003.
    [18]. A. Rezayee and S. Gazor, "An adaptive KLT approach for speech enhancement," IEEE trans. Speech Audio Processing, vol. 9, pp.87-95, Feb. 2001.
    [19]. N. R. Shanbhag and K. K. Parhi, "Relaxed look-ahead pipelined LMS adaptive filters and their application to ADPCM coder," IEEE trans. Circuit SySt. II, vol. 40, pp.753-766, Dec. 1993.
    [20]. G. Long, F. Ling and J.G. Proakis, "The LMS algorithm with delayed coefficient adaptation," IEEE Trans. Acoust. Speech, Signal Processing, vol. 37, pp. 1397-1405, Sept. 1989.
    [21]. ______, "Corrections to the LMS algorithm with delayed coefficient adaptation," IEEE Trans. Signal Processing, vol. 40, pp. 230-232, Jan. 1992.
    [22]. V. F. Pisarenko, "The retrieval of harmonics from a covariance function," Geophys. J.R. Astr. Soc., vol. 33, pp. 347-366, 1973.
    [23]. F. Jabloun and B. Champagne, "On the use of masking properties of the human ear in the signal subspace speech enhancement approach," in Int. Workshop on Acoustic Echo and noise Control, (Darmstadt, Germany), Sept. 2001.
    [24]. Yi Hu; Loizou, P.C., "A perceptually motivated approach for speech enhancement," IEEE Trans. Speech and Audio Processing, Vol. 11, Sept. 2003.
    [25]. E. Oja, "A simplified neuron model as a principal component analyzer," J.Math. Biol., vol. 15, pp. 267-273, 1982.
    [26]. A.Harada, K. Nishikawa and H. Kiya, " Pipelined architecture of the LMS adaptive digital filter with the minimum output latency," IEICE Trans., vol. E81-A, no.8, Aug. 1998.
    [27]. A.Harada, K. Nishikawa and H. Kiya, "A pipelined architecture for normalized LMS adaptive digital filter," IEICE Trans., vol. E82-A, no.2, Feb. 1999.
    [28]. M. Berouti, R. Schwartz, and J. Makhoul, "Enhancement of speech corrupted by acoustic noise," in Proc. IEEE Int. Acoust. Speech, Signal Processing, pp.208-211. 1979.
    [29]. P. Lockwood and J.Boudy, "Experiments with a nonlinear spectral subtractor (NSS), Hidden Markov Models and the projection for robust speech recognition in cars," Speech Audio Processing, vol. 3, pp. 251-266, 1995.
    [30]. J. Huang and Y. Zhao, "A DCT-based fast signal subspace technique for robust speech recognition," IEEE Trans. Speech and Audio Processing, vol. 8, pp. 747-751, Nov. 2000.
    [31]. Altera Corporation The Programmable Solutions Company [Online]. Available: http://www.altera.com/literature/lit-index.html
    [32]. S. Haykin, Adaptive Filter Theory. Prentice Hall, 4th edition, 2002.
    [33]. I. T. Jollie, Principal Component Analysis. Springer Series in Statistics, Springer-Verlag, 1st ed., 1986.
    [34]. Shi-Huang Chen and Jhing-Fa Wang, "Speech enhancement using perceptual wavelet packet decomposition and Teager energy operator," accepted to appear in The Journal of VLSI Signal Processing Systems, Special Issue on Real World Speech Processing.
    [35]. I. Pinter, "Perceptual wavelet-representation of speech signals and its application to speech enhancement," Computer Speech and Language, 10(1), pp. 1-22, 1996.
    [36]. O. Ghitza, "Auditory model and human performance in tasks related to speech coding and speech recognition," IEEE Trans. Speech and Audio Processing, vol. 2, pp. 115-132, 1994
    [37]. Lawrence Rabiner and Biing-Hwang Juang, Fundamentals of Speech Recognition. Englewood Cliffs, NJ: Prentice-Hall, 1993.
    [38]. V.Solo and X. Kong, "Performance analysis of adaptive eigenanalysis algorithms," IEEE Trans. Signal Processing, vol. 46, Mar. 1998.
    [39]. T. Gustafson, "Instrumental variable subspace tracking using projection approximation," IEEE Trans. Signal Processing, vol. 46, Mar. 1998.
    [40]. J. Yang and M.Kaveh, "Adaptive eigensubspace algorithms for direction of frequency estimation and tracking," IEEE Trans. Acoust., Speech Signal Processing, vol. 36, pp. 241-251, 1998.
    [41]. Jhing-Fa Wang, Hsien-Chang Wang and Chung-Hsien Yang, "TAICAR - A Collection of In-Car Mandarin Speech Database in Taiwan," O-COCOSDA2003 / PACLIC17, Singapore.
    [42]. ALTERA, "Synplify & Quartus II Design Methodology," Document Part NO AN-226-4.1 http://www.altera.com , February, 2003.
    [43]. Synplify, "Synplify Pro Tutorial," Synplicity, Inc. http://www.synplicity.com June. 2003.

    下載圖示 校內:2005-08-10公開
    校外:2005-08-10公開
    QR CODE