| 研究生: |
張凱行 Chang, Kai-Hsing |
|---|---|
| 論文名稱: |
一個基於子空間追蹤演算法之語音強健系統及其硬體設計 Algorithm/Hardware Design of a Subspace Tracking Based Speech Enhancement System |
| 指導教授: |
王駿發
Wang, Jhing-Fa |
| 學位類別: |
碩士 Master |
| 系所名稱: |
電機資訊學院 - 電機工程學系 Department of Electrical Engineering |
| 論文出版年: | 2004 |
| 畢業學年度: | 92 |
| 語文別: | 英文 |
| 論文頁數: | 72 |
| 中文關鍵詞: | 子空間追蹤 、雜訊消除 、硬體設計 |
| 外文關鍵詞: | hardware design, subspace tracking, noise reduction |
| 相關次數: | 點閱:93 下載:4 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
在此篇論文中,我們基於子空間追蹤演算法設計一個語音強鍵系統。所提出的演算法中,整合了聽覺分頻濾波推架構來做為前處理。藉由實驗模擬結果發現,以TAICAR資料庫音檔來做測試,所提出的架構比傳統子空間方法語音強健效果較好。由其在汽車雜訊環境中,低頻雜訊 (低於1KHz) 經過分頻濾波推後能被有效的去除。對於即時系統應用,我們設計一個子空間追蹤演算法的管線化超大型積體電路架構。不用延持技術,應用前瞻技術方法推導硬體,子空間追蹤演算法中的資料相依的危障能被順利解決。我們所推導的管線子空間演算法架構之收斂速度比延持的PASTd架構還要更快。在硬體設計中,為了減少晶片面積,我們共用乘法器來分擔多個乘法,這使得乘法器數目減少相對節省面積大小,也使得且濾波器階數和乘法器數目無關。我們所推導出來的子空間追蹤演算法實現於ARM-Based Aletra EpxA10發展板上。其工作頻率為9.7MHz。
In this thesis, we describe a design of signal subspace speech enhancement based on subspace tracking algorithm. The proposed algorithm incorporates a perceptual filterbank which is derived from psycho-acoustic model with signal subspace processing. The experimental results which were obtained by testing TAICAR database show that our approach is better than conventional subspace methods. The low frequency noises (below 1KHz) in car noisy environments are suppressed efficiently after applying the perceptual filterbank. For real time applications, we derive a pipelined VLSI architecture of the subspace tracking algorithm. The data hazard of subspace tracking algorithm is solved by using Look-Ahead method without delayed updating. The convergence rate of our architecture is faster than those of delayed PASTd architectures. To save the chip area, a shared technique for the arithmetic of multiplication units is adopted. It makes the number of multipliers be independent with the filter length. This architecture has been realized in ARM-based ALTERA EPXA10 Development Board with frequency at 9.7MHz. Simulation results are presented to validate our algorithm and hardware architectures.
[1]. Y. Ephraim and H. L. Van-Trees, "A signal subspace approach for speech enhancement," IEEE Trans. Speech Audio Processing, vol. 3, pp. 251-266, July 1995.
[2]. B. Yang, "Projection approximation subspace tracking," IEEE Trans. Signal Processing, vol. 43, pp. 95-107, Jan. 1995.
[3]. Fan Xu and Willson, A.N., Jr., "Novel systolic architectures for signal subspace tracking,"in Proc. IEEE Circuits and Systems,Vol.2 , pp.8-11 Aug. 2000.
[4]. F. Xu and A.N. Jr., "A high-performance architecture for an RLS-like eigenvector algorithm," in Proc. of ICSP, pp. 559-562, 2000.
[5]. M. Kuropatwinski, D. Leckschat, K. Kroschel, and A. Czyzewski, "Integration of speech enhancement and coding techniques," in Proc. IEEE Workshop on Speech Coding, (Porvoo, Finland), pp. 168-170, May 1999.
[6]. D. Kim, Y. Park, I. Kim, and S. Park, "The effect of the speech enhancement algorithm for the sensorineural hearing impairment listener," in Proc. Twentieth Annual Int. Conf. IEEE Eng. in Med. and Bio. Soc., vol. 6, (Piscataway, New Jersey), pp. 3150-3153, Oct. 1998.
[7]. D. O'Shaughnessy, P. Kabal, D. Bernardi, L. Barbeau, C.-C. Chu, and J.-L. Moncet,"Applying speech enhancement to audio surveillance," in Proc. IEEE Int. Carnahan Conf. on Crime Countermeasures, (Lexington, KY), pp. 69-71, Oct. 1988.
[8]. B. L. Sim, Y. C. Tong, J. S. Chang and C. T. Tan, "A parametric formulation of the generalized spectral subtraction method, " IEEE Trans. Speech and Audio Processing, vol. 6, pp. 328-337, Jul. 1998.
[9]. J. Rissanen, "MDL denoising," IEEE Trans. Info. Theory, vol. 46, pp. 2537-2543, Nov. 2000.
[10]. X. Shen and L. Deng, "A dynamic system approach to speech enhancement using the H-infinity filtering algorithm," IEEE Trans. Speech and Audio Processing, vol. 7, pp.391-399, Jul. 1999.
[11]. W. G. Knecht, M. E. Schenkel and G. S. Moschytz, "Neural network filters for speech enhancement," IEEE Trans. Speech and Audio Processing, vol. 3, pp. 433-438, Nov. 1995.
[12]. R. DeGroat, "Noniterative subspace tracking," IEEE Trans. Signal Processing, vol. SP-40, pp. 571-577, March 1992.
[13]. P. Strobach, "Bi-iteration SVD subspace tracking algorithms," IEEE Trans. Acoust., Speech, Signal Processing, vol. 45, pp. 1222-1240, May 1997.
[14]. I. Karasalo, "Estimating the covariance matrix by signal subspace averaging," IEEE Trans. Acoust., Speech, Signal Processing, vol. 34, pp. 8-12, Feb. 1986.
[15]. Carlos E. Davila, "Efficient, High performance, Subspace Tracking for Time-Domain Data," IEEE Trans. Signal Processing, vol. 48, pp. 3307 - 3315, Dec. 2000.
[16]. M. Klein, P. Kabal, "Signal subspace speech enhancement with perceptual post-filtering," in Proc. IEEE ICASSP, May. 2002, pp.537-540.
[17]. Jong Uk Kim, S.G. Kim and C.D. Yoo, "The incorporation of masking threshold to subspace speech enhancement," in Proc. IEEE ICASSP, Apr. 2003.
[18]. A. Rezayee and S. Gazor, "An adaptive KLT approach for speech enhancement," IEEE trans. Speech Audio Processing, vol. 9, pp.87-95, Feb. 2001.
[19]. N. R. Shanbhag and K. K. Parhi, "Relaxed look-ahead pipelined LMS adaptive filters and their application to ADPCM coder," IEEE trans. Circuit SySt. II, vol. 40, pp.753-766, Dec. 1993.
[20]. G. Long, F. Ling and J.G. Proakis, "The LMS algorithm with delayed coefficient adaptation," IEEE Trans. Acoust. Speech, Signal Processing, vol. 37, pp. 1397-1405, Sept. 1989.
[21]. ______, "Corrections to the LMS algorithm with delayed coefficient adaptation," IEEE Trans. Signal Processing, vol. 40, pp. 230-232, Jan. 1992.
[22]. V. F. Pisarenko, "The retrieval of harmonics from a covariance function," Geophys. J.R. Astr. Soc., vol. 33, pp. 347-366, 1973.
[23]. F. Jabloun and B. Champagne, "On the use of masking properties of the human ear in the signal subspace speech enhancement approach," in Int. Workshop on Acoustic Echo and noise Control, (Darmstadt, Germany), Sept. 2001.
[24]. Yi Hu; Loizou, P.C., "A perceptually motivated approach for speech enhancement," IEEE Trans. Speech and Audio Processing, Vol. 11, Sept. 2003.
[25]. E. Oja, "A simplified neuron model as a principal component analyzer," J.Math. Biol., vol. 15, pp. 267-273, 1982.
[26]. A.Harada, K. Nishikawa and H. Kiya, " Pipelined architecture of the LMS adaptive digital filter with the minimum output latency," IEICE Trans., vol. E81-A, no.8, Aug. 1998.
[27]. A.Harada, K. Nishikawa and H. Kiya, "A pipelined architecture for normalized LMS adaptive digital filter," IEICE Trans., vol. E82-A, no.2, Feb. 1999.
[28]. M. Berouti, R. Schwartz, and J. Makhoul, "Enhancement of speech corrupted by acoustic noise," in Proc. IEEE Int. Acoust. Speech, Signal Processing, pp.208-211. 1979.
[29]. P. Lockwood and J.Boudy, "Experiments with a nonlinear spectral subtractor (NSS), Hidden Markov Models and the projection for robust speech recognition in cars," Speech Audio Processing, vol. 3, pp. 251-266, 1995.
[30]. J. Huang and Y. Zhao, "A DCT-based fast signal subspace technique for robust speech recognition," IEEE Trans. Speech and Audio Processing, vol. 8, pp. 747-751, Nov. 2000.
[31]. Altera Corporation The Programmable Solutions Company [Online]. Available: http://www.altera.com/literature/lit-index.html
[32]. S. Haykin, Adaptive Filter Theory. Prentice Hall, 4th edition, 2002.
[33]. I. T. Jollie, Principal Component Analysis. Springer Series in Statistics, Springer-Verlag, 1st ed., 1986.
[34]. Shi-Huang Chen and Jhing-Fa Wang, "Speech enhancement using perceptual wavelet packet decomposition and Teager energy operator," accepted to appear in The Journal of VLSI Signal Processing Systems, Special Issue on Real World Speech Processing.
[35]. I. Pinter, "Perceptual wavelet-representation of speech signals and its application to speech enhancement," Computer Speech and Language, 10(1), pp. 1-22, 1996.
[36]. O. Ghitza, "Auditory model and human performance in tasks related to speech coding and speech recognition," IEEE Trans. Speech and Audio Processing, vol. 2, pp. 115-132, 1994
[37]. Lawrence Rabiner and Biing-Hwang Juang, Fundamentals of Speech Recognition. Englewood Cliffs, NJ: Prentice-Hall, 1993.
[38]. V.Solo and X. Kong, "Performance analysis of adaptive eigenanalysis algorithms," IEEE Trans. Signal Processing, vol. 46, Mar. 1998.
[39]. T. Gustafson, "Instrumental variable subspace tracking using projection approximation," IEEE Trans. Signal Processing, vol. 46, Mar. 1998.
[40]. J. Yang and M.Kaveh, "Adaptive eigensubspace algorithms for direction of frequency estimation and tracking," IEEE Trans. Acoust., Speech Signal Processing, vol. 36, pp. 241-251, 1998.
[41]. Jhing-Fa Wang, Hsien-Chang Wang and Chung-Hsien Yang, "TAICAR - A Collection of In-Car Mandarin Speech Database in Taiwan," O-COCOSDA2003 / PACLIC17, Singapore.
[42]. ALTERA, "Synplify & Quartus II Design Methodology," Document Part NO AN-226-4.1 http://www.altera.com , February, 2003.
[43]. Synplify, "Synplify Pro Tutorial," Synplicity, Inc. http://www.synplicity.com June. 2003.