成功大學博碩士論文系統

簡易檢索 / 詳目顯示

回結果列表

研究生：	江佩芳 Chiang, Pei-Fang
論文名稱：	混合激發線性預測語音編碼之研究 The Research of Mixed Excitation Linear Prediction ( MELP ) Speech Coding
指導教授：	廖德祿 Liao, Teh-Lu
學位類別：	碩士 Master
系所名稱：	工學院 - 工程科學系 Department of Engineering Science
論文出版年：	2002
畢業學年度：	90
語文別：	英文
論文頁數：	66
中文關鍵詞：	語音編碼
外文關鍵詞：	MELP, speech coding
相關次數：	點閱：105 下載：0
分享至:	分享至facebook 分享至twitter

查詢本校圖書館目錄查詢臺灣博碩士論文知識加值系統勘誤回報

語音編碼不論是在資料壓縮或是語音的傳輸上都極為重要，而語音傳輸是目前最主要也最普遍的通訊傳輸服務。儘管語音編碼技術很早就發展出來，但是隨著科技的發展依舊不斷的推陳出新。新的語音編碼傳輸位元率已由8Kbps（CELP）與4.8Kbps（CS-ACELP）發展至2.4Kbps（MELP）。也因為傳輸位元率的降低，語音品質就只能由更複雜的演算法來提升，這使得實現快速語音編碼相當困難。
繼FS1016 CELP 4.8K bps之後, 美國國防部於1996年宣佈了混合激發線性預測編碼器(MELP)，為2.4K bps語音編碼的新標準，其目的是在傳輸網路頻寬限制下，藉由壓縮後資料的傳送，來節省傳輸位元率，並在解碼端能迅速還原音質良好的聲音訊號。
因此本論文也特別針對MELP語音編碼器架構之各單元作一介紹。並著手於程式的撰寫，程式部分是以C語言來實現，且列舉幾個例子來比較合成語音與原始語音的差異，結果驗證合成語音與原始語音以人耳辨識幾乎一樣。最後，提出語音壓縮系統未來之研究方向。

Speech coding is very important in data compression and speech communication. Speech communication is at present the most dominant and common service in telecommunication networks. Though, speech coding technology is well developed, science and technology still to find new ways of speech coding from old theories continuously. The transmission bit rate of new digital speech coding techniques has dropped from 8 Kbps (CELP), 4.8Kpbs (CS-ACELP) to 2.4 Kbps (MELP). As the bit rate decreasing, the speech quality can only be maintained by employing very complex algorithms which are difficult to implement in real time.
After FS1016 CELP 4.8K bps, the MELP vocoder was selected as the new 2400 bps Federal Standard speech coder by the United Stated Department of Defense Digital Voice Processing Consortium (DDVPC) in 1996. The purpose is in limited bandwidth of the transmission network by data transfer after compression, to economize bit rates, and to return to high quality of the original speech signal quickly in the decoder.
Hence, the units of MELP coder will be described in this thesis. The algorithm will be programmed in C language. We list several examples to compare the difference between the synthesized speech and the original speech. Experiments show that the synthesized speech sounds almost identical to the original speech. Finally, research direction in the future of speech compression system will be discussed.

Abstract in Chinese	I
Abstract in English	III
Acknowledgement	V
Contents	VI
List of Figures	IX
List of Tables	XI

Chapter 1  Introduction	1
1 Motivation	1
2 Organization	2
Chapter 2  Speech Coding	3
1 Coding Strategies	3
1.1 Waveform Coding	4
1.2 Parametric Coding	5
1.3 Hybrid Coding	5
2 LPC Basics	6
2.1 Linear Predictive Analysis	7
2.2 Solutions to LPC Analysisn	9
3 Pitch Prediction	13
4 Analysis-by Synthesis (AbS) Coding of Speech	14
5 Excitation Signal	15
6 Voice Compression Standards	16
Chapter 3  MELP Algorithm Description	17
1 MELP coder Introduction 	17
2 Encoder	17
2.1 Low Frequency Remove	18
2.2 Pitch Calculation	18
2.3 LPC Analysis	25
2.4 Aperiodic Flag	26
2.5 Fourier Magnitude Calculation	26
2.6 Quantization	26
2.7 Error Protection And Bit Packing	33
2.8 Transmission Bit Stream	33
3 Decoder	34
3.1 Bit Unpacking and Error Correction	35
3.2 Noise Attenuation	38
3.3 Parameter Interpolation	39
3.4 Mixed Excitation Generation	40
3.5 Adaptive Spectral Enhancement	43
3.6 Linear Prediction Synthesis	44
3.7 Gain Adjustment	45
3.8 Pulse Dispersion	45
3.9 Synthesis Loop Control	45
Chapter 4  Implementation of MELP Vocoder and Results	46
1 Analog Specification	46
1.1 Format	46
1.2 Attributes	47
2 Implementation and Results	48
Chapter 5  Conclusions and Future Research Works	61
1 Conclusions	61
2 Future Research Works	61
References	63
Appendix	65

                                    

[1] A. M. Kondoz, “Digital speech :/coding for low bit rate communication systems”, J. Wiley, Chichester, 1994.
[2] Speech coding: http://cslu.cse.ogi.edu/HLTsurvey/ch10node4.html
[3] T.T. Teo, E.C. Tan,” Implementation of 2400 bps MELP vocoder on TMS320C44”, Signal Processing Proceedings, 1998. ICSP '98. 1998 Fourth International Conference on , 1998, pp. 576-579.
[4] T. E. Tremain, M. A. Kohler, T. G. Champion, “Philosophy and Goals of the DoD 2400 bps Vocoder Selection Process”, proceedings of ICASSP, IEEE, pp. 1137-1140,1996.
[5] MELP Vocoder Algorithm: http://www.aspi.com/tech/specs/pdfs/melp.pdf
[6] A. Nejat Ince, “Digital speech processing :/speech coding, synthesis, and recognition”, Kluwer Academic Publishers, Boston, 1992.
[7] Waveform Codecs: http://www-mobile.ecs.soton.ac.uk/speech_codecs/waveform.html
[8] MELP (Mixed Excitation Linear Prediction) Speech Coder: http://www.hait.ac.il/staff/commEng/Noam_Amir/speech/melp/contents.htm
[9] Source Codecs: http://www-mobile.ecs.soton.ac.uk/speech_codecs/source.html
[10] Hybrid Codecs: http://www-mobile.ecs.soton.ac.uk/speech_codecs/hybrid.html
[11] Lawrence R. Rabiner, Ronald W. Schafer, “Digital processing of speech signals”, Prentice-Hall, N.J., 1978.
[12] ITN530 Corporate Telecommunication, Voice Compression: http://sky.fit.qut.edu.au/~rolf/itn530/ass982/ang/voice.htm
[13] “Analog to Digital Conversion of Voice by 2400 Bit/second Mixed Excitation Linear Prediction (MELP)”, Draft FIBS Publication on MELP, May 28, 1998.
[14] Xiao Lin, Choon Boon Lim, Soo Peng Hoh, Gang Li, Hai Bin Huang, “Real time implementation of low bit rate speech encoder MELP on TMS320C54x DSP”, Signal Processing Proceedings, 2000. WCCC-ICSP 2000. 5th International Conference on , Volume: 2 , 2000, pp. 651-654.
[15] “Final Report for Fixed-Point and TMS320C5x Implementation of the 2.4 kbps MELP Federal Standard Speech Coder”, TI DSPS R&D Center, April 29, 1998.
[16] Teo Tse Tsong, Tan Eng Chong, “Real-Time Implementation of MELP Vocoder”.
[17] McCree, A.; Kwan Truong; George, E.B.; Barnwell, T.P.; Viswanathan, V., “An Enhanced 2.4 Kbit/s MELP Coder”, Speech Coding for Telecommunications, 1995. Proceedings., 1995 IEEE Workshop on, pp. 101-102.
[18] McCree, A.; Kwan Truong; George, E.B.; Barnwell, T.P.; Viswanathan, V., “A 2.4 kbit/s MELP coder candidate for the new U.S. Federal Standard”, Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on , Volume: 1 , 1996, pp. 200-203.
[19] Supplee, L.M.; Cohn, R.P.; Collura, J.S.; McCree, A.V., “MELP: the new Federal Standard at 2400 bps”, Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on , Volume: 2 , 1997, pp. 1591-1594.
[20] A New Federal Standard Algorithm for 2400bps Coded Voice: http://maya.arcon.com/ddvpc/24results.htm
[21] MELP speech coding examples: http://www.ee.bilkent.edu.tr/~cetin/melp.html
[22] Speech Analysis http://svr-www.eng.cam.ac.uk/~ajr/SA95/
[23] Waveform Coding Techniques: http://www.cisco.com/warp/public/788/signalling/waveform_coding.html

校內：2052-07-05公開
校外：2052-07-05公開電子論文尚未授權公開，紙本請查館藏目錄

簡易檢索 / 詳目顯示

相關論文