簡易檢索 / 詳目顯示

研究生: 張維城
Chang, Wei-Chen
論文名稱: 多通道遞迴式類神經網路音訊分析/合成模型暨MPEG-4結構音訊下之應用
A Multi-Channel Recurrent Network Analysis/Synthesis Model and Its MPEG 4 Structured Audio Application
指導教授: 蘇文鈺
Su, Wen-Yu
學位類別: 碩士
Master
系所名稱: 電機資訊學院 - 資訊工程學系
Department of Computer Science and Information Engineering
論文出版年: 2002
畢業學年度: 90
語文別: 中文
論文頁數: 60
中文關鍵詞: 交換式鋼琴合成法耦合現象振幅調變無限脈衝響應模型合成法耦合弦模型類神經網路訓練演算法
外文關鍵詞: amplitude modulation, IIR synthesis method, neural network training algorithm, commuted piano synthesis method, coupled string model, coupling phenomenon
相關次數: 點閱:72下載:4
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 敲弦樂器如鋼琴通常會有數根琴弦綁在同一個琴橋上的情形,而由於強烈的耦合現象(coupling phenomenon),使得產生的樂音具有高度複雜的振幅調變情形。所以,如何決定一適當的合成模型及其參數以合成近似原音的合成音一直是個困難的問題。
    本論文基於三個前人的工作:耦合弦模型(coupled string model)、交換式鋼琴合成法(commuted piano synthesis method)、無限脈衝響應模型合成法(IIR synthesis method),提出一個多通道遞迴式類神經網路架構。我們期望在不具備樂器物理特性的知識的情況下,電腦能藉著此一類神經網路訓練演算法自動地分析樂音以擷取適當的合成參數。

    Struck string instruments such as pianos usually have groups of strings terminated at some common bridges, respectively. Because of the strong coupling phenomenon, the produced tones exhibit highly complex amplitude modulation patterns. Therefore, it is difficult to adjust synthesis model parameters according to the recorded instruments such that the synthesized tones can match the measurements.
    In this paper, a multi-channel synthesis model is proposed based on three previous works, the coupled string model, the commuted piano synthesis method and the IIR synthesis method. This work attempts to automatically extract the synthesis model parameters by using a neural-network training algorithm without knowing any physical properties of the instruments.

    摘要 i 誌謝 iii 目錄 iv 表目錄 vi 圖目錄 vii 符號 ix 第一章 緒論 1 1.1 引言 1 1.2 研究背景與動機 1 1.3 章節概要 2 第二章 耦合現象 3 2.1 耦合現象的產生 3 2.2 鋼琴的模型合成法 7 2.2.1 耦合弦合成模型 7 2.2.2 交換式鋼琴合成法 10 第三章 多通道遞迴式類神經網路模型 14 3.1 分析/合成模型概述 14 3.2 雙通道合成模型範例 15 3.3 類神經網路架構 16 3.3.1 無限脈衝響應模型合成法 16 3.3.2 琴橋網路 17 3.3.3 多層感知網路架構 20 3.4 類神經網路學習演算法 21 3.4.1 SARPROP演算法 23 3.4.2 多階段訓練程序 26 第四章 模擬實驗 28 4.1 鋼琴的分析與討論 28 4.2 揚琴的分析與討論 32 第五章 合成演算法於MPEG4-SA之應用 37 5.1 MPEG4-SA簡介 37 5.2 多通道音訊合成模型的SAOL實作 39 第六章 結論與未來研究方向 48 6.1 結論 48 6.2 未來研究方向 48 參考文獻 50 附錄 53

    [1] Alvin W. Y. Su and S. F. Liang, “Dynamics Modeling of Musical Strings by Linear Scattering Recurrent Networks”, International Computer Symposium 1996.
    [2] Alvin W. Y. Su and S. F. Liang, “Synthesis of Plucked-String Tones by Physical Modeling with Recurrent Neural Networks”, in Proceedings of the IEEE 1997 Workshop on Multimedia Signal Processing, pp.71-76. Princeton, NJ, 1997.
    [3] Alvin W. Y. Su and S. F. Liang, “A Generalized Model-Based Analysis/Synthesis Method for Plucked-String Instruments by Using Recurrent Neural Networks”, the 106th AES Convention and Conference, Invited Paper, 5/1999.
    [4] Alvin W. Y. Su, S. F. Liang and C. T. Lin, “Model-Based Synthesis of Plucked String Instruments by Using a Class of Scattering Recurrent Networks”, IEEE Trans. on Neural Networks, vol.11, no.1, pp.1-16, Jan. 2000.
    [5] Alvin W. Y. Su and S. F. Liang, “A New Automatic IIR Analysis/Synthesis Technique For Plucked-String Instruments”, IEEE Trans. On Speech and Audio, vol.9, no.7, p.747, 10/2001.
    [6] Eric D. Scheirer, “THE MPEG-4 STRUCTURED AUDIO STANDARD”, E15-401D MIT Media Laboratory Cambridge MA 02139.
    [7] Erik Holsinger, “How Music and Computers Work”, Ziff-Davis Press, Calif., 1994.
    [8] F. J. Pineda, “Recurrent Backpropagation and the Dynamical Approach to Adaptive Neural Computation,” Neural Computation, vol. 1, pp. 161-172, 1989.
    [9] G. Weinreich, “Coupled Piano String ”, Journal of the Acoustical Society of America, vol. 62, no. 6, pp. 1474-1484, Dec. 1977.
    [10] H. Szu, “Fast simulated annealing,” in J. S. Denker, ed., Neural Networks for Computing, pp. 420-425, American Institute of Physics, New York, 1986.
    [11] ISO/IEC JTC 1/SC 29/WG 11 N2503-sec5, “Information technology – Coding of audio-visual objects, Part 3 : Audio, Section 5 : Structured Audio” ,1999-3-10.
    [12] J. Chowning, "The Synthesis of Complex Audio Spectra by Means of Frequency Modulation," Journal of the Audio Engineering Society 21(7), 1973; reprinted in Computer Music Journal 1(2), 1977.
    [13] J. O. Smith III, “Waveguide Filter Tutorial”, in Proceedings of the 1987 International Computer Music Conference, pp. 9-16, Champaign-Urbana. 1987.
    [14] J. O. Smith III, “Physical Modeling using Digital Waveguides”, Computer Music Journal special issue on Physical Modeling of Music Instrument, Part I, vol. 16, no. 4, pp.74-91, Winter/1992.
    [15] J. O. Smith III, “Efficient Synthesis of Stringed Musical Instruments”, in Proceedings of the 1993 International Computer Music Conference, pp. 64-71, Tokyo, Japan, September 1993.
    [16] J. O. Smith III and S. A. Van Duyne, “Commuted Piano Synthesis”, in Proceedings of the 1995 International Computer Music Conference, pp. 319-326, Banff. 1995.
    [17] J. O. Smith III, ”Physical Modeling Synthesis Update” Computer Music Journal, vol. 20, no. 2 , pp. 44-56, MIT Press, Summer 1996.
    [18] “MPEG-4 Structured Audio Page of Machine Listening Group in MIT Media Laboratory“, http://sound.media.mit.edu/mpeg4/
    [19] “MPEG-4 Structured Audio: Developer Tools Page of CS Division in UC Berkeley ”, http://www.cs.berkeley.edu/~lazzaro/sa/index.html
    [20] M. Riedmiller and H. Braun, “A direct adaptive method for faster backpropagation learning: The RPROP algorithm,” in Proceedings of ICNN 93, pp. 586-591, San Francisco, CA, 1993.
    [21] N. K. Treadgold and T. D. Gedeon, “Simulated Annealing and Weight Decay in Adaptive Learning: The SARPROP Algorithm”, IEEE Trans. on Neural Networks, vol. 9, no. 4, pp. 662-668, 1998.
    [22] R. J. Williams, and Jing Peng, “An Efficient Gradient-Based Algorithm for On-Line Training of Recurrent Network Trajectories,” Neural Computation, vol. 2, pp. 490-501, 1990.
    [23] Samuel Pellman, “An Introduction to the Creation of Electroacoustic Music”, Wadsworth Pub. Co., Calif., 1994.
    [24] S. F. Liang, “Dynamics Modeling of Musical String by ANN”, Master thesis, Depart. Of Control Eng., Chiao-Tung University, Taiwan, 1996.
    [25] S. F. Liang and Alvin W. Y. Su, “Dynamics Modeling of Musical String by linear Scattering Recurrent Network”, ICS’96 Proceedings on Artificial Intelligence, p263-270, Taiwan, 1996.
    [26] S. F. Liang, Alvin W. Y. Su and C. T. Lin, “A New Recurrent-Network Based Music Synthesis Method for Chinese Plucked-String Instruments-Pipa and Qin”, International Joint Conference on Neural Networks, 7/1999.
    [27] S. F. Liang and Alvin W. Y. Su, “Recurrent Neural Network Based Physical Model for the Chin and Other Plucked-String Instruments”, Journal of Audio Engineering Society, vol. 48, no. 11, pp. 1045-1059, 11/2000.
    [28] T. Tolonen, V. Valimaki and M. Karjalainen, “Modeling of Tension Modulation Nonlinearity in Plucked String”, IEEE Trans. On Speech and Audio, vol.8, no.3, pp.300, May 2000.

    下載圖示 校內:2003-08-02公開
    校外:2003-08-02公開
    QR CODE