簡易檢索 / 詳目顯示

研究生: 黃捷
Huang, Jay
論文名稱: 運用樣式索引技術之高效性內涵式音樂檢索
Novel Pattern Indexing Techniques for Efficient Content-based Music Retrieval
指導教授: 曾新穆
Tseng, Vincent Shin-Mu
學位類別: 碩士
Master
系所名稱: 電機資訊學院 - 資訊工程學系
Department of Computer Science and Information Engineering
論文出版年: 2008
畢業學年度: 96
語文別: 中文
論文頁數: 64
中文關鍵詞: 音樂內涵式檢索資料探勘音樂片段搜尋關聯樣式多媒體資料庫
外文關鍵詞: Content-based music retrieval, Data mining, association patterns, music fragment search, multimedia database
相關次數: 點閱:108下載:2
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 隨著多媒體擷取技術之進步,多媒體資料已和我們的生活更加緊密結合,尤以音樂為盛。如何準確且有效率地擷取使用者感興趣的音樂是音樂搜尋之主要目的。由於音樂資料是屬於時序性資料,目前許多研究除了利用音樂的低階特徵值外,也同時考慮音樂之時序意義,試著找出關聯性以相互結合。然而,許多方法不斷致力於追求更高準確度之同時,卻往往忽略了效率的考量。在本研究中,我們發展三種快速樣式索引技術,利用編碼原則,讓音樂片段以低維度的群組定義來取代高維度之低階特徵值,藉以降低計算複雜度,同時考慮存在於音樂中的連續關聯性,搭配其關聯程度來產生關聯樣式組合,以提昇搜尋準確率,並透過獨特設計之快速樣式索引樹,可快速地找到在資料庫中最有可能包含該片段的音樂。實驗結果顯示,我們所提出之方法可以在很短的執行時間內完成音樂搜尋,在準確度上也有很好的表現。

    With the progress of multimedia capturing techniques, multimedia data, especially music data is more and more tightly bound up with our life. How to access the user-interested music efficiently and effectively is a hot topic for content-based music retrieval. Because music data is temporal, many previous works used not only the low-level features of music itself but also the sequence of the features. Most of the works put their concentration on enhancing the accuracy but ignore another important issue: the efficiency. In our method, the high dimensionality of low-level features is first reduced to improve the retrieval performance by the labeling approach. Then we make use of the temporal continuity to find the valuable patterns within a sliding window and construct the index structure. Through our proposed method, the music that is related to user’s query can be found rapidly.

    ABSTRACT II 摘要 III 表目錄 VI 圖目錄 VII 第一章 導論 1 1.1 研究目的 1 1.2 問題描述 2 1.3 研究方法概述 3 1.4 論文貢獻 4 1.5 論文架構 4 第二章 文獻探討 5 2.1 音樂內涵分析 5 2.2 時間序列符號化相關研究 6 2.3 字串序列相似比對研究 7 2.4 相關之音樂內涵式搜尋研究 9 2.4.1 以特徵值選取(Feature selection)為基準之音樂搜尋系統 10 2.4.2 以編碼方式為基準之音樂搜尋系統 11 第三章 研究方法 14 3.1 方法架構介紹 14 3.2 音樂資料前處理 16 3.2.1 音樂特徵值擷取 16 3.2.2 音樂特徵值編碼 18 3.2.3 關聯樣式分析 20 3.2.4 建立索引結構 23 3.3 音樂搜尋階段 24 3.4 改良模組 29 3.4.1 AFPI(Advanced Fast Pattern Index) 30 3.4.2 FAA(Fusion of AFPI and Alignment) 36 第四章 實驗分析 39 4.1 實驗分析 39 4.2 實驗規劃 41 4.3 實驗結果分析 42 4.3.1 實驗參數設定 42 4.3.2 音樂片段長度對準確度與效率之影響 44 4.3.3 前N名結果的準確度比對 46 4.3.4 音樂類別對準確度影響 47 4.3.5 資料庫音樂數量上升對準確度之影響 48 4.3.6 方法各步驟執行時間 49 4.3.7 輸入音樂內容發生變更時對準確度之影響 50 4.4 實驗總結 54 第五章 結論 57 5.1 研究結論 57 5.2 未來研究發展及應用 58 5.2.1 未來研究方向 58 5.2.2 未來可應用方向 59

    [1] R. Agrawal, T. Imielinski, and A. Swami, “Mining Association Rules between Sets of Items in Large Databases.” In Proc. of the ACM SIGMOD international conference on Management of data, pp. 207-216, Washington, DC, USA, May 1993.
    [2] R.Agrawal and R.Srikant, “Fast Algorithms for Mining Association Rules.” In Proc. of the 20th International Conference on Very Large Data Bases, pp. 487-499, Santiago de Chile, Chile, September 1994.
    [3] Hsin-Chien Chiang and Jer-Shyan Wu, “Analysis and Development on Pairwise Sequence Alignment Tool : BLAST.” , Chung Hua University, 2003.
    [4] Bin Cui, Jialie Shen , Gao Cong , Heng Tao Shen and Cui Yu, “Exploring Composite Acoustic Features for Efficient Music Similarity Query. ” In Proc. of the Multimedia, Santa Barbara, California, USA, October 23-27, 2006.
    [5] Chan, K. and Fu, A. W. “Efficient Time Series Matching by Wavelets, ” In proc. of the 15th IEEE Int'l Conference on Data Engineering, Sydney, Australia, pp 126-133, Mar 23-26, 1999.
    [6] C. Faloutsos, M. Ranganathan, and Y. Manolopoulos, “Fast Subsequence Matching in Time-Series Databases.” In proc. of the ACM SIGMOD Int’l Conference on Management of Data, Minneapolis, pp 419-429, May 24-27, 1994.
    [7] J. Foote, “An Overview of Audio Information Retrieval,” In proc. of Multimedia, ACM, Press/Springer-Verlag , pp. 2-11, January 1992.
    [8] J. Foote, “Content-based retrieval of music and audio,” in SPIE Multimedia Storage Archiving Systems II, vol.3229, C.C.J. Kuo et al., Eds., pp. 138-147, 1997.
    [9] J. Foote, “Arthur: Retrieving orchestral music by long-term structure.” In Proc. of the First International Symposium on Music Information Retrieval (ISMIR), Plymouth, USA, October 2000.
    [10] J. Foote , Matthew Cooper , Unjung Nam , “Audio Retrieval by Rhythmic Similarity”, In Proc. of Institute Research Coordination Acoustics Music (IRCAM) , 2002.
    [11] Yu-Ting Huang , Vincent S. Tseng, ”Efficient Content-based Video Retrieval by Using Pattern Indexing Technique .” , National Cheng Kung University , Tainan , Taiwan , R.O.C , 2007.
    [12] D. Huron, “Perceptual and cognitive applications in music information retrieval.” In Proc. of International Symposium of Music Information Retrieval (ISMIR), Plymouth, Massachusetts, October 23-25, 2000.
    [13] N. Hu and R. B. Dannenberg. “A comparison of melodic database retrieval techniques using sung queries.” In Proc. of the Second ACM/IEEE-CS Joint Conference on Digital Libraries, pages 301–307, Portland, USA, July 2002.
    [14] Iman S.H. Suyoto, Alexandra L. Uitdenbogerd and Falk Scholer, “Effective Retrieval of Polyphonic Audio with Polyphonic Symbolic Queries.” In Proc. of Multimedia Information Retrieval (MIR) , Augsburg, Bavaria , Germany , September 28-29 ,2007.
    [15] Jamedo music database : http://www.jamendo.com/en/
    [16] Jean-Julien, Aucouturier and M. Sandler, “Finding repeating patterns in acoustical musical signals: Applications for audio thumbnailing.” In Proc. of the Audio Engineering Society 22nd International Conference on Virtual, Synthetic and Entertainment Audio, pages 412–421, Espoo, Finland, June 2002.
    [17] K. Kashino, G. Smith, and H. Murase, “Time-Series Active Search for Quick Retrieval of Audio and Video,” In Proc. of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Phoenix, Arizona, March 15-19, 1999.
    [18] E. Keogh, K. Chakrabarti, M. Pazzani and S. Mehrotra, ” Locally Adaptive Dimensionality Reduction for Indexing Large Time Series Databases.” In Proc. of ACM SIGMOD Conference on Management of Data. Santa Barbara, CA, pp 151-162, May 21-24, 2001.
    [19] J. Kruskal and D. Sankoff, “An Anthology of Algorithms and Concepts for Sequence Comparison,” In Time Warps, String Edits, and Macromolecules: the Theory and Practice of String Comparison, eds. D. Sankoff and J. Kruskal, CSLI , Publications, (Stanford) ,1999.
    [20] Qing Li , Byeong Man Kim , Dong Hai Guan , Duk whan Oh , “A Music Recommender Based On Audio Feature” In Proc. of SIGIR’04 , Sheffield, South Yorkshire, UK, July 25-29, 2004.
    [21] R. J. Larsen and M. L. Marx, ”An Introduction to Mathematical Statistics and Its Applications.” Prentice Hall, Englewood, Cliffs, N.J. 2nd Edition.
    [22] Chih-Chin Liu and Chuan-Sung Huang , “A Singer Identification Technique for Content-Based Classification of MP3 Music Object” In Proc. of 11th ACM International Conference on Information and Knowledge Management (CIKM) ,Mclean, Virginia, November 4-9, 2002.
    [23] Jessica Lin , Eamon Keogh , Stefano Lonardi and Bill Chiu , “A Symbolic Representation of Time Series , with implications for Streaming Algorithm” , In Proc. of Data Mining and Knowledge Discovery Workshop (DMKD) , San Diego ,USA , ACM 1-58113-763-x , June 13 ,2003.
    [24] M. Müller, F. Kurth, and T. Röder, “Towards an efficient algorithm for automatic score-to-audio synchronization”. In Proc. of International Conference on Music Information Retrieval (MIR), New York , USA, October 15-16, 2004.
    [25] B. Pardo and M. Sanghi, “Polyphonic musical sequence alignment for database search.” In Proc. of the International Conference on Music Information Retrieval (MIR), London, England, September 11-25, 2005.
    [26] Kyu-Sik Park, Won-Jung Yoon, Kang-Kue Lee, Sang-Heon On and Ki-Man Kim, “MRTB Framework: A Robust Content-Based Music Retrieval and Browsing.” In IEEE Transactions on Consumer Electronics , Vol.51 , No.1 , February 2005.
    [27] Jermy Pickens , Juan Pablo Bello , Giuliano Monti , Tim Crawford , Matthew Dovey , Mark Sandler and Don Byrd , “Polyphonic Score Retrieval Using Polyphonic Audio Queries : A Harmonic Modeling Approach. “In Proc. of Institute Research Coordination Acoustics Music (IRCAM) , 2002.
    [28] J. Pickens and C. Iliopoulos, “Markov random fields and maximum entropy modeling for music information retrieval”. In Proc. of International Symposium of Music Information Retrieval (ISMIR), London, UK, September 11-15, 2005.
    [29] J. Saunders, “Real-time discrimination of broadcast speech/music.” In IEEE International conference on Acoustics , speech , and Signal Processing (ICASSP-96), vol.11 , pp. 993-996 , Atlanta, Georgia, May 7-10, 1996.
    [30] S. Shalev-Shwartz, J. Keshet, and Y. Singer, “Learning to align polyphonic music.” In Proc. of International Symposium of Music Information Retrieval (ISMIR), Barcelona, Spain. October 10-14, 2004.
    [31] J. Shifrin and W. P. Birmingham, “Effectiveness of HMM-based retrieval on large databases.” In Proc. of International Symposium of Music Information Retrieval (ISMIR), Washington, D.C., USA, October 2003.
    [32] F. Soulez, X. Rodet, and D. Schwarz, “Improving polyphonic and poly-instrumental music to score alignment.” In Proc. of International Symposium of Music Information Retrieval (ISMIR), Washington, D.C., USA, October 2003.
    [33] R. Srikant and R. Agrawal, “Mining Generalized Association Rules.” In Proc. of the 21th International Conference on Very Large Data Bases, pp. 420-431, Zurich, Switzerland, September 1995.
    [34] Dacheng Tao , Hao Liu , Xiaoou Tang , “K-BOX: A Query-by-Singing based Music Retrieval System”. In Prof. of Multimedia (MM) , New York , USA , October 10-16 , 2004.
    [35] R. Typke, F. Wiering, and R. C. Veltkamp, “A search method for notated polyphonic music with pitch and tempo fluctuations”. In Proc. of International Symposium on Music Information Retrieval (ISMIR) , Barcelona, Spain. October 10-14, 2004.
    [36] A. L. Uitdenbogerd and J. Zobel. “Melodic matching techniques for large music databases.” In Proc. of the 1999 ACM Multimedia Conference, pages 57–66, Orlando, USA, November 1999.
    [37] E. Wold, T. Blum, D. Keislar, and J. Wheaton, “Content-based classification, search, and retrieval of audio,” In IEEE , Multimedia, vol.3, no. 2, Boston, November 18-22, 1996.

    下載圖示 校內:2009-08-18公開
    校外:2010-08-18公開
    QR CODE