簡易檢索 / 詳目顯示

研究生: 林孜燕
Lin, Tzu-Yen
論文名稱: 智慧型BioAgent及基因資料管理之整合系統
Integrated Intelligent BioAgent and Gene Data Management System for Bio-Informatiics
指導教授: 李強
Lee, Chiang
學位類別: 碩士
Master
系所名稱: 電機資訊學院 - 資訊工程學系
Department of Computer Science and Information Engineering
論文出版年: 2005
畢業學年度: 93
語文別: 中文
論文頁數: 73
外文關鍵詞: database, data management, automatic integration mechanism, biological information
相關次數: 點閱:75下載:1
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  •   對於位在網路上molecular biology distributed databases擁有大量且為異質性資料庫系統,molucular biologists從其中掘出應研究需求獨特的基因資料並將所需的基因資料傳至使用者端,而資料之間所應被具有意義的聯繫關係並使其支援相異file format資料與資料間的交互參考性,縮短資料量處理時間和使得使用者端資料的多次可再利用性,依照個別研究自定需求的資料呈現上,我們提供一個具有自動化且整合的機制來解決這些問題,達成由系統幫助使用者收集管理基因資料的目標是我們研究主要的主題.

     This thesis discusses the issue of integration of multiple heterogeneous databases storing molecular biological information. This integrated system is mainly used for molecular biologists to allow them mine useful information from the multiple heterogeneous databases. The issues that ought to be resolved include the determination of relationships between data,conformation of heterogeneous data, cross-referencing of data in different databases,minimization of processing time,etc. We propose in this thesis an automatic integration mechanism to resolve these problems so as to achieve the goal of automatic collecting,conformaing,and managing the required information.

    1. Introduction 1 2. Motivation 3 2.1 Distributed Gene Data Retrieval……………………………………3 2.2 Gene Data Storage and Management …………………………………7 3. Related Work 11 3.1 Intelligent BioAgent…………………………………………………11 3.2 Gene Data Information Storage and Management…………………16 3.3 Conclusions ……………………………………………………………20 4. Proposed Solution 22 4.1 Proposed Solution ……………………………………………………22 4.2 Thesis Organization …………………………………………………27 5. System Architecture — Application and Extraction Layer 28 5.1 敘論………………………………………………………………………28 5.2 Intelligent BioAgent…………………………………………………31 5.2.1 Workflow…………………………………………………………31 5.2.2 Data Retrieval Module ………………………………………35 5.2.3 Page Parser ……………………………………………………39 5.2.4 Recovery Process………………………………………………41 5.2.5 Timer ……………………………………………………………44 6. System Architecture — Manipulation Layer and Internal Layer 45 6.1 Update Monitor…………………………………………………………45 6.2 Data Storage Module …………………………………………………47 6.2.1 Gene Data Storage ……………………………………………47 6.2.2 Gene Data儲存格式的命名 ……………………………………59 6.2.3 hyperlink改寫 …………………………………………………60 6.3 Data Compaction and Query Module…………………………………60 6.3.1 查詢蒐集文件……………………………………………………60 6.3.2 查詢文件內容……………………………………………………61 7. System Operations and Related Technology 63 8. Conclusions 70

    [1] Edward S. Chen and Daniel B. Davison, “Distributing molecular biology information:Gopher, WAIS and the University of Houston Gene-Server”, Proceedings of the 1993 ACM/SIGAPP symposium on Applied computing:states of the art and practice-1993, Pages 634-640, 1993.
    [2] Patrick Herde and Peter R. Sibbald, “Integration of molecular biology data collections using object oriented databases and programming”, Addendum to the proceedings on Object-oriented programming system, languages, and applications (Addendum), Pages 177-178, 1992.
    [3] UniGene, Homo Sapiens Library browser, http://www.ncbi.nlm.nih.gov/UniGene/UGOrg.cgi?TAXID=9606
    [4] J. Setubal and J. Meidanis, “Introduction to computational molecular biology”, PWS Publishing Company, 1997
    [5] Windows MSDN online, http://msdn.microsoft.com/default.asp
    [6] UniGene, http:// www.ncbi.nlm.nih.gov/UniGene/UGOrg.cgi
    [7] GenBank, http:// www.ncbi.nlm.nih.gov/
    [8] “HyperText Markup Language (HTML)”, HyperText Markup Language (HTML): Working and Background Materials, http://www.w3.org/hypertext/WWW/MarkUp/MarkUp.html
    [9] M. Cutler, H. Deng, S. S. Maniccam, and W. Meng, “A New Study on Using HTML Structures to Improve Retrieval”, Proceedings of the 11th IEEE International Conference on Tools with Artificial Intelligence, Pages 406-409, 1999
    [10] Hasan M. Jamil, “Achieving Interoperability of Genome Databases Through Intelligent Web Mediators”, International Conference on Bioinformatics and Biomedical Engineering, Pages 118-125, 2000.
    [11] Paola Atzeni、Paolo Merialdo, Giansalvatore Mecca, “To Weave the Web”, Proceedings of the 23rd International Conference on VLDB, Pages 206-215, 1997
    [12] “Hypertext Transfer Protocol (HTTP):A protocol for networked information”, http://www.w3.org/hypertext/WWW/Protocols/HTTP/HTTP2.html
    [13] James Watson and Francis Crick, “Molecular Structure of Nucleic Acids”, Nature, Vol. 171, Page 737, 1953
    [14] Fernando Ferri, Christina Chiselli, Patrizia Grifoni and Macro Padula, “Toward A Retrieval of HTML Documents Using A Semantic Approach”, IEEE International Conference on Multimedia and Expo(Ⅲ), Pages 1571-1574, 2000
    [15] TelePort, http://www.tenmax.com
    [16] David W. Cheung, Ben Kao, Joseph Lee, “Discovering User Access Patterns on the World Wide Web”, Proceedings of the 1st Pacific-Asia Conference on Knowledge Discovery and Data Mining, Pages 1-14
    [17] Ling Liu, Calton Pu, Wei Tang, David Buttler, John Biggs, Tong Zhou, Paul Benninghoff, Wei Han, Fenghua Yu, “CQ:a personalized update monitoring toolkit”, Proceeding of ACM Intl’ Conference on Management of Data (SIGMOD), Pages 547-549, 1998
    [18] Seung-Jin Lim, Tiu-Kai Ng, “WebView:A Tool for Retrieving Internal Structures and Extracting Information from HTML Documents”, Proceedings of the 6th International Conference on Database Systems for Advanced Applications, Pages 71-80, 1999
    [19] UniGene Sequences, http://www.ncbi.nlm.nih.gov/UniGene/
    [20] Genome, http://genome.chop.edu
    [21] ACM, http://www.acm.org
    [22] D. Harman, E. Fox, R. Baeza-Yates, and W. Lee, “Inverted files”, In W. Frakes and R. Baeza-Yates, editors, Information Retrieval: Algorithms and Data Structures, Chapter 3, Pages 28-43, 1992
    [23] Seung-Jin Lim, Yiu-kai Ng, “An Automated Approach for retrieving hierarchical data from HTML tables”, Proceedings of the 8th International Conference on Information and Knowledge Management, Pages 466-474, 1999
    [24] R. S. Boyer and J. S. Moore, “A Fast String Searching Algorithm”, Communications of the ACM, Volume 20, Issue 10, Pages 762-772, 1977
    [25] D. E. Knuth, J. H. Morris, V. B. Pratt, “Fast Pattern Matching in Strings”, SIAM Journal of Computing, Pages 323-350, 1977
    [26] Hung-Yu Kao, Shian Hua Lin, Jan-Ming Ho, Ming-Syan Chen, “Entropy-Based Link Analysis for Mining Web Informative Structures”, Proceedings of the 11th International Conference On Information and Knowledge Management, Pages 574-581, 2002

    下載圖示 校內:2006-01-19公開
    校外:2006-01-19公開
    QR CODE