研究生: |
陳震宇 Chen, Cheng-Yu |
---|---|
論文名稱: |
自動產生淬取基因與基因作用關係之規則 Mining Extraction Rules from Biomedical Documents |
指導教授: |
蔣榮先
Chiang, Jung-Hsien |
學位類別: |
碩士 Master |
系所名稱: |
電機資訊學院 - 資訊工程學系 Department of Computer Science and Information Engineering |
論文出版年: | 2003 |
畢業學年度: | 91 |
語文別: | 中文 |
論文頁數: | 62 |
中文關鍵詞: | 文件探勘、資訊萃取、萃取規則、序列性模式探勘 |
外文關鍵詞: | Text Mining、Information Extraction、Extraction |
相關次數: | 點閱:89 下載:1 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
本論文提出結合文件探勘和序列性模式探勘的方法,以自動產生淬取醫學文件中”基因和基因之間作用關係”的規則;內容共分為以下若干個部份: 首先以文件中自然語言書寫的習慣,配合文件探勘技術和序列性模式探勘方法,找出在句中可能具有描述基因和基因之間作用關係的模式;接下來將已產生的模式,作為資訊淬取技術的”淬取規則”,由醫學文件中符合淬取規則的基因之間的關係淬取出來,然後將這些輸出結果合併,例如,”基因A 作用關係基因B”和”基因B 作用關係 基因C”可以結合成”基因A 作用關係 基因B 作用關係 基因C”,所有的關係句合併之後,將會形成”基因-基因作用關係網路”,其中,”作用關係字 ”等事先定義的正負向調控關係關鍵字,最後,以圖形的方式呈現基因和基因之間的作用關係。我們會提供以圖形化介面的程式提供使用由大量醫學文件中淬取基因之間的作用關係,並以”PubMed”提供的醫學文件,作為驗証本論文在實際上可行性的資料集。
None
參考文獻
[1] Agrawal R. and R. Srikant. ; Fast algorithms for mining association rules in large databases. In VLDB-94, September 1994.
[2] Agrawal R. and R. Srikant. ; Mining Sequential Patterns. In Proc. of the 11th Int'l Conference on Data Engineering, Taipei, Taiwan, March 1995.
[3] Apte C, F. Damerau and S. Weiss. ; Text Mining with Decision Rules and Decision Trees. Workshop on Learning from Text and the Web, Conference on Automated Learning and Discovery, Pitts burgh, PA, 1998.
[4] Blaschke C. and A. Valencia ; The Frame-based Module of the SUISEKI Information Extraction System. IEEE Intelligent Systems, pages 14-20, 2002.
[5] Brill E. ; A simple rule-based part of speech tagger. In Proceedings of the Third Conference on Applied Natural Language Processing, ACL, Trento, Italy, 1992.
[6] Grishman R. ; Information extraction: Techniques and challenges. In Maria Teresa Pazienza, editor, Information Extraction. Springer-Verlag, Lecture Notes in Artificial Intelligence, Rome, 1997.
[7] Jenssen T., A. Lagreid, J. Komorowski and E. Hovig ; A literature network of human genes for high-throughput analysis of gene expression. Nature genetics, volume 28, pages 21 – 28, 2001.
[8] Marcotte E. M., L. Xenarios, and D. Eisenberg ; Mining literature for protein-protein interactions. Bioinformatics, Vol. 17, pages 359-363, 2001.
[9] Neto, J. L., A. D. Santos, C. A. A. Kaestner, and A. A. Freitas (2000). ; Document Clustering and Text Summarization. In Proceedings, 4th Int. Conference on Practical Applications of Knowledge Discovery and Data Mining (PADD-2000), 41-55. London: The Practical Application Company.
[10] Ono T. ; Automated extraction of information on protein-protein interactions from the biological literature, Bioinformatics Volume 17, Issue 11, November 2 2001
[11] Palakal M., M. Stephens, S. Mukhopadhyay, R. Raje, and S. Rhodes ; A Multi level Text Mining Method to Extract Biological Relationships. IEEE Computer Society Bioinformatics Conference, August 14 - 16, 2002.
[12] Park J. S., M. Chen, and P. S. Yu. ; An effective hash based algorithm for mining association rules. In ACM SIGMOD Intl. Conf. Management of Data, May 1995.
[13] Riloff E. ; Automatically constructing a dictionary for information extraction tasks. In Proceedings of the Eleventh National Conference on Artificial Intelligence, AAAI Press / MIT Press, pages 811-816, 1993.
[14] Riloff E. and J. Shoen ; Automatically Acquiring Conceptual Patterns Without an Annotated Corpus. In Proceeding of the Third Workshop on very Large Corpora, 148-161, 1995.
[15] Riloff E. ; Automatically Generating Extraction Patterns from Untagged Text. In Proceedings of the Thirteenth National Conference on Artificial Intelligence, Portland, OR, 1044-1049, 1996.
[16] Staab S. ; Mining information for Functional Genomics. IEEE Intelligent System, 2002.
[17] Thomsa J., D. Milward, C. Ouzounis, S. Pulman and M. Carroll ; Automatic Extraction of Protein Interactions from Scientific Abstract. In Altman et al. [3], pages 538—549, 2000.
[18] Usama M. F., G. Piatetsky-Shapiro, and P. Smyth. ; Advances in Knowledge Discovery and Data Mining, chapter From Data Mining to Knowledge Discovery: An overview. AAAI/MIT Press, 1996.
[19] Yen S. J. and A.L.P. Chen ; An Efficient Approach to Discovering Knowledge from Large Databases. 4th International Conference on Parallel and Distributed Information Systems (PDIS '96) , December 18 - 20, 1996.