簡易檢索 / 詳目顯示

研究生: 鄭宇軒
Cheng, Yu-Hsuan
論文名稱: 基於事件知識圖以及廣義知網建立新聞對話服務機器人
News Bot: A Conversational News Service Based On Event-centric Knowledge Graphs and E-HowNet
指導教授: 盧文祥
Lu, Wen-Hsiang
學位類別: 碩士
Master
系所名稱: 電機資訊學院 - 資訊工程學系
Department of Computer Science and Information Engineering
論文出版年: 2017
畢業學年度: 105
語文別: 英文
論文頁數: 55
中文關鍵詞: 事件抽取敘事框架聊天機器人應用服務
外文關鍵詞: Event extraction, Narrative scheme, Chabot application
相關次數: 點閱:83下載:0
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 對於人們來說,新聞是了解世界各地大小事情所不可或缺的資源,而網路新聞在近幾年更是變得越來越熱門。雖然現在檯面上已有相當多的新聞應用服務,但是僅有少數可以讓使用者有效率地瞭解新聞內容。為了解決這個問題,我們首先提出Syntax Rule-based Event Processing Model (SREPM)來擷取新聞文章中以(主詞, 動詞, 受詞)模式出現的事件。接著,在廣義知網的知識支援下,我們定義了一個結合「人、事、時、地、物」實體與事件鍊,並且以後者為中心的知識圖架構。最後,基於事件知識圖以及廣義知網裡的知識,我們推出了一個新聞對話服務機器人,它不僅可以提供視覺化後的事件知識圖以供讀者快速了解新聞概要,更能在五合學的基礎上,回應讀者們相關的問題。

    News is an essential source for people to understand recent events around the world, and online news is becoming more and more popular in these years. Despite various services and apps specializing in news, few of them can fit the needs of understanding news stories efficiently. In order to solve this problem, we first proposed a Syntax Rule-based Event Processing Model (SREPM) that can extract events in articles on the basis of (Subject, Verb, Object) pattern. Second, with the supporting knowledge from E-HowNet, we defined an event-centric knowledge graph which combines the recognized entities of Time, Places, Items, Roles and Events with event-chains from SREPM, making visualization of it with the purpose of offering quick outline for user. Last but not least, we proposed News Bot – a conversational news service system based on the event-centric knowledge graph, aiming to answer questions about news articles regarding to Five W’s in journalism.

    摘要 III Abstract IV 致謝 V Table of Contents VI List of Tables VIII List of Figures IX Chapter 1 Introduction 1 1.1 Background 1 1.2 Motivation 2 1.3 Method 3 1.4 Contributions 7 1.5 Organization of this Dissertation 7 Chapter 2 Related Work 8 2.1 Event Extraction 8 2.2 Narrative Scheme 9 2.3 News Application 10 Chapter 3 Method 11 3.1 System Architecture 11 3.2 Tools: CKIP Chinese Parser and E-HowNet 12 3.2.1 CKIP Chinese Parser 12 3.2.2 E-HowNet 13 3.3 Entities Identification 14 3.3.1 Time 15 3.3.2 Places 15 3.3.3 Items 16 3.3.4 Roles 17 3.3.5 Events 18 3.4 Events Processing 19 3.4.1 Event Extraction 21 3.4.2 Event-Chains Generation 24 3.5 News Bot 27 3.5.1 Events Extraction 28 3.5.2 Request Understanding 28 3.5.3 Request Matching 30 3.5.4 Response Generating 32 Chapter 4 Experiments 35 4.1 Experiment Setup 35 4.1.1 Dataset 35 4.1.2 Evaluation Metrics 36 4.2 Experiment of Event Extraction 37 4.3 Experiment of Entities Identification 41 4.4 Experiment of News Bot 45 4.4.1 Testing on Threshold of Similarity 45 4.4.2 Experiment of Answer Generation 46 Chapter 5 Conclusions 49 Acknowledgment 50 Reference 50

    [1] Nathanael Chambers and Dan Jurafsky. 2008. Unsupervised learning of narrative event chains. In Proceedings of ACL-08, Hawaii, USA
    [2] Nathanael Chambers and Dan Jurafsky. 2009. Unsupervised Learning of Narrative Schemas and their Participants. In ACL/IJCNLP 2009
    [3] Chinese Knowledge Information Processing Group Academia Sinica Institute of Information Science. 1998. 中央研究院平衡語料庫的內容與說明.
    [4] Chinese Knowledge Information Processing Group Academia Sinica Institute of Information Science. 2009. E-HowNet Technical Report.
    [5] Delia Rusu, Jožef Stefan International, James Hodson and Anthony Kimball. 2014. Unsupervised Techniques for Extracting and Clustering Complex Events in News. In Proceedings of the Second Workshop on EVENTS, ACL 2014.
    [6] Zhaohui Wu, Chen Liang and C. Lee Giles. 2015. Storybase: Towards Building a Knowledge Base for News Events. In Proceedings of ACL-IJCNLP 2015 System Demonstrations, pages 133–138.
    [7] Gianluca Demartini, Malik Muhammad Saad Missen, Roi Blanco and Hugo Zaragoza. 2010. Entity summarization of news articles. In Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval, pages 795-796.
    [8] Piek Vossen, Tommaso Caselli and Yiota Kontzopoulou. 2015. Storylines for structuring massive streams of news. In Proceedings of the First Workshop on Computing News Storylines, pages 40–49.
    [9] Bishan Yang and Tom Mitchell. 2016. Joint Extraction of Events and Entities within a Document Context. In Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL), pp. 289-299.
    [10] Shize Xu, Shanshan Wang and Yan Zhang. 2013. Summarizing Complex Events: a Cross-modal Solution of Storylines Extraction and Reconstruction. In Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, pages 1281–1291.
    [11] Gevorg Poghosyan and Georgiana Ifrim. 2016. Real-time News Story Detection and Tracking with Hashtags. In Proceedings of 2nd Workshop on Computing News Storylines, pages 20–29.
    [12] C. C. Aggarwal and K. Subbian. 2012. Event detection in social streams. In SDM, 2012.
    [13] Newman, David, Chemudugunta Chaitanya, Smyth Padhraic, and Steyvers Mark. 2006. Analyzing entities and topics in news articles using statistical topic models. In IEEE, 2006.
    [14] Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Efficient Estimation of Word Representations in Vector Space.
    [15] Min-Chul Yang, Nan Duan, Ming Zhou and Hae-Chang Rim. 2014. Joint Relational Embeddings for Knowledge-based Question Answering. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 645–650.
    [16] Zhiwei Li, Mingjing Li, Wei-ying Ma. 2005. A Probabilistic Model for Retrospective News Event Detection. In SIGIR, 2005.
    [17] Chuan-jie Lin, Hsin-hsi Chen, Che-chia Liu, Jin-he Tsai and Hong-jia Wong. 2001. Open Domain Question Answering on Heterogeneous Data. In Proceedings of ACL Workshop on Human Language Technology and Knowledge Management.
    [18] Katsiaryna Stalpouskaya and Christian Baden. 2015. To Do or Not to Do: the Role of Agendas for Action in Analyzing News Coverage of Violent Conflict. In Proceedings of the First Workshop on Computing News Storylines, pages 21–29.
    [19] Viet Ha-Thuc, Yelena Mejova, Christopher Harris and Padmini Srinivasan. 2009. A relevance-based topic model for news event tracking. In Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, pages 764-765.
    [20] Chen Lin, Dingding Wang , Chun Lin , Yang Chen , Jingxuan Li and Tao Li. Generating Event Storylines from Microblogs. In CKIM, 2012.
    [21] Peng Zhang , Wenjie Li , Furu Wei , Qin Lu and Yuexian Hou. 2008. Exploiting the role of position feature in chinese relation extraction. In In Proceedings of LREC, 2008.
    [22] Lifu Huang and Lian’en Huang. 2013. Optimized Event Storyline Generation based on Mixture-Event-Aspect Model. In Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, pages 726–735.
    [23] Amr Ahmed, Qirong Ho, Jacob Eisenstein, Eric Xing, Alexander J Smola, and Choon Hui Teo. 2011. Unified analysis of streaming news. In WWW, pages
    267–276.
    [24] Egoitz Laparra, Itziar Aldabe and German Rigau. 2015. From TimeLines to StoryLines: A preliminary proposal for evaluating narratives. In ACL-IJCNLP 2015.
    [25] Emma Barker and Robert Gaizauskas. 2016. Summarizing Multi-Party Argumentative Conversations in Reader Comment on News. In Proceedings of the Third Workshop on Argument Mining, ACL 2016.
    [26] Enrique Alfonseca, Daniele Pighin, and Guillermo Garrido. 2013. Heady:
    News headline abstraction through event pattern clustering. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, pages 1243–1253.
    [27] Yu Hong, Jianfeng Zhang, Bin Ma, Jianmin Yao, Guodong Zhou, and Qiaoming Zhu. 2011. Using cross-entity inference to improve event extraction. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies-Volume 1, pages 1127–1136. Association for Computational Linguistics.
    [28] Joint event extraction via structured prediction with global features. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics.
    [29] Xavier Tannier and V´eronique Moriceau. 2013. Building event threads out of multiple news articles. Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing EMNLP.
    [30] Neil McIntyre and Mirella Lapata. 2009. Learning to tell tales: A data-driven approach to story generation. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, pages 217–225, Singapore.
    [31] Nathanael Chambers. 2013. Event schema induction with a probabilistic entity-driven model. In Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing.
    [32] David McClosky, Mihai Surdeanu, and Christopher D Manning. 2011. Event extraction as dependency parsing. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL), pages 1626–1635. Association for Computational Linguistics.
    [33] Shasha Liao and Ralph Grishman. 2010. Using document level cross-event inference to improve event extraction. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics (ACL), pages 789–797. Association for Computational Linguistics.
    [34] Lev Ratinov and Dan Roth. 2009. Design challenges and misconceptions in named entity recognition. In Proceedings of the Thirteenth Conference on Computational Natural Language Learning, pages 147–155. Association for Computational Linguistics.
    [35] Hoifung Poon and Pedro Domingos. 2009. Unsupervised semantic parsing.
    In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1, pages 1–10. Association for Computational Linguistics.
    [36] Mike Mintz, Steven Bills, Rion Snow, and Dan Jurafsky. 2009. Distant supervision for relation extraction without labeled data. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2, pages 1003–1011. Association for Computational Linguistics.
    [37] Peng Li, Jing Jiang, and Yinglin Wang. 2010. Generating templates of entity summaries with an entity-aspect model and pattern mining. In Proceedings of the 48th annual meeting of the Association for Computational Linguistics, pages 640–649. ACL.
    [38] Jonathan Berant, Andrew Chou, Roy Frostig, and Percy Liang. 2013. Semantic parsing on Freebase from question-answer pairs. In Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, pages 1533–1544, Seattle, Washington, USA, October. Association for Computational Linguistics.
    [39] Xuchen Yao and Benjamin Van Durme. 2014. Information extraction over structured data: Question answering with freebase. In Proceedings of the 52nd
    Annual Meeting of the Association for Computational Linguistics, pages 956–966, Baltimore, Maryland, June. Association for Computational Linguistics.
    [40] Jason Weston, Antoine Bordes, Oksana Yakhnenko, and Nicolas Usunier. 2013. Connecting language and knowledge bases with embedding models for relation extraction. In Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, pages 1366–1371, Seattle, Washington, USA, October. Association for Computational Linguistics.

    無法下載圖示 校內:2022-07-25公開
    校外:不公開
    電子論文尚未授權公開,紙本請查館藏目錄
    QR CODE