簡易檢索 / 詳目顯示

研究生: 李美鳳
Lee, Mei-Feng
論文名稱: 基於動態群集串聯的概念推論:以網路貼文為應用案例
Dynamic Clustering Combination for Concept Inference in Online Messages
指導教授: 鄧維光
Teng, Wei-Guang
學位類別: 碩士
Master
系所名稱: 工學院 - 工程科學系碩士在職專班
Department of Engineering Science (on the job class)
論文出版年: 2020
畢業學年度: 108
語文別: 中文
論文頁數: 56
中文關鍵詞: 概念推論概念聯想動態群集群集串聯
外文關鍵詞: concept inference, concept association, dynamic clustering, cluster combination
相關次數: 點閱:154下載:12
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 人類在學習語言與知識過程中,建構與推論概念間的關聯是個很重要的步驟,本論文藉由觀察到幼兒學習知識概念的方式,透過本身具備的已知但少量的概念詞彙推論未知概念,得到建構概念詞彙推論流程與推論模組的靈感,進而提出三個研究問題面向進行探索,分別為:(1)概念萃取的可行性、(2)概念推論方法的適切性、(3)捕捉概念變異。依據研究問題面向的探索,本研究設計出一概念詞彙推論系統與推論邏輯對應方法,達到以運算方式來建構出類似人們對於概念詞彙聯想,延伸與跳躍等多樣化的概念詞彙。由實驗的案例可以發現,推論邏輯方法以評估方法所呈現的數值分析結果,推論平均成效為本研究提出的動態群集串聯方法(Dynamic Clustering Combination)與資訊檢索方法優於一般群集法,但若以文字概念詞彙分析結果而言,動態群集串聯方法更能夠表現出本研究目標的成果。

    In the learning process of language and knowledge, the construction and inference of the relations of concepts are very important steps. This paper is motivated by observing the inferences process from that children have their own, few and known conceptual vocabulary to generate unknown concepts. Then, three aspects are included to explore this conceptual vocabulary inference module. They are (1) the feasibility of concept extraction, (2) the appropriateness of concept inference methods, and (3) the capturing of concept variations. This research has conducted a concept vocabulary inference system and inference logic correspondence method to achieve the construction of a variety of conceptual vocabulary similar to people's association, extension and jumping of conceptual vocabulary by a computational approach. In experimental results, the average performance of concept inference of the proposed Dynamic Clustering Combination and the retrieval-based combination approaches are better than the ones of general clustering methods. Moreover, our proposed approach can reach the practical diversity of concept extension.

    摘要 i Extended Abstract ii 誌謝 xi 目錄 xii 表目錄 xiv 圖目錄 xv 第一章 緒論 1 1.1 研究背景與動機 1 1.2 研究目標 8 第二章 文獻探討 11 2.1 自然語言處理 12 2.2 資料分群技術-K平均演算法 14 第三章 動態群集概念推論系統設計與方法 16 3.1 系統設計 16 3.2 系統架構 17 3.3 歸納模組 19 3.4 推論模組 21 第四章 實驗與評估 25 4.1 資料集 25 4.2 評估方法 28 4.3 實驗設計 30 4.4 實驗結果 31 第五章 個案與概念詞彙分析 41 第六章 結論 50 參考文獻 52

    [1] R. Kuhn, R. De Mori, “A Cache-based Natural Language Model for Speech Recognition,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 12, no. 6, pp. 570-583, 1990.
    [2] J. Andreas, A. Vlachos, S. Clark, “Semantic Parsing as Machine Translation,” Association for Computational Linguistics, Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics , Volume 2: Short Papers, pp. 47-52, 2013.
    [3] V. Pham, et al., “Dropout Improves Recurrent Neural Networks for Handwriting Recognition,” Proceedings of the 14th International Conference on Frontiers in Handwriting Recognition, 2014.
    [4] E. Cambria, B.White, “Jumping NLP Curves: A Review of Natural Language Processing Research,” IEEE ComputatIonal Intellgence Magazine, pp. 48-57, 2014.
    [5] Wikipedia contributors. “林奕含. ” 維基百科, 自由的百科全書. Accessed on 15 Jun., 2015 [Internet]. Available: https://zh.wikipedia.org/wiki/%E6%9E%97%E5%A5%95%E5%90%AB
    [6] G. Rickheit, W. Schnotz, H. Strohner, “The Concept of Inference in Discourse Comprehension,” Advances in Psychology, pp. 3-49, 1985.
    [7] S. Hinduja, JW. Patchin, “Bullying, Cyberbullying, and Suicide,” Journal of Archives of Suicide Research, vol. 14, no. 3, pp. 206-221, 2010.
    [8] R. Slonje, A. Frisen, A. Frisen, P. K Smith, “The Nature of Cyberbullying, and Strategies for Prevention,” Computers in Human Behavior, 2012
    [9] G. Kwan, M. Skoric, “Facebook Bullying: An Extension of Battles in School,” Computers in Human Behavior, vol. 29, no. 1, pp. 16-25, 2013.
    [10] S. Menini, G. Moretti, et al., “A System to Monitor Cyberbullying Based on Message Classification and Social Network Analysis,” Proceedings of the Third Workshop on Abusive Language Online, pages 105–110, 2019.
    [11] W. He, S. Zhab, L. Lia, “Social Media Competitive Analysis and Text Mining: A Case Study in the Pizza Industry,” International Journal of Information Management, vol. 33, no. 3, pp. 464-472, 2013.
    [12] M. Mostafa, “More than Words: Social Networks’ Text Mining for Consumer Brand Sentiments,” Expert Systems with Applications, vol. 40, pp. 4241-4251, 2013.
    [13] R. Lu, H. Tsao, et al., “Sentiment Analysis of Brand Personality Positioning Through Text Mining,” Proceedings of Journal of Information Technology Research(JITR), vol. 12, 2019.
    [14] K. Shu, A. Sliva, S. Wang, et al., “Fake News Detection on Social Media: A Data Mining Perspective,” Special Interest Group on Knowledge Discovery and Data Mining (SIGKDD) Explorations, vol. 19, no. 1, pp. 22-36, 2016.
    [15] H. Allcott, M. Gentzkow, “Social Media and Fake News in the 2016 Election,” Proceedings of Journal of Economic Perspectives, vol. 31, pp. 211-236, 2017.
    [16] L. Wang, Y. Wang, G. de Melo, G. Weikum, "Five Shades of Untruth: Finer-Grained Classification of Fake News," IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), pp. 593-594, 2018.
    [17] S. Kai, W. Suhang, L. Huan, “Beyond News Contents: The Role of Social Context for Fake News Detection,” Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining(WSDM), pp.312-320, 2019.
    [18] M. Halvey, M. Keane , “An Assessment of Tag Presentation Techniques,” Proceedings of the international conference on World Wide Web (WWW), pp.1313-1314, 2007.
    [19] M. Allahyari, S. Pouriyeh, et al., “A Brief Survey of Text Mining: Classification, Clustering and Extraction Techniques,” Proceedings of Knowledge Discovery and Data Mining(KDD) Bigdas, 2017.
    [20] M. Hearst, “Untangling Text Data Mining,” Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics (ACL), pp.3-10 , 1999.
    [21] U. Fayyad, G. Piatetsky-Shapiro, P. Smyth, “From Data Mining to Knowledge Discovery in Databases”, AIMagazine, vol. 17, no. 3, p. 37, 1996.
    [22] U. Fayyad, G. Piatetsky-Shapiro, P. Smyth, “Knowledge Discovery and Data Mining: Towards a Unifying Framework,” Proceedings of Knowledge Discovery and Data Mining(KDD), vol. 96, pp. 82-88, 1996.
    [23] W. J. Frawley, G. Piatetsky-Shapiro, and C. J. Matheus, “Knowledge Discovery in Databases: An Overview”, AIMag, vol. 13, no. 3, p. 57, 1992.
    [24] Z. S. Harris,” Distributional Structure”, Proceedings of Journal of WORD, vol.10, pp.146-162, 1954.
    [25] Y. Fu, Y Feng, and J. Cunningham,”Paraphrase Generation with Latent Bag of Words”, Neural Information Processing Systems, 2019.
    [26] E. Cambria and A. Hussain, “Sentic Computing: Techniques, Tools, and Applications”, SpringerBriefs in Cognitive Computation book series,volume 2, 2012.
    [27] M. Wrzalik and D. Krechel, "Balanced Word Clusters for Interpretable Document Representation," International Conference on Document Analysis and Recognition Workshops (ICDARW), pp. 103-109, 2019.
    [28] P. Li, K. Mao, Y. Xu, Q. Li, J. Zhang, “Bag-of-Concepts representation for document classification based on automatic knowledge acquisition from probabilistic knowledge base”, Knowledge-Based Systems, 2020.
    [29] C. Snigdha, P. Haoruo, R. Dan, “Story Comprehension for Predicting What Happens Next,” Proceedings of the Conference on Empirical Methods in Natural Language Processing(EMNLP)", pp. 1603-1614, 2017.
    [30] F. Michael, S. Swapna, “Sentiment Analysis and Lexical Cohesion for the Story Cloze Task,” Proceedings of the 2nd Workshop on Linking Models of Lexical, Sentential and Discourse-level Semantics, pp. 62-67, 2017.
    [31] Sun Junyi, ”jieba中文分詞”. Accessed on Sep. 23, 2015[Online]. Available: https://github.com/fxsjy/jieba
    [32] W. Ma, K. Chen, “A bottom-up merging algorithm for Chinese unknown word extraction,” Proceedings of the second SIGHAN workshop on Chinese language processing, pp.31-38, 2003.
    [33] S. P. Mehdi Allahyari, M. Assefi, S. Safaei, et al., “A Brief Survey of Text Mining: Classification, Clustering and Extraction Techniques,” Proceedings of Knowledge Discovery and Data Mining(KDD) Bigdas, 2017.
    [34] H. Kim, H. Kyul Kim, S. Cho, “Rening Initial Points for K-Means Clustering,” Proceedings of the Fifteenth International Conference on Machine Learning(ICML), pp. 91-99, 1998.
    [35] G. Hamerly, C. Elkan, “Alternatives to the K-means Algorithm that Find Better Clusterings,” Proceedings of the eleventh international conference on Information and knowledge management (CIKM), 2002.
    [36] D. Lin, X. Wu, “Phrase Clustering for Discriminative Learning,” Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, vol. 2, pp. 1030-1038, 2009.
    [37] C. Manning, P. Raghavan, H. Schutze, “Introduction to Information Retrieval,” Cambridge University Press, 2008.
    [38] K. Chan, W. Ke, and S. Im, “Probability K-means Clustering for Neural Network Architecture,” Proceedings of International Conference on Advances in Artificial Intelligence (ICAAI), pp. 52-57, 2019.
    [39] M. Rana1, R Afrin, et al., “Concept Extraction from Ambiguous Text Document using K-means”, International Research Journal of Engineering and Technology (IRJET), vol 6, pp. 5317-5330, 2019.

    下載圖示 校內:2021-01-02公開
    校外:2021-01-02公開
    QR CODE