| 研究生: |
林聖軒 Lin, Sheng-Xuan |
|---|---|
| 論文名稱: |
基於雙向觀點及主題資訊之立場偵測 Bidirectional Perspective with Topic Information for Stance Detection |
| 指導教授: |
高宏宇
Kao, Hung-Yu |
| 學位類別: |
碩士 Master |
| 系所名稱: |
電機資訊學院 - 資訊工程學系 Department of Computer Science and Information Engineering |
| 論文出版年: | 2020 |
| 畢業學年度: | 108 |
| 語文別: | 英文 |
| 論文頁數: | 47 |
| 中文關鍵詞: | 立場偵測 、雙向觀點 、預訓練語言模型 、主題模型 |
| 外文關鍵詞: | Stance detection, bidirectional perspective, pre-trained model, topic model |
| 相關次數: | 點閱:119 下載:18 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
由於網路的便利,有許多社群網站或網路新聞會大量傳播錯誤的假新聞,這些假新聞會引起社會的恐慌和不安,或達成政治上的目的。自動假新聞偵測可以快速的對假新聞進行分類,並在事件發生時迅速幫助社會澄清訊息的真假,而無需進行長時間繁雜的人工檢查。通過分析群眾或新聞立場來幫助判斷新聞的真實性是目前主流的方法之一。因此,立場偵測已成為近年來受到重視的研究領域,如何準確的檢測立場已成為檢測假新聞的首要目標。
這項研究的目標是準確地偵測新聞的立場,以指導對假新聞的識別。本文提出了一種基於預訓練的BERT語言模型的立場偵測網路,BERT在外部語料庫上使用無監督的學習訓練,從而獲得通用的語言知識。近年來,基於BERT的遷移學習方法已被廣泛使用,並取得了優異的成績。下游任務可以受益於預訓練模型中所學到的先驗語言知識。先前的大多數的方法在對立場進行分類時僅使用了單一方向的推理資訊,這可能會遺漏一些重要信息。因此,我們提出了一種雙向推理立場偵測模型,該模型可以利用雙向觀點的訊息來總結出更加全面的資訊來幫助立場分類。最後,我們將立場偵測任務定義為階層結構的任務,並使用階層分類系統以及加入文本的主題資訊來幫助立場的分類。實驗結果表明,我們的模型能夠更加準確地分類立場。
Because of the convenience of the Internet, there are many websites or online news spread misinformation, cause panic and trepidation in society. Automatic fake news detection can classify fake news and help the society to clarify the information is true or false without human checking. Detecting fake news by analyzing the stance is one of the mainstream methods, stance detection has become a new popular research field in recent years. How to accurately detect stance has become the primary goal of detecting fake news.
This research aims to detect the news stance accurately, and we propose a method based on a pre-trained BERT language model. Most of the previous work only used the knowledge of single inference direction when classifying the stance, which may lose some important information. Therefore, we propose a bidirectional inference stance detection model, which can leverage bidirectional perspective information to classify the stance more comprehensively. We also define the stance detection task as a hierarchical structure task, and use the hierarchical classification and incorporate the topic information to help the stance classification. Experiment results show that our model can classify the stance more accurately.
[1] Mihaylov, T., Georgiev, G., & Nakov, P. (2015, July). Finding opinion manipulation trolls in news community forums. In Proceedings of the nineteenth conference on computational natural language learning (pp. 310-314).
[2] Shu, K., Sliva, A., Wang, S., Tang, J., & Liu, H. (2017). Fake news detection on social media: A data mining perspective. ACM SIGKDD Explorations Newsletter, 19(1), 22-36.123
[3] Weedon, J., Nuland, W., & Stamos, A. (2017). Information operations and Facebook. Version, 1, 27.
[4] Sainath, Tara N., et al. "Convolutional, long short-term memory, fully connected deep neural networks." 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2015.
[5] Hanselowski, Andreas, et al. "A Retrospective Analysis of the Fake News Challenge Stance-Detection Task." Proceedings of the 27th International Conference on Computational Linguistics. 2018.
[6] Mohtarami, Mitra, et al. "Automatic Stance Detection Using End-to-End Memory Networks." Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers). 2018..
[7] Sukhbaatar, Sainbayar, Jason Weston, and Rob Fergus. "End-to-end memory networks." Advances in neural information processing systems. 2015.
[8] Slovikovskaya, V., & Attardi, G. (2020, May). Transfer Learning from Transformers to Fake News Challenge Stance Detection (FNC-1) Task. In Proceedings of The 12th Language Resources and Evaluation Conference (pp. 1211-1218).
[9] Vaswani, Ashish, et al. "Attention is all you need." Advances in neural information processing systems. 2017.
[10] Devlin, J., Chang, M. W., Lee, K., & Toutanova, K. (2019, June). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers) (pp. 4171-4186).
[11] Kim, Yoon. "Convolutional neural networks for sentence classification." arXiv preprint arXiv:1408.5882 (2014).
[12] Hochreiter, Sepp, and Jürgen Schmidhuber. "Long short-term memory." Neural computation 9.8 (1997): 1735-1780.
[13] Blei, David M., Andrew Y. Ng, and Michael I. Jordan. "Latent dirichlet allocation." Journal of machine Learning research 3.Jan (2003): 993-1022.
[14] Griffiths, Thomas L., and Mark Steyvers. "Finding scientific topics." Proceedings of the National academy of Sciences 101.suppl 1 (2004): 5228-5235.
[15] Srivastava, A., & Sutton, C. (2017). Autoencoding variational inference for topic models. arXiv preprint arXiv:1703.01488.
[16] Wu, Yonghui, et al. "Google's neural machine translation system: Bridging the gap between human and machine translation." arXiv preprint arXiv:1609.08144 (2016).
[17] Kingma, D. P., & Welling, M. (2013). Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114.
[18] Lukasik, Michal, et al. "Hawkes processes for continuous time sequence classification: an application to rumour stance classification in twitter." Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). 2016.
[19] Wolf, T., Debut, L., Sanh, V., Chaumond, J., Delangue, C., Moi, A., ... & Brew, J. (2019). Huggingface’s transformers: State-of-the-art natural language processing. ArXiv, abs/1910.03771.
[20] Bahdanau, Dzmitry, Kyunghyun Cho, and Yoshua Bengio. "Neural machine translation by jointly learning to align and translate." arXiv preprint arXiv:1409.0473 (2014).
[21] Bajaj, Samir. "The Pope Has a New Baby!." Fake news detection using deep learning (2017).
[22] Augenstein, I., Rocktäschel, T., Vlachos, A., Bontcheva, K. (2016). Stance detection with bidirectional conditional encoding. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2016).
[23] Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Lukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In Advances in Neural Information Processing Systems, pages 6000–6010