| 研究生: |
楊政晃 Yang, Cheng-Huang |
|---|---|
| 論文名稱: |
推特上的競爭辨識 Competition Identification in Twitter |
| 指導教授: |
高宏宇
Kao, Hung-Yu |
| 學位類別: |
碩士 Master |
| 系所名稱: |
電機資訊學院 - 資訊工程學系 Department of Computer Science and Information Engineering |
| 論文出版年: | 2013 |
| 畢業學年度: | 101 |
| 語文別: | 英文 |
| 論文頁數: | 47 |
| 中文關鍵詞: | 推特 、社群網路 、意見探勘 、情感分析 |
| 外文關鍵詞: | Twitter, Social Network, Opinion Mining, Sentiment Analysis |
| 相關次數: | 點閱:106 下載:3 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
推特是當今最受歡迎的微網誌平台,提供了使用者透過各式各樣的方式分享資訊,除了透過推特網站之外,還可以經由手機簡訊以及手機上的應用程式來發表訊息。現在每天有超過一千四百萬個活躍使用者,產生超過三千四百萬則訊息,所以推特成為了非常好的資源用來探勘大眾意見。傳統在網路上的意見探勘專注於分析特定領域的意見,例如電影、書本或是產品;利用使用者發表對特定產品的文章當作資源,然後分析這些文章的意見。但是都僅侷限對單一物件的意見進行探勘,這些方法在即時訊息流中並無法針對不同物件間進行比較,甚至找到比較標的。
在我們的研究中,我們希望透過意見探勘找出一個物件與其他物件間的競爭關係。此研究內,我們提出了兩物件競爭關係以及相對意見的想法來做競爭的辨識。本論文建構了一個架構於推特上進行競爭辨識的系統。方法部分分成兩個階段,資料處理階段將推特資料處理成有用的資訊;而排名階段會根據提出的物件競爭圖對相對意見做分析,並且整合所有意見以用來辨識出競爭關係。
我們的研究結果顯示:相對意見有助於我們辨識出競爭關係,而且我們提出的整合意見方法也對此問題研究結果有幫助。
Twitter, one of the most popular microblogging services allow users (twitterer) to post messages (tweet) through the website interface, SMS, or a range of Apps for mobile devices. It has over 140 million active users, generating over 340 million messages per day. As a result of rapidly increasing numbers of tweets, Twitter becomes a valuable source for mining people’s opinion and sentiment. Tradition opinion mining from general contents (Blog articles, review articles) on the web focuses on specific domains such as movie, book or consumer product. They gather lots of articles regarding the particular product, and analysis the opinion in the articles. Different from mining the opinion from particular objects, we want to know whether object A can be found when mining opinions from object B, and conclude the competition between object A and object B.
In this paper, we introduce the competition relation between two objects, and propose the idea of the relative opinion to identify the competition. We construct a framework to identify the competition in Twitter. Our method consists of two stages: First, the data process stage processes the raw data from Twitter and constructs the opinion conversions. The ranking stage mines the relative opinion between objects. This stage also aggregates the opinions to identify the competition.
Our result shows that the relative opinion is a good indicator to identify the competition. The opinion aggregation algorithm we proposed can help us to improve the result of the identification.
[1] J. Bollen, H. Mao, and X.-J. Zeng, "Twitter mood predicts the stock market," CoRR, vol. abs/1010.3003, 2010.
[2] W. Chamlertwat, P. Bhattarakosol, T. Rungkasiri, and C. Haruechaiyasak, "Discovering Consumer Insight from Twitter via Sentiment Analysis," J. UCS, vol. 18, pp. 973-992, 2012.
[3] D. Davidov, O. Tsur, and A. Rappoport, "Enhanced sentiment learning using Twitter hashtags and smileys," presented at the Proceedings of the 23rd International Conference on Computational Linguistics: Posters, Beijing, China, 2010.
[4] M.-C. de Marnee and C. Manning, "Stanford typed dependencies manual."
[5] M.-C. de Marneffe, B. MacCartney, and C. D. Manning, "Generating Typed Dependency Parses from Phrase Structure Trees," presented at the LREC, 2006.
[6] K. Gimpel, N. Schneider, B. O'Connor, D. Das, D. Mills, J. Eisenstein, M. Heilman, D. Yogatama, J. Flanigan, and N. A. Smith, "Part-of-speech tagging for Twitter: annotation, features, and experiments," presented at the Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2, Portland, Oregon, 2011.
[7] A. Go, R. Bhayani, and L. Huang, "Twitter Sentiment Classification using Distant Supervision," ed.
[8] S. A. Golder and M. W. Macy, "Diurnal and Seasonal Mood Vary with Work, Sleep, and Daylength Across Diverse Cultures," Science, vol. 333, pp. 1878-1881, September 30, 2011 2011.
[9] M. Hu and B. Liu, "Mining and summarizing customer reviews," presented at the Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining, Seattle, WA, USA, 2004.
[10] M. Hu and B. Liu, "Mining opinion features in customer reviews," presented at the Proceedings of the 19th national conference on Artifical intelligence, San Jose, California, 2004.
[11] X. Huang, X. Wan, and J. Xiao, "Learning to find comparable entities on the web," presented at the Proceedings of the 13th international conference on Web Information Systems Engineering, Paphos, Cyprus, 2012.
[12] A. Jain and P. Pantel, "How do they compare? Automatic identification of comparable entities on the Web," in Information Reuse and Integration (IRI), 2011 IEEE International Conference on, 2011, pp. 228-233.
[13] L. Jiang, M. Yu, M. Zhou, X. Liu, and T. Zhao, "Target-dependent Twitter sentiment classification," presented at the Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1, Portland, Oregon, 2011.
[14] S.-M. Kim and E. Hovy, "Determining the sentiment of opinions," presented at the Proceedings of the 20th international conference on Computational Linguistics, Geneva, Switzerland, 2004.
[15] B. Liu, M. Hu, and J. Cheng, "Opinion observer: analyzing and comparing opinions on the Web," presented at the Proceedings of the 14th international conference on World Wide Web, Chiba, Japan, 2005.
[16] B. O'Connor, R. Balasubramanyan, B. R. Routledge, and N. A. Smith, "From Tweets to Polls: Linking Text Sentiment to Public Opinion Time Series," in Proceedings of the International AAAI Conference on Weblogs and Social Media, ed, 2010.
[17] L. Page, S. Brin, R. Motwani, and T. Winograd, "The PageRank Citation Ranking: Bringing Order to the Web," Stanford InfoLab, Technical Report1999.
[18] A. Pak and P. Paroubek, "Twitter as a Corpus for Sentiment Analysis and Opinion Mining," presented at the Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10), Valletta, Malta, 2010.
[19] B. Pang and L. Lee, "Opinion Mining and Sentiment Analysis," Found. Trends Inf. Retr., vol. 2, pp. 1-135, 2008.
[20] B. Pang, L. Lee, and S. Vaithyanathan, "Thumbs up?: sentiment classification using machine learning techniques," presented at the Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10, 2002.
[21] B. Shenghua, "Competitor Mining with the Web," IEEE Transactions on Knowledge and Data Engineering, vol. 20, pp. 1297-1310, 2008.
[22] A. Tumasjan, T. O. Sprenger, P. G. Sandner, and I. M. Welpe, "Predicting elections with twitter: What 140 characters reveal about political sentiment," in Proceedings of the Fourth International AAAI Conference on Weblogs and Social Media, ed, 2010, pp. 178-185.
[23] P. D. Turney, "Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews," presented at the Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, Philadelphia, Pennsylvania, 2002.
[24] X. Wang, F. Wei, X. Liu, M. Zhou, and M. Zhang, "Topic sentiment analysis in twitter: a graph-based hashtag sentiment classification approach," in Proceedings of the 20th ACM international conference on Information and knowledge management, Glasgow, Scotland, UK, 2011, pp. 1031-1040.
[25] T. Wilson, P. Hoffmann, S. Somasundaran, J. Kessler, J. Wiebe, Y. Choi, C. Cardie, E. Riloff, and S. Patwardhan, "OpinionFinder: a system for subjectivity analysis," presented at the Proceedings of HLT/EMNLP on Interactive Demonstrations, Vancouver, British Columbia, Canada, 2005.