簡易檢索 / 詳目顯示

研究生: 游士誼
Yu, Scott
論文名稱: 基於使用者目的偵測之網路短語查詢的改善
Improving Short-query Web Search Based on User Goal Identification
指導教授: 盧文祥
Lu, Wen-Hsiang
學位類別: 碩士
Master
系所名稱: 電機資訊學院 - 資訊工程學系
Department of Computer Science and Information Engineering
論文出版年: 2006
畢業學年度: 94
語文別: 中文
論文頁數: 52
中文關鍵詞: 使用者目的使用者目的偵測全球資訊網網路搜尋機率模型
外文關鍵詞: probabilistic model, Web search, user goal identification, User goal, World Wide Web
相關次數: 點閱:128下載:1
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 隨著全球網際網路的蓬勃發展,使用者可以輕鬆的透過搜尋引擎,得到全球豐富的資訊。搜尋引擎根據使用者的查詢詞,與已收集的網頁比對關鍵字,最後搜尋引擎依照其制訂的排名方式,將相關網址列出供使用者點選。然而,此情形將會導致搜尋引擎找出大量的文件,而增加使用者的負擔。因此,本篇論文藉由使用者下達的查詢詞所得到的搜尋結果,透過語法結構,使用動詞與名詞的組合,發展使用者目的偵測之機率模型,可以自動地產生使用者背後真正的目的──使用者目的,最後再利用使用者目的對原始查詢詞的搜尋結果進行重新排名,以減低使用者的負擔。

    As the Internet grows quickly, users can easily get rich information from the World Wide Web. Search engines use keywords in users’ queries to match the pages they collected. Then these engines sort related URLs and list search results by their ranking method to let user click them. However, a great deal of search results provided by search engines would add users’ load of reading information. This paper uses search results and the pair of verb and noun occurring in search results to identify user goals. We propose a probabilistic model of user goal identification in order to automatically find the needs behind users (user goals), and then re-rank the search results based on user goal identification.

         目錄 …………………………………………………………………iii 圖目錄 …………………………………………………………………v 表目錄 ………………………………………………………………vii 第一章 緒論 …………………………………………………………1 1.1 研究背景…………………………………………………………1 1.2 研究動機…………………………………………………………1 1.3 環境與問題描述…………………………………………………2 1.4 研究貢獻…………………………………………………………9 1.5 論文架構…………………………………………………………9 第二章 相關研究……………………………………………………10 2.1 搜尋研究議題 …………………………………………………10 2.2 使用者目的分析 ………………………………………………12 2.3 個人化搜尋 ……………………………………………………13 第三章 使用者目的偵測與搜尋結果重新排名……………………15 3.1 使用者目的偵測方法 …………………………………………15 3.2 基本機率模型 …………………………………………………17 3.3 獨立式機率模型 ………………………………………………20 3.4 搜尋結果重新排名 ……………………………………………22 第四章 實驗分析……………………………………………………25 4.1 實驗資料和評估方法 …………………………………………25 4.1.1 視窗範圍分析 ………………………………………………26 4.1.2 使用者目的偵測模型比較 …………………………………28 4.1.3 依詞性類別分析使用者目的 ………………………………31 4.1.4  瀏覽型與非瀏覽型的比較…………………………………33 4.2 搜尋結果重新排名分析 ………………………………………36 第五章 結論與未來研究方向………………………………………44 5.1 結論 ……………………………………………………………44 5.2 未來研究方向 …………………………………………………44 第六章 參考文獻……………………………………………………46 附錄一 動詞詞性類別標籤…………………………………………49 附錄二 名詞詞性類別標籤…………………………………………50 附錄三 高頻查詢詞列表與類別……………………………………51

    [1]  E. Balfe and B. Smyth, “Query mining for community based web search,” in Proceedings of the Web intelligence, IEEE/WIC/ACM international Conference on (Wi'04), pages 594-598, 2004.
    [2]  R. Barrett, P. P. Maglio, and D. C. Kellem, "How to personalize the Web," in Proceedings of the SIGCHI conference on Human factors in computing systems, pages 78-52, 1997.
    [3]  A. Board, “A taxonomy of web search,” SIGIR Forum, vol. 36, issue 2, pages 3-10, 2002.
    [4]  O. Buyukkokten, J. Cho, H. Garcia-Molina, L. Gravano, and N. Shivakumar, “Exploiting geographical location information of web pages,” ,in Proceedings of the ACM SIGMOD Workshop on the Web and Databases (WebDB'99), June 1999.
    [5]  S. Chien and N. Immorlica, "Semantic similarity between search engine queries using temporal correlation," in Proceedings of the 14th international conference on World Wide Web, pages 2-11, 2005.
    [6]  H. Cui, J.-R. Wen, J.-Y. Nie, and W.-Y. Ma, “Probabilistic Query Expansion Using Query Logs,” in Proceedings of the 11th International Conference on World Wide Web, pages 325–332, 2002.
    [7]  J. Ding, L. Gravano, and N. Shivakumar, “Computing Geographical Scopes of web Resources,” in Proceedings of the 26th International Conference on Very Large Data Bases, pages 545–556, 2000.
    [8]  L. Gravano, and V. Hatzivassiloglou, and R. Lichtenstein, “Categorizing Web Queries According to Geographical Locality,” in Proceedings of the 12th international Conference on information and Knowledge Management (CIKM’04), pages 325-333, 2003.
    [9]  Ş. Gündüz and M. T. Özsu, "A Web page prediction model based on click-stream tree representation of user behavior," in Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining, pages 535-540, 2003.
    [10] G. Jeh and J. Widom, "Scaling personalized web search," in Proceedings of the 12th international conference on World Wide Web, pages 271-279, 2003.
    [11] U. Lee, Z. Liu and J.Cho, “Automatic Identification of User Goals in Web Search,” in Proceedings of the 14th International Conference on World Wide Web, pages 391–400, 2005.
    [12] P. Maglio and R. Barrett, "Intermediaries personalize information streams," Commun. ACM, vol. 43, pp. 96-101, 2000.
    [13] L. Page, S. Brin, R. Motwani, and T. Winograd, "The pagerank citation ranking: Bringing order to the web," in Stanford Digital Libraries Working Paper, 1998.
    [14] D.E. Rose and D. Levinson, “Understanding User Goals in Web Search,” in Proceedings of the 13th International Conference on World Wide Web, pages 13–19, 2004.
    [15] C. Silverstein, M. Henzinger, H. Marais and M. Moricz, “Analysis of a Very Large Web Search Engine Query Log,” SIGIR Forum, vol. 33, issue 3, pages 6-12, 1999.
    [16] C. G. Thomas and G. Fischer, "Using agents to personalize the Web," in Proceedings of the 2nd international conference on Intelligent user interfaces, pages 53-60, 1997.
    [17] W. Wang and O. R. Zaïane, "Clustering Web sessions by sequence alignment," presented at Proceedings of the 13th international workshop on Database and Expert Systems Applications, pages 394-398, 2002.
    [18] H.J. Zeng, Q.C. He, Z. Chen, W.Y. M and J. Ma, “Learning to cluster web search results,” in Proceedings of the 27th annual international conference on Research and development in information retrieval, pages 210-217, 2004.
    [19] Q. Zhao, C.-H. Hoi, T.-Y Liu, S.S. Bhowmick, M. R. Lyu ,and W.-Y Ma, “Time-Dependent Semantic Similarity Measure of Queries Using Historical ClickThrough Data,” in Proceedings of the international conference on World Wide Web, pages 543-552, 2006.

    下載圖示 校內:2007-09-04公開
    校外:2009-09-04公開
    QR CODE