研究生: |
朱彥侖 Chu, Yen-Lun |
---|---|
論文名稱: |
可全句查詢之學術寫作搭配詞偵錯及推薦系統 Sentence-wide Collocation Recommendation System With Error Detection For Academic Writing |
指導教授: |
王宗一
Wang, Tzone-I |
學位類別: |
碩士 Master |
系所名稱: |
工學院 - 工程科學系 Department of Engineering Science |
論文出版年: | 2014 |
畢業學年度: | 102 |
語文別: | 中文 |
論文頁數: | 75 |
中文關鍵詞: | 搭配詞 、語料庫建置 、搭配詞分析 、搭配詞查詢系統 |
外文關鍵詞: | Collocation, corpus build, collocation analysis, collocation system |
相關次數: | 點閱:165 下載:1 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
英文研究論文寫作在學術競爭激烈的今日十分重要。在學術寫作中,使用正確且恰當的搭配組合,不只可以將寫作內容的句子銜接得當,還可以精確地表達寫作者的想法,幫助閱讀者理解文意,達到分享研究成果之目的。因此,搭配詞查詢系統的發展,對於提升英文寫作品質、正確使用措辭用字尤有助益。
然而,在不同領域中,對於論文寫作的用字遺詞、寫作風格皆不盡相同,因此學術寫作與一般寫作在搭配詞的使用上,亦略有差異。另外,現有的搭配詞推薦系統多以關鍵字來查詢結果,但搭配詞往往在句子中扮演著表達意義的重要角色且一個句子中通常會存在多個搭配詞組,寫作者並不容易選用正確的關鍵字來查詢搭配詞。
故本研究,以輔助學術論文寫作為目的,蒐集大量專業及高品質的學術文獻建置語料庫並利用自然語言處理技術開發一學術搭配詞推薦系統。並且,本系統亦允許使用者以自然語句輸入,系統可自動偵測搭配詞的位置及種類並依據語意推薦其他的同義搭配詞。在實驗中,可全句查詢之學術寫作搭配詞偵錯及推薦系統的首位搭配詞,其有效性高達73.1%。撰寫者將可參考本系統所推薦的搭配詞及其頻率等資訊,輔助寫作者來選擇更適合的搭配詞或改寫句子,以減少搭配詞使用上的錯誤,進而提升論文的寫作品質。
Collocation plays an important role in English article writing. This research builds a collocation corpus for academic writing reference. Based on the collocation corpus, this research also establishes a sentence-wide collocation recommendation and error detection system with for academic writing. The corpus is built from Science Citation Index (SCI) papers and industry field thesis, which are collected and processed by a formal procedure developed in this research. The first step of the procedure uses the Stanford Parser to parse and retrieve collocations sentence by sentence from those papers and thesis. The second step classifies the retrieved collocations in different types and gathers their information to form a collocation corpus specifically for academic article writings. The use of the corpus is through a web-based collocation system interface built in this study. Distinguished from other collocation systems found on the web nowadays, this system can do full sentence collocation error detections and recommendations. After several conducted experiments, the system is proved to be able to give satisfiable feedbacks and recommendations for scientific article authors. Although the collocation corpus now is not complete enough to give the most precise results, the formal procedure can still keep enhancing the corpus and improving the system by automatically collecting articles from various fields.
Ackermann, K., & Chen, Y.-H. (2013). Developing the Academic Collocation List (ACL) – A corpusdrivenand expert-judged approach. Journal of English for Academic Purposes.
Ackermann, K., Jong, J. H. A. L. d., Kilgarriff, A., & Tugwell, D. (2011). The Pearson International Corpus of Academic English (PICAE). Proceedings of CorpusLinguistics.
Bahns, J. (1993). Lexical collocations: a contrastive view. ELT journal.
Benson, M., Benson, E., & Ilson, R. F. (1986). The BBI Combinatory Dictionary of English A guide to word combinations.
Bouma, G. (2009). Normalized (Pointwise) Mutual Informationin Collocation Extraction. Proceedings of the Biennial GSCL Conference.
Chen, Y.-C., Yen, T.-X., & CHang, J. S. (2012). Associating Collocations with WordNet Senses Using Hybrid Models. Proceedings of the Twenty-Fourth Conference on Computational Linguistics and Speech Processing.
Church, K. W., & Hanks, P. (1990). WORD ASSOCIATION NORMS, MUTUAL INFORMATION, AND LEXICOGRAPHY.
Clear, J. (1995). T-SCORE AND MUTUAL INFORMATION SCORE FROM BIRMINGHAM CORPUS WEBSITE. from http://lingua.mtsu.edu/chinese-computing/docs/tscore.html
Davies, M. (2008). The Corpus of Contemporary American English: 450 million words, 1990–present., from http://corpus.byu.edu/coca/
Futagi, Y., Deane, P., Chodorow, M., & JoelTetreault. (2008). A computational approach to detecting collocation errors in the writing of nonnativespeakers of English. Computer Assisted Language Learning.
Gao, Z.-M. (2013). Automatic Identification of English Collocation Errors based on Dependency Relations. 27th Pacific Asia Conference on Language, Information, and Computation, 550-555.
Jian, J.-Y., Chang, Y.-C., & Chang, J. S. (2004). TANGO: Bilingual Collocational Concordancer. Annual Conference of the Association for Computational Linguistics.
Leed, R. L., & Nakhimovsk, A. D. (1979). LEXICAL FUNCTIONS AND LANGUAGE LEARNING. The Slavic and East European Journal, 23, 104-113.
Lewis, M. (2002). Implementing the Lexical Approach. Boston: Thomson Heinle.
Li, C.-C. (2005). A Stduy Of Collocational Error Types In ESL/EFL College Learners' Writing. from https://www.google.com/url?sa=t&rct=j&q=&esrc=s&source=web&cd=3&cad=rja&uact=8&ved=0CDMQFjAC&url=http%3A%2F%2Fethesys.lib.mcu.edu.tw%2FETD-db%2FETD-search%2Fgetfile%3FURN%3Detd-0730105-205237%26filename%3Detd-0730105-205237.pdf&ei=Oi_VU6CDEM_98QWbwICoBA&usg=AFQjCNE4t1EBM0yTzjYKjLFhDs7yhoLh4w&sig2=QYlNX9fvbK_K6NtCX88ydw
Lin, J., & Wilbur, W. J. (2008). Modeling actions of PubMed users with n-gram language models. Information retrieval.
Losee, R. M. (2001). Natural Language Processing in Support of Decision-making: Phrase and Part-of-speech Tagging. Information Processing & Management, 37(6), 769-787.
Marneffe, M.-C. d., & Manning, C. D. (2013). Stanford typed dependencies manual.
Miller, G. A. (1995). WordNet: a lexical database for English. Communications of the ACM, 38(11), 39-41.
NLPlab, N. (2010). Collocation Checker. from http://miscollocation-richtrf.rhcloud.com/
Peter, H. (1998). Phraseology and Second Language Proficiency. Applied Linguistics.
Peter, W. (2006). The Porter stemming algorithm: then and now. Electronic library and information systems, 40(3), 219-223.
Press, O. U. (2002). Oxford Collocations Dictionary for Students of English.
Qingzhao, Z. (2011). Using Corpora in Autonomous Study of English Vocabulary:A Case Study of STORM. Proceedings of The 16th Conference of Pan-Pcific Association of Applied Linguistics.
reilly, N. (2013). A comparative analysis of present and past participial adjectives and their collocations In the corpus of contemporary American english (coca). from http://www.slideshare.net/NataliaReillyPhD/reilly-natalia-v201312ma
S, A., & Kaimal, R. (2012). DOCUMENT SUMMARIZATION USING POSITIVE POINTWISE MUTUAL INFORMATION. International Journal of Computer Science & Information Technology, 4.
Smadja, F. (1989). Lexical Co-occurrence: The Missing Link Journal for Literary and Linguistic Computing.
Smadja, F. (1993). Retrieving Collocations from Text: Xtract. Association for Computational Linguistics.
Smadja, F. A. (1991). From n-grams to collocations an evaluation of xtract. Proceedings of the 29th annual meeting on Association for Computational Linguistics.
Thanopoulos, A., Fakotakis, N., & Kokkinakis, G. (2002). Comparative Evaluation of Collocation Extraction Metrics The International Conference on Language Resources and Evaluation.
Wu, J.-C., Chang, Y.-C., Mitamura, T., & Chang, J. S. (2010). Automatic Collocation Suggestion in Academic Writing. Proceedings of the ACL 2010 Conference, 115-119.
Zhang, X. (1993). English collocations and their effect on the writing of native and non-nativeCollegeFreshmen. Unpublished doctoral dissertation, Indiana University of Pennsylvania,Pennsylvania.
教育部高教司. (2004). 大學校務評鑑規劃與實施計畫-評鑑手冊. fromhttp://academic.ntou.edu.tw/service/dia/ntou/book1.pdf
廖柏森. (2008). 英文研究論文寫作搭配詞指引.