簡易檢索 / 詳目顯示

研究生: 洪偉傑
HUNG, WEI-CHIEH
論文名稱: 航空安全系統大數據解析之初步探究
Preliminary Study on Aviation Safety System using Big Data Analytics
指導教授: 林清一
Lin, Chin E.
學位類別: 碩士
Master
系所名稱: 工學院 - 民航研究所
Institute of Civil Aviation
論文出版年: 2016
畢業學年度: 104
語文別: 英文
論文頁數: 56
中文關鍵詞: 大數據解析引擎壽命預測航空數據
外文關鍵詞: Big data analytic, Engine life cycle, Aviation data
相關次數: 點閱:123下載:10
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 大數據解析已成近年來相當熱門的話題,各領域皆試著把新思維帶入企業中,以提升營收或是找到潛在模式。大數據解析的基礎建立於大量、多樣化、快速產生的數據量以及其準確性,並且需要搜集、分析、應用才能展現它的價值。航空業對於數據產生、保存上更是鉅細靡遺,被法規和相關手冊嚴格規範,若能更妥善利用這些數據,將會使得航空業成為更安全、更有效率的產業。
    基於CRISP-DM(Cross Industry Standard Process for Data Mining),本論文將這套已發展成熟並廣泛利用的架構利用於航空業,初步探討大數據解析之思維。由於航空業瞬息萬變,數據解析又是針對個案研究,一套通用的程序可以讓分析師面對各種狀況時具備數據解析思維。本論文以引擎壽命預測做為範例,將引擎感測器產生的數據進入數據解析程序並探討,找出其隱藏價值。
    人類的判斷力與直覺,配合數據解析的客觀性,將成為強而有力的洞見。

    Big data analysis has become a very popular topic recently. In various fields are trying to bring the new insight into their enterprise for increasing revenue or finding potential patterns. Furthermore the airline can seek new way to serve the marketplace and increase their profit, balancing both safety and revenue. Basis for big data analysis is on the volume, variety, velocity, and veracity of data. It needs to be well collected, analyzed, and applied that can reveal its value. For aviation, there is a strict regulation for collecting, preserving data, but most of the data are not well used.
    In this study, based on CRISP-DM, a robust and well-proven data mining methodology, as the fundamental concept to create a data analytic processing which is suitable for aviation. A generic data thinking can help analysts to handle with diversified data type, a variety data source, and different analyzing software. We focus on the procedure that gives aviation analysts a logical and data thinking. The purpose of this study is to create a preliminary approach of aviation data analytic, because there is not a standard operation procedure yet. However, data analytic is a case by case study, it is impossible to have an all-powerful tool which is able to analyze every database. An engine sensor database is used in this procedure to give example for data thinking and investigating. Try to find hidden information to reveal its value.
    Bringing big data analytic into aviation safety system does not mean to exclude experts' opinion. Human work with data can generate great insights.

    摘要 I ABSTRACT II 誌謝 III List of Figures VI List of Tables VII CHAPTER I INTRODUCTION 1 1.1 Motivation 1 1.2 Main Idea 3 1.3 Scope 4 1.4 Literature survey 4 1.5 Thesis Outline 9 CHAPTER II RESEARCH BACKGROUND 10 2.1 Background 10 2.2 Data collection method 11 2.3 Aviation Data 11 2.4 Unlabeled and Labeled Data 14 2.5 Supervised, Semi-Supervised and Unsupervised Learning 15 2.6 Perspective difficulties 16 CHAPTER III METHODOLOGY 18 3.1 Basic concept of data mining 18 3.1.1 Data mining and big data 18 3.1.2 Applications of data mining 20 3.2 Basic concept of CRISP-DM 21 3.2.1 Aviation business Understanding 23 3.2.2 Data Understanding 24 3.2.3 Data Preparation 26 3.2.4 Modeling 28 3.2.5 Evaluation 29 3.2.6 Deployment 29 CHAPTER IV RESULTS AND DISCUSSION 31 4.1 Expected outcomes 31 4.2 Aviation business Understanding 32 4.3 Data Understanding 35 4.4 Data Preparation 38 4.4.1 Time series shifting 38 4.5 Modeling 45 4.6 Evaluation 47 4.7 Deployment 49 CHAPTER V CONCLUSIONS 50 REFERENCES 52

    [1] N. Graham, “Aviation Safety: Making a safe system even safer”, Air Navigation Bureau International Civil Aviation Organization Video Message, October, 2010.
    [2] FAA, Aeronautical Decision-Making, pilot's handbook of aeronautical knowledge, chapter 17, 2014.
    [3] J. Mouawad,& C. Drew, “Airline industry at its safest since the dawn of the jet age,” The New York Times Vol. 11, 2013
    [4] A. Mosleh, A. Dias, G. Eghbali, & K. Fazen," An integrated framework for identification, classification, and assessment of aviation systems hazards." Probabilistic Safety Assessment and Management. Springer London, 2004.
    [5] J. Caldwell, "Fatigue Countermeasures in Aviation". Aviation, Space, and Environmental Medicine 80 (1): pp. 29–59, 2009
    [6] V. M. Schonberger, & K. Cukier., “Big Data: A Revolution That Will Transform How We Live, Work, and Think”, “Books of The Times”, May 2013.
    [7] T. J. Wholey, IBM Global Business Services White Paper, “Commercial Aviation and Aerospace: Big Data Analytics for Advantage, Differentiation and Dollars”, December 2014.
    [8] M. Finnegan, “Boeing 787s to Create Half a Terabyte of Data Per Flight, Says Virgin Atlantic”, March, 2013.
    [9] Y. Dodge, the Oxford Dictionary of Statistical Terms, Oxford University Press, 6 edition. ISBN 0199206139, September 2006.
    [10] W. Willcox, "The Founder of Statistics", Review of the International Statistical Institute Vol. 5, No. 4, pp. 321-328, 1938.
    [11] A. Hald, a History of Mathematical Statistics, Wiley, ISBN 0471179124, 1998.
    [12] Agrawal, & K. Gopal, Biomonitoring of Water and Waste water. Springer Science & Business Media, 2013.
    [13] D. J. Rumsey, & D. Unger, U Can: Statistics for Dummies, John Wiley & Sons, Inc., pp. 31, ISBN: 978-1-119-08485-3, July 2015.
    [14] W. J. Frawley, G. P. Shapiro, & C. J. Matheus, "Knowledge discovery in databases: An overview.” AI magazine 13. 3: 57, 1992.
    [15] G. P. Shapiro, "Knowledge discovery in real databases: A report on the IJCAI-89 Workshop." AI magazine 11.4: 68, 1990
    [16] U. Fayyad et al., "From data mining to knowledge discovery in databases”, AI magazine 17.3: 37, 1996.
    [17] Shearer, “the CRISP-DM model: the new blueprint for data mining”, Journal of Data Warehousing, Volume 5 Number 4, 2000.
    [18] G. P. Shapiro, “ CRISP-DM, still the top methodology for analytics, data mining, or data science projects”, available in April 2016 from website: http://www.kdnuggets.com/2014/10/crisp-dm-top-methodology-analytics-data-mining-data-science-projects.html
    [19] S. Ayhan, J. Pesce, P. Comitz, D. Sweet, S. Bliesner, & G. Gerberick," Predictive analytics with aviation big data." Integrated Communications, Navigation and Surveillance Conference, ICNS, 2013.
    [20] Computer Sciences Corporation Federal Sector, “Aircraft Situation Display to Industry: Functional Description and Interface Control Document for the XML Version.” Version 1.8, Available from website in April 206: http://www.fly.faa.gov/ASDI/asdi.html, 2011.
    [21] J. Manyika, M. Chui, B. Brown, J. Bughin, R. Dobbs, C. Roxburgh, & A. H. Byers, “Big Data: The next frontier for innovation, competition and productivity”, Technical report, McKinsey Global Institute, May 2011.
    [22] V. Dhar, “Data Science and Prediction”, Communications of the ACM, vol. 56, no. 12, December 2013.
    [23] M.G. Siegler, “Eric Schmidt: Every 2 Days We Create As Much Information As We Did Up To 2003”,Techcrunch Newspaper, Aug 4, 2010.
    [24] M. Schroeck, R. Shockley, J. Smart, D. Romero-Morales, & P. Tufano, “Analytics: The real-world use of big data”, IBM Global Business Services Business Analytics and Optimization Executive Report, October 2012.
    [25] Z. Nazeri, E. Bloedorn, & P. Ostwald, "Experiences in mining aviation safety data." ACM SIGMOD Record, Vol. 30. No. 2. ACM, 2001.
    [26] The INTEL IT Center, “Big Data Visualization: Turning Big Data Into Big Insights”, March 2013. Available from website: http://www.intel.com/content/www/us/en/big-data/big-data-visualization-turning-big-data-into-big-insights.html
    [27] Rockwell Collins, Aircraft Communications Addressing and Reporting System (ACARS), available from website in April 2016: https://www.rockwellcollins.com/Services_and_Support/Information_Management/~/media/DA843DB0792946C58740F613328E5022.ashx
    [28] Pagels, "Aviation Data Mining." Scholarly Horizons: University of Minnesota, Morris Undergraduate Volume 2 Issue 1 Article 3, 2015.
    [29] Trewartha, "Investigating data mining in MATLAB" , Bachelor Dissertation, Department of Science, Rhodes University, 2006.
    [30] X. ZHU, “Semi-supervised learning literature survey”, Computer Sciences TR 1530, University of Wisconsin, 2005.
    [31] Example of supervised and semi-supervised learning in binary classification problems, Available from website: http://bioinformatics.oxfordjournals.org/content/24/6/783/F1.expansion.html
    [32] Persing, & V. Ng, "Semi-supervised cause identification from aviation safety reports", Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Vol. 2, 2009.
    [33] T. De Bie, T. T. Maia, & A. de Pádua Braga, "Machine Learning with Labeled and Unlabeled Data.", ESANN, 2009.
    [34] M. S. Chen, J. Han, & P. S. Yu, "Data mining: an overview from a database perspective." Knowledge and data Engineering, IEEE Transactions pp.866-883, 1996.
    [35] Harris, E. Bloedorn, & N. Rothleder, “Recent Experiences with Data Mining in Aviation Safety.” Special Interest Group on Management of Data, Data Mining and Knowledge Discovery (SIGMOD-DMKD) Workshop, 1998.
    [36] S. Brin, R. Motwani, & C. Silverstein, "Beyond market baskets: Generalizing association rules to correlations." ACM SIGMOD Record. Vol. 26. No. 2. ACM, 1997.
    [37] J. Widom, "Research problems in data warehousing." Proceedings of the fourth international conference on Information and knowledge management. ACM, 1995.
    [38] S. T. Gahane, “The Conceptual Overview: Challenges and Opportunities with “BIG DATA””, International Journal of IT, Engineering and Applied Sciences Research (IJIEASR) Volume 2, No. 9, pp.23-28, September 2013.
    [39] T. J. Wholey, IBM Global Business Services White Paper, “Commercial Aviation and Aerospace: Big Data Analytics for Advantage, Differentiation and Dollars.” December 2014. Available from website: http://www-01.ibm.com/common/ssi/cgi-bin/ssialias?infotype=SA&subtype=WH&htmlfid=GBW03316USEN
    [40] D. Al-Jumeily, A. Hussain, C. Mallucci, & C. Oliver, Applied Computing in Medicine and Health, Morgan Kaufmann, pp.110, 2015.
    [41] P. Chapman, J. Clinton, R. Kerber, T. Khabaza, T. Reinartz, C. Shearer, & R. Wirth ,"CRISP-DM 1.0 Step-by-step data mining guide." 2000.
    [42] Harper & S. D. Pickett, “Methods for mining HTS data”, Drug Discovery Today, Volume 11, Numbers 15/16, pp. 694-699, 2006.
    [43] Holmes, A. Donkin, & I. H. Witten, “Weka: A machine learning workbench”, Intelligent Information Systems, 1994. Proceedings of the 1994 Second Australian and New Zealand Conference, 1994.
    [44] T. Wang, J. Yu, D. Siegel, & J. Lee, “A similarity-based prognostics approach for remaining useful life estimation of engineered systems. In Prognostics and Health Management. ”, International Conference on, IEEE, pp. 1-6, 2008.
    [45] R. Kohavi, " A study of cross-validation and bootstrap for accuracy estimation and model selection.", Ijcai, Vol. 14, No. 2, 1995.

    下載圖示 校內:立即公開
    校外:2018-07-04公開
    QR CODE