研究生: |
林孝忠 Lin, Hsiao-Chung |
---|---|
論文名稱: |
群集中心具有體積之模糊分群績效比較 |
指導教授: |
吳植森
Wu, Chih-Sen |
學位類別: |
碩士 Master |
系所名稱: |
管理學院 - 資訊管理研究所 Institute of Information Management |
論文出版年: | 2004 |
畢業學年度: | 92 |
語文別: | 中文 |
論文頁數: | 52 |
中文關鍵詞: | 相似性測度 、模糊分群 、群集中心具有體積之模糊分群演算法 、群集聚合 |
外文關鍵詞: | Fuzzy Clustering Algorithm, Similarity Measure, Fuzzy Clustering, Volume Prototype Based, Cluster Merging |
相關次數: | 點閱:80 下載:1 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
經營企業需要使用各式各樣的資料來協助決策,分析資料從中找出資料的結構與意涵對管理者而言相當重要。當對資料分佈結構不清楚的時候,使用資料探勘領域中之分群演算法以協助發覺資料隱藏的樣式,以及資料在真實世界的概念。一般而言,模糊集合適合用來處理不完整,含有雜訊之資料等問題。目標函數基礎的模糊分群演算法需事先固定群集的數目,無法自動產生群集數。因此本研究使用群集中心具有體積之模糊分群演算法,在演算法中使用相似性測度自動地決定最佳的群集數。並且使用群集有效性指標來比較、驗證群集中心具有體積之模糊分群演算法的分群結果。
本研究使用之修正型的模糊分群演算法:ME-FCM與ME-GK演算法,加入修改之模糊共變異數矩陣與群集聚合的機制,以達成研究之三個目的:(1)找出使得目標函數最小的模糊分割與群集中心。(2)在不需要事先指定群集數目的情況下,在分群過程中自動決定最適當的群集數。(3)修改模糊共變異數矩陣,減少計算上的誤差,產生較佳之分群結果。經由實證研究顯示,修正型的模糊分群演算法有較佳之分群結果與分類正確率。
Decision making is an essential to business operation. Decision making relies on good models built from various kinds of data. Analyzing data and finding meanings of data are important tasks for knowledge workers. In real world data are hidden in a very complicated database. We need to use techniques, such as decision and clustering methods, to discover hidden patterns and real meanings of data. In general, fuzzy sets can be used to process incomplete and noisy data. However, some objective function based fuzzy clustering algorithms have to assign the number of clusters at the beginning of the process. This research uses volume
prototype based fuzzy clustering algorithm and similarity measure to figure out the number of clusters automatically. Besides, we use cluster validity index to compare the performance of volume prototype based fuzzy clustering algorithm with other fuzzy clustering algorithms.
Two algorithms were developed in this study: ME-FCM and ME-GK algorithms. The algorithms find the fuzzy partitions and centers, and in the meanwhile, minimize the objective function minimum as well. One unique feature of the algorithm is the number of clusters can be determined automatically. Another feature is a modified fuzzy covariance matrix was used to reduce computing errors. The results show that modified fuzzy clustering algorithm can reach better consequences of clustering and accuracy of classification.
Anderberg, M.R, 1973, Cluster analysis for applications, Academic Press, New York.
Babuska, R., 1998, Fuzzy Modeling for Control, Kluwer Academic Publishers, Boston.
Babuska, R., Veen, P.J., and Kaymak, U., 2002, “Improved Covariance Estimation for Gustafson-Kessel Clustering”, In Proceedings of the 2002 IEEE International Conference on
Fuzzy Systems, Vol.2 pp.1081-1085.
Bandyopadhyay, S., 2004, “An Automatic Shape Independent Clustering Technique”, Pattern Recognition, Vol.37 No.1 pp.33-45.
Bezdek, J.C., 1981, Pattern Recognition with Fuzzy Objective Function Algorithm, Plenum Press, New York.
Chen, Z.G., 2001, Data Mining and Uncertain Reasoning:An Integrated Approach, John Wiley, New York.
Dunn, J.C., 1973, “A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Seperated Clusters” , Journal of Cybernetics, Vol.3 No.3 pp.32-57.
Claudia, R.F, Leonardo, S.V., and Adriano, J.O.C, 2002, “A Validity Measure for Hard and Fuzzy Clustering Derived from Fisher’s Linear Discriminant”, Proceedings of the 2002 IEEE International Conference on Fuzzy Systems, Vol.2 pp.1493-1498.
Frigui, H., and Krishnapuram ,R., 1996, “A Robust Algorithm for Automatic Extraction of An Unknown Number of Clusters from Noisy Data”, Pattern Recognition Letters, Vol.17 No.12 pp.1223-1232.
Frigui, H., and Krishnapuram ,R., 1997, “Clustering by Competitive Agglomeration”, Pattern Recognition, Vol.30 No.7 pp.1109-1119.
Frigui, H., and Krishnapuram, R., 1999, “A Robust Competitive Clustering Algorithm with Applications in Computer Vision”, IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol.21 No.5 pp.450-465.
Frigui, H., and Nasraoui, O, 2004, “Unsupervised Learning of Prototypes and Attribute Weights”, Pattern Recognition, Vol.37 No.3 pp.567-581.
Gao, X.B., and Xie, W.X., 2000, ”Advances in Theory and Applications of Fuzzy 50 Clustering”, Chinese Science Bulletin, Vol.45 No.11 pp.961-970.
Gustafson, D.E., and Kessel, W.C., 1979, “Fuzzy Clustering With a Fuzzy Covariance Matrix”, In Proceedings of IEEE Conference on Decision and Control, pp.761-766.
Han, J., and Kamber, M., 2001, Data Mining:Concepts and Techniques, Morgan Kaufmann Publishers, San Francisco.
Halkidi, M., Batistakis, Y., and Vazirgiannis,M., 2001, “On Clustering Validation Techniques ”, Journal of Intelligent Information Systems, Vol.17 No.2-3 pp.107-145.
Hathaway, R.J., Bezdek, J.C., and Hu, Y.K., 2000, “Generalized Fuzzy c-Means Clustering Strategies Using LP Norm Distances”, IEEE Transactions on Fuzzy Systems, Vol.8 No.5 pp.576-582.
Hammah, R.E., and Curran, J.H., 2000, “Validity Measures for the Fuzzy Cluster Analysis of Orientations”, IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol.22 No.12 pp.1467-1472.
Hathaway, R.J., and Bezdek, J.C., 2001, “Fuzzy c-Means Clustering of Incomplete Data”, IEEE Transactions on Systems, Man, and Cybernetics-Part B, Vol.31 No.5 pp.735-744.
Hoppner, F,, Klawonn, F., Rudolf, K., and Runkler, T., 1999,Fuzzy Cluster Analysis: Methods for
Classification, Data Analysis, and Image Recognition, John Wiley, Chichester.
Jain, A.K., Murty, M.N., and Flynn, P.J., 1999, “Data Clustering: A Review”, ACM Computing Surveys, Vol.31 No.3 pp.264-323.
Kantardzic, M., 2003, Data Mining: Concepts, Models, Methods, and Algorithms, John Wiley, Hoboken, NJ.
Kaymak, U., and Babuska, R., 1995, “Compatible Clustering Merging for Fuzzy Modeling”, Proceedings of 1995 IEEE International Conference on Fuzzy Systems, Vol.2 pp.897-904.
Kaymak, U., and Setnes, M., 2002, “Fuzzy Clustering with Volume Prototypes and Adaptive Cluster Merging”, IEEE Transactions on Fuzzy Systems, Vol.10 No.6 pp.705-712.
Kim, D.W., Lee,K.H., and Lee,D., 2003, “Fuzzy Cluster Validation Index Based on Inter-Cluster Proximity”, Pattern Recognition Letters, Vol.24 No.15 pp.2561-2574.
Krishnapuram, R., and Freg, C.P., 1992, “Fitting An Unknown Number of Lines and Planes to Image Data Through Compatible Cluster Merging”, Pattern Recognition, Vol.25 No.4 pp.385-400.51
Krishnapuram, R., and Kim, J., 1999, “A Note on the Gustafson-Kessel and Adaptive Fuzzy Clustering Algorithms”, IEEE Transactions on Fuzzy Systems, Vol.7 No.4 pp.453-461.
Krishnapuram, R.,and Kim, J., 2000, “Clustering Algorithms Based on Volume Criteria”, IEEE Transactions on Fuzzy Systems, Vol.8 No.2 pp.228-236.
Lin,C.R., and Chen, M.S., 2002, “A Robust and Efficient Clustering Algorithm Based on Cohesion Self-Merging”, Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp.582-587.
Leski, J., 2003, “Towards a Robust Fuzzy Clustering”, Fuzzy Sets and Systems, Vol.137 No.2 pp.215-233.
Melek, W.W., Emamo, M.R., and Goldenberg, A.A., 1999, ”An Improved Robust Fuzzy Clustering Algorithm”, 1999 IEEE International Fuzzy Systems Conference Proceedings,Vol.3 pp.1261-1265.
Mitra, S., Pal, S.K., and Mitra, P., 2002, “Data Mining in Soft Computing Framework: A Survey”, IEEE Transactions on Neural Networks, Vol.13 No.1 pp.3-14.
Miyamoto, S., 1998, “An Overview and New Methods in Fuzzy Clustering”, 1998 Second International Conference on Knowledge-Based Intelligent Electronic Systems, Vol.1 pp.33-40.
Norio, W., 2003, “Fuzziness Indices for Fuzzy Clustering”, The 12th IEEE International Conference on Fuzzy Systems, Vol.2 pp.1455-1458.
Pakhira, M.K., and Bandyopadhyay, S., and Maulik,U., 2004, “Validity Index for Crisp and Fuzzy Clusters”, Pattern Recognition, Vol.37 No.3 pp.487-501.
Pal, N.R., and Bezdek, J.C., 1995, “On Cluster Validity for the Fuzzy c-Means Model”, IEEE Transactions on Fuzzy Systems, Vol.3 No.3 pp.370-379.
Rezaee, M.R., Lelieveldt, B.P.F., and Reiber, J.H.C., 1998, “A New Cluster Validity Index for the Fuzzy c-Means”, Pattern Recognition Letters, Vol.19 No.3-4 pp.237-246.
Setnes, M., Babuska, R., Kaymak, U., and Lemke, H.R.N., 1998, “Similarity Measures in Fuzzy Rule Base Simplification”, IEEE Transactions on Systems,Man,and Cybernetics-Part B, Vol.28 No.3 pp.376-386.
Tao, C.W., 2002, “Unsupervised Fuzzy Clustering with Multi-Center Clusters”, Fuzzy Sets and Systems, Vol.128 No.3 pp.305-322.