簡易檢索 / 詳目顯示

研究生: 許佑銘
Hsu, Yu-Ming
論文名稱: 群集分析於模型觀點抽樣設計之應用
Utilization of Cluster Analysis on the Sampling Selection of the Model-based Sampling Survey
指導教授: 趙昌泰
Chao, Chang-Tai
學位類別: 碩士
Master
系所名稱: 管理學院 - 統計學系
Department of Statistics
論文出版年: 2016
畢業學年度: 104
語文別: 中文
論文頁數: 54
中文關鍵詞: 群集分析最佳化抽樣策略模型觀點抽樣推論
外文關鍵詞: Cluster analysis, Optimal sampling strategy, Model-based sampling
相關次數: 點閱:176下載:5
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 在有限母體的抽樣調查中,當母體單元之間具有相關性時,若欲預測感興趣的母體量,應如何適當地從N個母體單元中選取n樣本單元,以得到較低的均方預測誤差,一向是抽樣理論及應用中的基本問題。在過去所提出的最佳化抽樣策略中,雖能適當地選取樣本單元,以達到最小化均方預測誤差,但往往伴隨繁複的計算過程,並且其最佳化的過程在實用上有一定之難度,此外,過去的最佳化抽樣策略,必須先已知母體真實的分配模型,才可使用並且得到最佳化之結果。
    本論文中,基於多變量分析的群集分析方法,提出了兩種抽樣策略。在已知母體間相關性矩陣,利用相關性矩陣及群集分析的分群方法,在所給定之樣本數下,先將母體單元分群後再進行樣本選取,這兩種抽樣策略不需完整的母體模型假設,僅需要母體間的相關性矩陣,不需繁複的計算過程,即可選取最適當的樣本單元進行預測。透過模擬研究證明所提出的兩種抽樣策略,優於簡單隨機抽樣的預測結果,再利用實際資料,解釋兩種抽樣設計如何在實際資料上使用,以得到較好的預測結果。

    For the prediction problem in survey sampling under a finite population, n sampling units are selected out of N population units and observed to predict the population quantity of interest. The optimal sampling strategies proposed by different authors in the past can be used to select the optimal sample with which the mean-square error can be minimized. However, the computational load can be very extensive, and the optimization algorithm is not easy to implement. Additionally, the exact population distribution has to be assumed.
    Two model-based sampling selection methods based on Cluster Analysis under a given sample size n are proposed in this article. Both design are better than SRSWOR in terms of given lower prediction mean-square error. These sampling methods do not require extensive computation nor exact population distribution to select the sampling units. Simulation study shows that they can be more effective than SRSWOR. An example on the utilization of the proposed sampling methods in practice is also presented.

    摘要. . . . . . . . . .i 英文摘要. . . . . . . .ii 誌謝. . . . . . . . . .vi 目錄. . . . . . . . . .vii 表目錄. . . . . . . . .ix 圖目錄. . . . . . . . .x 第1 章 序論. . . . . . .1 第2 章 抽樣設計. . . . .4 第2.1 節 抽樣設計一. . . .6 第2.2 節 抽樣設計二. . . .6 第3 章 模擬研究. . . . . .8 第3.1 節資料生成. . . . .8 第3.2 節母體預測量. . . . .10 第3.3 節模擬結果. . . . . .12 第3.3.1 節高斯模型. . . . .12 第3.3.2 節對數高斯模型. . .19 第4 章實例分析. . . . . . .26 第4.1 節黃石國家公園二氧化碳排放量. . . .26 第4.2 節德州Wolfcamp 含水層. . . . . .30 第5 章問題與討論. . . . . . . . . . . .37 參考文獻. . . . . . . . . . . .38 附錄A 高斯分配模擬結果. . . . . .40 A.1 K-平均法. . . . . . . . . .40 A.2 K-物件法. . . . . . . . . .42 附錄B 對數高斯分配使用BLUP 模擬結果. . . .45 B.1 K-平均法. . . . . . . . . . .45 B.2 K-物件法. . . . . . . . . . .47 附錄C 對數高斯分配使用BUP 模擬結果. . . .50 C.1 K-平均法. . . . . . . . . . .50 C.2 K-物件法. . . . . . . . . . .52

    1. Anderson TW. (1984). An Introduction to Multivariate Statistical Analysis, 2nd ed., John Wiley & Sons, New York.
    2. Basu D. (1969). Role of the sufficiency and likelihood principles in sample survey theory., Sankhyã, A31, 441–454.
    3. Bolfarine H, Zacks S. (1992). Prediction Theory for Finite Population, Springer Verlag, New York.
    4. Brissette FP, Khalili M, Leconte R. (2007). Efficient stochastic generation of multi-site synthetic precipitation data., Journal of Hydrology, 345, 121–133.
    5. Chao CT. (2003). Markov chain Monte Carlo on adaptive sampling selections., Environmental and Ecological Statistics, 10, 129–151.
    6. Chao CT. (2004). Selection of samplings units under a correlated population based on the eigensystem of the population covariance matrix., Environmetrics, 15, 757-775.
    7. Chao CT, Thompson SK. (2001). Optimal adaptive selection of sampling sites., Environmetrics, 12, 517–538.
    8. Chartrand G., Orllermann OR. (1993), Applied and Algorithmic Graph Theory, McGraw-Hill, New york.
    9. Cressie NAC. (1993). Statistics for Spatial Data, Revised version, Wiley, New York.
    10. Flores LA, Martı´nez LI, Ferrer CM. (2003). Environmetrics, 14, 45–61.
    11. Johnson RA, Wichern DW. (1998). Applied Multivariate Statistical Analysis, 4th ed., Prentice-Hall Inc: Upper Saddle River, N.J.
    12. Mate´rn B. (1986). Spatial Variation, 2nd ed. Springer-Verlag, Berlin.
    13. Rencher AC. (1995). Methods of Multivariate Analysis, John Wiley & Sons, New York.
    14. Ripley BD. (1981). Spatial Statistics, Wiley, New York.
    15. Sacks J, Schiller S. (1988). Spatial design. In Statistical Decision Theory and Related Topics IV, Vol. 2, Gupta SS, Beregr JO (eds)., Springer, New York, 385-395.
    16. Solomon H, Zacks S. (1970). Optimal design of sampling from finite populations: a 38 critical review and indication of new research areas., Journal of the American Statistical Association, 65, 653–677.
    17. Thompson SK, Seber GAF. (1996). Adaptive Sampling, Wiley, New York.
    18. Werner C, Brantley SL, Boomer K. (2000). CO2 emissions related to the Yellowstonevolcanic system 2. Statistical sampling, total degassing, and transport mechanisms., Journal of Geophysical Research, 105(10), 831–846.
    19. Zacks S. (1969). Bayes sequential design of fixed size samples from finite population., Journal of American Statistical Association, 64, 1342–1369.

    下載圖示 校內:2021-07-31公開
    校外:2021-07-31公開
    QR CODE