簡易檢索 / 詳目顯示

研究生: 游惠群
Yu, Hui-Chun
論文名稱: 隨機右設限資料的核密度估計之帶寬選擇
Bandwidth selection for kernel density estimate for randomly right-censored data
指導教授: 吳鐵肩
Wu, Tiee-Jian
學位類別: 博士
Doctor
系所名稱: 管理學院 - 統計學系
Department of Statistics
論文出版年: 2013
畢業學年度: 101
語文別: 英文
論文頁數: 64
中文關鍵詞: 收斂速度訊息界限核密度估計特徵函數設限資料帶寬選擇
外文關鍵詞: Bandwidth selection, characteristic function, censored data, convergence rate, information bound, kernel density estimation
相關次數: 點閱:111下載:3
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 本文考慮在隨機樣本數為n的隨機右設限(randomly right-censored)資料下,以核密度方法估計(kernel density estimator)存活時間之機率密度函數(lifetime density)f之帶寬(bandwidth)選擇問題。此法將Chiu(1992)提出的帶寬選擇法由完整資料(complete data)推廣至隨機右設限資料。其關鍵在於修正樣本特徵函數(sample characteristic function)的高頻區,使得高頻區降低的變異比增加的偏誤更為顯著。在f與核函數(kernel function)符合特定的平滑條件下,本文提出的帶寬選擇法以最佳(root n)速度收斂至常態分佈,並合理猜測其漸近變異數達到訊息界線(information bound)。模擬研究設定了符合實務的樣本數與設限比率,結果顯示本文提出之帶寬選擇法表現優異,更甚於交叉驗證(cross-validation)選擇法。

    Based on randomly right-censored sample of size n, the problem of selecting the global bandwidth in kernel density estimation of lifetime density f is investigated. A stabilized bandwidth selector, which is an extension to censored data of the complete-sample selector of Chiu (1992), is proposed. The key idea of our selector is to modify the weighted sample characteristic function beyond some cut-off frequency to reduce the sample variations without significantly inflating the bias. It is shown that under some smoothness conditions on f and the kernel, our selector is asymptotically normal distributed with the optimal root n relative convergence rate and attains the (conjectured) information bound. The excellent performances of the proposed selector at practical sample sizes are clearly demonstrated in simulation studies. In particular, the proposed selector performs conclusively better than the one selected by cross-validation.

    1 Introduction 1 2 Literature review 3 2.1 Kernel estimate for complete data case . . . . . . . . . . . . . . . . . . 3 2.2 Boundary effects . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 2.3 Kernel estimator for right censored data . . . . . . . . . . . . . . . . . 9 3 The proposed method 14 3.1 Fourier transform . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14 3.2 The Proposed Bandwidth Selector . . . . . . . . . . . . . . . . . . . . . 16 3.3 The Main Theoretical Results . . . . . . . . . . . . . . . . . . . . . . . 17 3.4 The Modification of the Proposed Bandwidth Selector . . . . . . . . . . 19 4 Simulation results 21 5 Discussion and future research 23 6 Proofs 24 Appendix A Tables 37 Appendix B Figures 46 Bibliography 61 List of Tables A.1 Simulation Settings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38 A.2 Simulation results for the kernel estimation of Gamma(10,1) . . . . . . 39 A.3 Simulation results for the kernel estimation of Weibull(10,20) . . . . . . 40 A.4 Simulation results for the kernel estimation of Gamma(4,1) . . . . . . . 41 A.5 Simulation results for the kernel estimation of Weibull(4,20) . . . . . . 42 A.6 Simulation results for the kernel estimation of Exponential(1) . . . . . 43 A.7 Simulation results for the kernel estimation of Bimodal I . . . . . . . . 44 A.8 Simulation results for the kernel estimation of Bimodal II . . . . . . . . 45 List of Figures 2.1 The effect of bandwidth on kernel estimate. . . . . . . . . . . . . . . . 4 2.2 Boundary effects on exponential distribution. . . . . . . . . . . . . . . . 8 2.3 (a)An example of the weight function;(b)Pre-weighted exponential distribution with mean equals to 1. . . . . . . . . . . . . . . . . . . . . . . 9 3.1 Cut-off frequency selection. . . . . . . . . . . . . . . . . . . . . . . . . . 20 B.1 Estimated bandwidth density of βSTA, β∞ and βCV for the model ♯ 1 . 47 B.2 Estimated bandwidth density of βSTA, β∞ and βCV for the model ♯ 2 . 48 B.3 Estimated bandwidth density of βSTA, β∞ and βCV for the model ♯ 3 . 49 B.4 Estimated bandwidth density of βSTA, β∞ and βCV for the model ♯ 4 . 50 B.5 Estimated bandwidth density of βSTA, β∞ and βCV for the model ♯ 5 . 51 B.6 Estimated bandwidth density of βSTA, β∞ and βCV for the model ♯ 6 . 52 B.7 Estimated bandwidth density of βSTA, β∞ and βCV for the model ♯ 7 . 53 B.8 Estimated density of the model ♯ 1 . . . . . . . . . . . . . . . . . . . . 54 B.9 Estimated density of the model ♯ 2 . . . . . . . . . . . . . . . . . . . . 55 B.10 Estimated density of the model ♯ 3 . . . . . . . . . . . . . . . . . . . . 56 B.11 Estimated density of the model ♯ 4 . . . . . . . . . . . . . . . . . . . . 57 B.12 Estimated density of the model ♯ 5 . . . . . . . . . . . . . . . . . . . . 58 B.13 Estimated density of the model ♯ 6 . . . . . . . . . . . . . . . . . . . . 59 B.14 Estimated density of the model ♯ 7 . . . . . . . . . . . . . . . . . . . . 60

    1. Blum, J. R. and Susarla, V. (1980) Maximal deviation theory of density and failure estimates based on censored data. In Multivariate analysis V (P. R. Krishnaiah, ed.) 213-222. North-Holland, New York.
    2. Bowman, A. (1984). An alternative method of cross-validation for the smoothing of density estimates. Biometrika, 71(2), 353-360.
    3. Brillinger, D. R. (1981). Time series data analysis and theory. Holt, Rinehart and Windston, New York.
    4. Cao, R. and J`acome, M. A. (2007) Almost sure asymptotic representation for the presmoothed distribution and density estimators for censored data. Statistics, 41(6), 517-534.
    5. Chiu, S.T. (1991a). Bandwidth selection for kernel density estimation. The Annals of Statististics, 19(4), 1833-1905.
    6. Chiu, S.T. (1991b). The effect of discretization error on bandwidth selection for kernel density estimation. Biometrika, 78(2), 436-441.
    7. Chiu, S.T. (1992). An automatic bandwidth selector for kernel density estimation. Biometrika, 79(4), 771-782.
    8. Cheng, M., Fan, J. and Marron, J.S. (1997). On automatic boundary corrections. The Annals of statistics, 25(4), 1691-1708.
    9. Davis, K.B. (1975). Mean square error properties of density estimates. The Annals of statistics, 3(4), 1025-1030.
    10. Diggle, P. (1985). A kernel method for smoothing point process data. Journal of the Royal Statistical Society C, 34(2), 138-147.
    11. Efromovich, S. (2001). Density estimation under random censorship and order restrictions: from asymptotic to small samples. Journal of the American Statistical Association, 96, 667-684.
    12. Fan, J. and Marron, J.S. (1992). Best possible constant for bandwidth selection. The Annals of statistics, 20, 2057-2070.
    13. F¨oldes, A., Rejt¨o, L. and Winter, B. B. (1981). Strong consistency properties of nonparametric estimators for randomly censored data, II: Estimation of density
    and failure rate. Periodica Mathematica Hungariaca 12 15-29.
    14. Hall, P. (1983). Large sample optimality of least squares cross-validation in density estimation. The Annals of Statistics, 4, 1156-1174.
    15. Hall, P. and Marron, J.S. (1991a). Lower bounds for bandwidth selection in density estimation. Probability Theory and Relative Fields, 90, 149-173.
    16. Hall, P. and Marron, J.S. (1991b). Local minimum in cross-validations. Journal of the Royal Statistical Society B, 53, 245-252.
    17. Hall, P., Sheather, S. J., Jones, M. C. and Marron, J.S. (1991). On optimal data-based bandwidth selection in kernel density estimation. Biometrika, 78, 263-269.
    18. Jones, M. C. (1991). The role of ISE and MISE in density estimation. Statistics and Probability Letters, 12, 51-56.
    19. Jones, M. C. (1993). Simple boundary correction for kernel density estimation. Statistics and Computing, 3, 135-146.
    20. Jones, M. C., Marron, J. S. and Park, B. U. (1991). A simple root n bandwidth selector. The Annals of Statistics, 19, 1919-1932.
    21. Kaplan, E. L. and Meier, P. (1958). Nonparametric estimation from incomplete observations. Journal of the American Statistical Association, 53, 457-481.
    22. Kulasekera, K. B. and Padgett, W. J. (2006). Bayes bandwidth selection in kernel density estimation with censored data. Journal of Nonparametric Statistics, 18, 129-143.
    23. Lo, S. H., Mack, Y. P. and Wang, J. L. (1989). Density and hazard rate estimation for censored data via strong representation of the Kaplan-Meier estimator, Probability Theory and Relative Fields, 80, 461-473.
    24. Marron, J. S. and Padgett, W. J. (1987). Asymptotically optimal bandwidth selection for kernel density estimators from randomly right-censored samples, The Annals of Statistics, 15(4), 1520-1535.
    25. Marron, J. S. and Ruppert, D. (1994). Transformations to reduce boundary bias in kernel density estimation, Journal of the Royal Statistical Society B, 56(4),
    563-671.
    26. Nadaraya, E. A. (1974). On the integral mean square error of some nonparametric estimates for the density function. Theory of Probability & Its Application, 19(1), 133-141.
    27. Rice, J. (1984). Boundary modification for kernel regression, Communication in Statistics Part A - Theory & method, 13, 893-900.
    28. Rudemo, M. (1982). Empirical choice of histograms and kernel density estimators. Scandinavian Journal of Statistics, 9, 65-78.
    29. Silverman, B. W. (1986). Density Estimation for Statistics and Data Analysis. Chapman and Hall, London.
    30. Stone, C. J. (1980). Optimal convergence rates for nonparametric estimations, The Annals of Statistics, 8, 1348-1360.
    31. Stute, W. (1995). The central limit theorem under random censorship, The Annals of Statistics, 23(2), 422-439.
    32. Woodroofe, M. (1970). On choosing a delta-sequence, The Annals of Mathematical Statistics, 41(5), 1665-1671.
    33. Wu, T.J. (1995). Adaptive root n estimates of integrated squared density derivatives. The Annals of Statistics, 23, 1474-1495.
    34. Wu, T.J. (1997). Root n bandwidth selectors for kernel estimation of density derivatives. Journal of the American Statistical Association, 92(438), 536-547.
    35. Wu, T.J. and Lin, Y. (2000). Information bound for bandwidth selection in kernel estimation of density derivatives. Statistica Sinica, 10, 457-473.
    36. Wu, T.J. and Tsai, M, -H. (2004). Root n bandwidths selectors in multivariate kernel density estimation. Probability Theory and Relative Fields, 129, 537-558.

    下載圖示 校內:2018-07-30公開
    校外:2018-07-30公開
    QR CODE