| 研究生: |
古怡杰 Gu, Yi-Jay |
|---|---|
| 論文名稱: |
統計型局部線性內嵌法於台灣手語多視角手形辨識 Statistical LLE for Multi-view TSL Hand Shape Recognition |
| 指導教授: |
謝璧妃
Hsieh, Pi-Fuei |
| 學位類別: |
碩士 Master |
| 系所名稱: |
電機資訊學院 - 資訊工程學系 Department of Computer Science and Information Engineering |
| 論文出版年: | 2006 |
| 畢業學年度: | 94 |
| 語文別: | 英文 |
| 論文頁數: | 57 |
| 中文關鍵詞: | 非線性降維方法 、手形辨識 、局部線性內嵌法 |
| 外文關鍵詞: | hand shape recognition, locally linear embedding, nonlinear dimensionality reduction |
| 相關次數: | 點閱:120 下載:1 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
以二維影像辨識物體時,當欲辨識的物體並非由一個固定的視角成像時,此時特徵空間的資料分佈便會較為複雜。這是由於在特徵空間中,二維影像在不同視角下產生的差異與大量的手形類別數目所造成。本篇論文嘗試利用所提出的「統計型局部線性內嵌法」來辨識多視角之台灣手語手形資料。「局部線性內嵌法」為一非監督式非線性降維方法,它利用局部線性之特性,以獲得內嵌於高維空間中之低維流形;藉由維持鄰近區域之結構,保留住原資料分佈之非線性結構。雖然「局部線性內嵌法」具有在低維空間保持資料非線性結構之能力,但是在將高維資料投影到低維空間的過程,並沒有將類別資訊納入考慮。以鑑於此,本文提出「統計型局部線性內嵌法」,以統計之方式提供類別資訊,以加強「局部線性內嵌法」在處理有關分類之能力。實驗以UCI資料集與台灣手語多視角手形之辨識應用,來驗證「統計型局部線性內嵌法」在分類之效能。實驗結果展現,「統計型局部線性內嵌法」比起原「局部線性內嵌法」獲致更好的分類效能。
The image-based object recognition problem becomes complicated when the objects of interest are not posed at a fixed view. In recognition of sign language, the variation of a hand shape due to multiple views and the large number of hand shapes (classes) yield a data distribution with a complicated nonlinear structure in the feature space. This makes it difficult to preserve the class separability under a linear transformation of dimensionality reduction. The locally linear embedding (LLE) is an unsupervised nonlinear dimensionality reduction approach that utilizes the local linearity to discover the low dimensional manifold embedded in the high dimensional space. This suggests that LLE may preserve the neighborhood configuration for the nonlinear structure of the multi-view hand shape data distribution. Although LLE has capability to recover the global nonlinear structure from locally linear fits, the class label information is not taken into account when mapping samples from the high dimensional space to a low dimensional feature space. The statistical LLE is thus proposed herein to improve the capability of LLE associated with classification by incorporating the class label information statistically. In experiments, the statistical LLE was applied to a multi-view TSL hand shape dataset to reduce dimensionality prior to classification. Several UCI datasets were also utilized to validate the proposed approach. Experimental results show that the statistical LLE gave a superior classification performance compared to the original LLE and the linear dimensionality reduction methods such as LDA and PCA.
[1] S. C. W. Ong and S. Ranganath, “Automatic sign language analysis: a survey and the future beyond lexical meaning,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 27, no. 6, June 2005.
[2] R. H. Liang and M. Ouhyoung, “A real-time continuous gesture recognition system for sign language,” Proc. of IEEE Int’l Conf. on Automatic Face and Gesture Recognition, April 1998.
[3] G. Fang, W. Gao and D. Zhao, “Large vocabulary sign language recognition based on hierarchical decision trees,” Proc. of Int’l Conf. on Multimodal Interfaces, 2003.
[4] Y. Nam, K. Wohn and H. L. Kwang, “Modeling and recognition of hand gesture using colored Petri nets,” IEEE Trans. Systems, Man and Cybernetics—Part A: Systems and Humans, vol. 29, no. 5, Sep. 1999.
[5] K. Fukunaga, Introduction to Statistical Pattern Recognition, second ed., New York: Academic Press, 1990.
[6] H. S. Seung and D. D. Lee, “The manifold ways of perception,” Science, vol. 290, Dec. 2000.
[7] S. T. Roweis and L. K. Saul, “Nonlinear dimensionality reduction by locally linear embedding,” Science, vol. 290, Dec. 2000.
[8] Y. Cui and J. Weng, “Appearance-based hand sign recognition from intensity image sequence,” Computer Vision and Image Understanding, vol. 78, no. 2, May 2002.
[9] R. C. Gonzalez and R. E. Woods, Digital Image Processing, second ed., Addison-Wesley, 2002.
[10] K. Arbter, W. E. Snyder, H. Burkhardt and G. Hirzinger, “Application of affine-invariant Fourier descriptors to recognition of 3-D objects,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 12, no. 7, July 1990.
[11] F. S. Chen, C. M. Fu and C. L. Hung, “Hand gesture recognition using a real-time tracking method and hidden Markov models,” Image and Vision Computing, vol. 21, no. 8, Aug. 2003.
[12] Y. Wu and T. S. Hung, “View-independent recognition of hand postures,” Proc. of IEEE Int’l. Conf. on Computer Vision and Pattern Recognition, vol. 2, June 2000.
[13] J. Triesch and C. Malsburg, “A system for person-independent hand posture recognition against complex background,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 23, no. 12, Dec. 1990.
[14] L. Bretzner, I. Laptev and T. Lindeberg, “Hand gesture recognition using multi-scale colour features, hierarchical models and particle filtering,” Proc. of IEEE Int’l Conf. on Automatic Face and Gesture Recognition, May 2002.
[15] Y. Hamada, N. Shimada and Y. Shirai, “Hand shape estimation under complex backgrounds for sign language recognition,” Proc. of IEEE Int’l Conf. on Automatic Face and Gesture Recognition, May 2004.
[16] M. J. Jones and J. M. Rehg, “Statistical color models with applications to skin detection,” Proc. of IEEE Int’l Conf. on Computer Vision and Pattern Recognition, vol. 1, 1999.
[17] Q. H. Thu, M. Meguro and M. Kaneko, “Skin-color extraction in images with complex background and varying illumination,” Proc. of IEEE Workshop on Application of Computer Vision, 2002.
[18] N. Habili, C. C. Lim and A. Moini, “Segmentation of the face and hands in sign language video sequences using color and motion cues,” IEEE Trans. Curcuits and Systems for Video Technology, vol. 14, no. 8, Aug. 2004.
[19] D. Chai and N. Ngan, “Face segmentation using skin-color map in videophone application,” IEEE Trans. Circuits and Systems for Video Techniques, vol. 9, no. 4, June 1999.
[20] S. L. Phung, A. Bouzerdoum and D. Chai, “Skin segmentation using color pixel classification: analysis and comparison,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 27, no. 1, Jan. 2005.
[21] V. Vezhnevets, V. Sazonov and A. Andreeva, “A survey on pixel-based skin color detection techniques.” Proc. of IEEE Int’l. Conf. on Computer Graphics and Vision, 2003.
[22] M. H. Yang and N. Ahuja, “Gaussian mixture model for human skin color and its applications in image and video databases,” Conf. on Storage and Retrieval for Image and Video Database, vol. 3656, 1999.
[23] J. B. Tenenbaum, V. Silva and J. C. Langford, “A global geometric framework for nonlinear dimensionality reduction,” Science, vol. 290, Dec. 2000.
[24] M. Belkin and P. Niyogi, “Laplacian eigenmaps and spectral techniques for embedding and clustering,” Neural Information Processing Systems, 2001.
[25] L. Yang, “Distance-preserving projection of high-dimensional data for nonlinear dimensionality reduction,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 26, no. 9, Sep. 2004.
[26] L. K. Saul and S. T. Roweis, “Think globally, fit locally: unsupervised learning of low dimensional manifolds,” Journal of Machine Learning Research, vol. 4, 2003.
[27] D. Ridder, O. Kouropteva, O. Okun, M. Pietikäinen and R. P. W. Duin, “Supervised locally linear embedding,” Proc. of Joint Int’l Conf. on ICANN/ICONIP, 2003.
[28] D. Lian, J. Yang, Z. Zheng and Y. Chang, “A facial expression recognition system based on supervised locally linear embedding,” Pattern Recognition Letters, vol. 26, 2005.
[29] X. Geng, D. Zhan and Z. Zhou, “Supervised nonlinear dimensionality reduction for visualization and classification,” IEEE Trans. Systems, Man and Cybernetics—Part B:Cybernetics, vol. 35, no. 6, Dec. 2005.
[30] X. Jia and J. A. Richards, “Cluster-space representation for hyperspectral data classification,” IEEE Trans. Geoscience and Remote Sesing, vol. 40, no. 3, March 2002.
[31] C. L. Blake and C. J. Merz, “UCI Repository of Machine Learning Databases,” http://www.ics.uci.edu/~mlearn/MLRepository.html, 1998.