研究生: |
蔡仲齡 Tsai, Zhong-Ling |
---|---|
論文名稱: |
含語者驗證之小型場所人臉辨識門禁系統的研發 Facial Recognition System in a Small Place with Speaker Validation |
指導教授: |
周榮華
Chou, Jung-Hua |
學位類別: |
碩士 Master |
系所名稱: |
工學院 - 工程科學系 Department of Engineering Science |
論文出版年: | 2008 |
畢業學年度: | 96 |
語文別: | 中文 |
論文頁數: | 68 |
中文關鍵詞: | 人臉辨識 、主分量分析 、語者辨識 、倒頻譜 、門禁系統 |
外文關鍵詞: | PCA, facial recognition, speaker validation, cepstrum |
相關次數: | 點閱:97 下載:6 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
本研究所完成之含語者驗證之人臉辨識門禁系統,適用於一般家庭、辦公室或小型公司等成員人數少於20人的團體。本系統可分成三個子系統:成員資料庫建立系統、人臉影像辨識系統及語者驗證系統。
利用人臉影像辨識系統辨識是否為成員,若為成員便准許進入,否則用語者驗證系統加以確認,來確保居家安全。以資料庫建立系統,建立專屬的人臉影像資料庫與語音資料庫。人臉影像辨識系統部分,乃將擷取到的影像使用膚色及五官偵測切割出單純的人臉影像,再利用主分量分析法來降低維度以減少運算量,最後使用最近歐式距離判斷是否為成員,若判斷為成員即可准許進入;反之,則進入語者驗證系統,引導使用者說出固定語句,使用倒頻譜擷取特徵,最後以最近歐式距離判斷,並與人臉影像結果做對照,驗證聲音是否為候選成員所屬,身分符合即可進入,反之則判定為非成員,禁止進入。
此門禁系統於即時運作時,其正確接受率達93%以上,辨識一人在影像部分所需時間在0.5秒上下,若需經由語者驗證系統則驗證時間約1秒左右。
In this thesis, a facial recognition system with speaker validation in a small place is designed. This system is suitable for a small place that has people less than 20. There are three sub - systems in the system - the database system, facial image recognition system, and speaker validation system.
First, use the database system to construct a facial image database and a speaker voice database which are the special database for the present system. Second, if the facial image recognition system judges that the user is a member of the database, the system allows that user to enter. Otherwise, the user has to be tested by the speaker voice validation system. The system lets the user pass, if the answer of the speaker voice validation system matches that of the facial image recognition system.
In the system, the accuracy rate is 93% above. It costs about 0.5 seconds to recognize one person in the facial image recognition system, and 1 second in the speaker voice validation system.
[1] P. N. Belhumeur, J, P. Hespanha, D. J. Kriegman “Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection” IEEE Transactions on pattern analysis and machine intelligence, vol.19, NO.7, pp. 711-720, July 1997.
[2] C. Garcia, G. Tziritas, “Face Detection Using Quantized Skin Color Regions Merging and Wavelet Packet Analysis” IEEE Tran. Multimedia, vol. 1, pp. 264-277, September 1999.
[3] A. M. Martinez, A. C. Kak, “PCA versus LDA” IEEE Transactions on pattern analysis and machine intelligence, vol.23, NO.2, pp. 228-233, February 2001.
[4] R. C. Gonzalez, R. E. Woods “Digital Image Processing 2nd ed.” by Prientice-Hall, Inc 2002.
[5] 王科翔 『多重人臉偵測驗證與識別系統』 國立成功大學工程科學系碩士班 碩士論文 2005年。
[6] R. L. Hsu, M. Abdel-Mottaleb, and Anil K. Jain “Face Detection in Color Image"IEEE Transactions on pattern analysis and machine intelligence, vol. 24, NO.5, pp. 696-706, May 2002.
[7] K. C. Kwak and W. Pedrycz, “Face recognition using an enhanced independent component analysis approach,” IEEE Trans. on Neural Networks, Vol. 18, No. 2, pp. 530-541, 2007.
[8] I. Guizatdinova and V. Surakka “Detection of Facial Landmarks from Neutral, Happy, and Disgust Facial Images” The 13-th International Conference in Central Europe on Computer Graphics, Visualization and Computer Vision'2005, WSCG 2005, University of West Bohemia, Campus Bory, Plzen-Bory, Czech Republic, January 31 - February 4, 2005 , Regular Papers, pp. 55-62.
[9] C. E. Thomaz, R. Q. Feitosa, A. Veiga, “Design of Radial Basis Function Network as Classifier in Face Recognition Using Eigenfaces” Neural Networks, 1998. Proceedings. Vth Brazilian Symposium on 9-11 Dec. 1998 Page(s):118 – 123
[10] M. J. Er, S. Wu, J. Lu, H. L. Toh, “Face recognition with radial basis function (RBF) neural networks” IEEE Transactions on Neural Networks, vol.13, NO.3, May 2002.
[11] R. O. Duda and P. E. Hart, D. G. Stork “Pattern Classification 2nd ed.” John Wiley & SONS, Inc, 2005.
[12] Y. Gizatdinova and V. Surakka “Feature-Based Detection of Facial Landmarks from Neutral and Expressive Facial Images"IEEE Transactions on pattern analysis and machine intelligence, vol.28, NO.1, pp. 135-139, January 2006.
[13] J. Qin, Z. S. He, “A SVM face recognition method based on Gabor-featured key points” Machine Learning and Cybernetics, 2005. Proceedings of 2005 International Conference onVolume 8, 18-21 Aug. 2005 Page(s):5144 - 5149 Vol. 8
[14] 王小華 『語音信號處理』全華圖書股份有限公司 96年出版。
[15] H. Jiang, X. W. Li, and Chaojun Liu, “Large Margin Hidden Markov Models for Speech Recognition” IEEE Transactions on audio, speech, and language processing, vol.14, NO.5, pp. 1584-1595, September 2006.
[16] P. Somervuno, A. Harma, and Seppo Fagerlund, “Parametric Representation of Bird Sounds for Automatic Species Recognition” IEEE Transactions on audio, speech, and language processing, vol.14, NO.6, pp. 2252-2263, November 2006.
[17] S. Wu, L. Jiang, S. Xie, and Allen C. B. Yeo “A Robust Method for Detecting Facial Orientation in Infrared Images” The Journal of The Pattern Recognition Society 39, pp. 303-309, 2006.
[18] C. Xie, J. Zhang, F. Long, “A Dynamic Feature Extraction Based on Wavelet Transforms for Speaker Recognition” Electronic Measurement and Instruments, 2007. ICEMI '07. 8th International Conference on Aug. 16 2007-July 18 2007 Page(s):1-595 - 1-598.
[19] K. S. Lee “Statistical Approach for Voice Personality Transformation” IEEE Transactions on audio, speech, and language processing, vol.15, NO.2, pp. 641-651, February 2007.
[20] N. Chatzichrisafis, V. Fiakoloukas, V. Digalakis, C. Harizakis, “Gaussian Mixture Clustering and Language Adaptation for the Development of a New Language Speech Recognition System” IEEE Transactions on audio, speech, and language processing, vol.15, NO.3, pp. 928-938, March 2007.
[21] 鍾偉仁 『語者辨認與驗證之初步研究』國立臺灣大學電信工程學研究所 碩士論文 2005年。