成功大學博碩士論文系統

簡易檢索 / 詳目顯示

回結果列表

研究生：	王浩丞 Wang, Hao-Cheng
論文名稱：	以模糊Q-學習為基礎之負重自動平衡控制策略於大型人形機器人之研製 Design and Implementation of Fuzzy Q-learning Based Weight-lifting Auto-balancing Control Strategy for Adult-sized Humanoid Robots
指導教授：	李祖聖 Li, Tzuu-Hseng S.
學位類別：	碩士 Master
系所名稱：	電機資訊學院 - 電機工程學系 Department of Electrical Engineering
論文出版年：	2014
畢業學年度：	102
語文別：	英文
論文頁數：	71
中文關鍵詞：	大型人形機器人、模糊Q-學習演算法、FIRA機器人舉重競賽
外文關鍵詞：	Adult-sized Humanoid Robots, Fuzzy Q-Learning Control, FIRA Weight-lifting Competition
相關次數：	點閱：112 下載：1
分享至:	分享至facebook 分享至twitter

查詢本校圖書館目錄查詢臺灣博碩士論文知識加值系統勘誤回報

本論文提出一運用模糊Q-學習演算法之控制器，以改進大型人形機器人在負重時行走之穩定性。本論文首先介紹本實驗室第三代大型人形機器人軟硬體架構、硬體規格與設計概念。為了讓機器人能適應不同負重情況，用感測器來獲得環境資訊是必要的，所以在機器人動作控制系統中加入了感測器與回授控制。在本論文所提模糊Q-學習演算法控制器中，係以這些感測器之資料為輸入，並產生相對應之輸出來對馬達做出調整，以適應不同負重時的情況，感測器包含經卡爾曼濾波器處理之九軸慣性感測器與用於計算零力矩點之壓力感測器。本論文所提控制器分為兩個學習階段，第一階段係學習控制器對步態中各姿態之影響權重，第二階段則對模糊Q-學習演算法做出學習，學習完成後，機器人便能藉由此回授控制器在不同的負重情況時進行自主平衡，維持步態的穩定性。最後，以FIRA機器人競賽的舉重項目進行實驗，證明本論文所提回授控制器之可行性與效果。

This thesis proposes a control method that improves the ability of adult-sized humanoid robots to adapt to weight-lifting situations. First, the architecture of the hardware, software and the design concepts for the third generation adult-sized robot, David III, are introduced. In order to achieve the goal of having humanoid robots automatically balance their motion for weight-lifting situations, feedback control is added to the motion control system. The feedback sensors include a three-axis accelerometer and a three-axis gyroscopic, which would be processed by Kalman filter, as well as eight force sensors providing the zero moment point (ZMP) information on the robot. These feedback signals are used as the input of a Fuzzy Q-learning controller, which adjusts the motions to keep the stabilization of the robot. The Fuzzy Q-learning controller consists of two stages: one is the stage of fitting the output weights of each pose in motion patterns, and the second is training the rule-table of the controller. The experiment shows that the controller allows the adult-sized robot to walk stably no matter whether it lifts a weight or has a backpack on its back. Thus, the developed controller does indeed keep the balance of the robot in different situations, which gives the robot the ability to adapt to various environments in the manner of human beings.

Abstract          I
Acknowledgment	  III
Contents	  IV
List of Figures	  VII
List of Tables	  X

Chapter 1. Introduction		        	1
1 Motivation				1
2 Thesis Organization			3

Chapter 2. Hardware Specifications of Adult-sized Humanoid Robot			                        5
1 Introduction				5
2 The Configurations of David III		7
3 Hardware Specification			10
3.1 Actuators				10
3.2 Motion Controller			11
3.3 Zigbee Module				14
3.4 9-axes IMU				15
3.5 Camera				16
3.6 Computer				17
3.7 Force Sensor				18
3.8 Li-poly Batteries			19
3.9 Materials				20
3.10 Gear					21
4 Special Design for David III		21
4.1 Anti-twisting Structure		22
4.2 Block of Motors			23
5 Summary					25

Chapter 3. Sensor Processing			27
1 Introduction				27
2 Vision					28
3 Accelerometer and Gyroscope with Kalman Filter	32
4 Force Sensor and Zero Moment Point (ZMP)	34
5 Summary					37

Chapter 4. Auto-balance Control Strategy Based on Fuzzy-Q Learning Control	                        38
1 Introduction				38
2 Concept of Fuzzy-Q Learning Control Method	39
2.1 Concept of Fuzzy Logic Control	39
2.2 Concept of Q-learning			40
2.3 Concept of Fuzzy Q-learning		42
3 Gait Pattern of David III			44
4 The Learning Process of Auto-balance Control Method 45
4.1 The first stage learning		45
4.2 The second stage learning		49
5 The Interface of the learning process	52
6 Summary					54

Chapter 5. Experiment Results			55
1 Introduction				55
2 Experimental Results			56
2.1 Comparison of Motions before and after the Learning Process	                        56
2.2 Control Strategy with Different Weight Lifting Situations		                        62
2.3 Auto-balance control when a backpack on robot’s back		                                63

Chapter 6. Conclusions and Future Works		66
1 Conclusions				66
2 Future Works				68

References					69


                                    

[1] FIRA, http://www.fira.net/
[2] RoboCup, http://www.robocup.org/
[3] K. Harada, S. Kajita, K. Kaneko, and H. Hirukawa, “Dynamics and balance of a humanoid robot during manipulation tasks,” IEEE Trans. Robotics, vol. 22, no. 3, pp. 568-575, Jun. 2006.
[4] S. Kajita, T. Nagasaki, K. Kaneko, and H. Hirukawa, “ZMP-based biped running control,” IEEE Trans. Robotics, vol. 14, no. 2, pp. 63-72, Jun. 2007.
[5] P. Sardain and G. Bessonnet, “Zero moment point-measurements from a human walker wearing robot feet as shoes,” IEEE Trans. Systems, Man and Cybernetics, Part A: Systems and Humans, vol. 34, no. 5, pp. 638-648, Sep. 2004.
[6] C. Fu and K. Chen, “Gait synthesis and sensory control of stair climbing for a humanoid robot,” IEEE Trans. Industrial Electronics, vol. 55, no. 5, pp. 2111-2120, May. 2008.
[7] K. Erbatur and O. Kurt, “Natural ZMP trajectories for biped robot reference generation,” IEEE Trans. Industrial Electronics, vol. 56, no. 3, pp. 835-845, Mar. 2009.
[8] Q. Huang, K. Kaneko, K. Yokoi, S. Kajita, T. Kotoku, N. Koyachi, H. Arai, N. Imamura, K. Komoriya, and K. Tanie, “Balance control of a biped robot combining off-line pattern with real-time modification,” in Proc. IEEE Int. Conf. Robotics and Automation, 2000, pp. 3346-3352.
[9] P. Sardain and G. Bessonnet, “Forces acting on a biped robot. Center of pressure-zero moment point,” IEEE Trans. Systems, Man and Cybernetics, Part A: Systems and Humans, vol. 34, pp. 630–372, Sep. 2004.
[10] J. P. Ferreira, M. Crisóstomo and A. P. Coimbra, “ZMP trajectory reference for the sagittal plane control of a biped robot based on a human CoP and gait”, in Proc. of the 2009 IEEE/ RSJ International Conf. on Intelligent Robots and Systems, pp.1588-1593, 2009
[11] F. Asano, M. Yamakita, N. Kamamichi, and Z. W. Luo, “A novel gait generation for biped walking robots based on mechanical energy constraint,” IEEE Trans. Robotics and Automation, vol. 20, no. 3, pp. 565-573, Jun. 2004.
[12] Q. Huang and Y. Nakamura, “Sensory reflex control for humanoid walking,” IEEE Trans. Robotics, vol. 21, no. 5, pp. 977-984, Oct. 2005.
[13] P. Sardain, M. Rostami, and G. Bessonnet, “An anthropomorphic biped robot: dynamic concepts and technological design,” IEEE Trans. Systems, Man and Cybernetics, Part A: Systems and Humans, vol. 28, no. 6, pp.823-838, Nov. 1998.
[14] Y. D. Kim, B. J. Lee, J. H. Ryu, and J. H. Kim, “Landing force control for humanoid robot by time-domain passivity approach,” IEEE Trans. Robotics, vol. 23, no. 6, pp. 1294-1301, Dec. 2007.
[15] P.Y. Glorennec, “Fuzzy Q-leaming and dynamical fuzzy Q-leaming”; in Proc. of IEEE International Conference on Fuzzy Systems, Orlando, vol. 1, pp. 474-479, Jun 1994.
[16] H. R. Berenji, “Fuzzy Q-Learning: A new approach for fuzzy dynamic programming,” in Proc. IEEE Int. Conf. Fuzzy Systems, 1994, pp. 486–491.
[17] P. Y. Glorennec and J. Jouffe, “Fuzzy Q-learning,” inProc. 6th IEEE Int. Conf. Fuzzy Systems, 1997.
[18] ROBOTIS, http://www.robotis.com/
[19] Arduino Due, http://arduino.cc/en/Main/ArduinoBoardDue
[20] Zigbee-100, http://support.robotis.com/en/product/auxdevice/communication/zigbee_manual.htm
[21] DMP, http://www.dmp.com.tw/
[22] Logitech [Online], Available: http://www.logitech.com/
[23] GIGABYTE, http://www.gigabyte.tw/
[24] Tekscan, http://www.tekscan.com/
[25] C.-H. Li, Design and implememtation of vision and strategy systems for the FIRA HuroCup competition, Master Thesis, National Cheng Kung University, Jul. 2011.
[26] C.-M. Chang, Design and implementation of vision and strategy system for humanoid robot soccer competition, Master Thesis, National Cheng Kung University, Jul. 2009.
[27] K. T. Holland, R. A. Holman, T. C. Lippmann, and J. Stanley, “Practical use of video imagery in nearshore oceanographic field studies,” IEEE Journal of Oceanic Engineering, vol. 22, no. 1, pp. 81-92, Jan 1997.
[28] Y. I. Abdel-Aziz and H. M. Karara, “Direct linear transformation from comparator coordinates into object space coordinates in close-range photogrammetry,” in Proc. ASP/UI Symp. Close-Range Photogrammetry, Urbana, IL, 1971, pp. 1–18.
[29] G. Welch and G. Bishop, “An introduction to the Kalman filter,” Dept. Comput. Sci., Univ. North Carolina, Chapel Hill, NJ, 2001.
[30] Julier, S.J., Uhlmann, J.K. “Unscented filtering and nonlinear estimation,” Proceedings of the IEEE, vol. 92, issue. 3, pp. 401–422, Mar 2004
[31] C. C. Lee, “Fuzzy logic in control system: Fuzzy logic controller-part I,” IEEE Trans. Systems, Man and Cybernetics, vol. 20, no. 2, pp. 404-418, 1990.
[32] C. C. Lee, “Fuzzy logic in control system: Fuzzy logic controller-part II,” IEEE Trans. Systems, Man and Cybernetics, vol. 20, no. 2, pp. 419-435, 1990.
[33] T. Takagi and M. Sugeno, “Fuzzy identification of systems and its application to modeling and control,” IEEE Trans. Systems, Man and Cybernetics, vol. 15, no. 1, pp. 116-132, 1985.
[34] C. J. C. H. Watkins, Learning with delayed rewards, Ph.D Thesis, Cambridge University, Psychology Department, 1989.
[35] L. A. Zadeh, “Fuzzy sets,” Information and control, vol. 8, no. 3, pp. 338-353, 1965.
[36] L. A. Zadeh, “Fuzzy algorithm,” Information and control, vol. 12, no. 2, pp. 94-102, 1968.
[37] C. J. C. H. Watkins and P. Dayan, “Q-learning,” Mach. Learn., vol. 8, pp. 279-292, 1992.
[38] R. S. Sutton and A. G. Barto, Reinforcement learning: An Introduction, MIT Press, Cambridge University, MA, 1998.
[39] K.-F. Lee, Design and implememtation of particle swarm optimization gait learning method for adult-sized humanoid robots, Master Thesis, National Cheng Kung University, Jul. 2012.
[40] FIRA HuroCup laws,
https://docs.google.com/document/d/1YgsunoOlx9Bg6bNirxuCqHSWQbIZEIlA9KlDZkQy1RA/pub

校內：2019-08-05公開
校外：不公開電子論文尚未授權公開，紙本請查館藏目錄

簡易檢索 / 詳目顯示

相關論文