| 研究生: |
王浩丞 Wang, Hao-Cheng |
|---|---|
| 論文名稱: |
以模糊Q-學習為基礎之負重自動平衡控制策略於大型人形機器人之研製 Design and Implementation of Fuzzy Q-learning Based Weight-lifting Auto-balancing Control Strategy for Adult-sized Humanoid Robots |
| 指導教授: |
李祖聖
Li, Tzuu-Hseng S. |
| 學位類別: |
碩士 Master |
| 系所名稱: |
電機資訊學院 - 電機工程學系 Department of Electrical Engineering |
| 論文出版年: | 2014 |
| 畢業學年度: | 102 |
| 語文別: | 英文 |
| 論文頁數: | 71 |
| 中文關鍵詞: | 大型人形機器人 、模糊Q-學習演算法 、FIRA機器人舉重競賽 |
| 外文關鍵詞: | Adult-sized Humanoid Robots, Fuzzy Q-Learning Control, FIRA Weight-lifting Competition |
| 相關次數: | 點閱:112 下載:1 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
本論文提出一運用模糊Q-學習演算法之控制器,以改進大型人形機器人在負重時行走之穩定性。本論文首先介紹本實驗室第三代大型人形機器人軟硬體架構、硬體規格與設計概念。為了讓機器人能適應不同負重情況,用感測器來獲得環境資訊是必要的,所以在機器人動作控制系統中加入了感測器與回授控制。在本論文所提模糊Q-學習演算法控制器中,係以這些感測器之資料為輸入,並產生相對應之輸出來對馬達做出調整,以適應不同負重時的情況,感測器包含經卡爾曼濾波器處理之九軸慣性感測器與用於計算零力矩點之壓力感測器。本論文所提控制器分為兩個學習階段,第一階段係學習控制器對步態中各姿態之影響權重,第二階段則對模糊Q-學習演算法做出學習,學習完成後,機器人便能藉由此回授控制器在不同的負重情況時進行自主平衡,維持步態的穩定性。最後,以FIRA機器人競賽的舉重項目進行實驗,證明本論文所提回授控制器之可行性與效果。
This thesis proposes a control method that improves the ability of adult-sized humanoid robots to adapt to weight-lifting situations. First, the architecture of the hardware, software and the design concepts for the third generation adult-sized robot, David III, are introduced. In order to achieve the goal of having humanoid robots automatically balance their motion for weight-lifting situations, feedback control is added to the motion control system. The feedback sensors include a three-axis accelerometer and a three-axis gyroscopic, which would be processed by Kalman filter, as well as eight force sensors providing the zero moment point (ZMP) information on the robot. These feedback signals are used as the input of a Fuzzy Q-learning controller, which adjusts the motions to keep the stabilization of the robot. The Fuzzy Q-learning controller consists of two stages: one is the stage of fitting the output weights of each pose in motion patterns, and the second is training the rule-table of the controller. The experiment shows that the controller allows the adult-sized robot to walk stably no matter whether it lifts a weight or has a backpack on its back. Thus, the developed controller does indeed keep the balance of the robot in different situations, which gives the robot the ability to adapt to various environments in the manner of human beings.
[1] FIRA, http://www.fira.net/
[2] RoboCup, http://www.robocup.org/
[3] K. Harada, S. Kajita, K. Kaneko, and H. Hirukawa, “Dynamics and balance of a humanoid robot during manipulation tasks,” IEEE Trans. Robotics, vol. 22, no. 3, pp. 568-575, Jun. 2006.
[4] S. Kajita, T. Nagasaki, K. Kaneko, and H. Hirukawa, “ZMP-based biped running control,” IEEE Trans. Robotics, vol. 14, no. 2, pp. 63-72, Jun. 2007.
[5] P. Sardain and G. Bessonnet, “Zero moment point-measurements from a human walker wearing robot feet as shoes,” IEEE Trans. Systems, Man and Cybernetics, Part A: Systems and Humans, vol. 34, no. 5, pp. 638-648, Sep. 2004.
[6] C. Fu and K. Chen, “Gait synthesis and sensory control of stair climbing for a humanoid robot,” IEEE Trans. Industrial Electronics, vol. 55, no. 5, pp. 2111-2120, May. 2008.
[7] K. Erbatur and O. Kurt, “Natural ZMP trajectories for biped robot reference generation,” IEEE Trans. Industrial Electronics, vol. 56, no. 3, pp. 835-845, Mar. 2009.
[8] Q. Huang, K. Kaneko, K. Yokoi, S. Kajita, T. Kotoku, N. Koyachi, H. Arai, N. Imamura, K. Komoriya, and K. Tanie, “Balance control of a biped robot combining off-line pattern with real-time modification,” in Proc. IEEE Int. Conf. Robotics and Automation, 2000, pp. 3346-3352.
[9] P. Sardain and G. Bessonnet, “Forces acting on a biped robot. Center of pressure-zero moment point,” IEEE Trans. Systems, Man and Cybernetics, Part A: Systems and Humans, vol. 34, pp. 630–372, Sep. 2004.
[10] J. P. Ferreira, M. Crisóstomo and A. P. Coimbra, “ZMP trajectory reference for the sagittal plane control of a biped robot based on a human CoP and gait”, in Proc. of the 2009 IEEE/ RSJ International Conf. on Intelligent Robots and Systems, pp.1588-1593, 2009
[11] F. Asano, M. Yamakita, N. Kamamichi, and Z. W. Luo, “A novel gait generation for biped walking robots based on mechanical energy constraint,” IEEE Trans. Robotics and Automation, vol. 20, no. 3, pp. 565-573, Jun. 2004.
[12] Q. Huang and Y. Nakamura, “Sensory reflex control for humanoid walking,” IEEE Trans. Robotics, vol. 21, no. 5, pp. 977-984, Oct. 2005.
[13] P. Sardain, M. Rostami, and G. Bessonnet, “An anthropomorphic biped robot: dynamic concepts and technological design,” IEEE Trans. Systems, Man and Cybernetics, Part A: Systems and Humans, vol. 28, no. 6, pp.823-838, Nov. 1998.
[14] Y. D. Kim, B. J. Lee, J. H. Ryu, and J. H. Kim, “Landing force control for humanoid robot by time-domain passivity approach,” IEEE Trans. Robotics, vol. 23, no. 6, pp. 1294-1301, Dec. 2007.
[15] P.Y. Glorennec, “Fuzzy Q-leaming and dynamical fuzzy Q-leaming”; in Proc. of IEEE International Conference on Fuzzy Systems, Orlando, vol. 1, pp. 474-479, Jun 1994.
[16] H. R. Berenji, “Fuzzy Q-Learning: A new approach for fuzzy dynamic programming,” in Proc. IEEE Int. Conf. Fuzzy Systems, 1994, pp. 486–491.
[17] P. Y. Glorennec and J. Jouffe, “Fuzzy Q-learning,” inProc. 6th IEEE Int. Conf. Fuzzy Systems, 1997.
[18] ROBOTIS, http://www.robotis.com/
[19] Arduino Due, http://arduino.cc/en/Main/ArduinoBoardDue
[20] Zigbee-100, http://support.robotis.com/en/product/auxdevice/communication/zigbee_manual.htm
[21] DMP, http://www.dmp.com.tw/
[22] Logitech [Online], Available: http://www.logitech.com/
[23] GIGABYTE, http://www.gigabyte.tw/
[24] Tekscan, http://www.tekscan.com/
[25] C.-H. Li, Design and implememtation of vision and strategy systems for the FIRA HuroCup competition, Master Thesis, National Cheng Kung University, Jul. 2011.
[26] C.-M. Chang, Design and implementation of vision and strategy system for humanoid robot soccer competition, Master Thesis, National Cheng Kung University, Jul. 2009.
[27] K. T. Holland, R. A. Holman, T. C. Lippmann, and J. Stanley, “Practical use of video imagery in nearshore oceanographic field studies,” IEEE Journal of Oceanic Engineering, vol. 22, no. 1, pp. 81-92, Jan 1997.
[28] Y. I. Abdel-Aziz and H. M. Karara, “Direct linear transformation from comparator coordinates into object space coordinates in close-range photogrammetry,” in Proc. ASP/UI Symp. Close-Range Photogrammetry, Urbana, IL, 1971, pp. 1–18.
[29] G. Welch and G. Bishop, “An introduction to the Kalman filter,” Dept. Comput. Sci., Univ. North Carolina, Chapel Hill, NJ, 2001.
[30] Julier, S.J., Uhlmann, J.K. “Unscented filtering and nonlinear estimation,” Proceedings of the IEEE, vol. 92, issue. 3, pp. 401–422, Mar 2004
[31] C. C. Lee, “Fuzzy logic in control system: Fuzzy logic controller-part I,” IEEE Trans. Systems, Man and Cybernetics, vol. 20, no. 2, pp. 404-418, 1990.
[32] C. C. Lee, “Fuzzy logic in control system: Fuzzy logic controller-part II,” IEEE Trans. Systems, Man and Cybernetics, vol. 20, no. 2, pp. 419-435, 1990.
[33] T. Takagi and M. Sugeno, “Fuzzy identification of systems and its application to modeling and control,” IEEE Trans. Systems, Man and Cybernetics, vol. 15, no. 1, pp. 116-132, 1985.
[34] C. J. C. H. Watkins, Learning with delayed rewards, Ph.D Thesis, Cambridge University, Psychology Department, 1989.
[35] L. A. Zadeh, “Fuzzy sets,” Information and control, vol. 8, no. 3, pp. 338-353, 1965.
[36] L. A. Zadeh, “Fuzzy algorithm,” Information and control, vol. 12, no. 2, pp. 94-102, 1968.
[37] C. J. C. H. Watkins and P. Dayan, “Q-learning,” Mach. Learn., vol. 8, pp. 279-292, 1992.
[38] R. S. Sutton and A. G. Barto, Reinforcement learning: An Introduction, MIT Press, Cambridge University, MA, 1998.
[39] K.-F. Lee, Design and implememtation of particle swarm optimization gait learning method for adult-sized humanoid robots, Master Thesis, National Cheng Kung University, Jul. 2012.
[40] FIRA HuroCup laws,
https://docs.google.com/document/d/1YgsunoOlx9Bg6bNirxuCqHSWQbIZEIlA9KlDZkQy1RA/pub
校內:2019-08-05公開