| 研究生: |
侯駿賢 Hou, Chun-Hsien |
|---|---|
| 論文名稱: |
以相互式類神經網路概念學習演算法
為基礎之雙機器人合作互動 Neural Network based Self and Mutual Concept Learning for Robot Cooperation |
| 指導教授: |
李祖聖
Li, Tzuu-Hseng S. |
| 學位類別: |
碩士 Master |
| 系所名稱: |
電機資訊學院 - 電機工程學系 Department of Electrical Engineering |
| 論文出版年: | 2015 |
| 畢業學年度: | 103 |
| 語文別: | 英文 |
| 論文頁數: | 64 |
| 中文關鍵詞: | 互相學習類神經網路 、概念學習 、機器人合作 |
| 外文關鍵詞: | Mutual learning neural network, concept learning, robot cooperation |
| 相關次數: | 點閱:76 下載:0 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
機器人合作在機器人研究領域,是一個重要且具未來性的議題。本論文提出一基於類神經網路演算法之相互式學習法,讓機器人透過溝通與分享自我學習的成果,提升彼此概念學習的能力,讓人們可以建構複雜的想法,相互溝通,以及理解並預測這個世界。讓機器人具備學習概念的能力,無異是朝向智慧化與人性化的一大步。本論文提出之互相學習類神經網路(MLNN)系統,讓機器人具備學習概念,以及合作與溝通的能力,並實現在智慧箱任務中。在該任務當中,機器人需辨識出桌上物件之形狀與顏色,並將物件投入由另一隻機器人所拿之智慧箱之相對應洞口。互相學習類神經網路,是由倒傳遞的類神經網路(BPNN)所衍生。視覺系統使用HSV 過濾背景,找出顏色與形狀,再利用雙線性方法讓影像正規化。機器人可透過TCP/IP通訊系統互相通訊,分享彼此的辨識結果,並規劃手臂動作。互相學習類神經網路系統,可同時更新雙機器人之類神經網路系統中的權重值。系統比較雙機器人之辨識結果,與過去辨識之正確性,並做出選擇。具有較高正確性的機器人,會將自身學習的權重值,傳送給另一隻機器人,以達到相互學習之目的。經由多次的學習過程,機器人將可以成功地建立概念。本篇論文所提出的方法,透過Matlab模擬學習過程,並且實際運用在兩隻機器人上。實驗的結果,證明本方法之可行性與有效性。
Multi-robot cooperation is an important issue in robotics. This thesis proposes a self and mutual concept learning algorithm based on Neural Network (NN) for robot cooperation. Robots learn a concept not only by themselves but also from each other, and they cooperate to complete a complicated task, that of Form Fitter. In the form fitter game, one robot explores the textures of shapes and grasps a shape on the desk to put on a box held by another robot. The Mutual Learning Neural Network (MLNN) system evolved from the Backpropagation Neural Network (BPNN). This visual system extracts the color and shape of objects by HSV and normalizes the images by bilinear interpolation. Robots utilize the TCP/IP communication system to communicate with each other and generate a series action of arm. The MLNN system updates both weights in the neural network system of each robot at the same time. The system compares the recognition results of both robots and chooses the better one. The robot which has greater accuracy will translate its learning weight to the other one to improve the performance of both robots. After learning many times, both robots can learn a concept. Finally, the proposed method is simulated by Matlab and demonstrated in two home service robots. The experimental results show the efficiency and feasibility of the MLNN system.
Reference
[1] D. Boud, R. Cohen, and J. Sampson, “Peer learning and assessment,” Assessment & Evaluation in Higher Education, vol. 24, no. 4, pp. 413-426, 1999.
[2] W. Damon, “Peer education: The untapped potential,” Journal of Applied Developmental Psychology, vol. 5, no, 4, pp. 331-343, 1984.
[3] M. Dorigo and L. M. Gambardella, “Ant colony system: a cooperative learning approach to the traveling salesman problem,” IEEE Transactions on Evolutionary Computation, vol. 1, no. 1, pp. 53-66, 1997.
[4] B. J. Millis and P. G. Cottell Jr, Cooperative learning for higher education faculty. Series on higher education, Oryx Press, 1997.
[5] D. W. Johnson, “An educational psychology success story: social interdependence theory and cooperative learning,” Educational Researcher, vol. 38, no. 5, pp. 365–379, 2009.
[6] Concept learning [Online]. Available: http://en.wikipedia.org/wiki/Concept_learning
[7] J. Bruner, J. J. Goodnow, and G. A. Austin, A study of thinking. New York: Science Editions, 1967.
[8] R. G. Suchman and T. Trabasso,“ Stimulus preference and cue function in young children's concept attainment,” Journal of Experimental Child Psychology, vol. 3, no. 2, pp.188-198, 1966.
[9] A. H. Buss and P. J. Lang, “Psychological deficit in schizophrenia: I. Affect, reinforcement, concept attainment,” Journal of Abnormal Psychology, vol. 70, no.1, pp. 2-24, 1965.
[10] C. W. Peters, “A comparison between the Frayer model of concept attainment and the textbook approach to concept attainment,” Reading Research Quarterly, pp. 252-254, 1974.
[11] R. L. Goldstone and A. Kersten, “Concepts and categorization,” in Handbook of psychology, 2003.
[12] D. L. Medin and E. E. Smith, “Concepts and concept formation,” Annual review of psychology, vol. 35, no. 1, pp. 113-138, 1984.
[13] A. B. Markman and V. S. Makin, “Referential communication and category acquisition,” Journal of Experimental Psychology: General, vol. 127, no. 4, pp. 331-354, 1998.
[14] E. E. Smith and S. A. Sloman, “Similarity-versus rule-based categorization,” Memory & Cognition, vol. 22, no.4, pp. 377-386, 1994.
[15] E. Rosch and C. B. Mervis, “Family resemblances: Studies in the internal structure of categories,” Cognitive Psychology, vol. 7, no. 4, pp. 573-605, 1975.
[16] C. B. Mervis and E. Rosch, “Categorization of natural objects,” Annual Review of Psychology, vol. 32, no. 1, pp. 89-115, 1981.
[17] L. W. Barsalou, “The instability of graded structure: Implications for the nature of concepts,” in Concepts and Conceptual Development: Ecological and Intellectual Factors in Categorization, CUP Archive, pp. 101-140, 1987.
[18] D. W. Aha, D. Kibler, and M. K. Albert, “Instance-based learning algorithms,” Machine learning, vol. 6, no. 1, pp. 37-66, 1991.
[19] M. J. Tarr, P. Williams, W. G. Hayward, and I. Gauthier, “ Three-dimensional object recognition is viewpoint dependent,” Nature Neuroscience, vol. 1, no. 4, pp. 275-277, 1998.
[20] C. K. Riesbeck and R. C.Schank, Inside case-based reasoning. Psychology Press, 2013.
[21] T. Hastie and R. Tibshirani, “Discriminant adaptive nearest neighbor classification,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 18, no. 6, pp. 607-616, 1996.
[22] T. Kohonen, “Improved versions of learning vector quantization,” in Proceeding of IJCNN International Joint Conference on Neural Networks, pp. 545-550, 1990.
[23] V. Chandrasekaran and Z.-Q. Liu, “Topology constraint free fuzzy gated neural networks for pattern recognition,” IEEE Transactions on Neural Network, vol. 9, no. 3, pp. 483-502, 1998.
[24] S.-W. Lee and H.-H. Song, “A new recurrent neural-network architecture for visual pattern recognition,” IEEE Transactions on Neural Network, vol. 8, no. 2, pp. 331-340, 1997.
[25] W. S. McCulloch and W. H. Pitts, “A logical calculus of the ideas immanent in nervous activity,” Bulletin of Mathematical Biophysics, vol. 5, pp.115-133, 1943.
[26] D. O. Hebb, The Organization of Behavior, Wiley, 1949.
[27] F. Rosenblatt, “The Perceptron: A probabilistic model for information storage and organization in the brain,” Psychological Review, vol. 65, pp. 386-408, 1958.
[28] D. E. Rumelhart, G. E. Hinton, and R. J. Williams, “Learning internal representations by error propagation,” in Parallel Distributed Processing: Explorations in the Microstructure of Cognition, MIT Press, Cambridge, MA, vol. 1, pp. 318-362, 1986.
[29] B. Hao and X. Dai, “The collision-free motion of robot with fuzzy neural network,” in Proceedings of 2nd International Conference on Industrial and Information Systems, vol. 2, pp. 219-222, 2010.
[30] L. Zhao, W. Pawlus, H. R. Karimi, and K. G. Robbersmyr, “Data-based modeling of vehicle crash using adaptive neural-fuzzy inference system,” IEEE/ASME Transactions on Mechatronics, vol. 19, no. 2, pp. 684-696, 2014.
[31] G. Yahui, and D. Xianzhong, “Kinematic cooperation analysis and trajectory teaching in multiple robots system for welding,” in Proceedings of IEEE Conference Emerging Technologies & Factory Automation (ETFA), pp. 1-8, 2011.
[32] M. Namvar, and A. Farhad, “Adaptive force-motion control of coordinated robots interacting with geometrically unknown environments,” IEEE Transactions on Robotics, vol. 21, no. 4, pp. 678-694, 2005.
[33] N. Vahrenkamp, E. Kuhn, T. Asfour, and R. Dillmann, “Planning multi-robot grasping motions,” in Proceedings of IEEE-RAS International Conference on Humanoid Robots (Humanoids), pp. 593-600, 2010.
[34] T. Liu and M. Q.-H. Meng, “Study on cooperation between humanoid robot Nao and Barrett WAM,” in Proceedings of IEEE International Conference on Robotics and Biomimetics (ROBIO), pp. 980-983, 2012.
[35] Y. Wang and C.W. de Silva, “Sequential Q-Learning with Kalman filtering for multirobot cooperative transportation,” IEEE/ASME Transactions on Mechatronics, vol. 15, no. 2, pp. 261-268, 2010.
[36] Y. Cai and S. X. Yang, “Fuzzy logic-based multi-robot cooperation for object-pushing,” in Proceedings of IEEE International Conference on Information and Automation (ICIA), pp. 273-278, 2011.
[37] Y. Mohan and S. G. Ponnambalam, “An extensive review of research in swarm robotics,” in Proceedings of World Congress on Nature & Biologically Inspired Computing, pp. 140-145, 2009.
[38] Y. Khaluf, E. Mathews, and F. J. Rammig, “Self-organized cooperation in swarm robotics,” in Proceedings of IEEE International Symposium on Object/Component/Service-Oriented Real-Time Distributed Computing Workshops (ISORCW), pp. 217-226, 2011.
[39] B. Gerardo and W. Jing, “Swarm intelligence,” in Proceedings of the Seventh Annual Meeting, Tokyo, Japan, pp.425-428, 1989.
[40] F. De Rango and N. Palmieri, “A swarm-based robot team coordination protocol for mine detection and unknown space discovery,” in Proceedings of International Wireless Communications and Mobile Computing Conference (IWCMC), pp. 703- 708, 2012.
[41] S. Chattunyakit, T. Kondo, I. Nilkhamhang, T. Phatrapornnant, and I. Kumazawa, “Two foraging algorithms for a limited number of swarm robots,” in Proceedings of SICE Annual Conference (SICE), pp. 1056-1061, 2013.
[42] L. Vig and J. A. Adams, “Multi-robot coalition formation,” IEEE Transactions on Robotics, vol. 22, no. 4, pp. 637-649, 2006.
[43] T. Fong, C. Thorpe, and C. Baur, “Multi-robot remote driving with collaborative control,” IEEE Transactions on Industrial Electronics systems, vol. 50, no. 4, pp. 699-704, 2003.
[44] W. Burgard, M. Moors, C. Stachniss, and F.E. Schneider, “Coordinated multi-robot exploration,” IEEE Transactions on Robotics, vol. 21, no. 3, pp. 376-386, 2005.
[45] L. Riazuelo, J. Civera, and J. M. M. Montiel, “C TAM: A Cloud framework for cooperative tracking and mapping,” Robotics and Autonomous Systems, vol.62, no. 4, pp.401-413, 2014.
[46] M. Tenorth, A. C. Perzylo, R. Lafrenz and M. Beetz, “Representation and exchange of knowledge about actions, objects, and environments in the RoboEarth framework,” IEEE Transactions on Automation Science and Engineering, vol. 10, no. 3, pp. 643-651, 2013.
[47] M. Waibel, M. Beetz, J. Civera, R. D'Andrea, J. Elfring, et al, “RoboEarth,” IEEE Robotics & Automation Magazine, vol. 18, no. 2, pp. 69-82, June 2011
[48] L. Riazuelo, M. Tenorth, D. Di Marco, M. Salas, D. Galvez-Lopez, L. Mosenlechner, L. Kunze, M. Beetz, J.D. Tardos, L. Montano, J.M. Martinez Montiel, “RoboEarth Semnatic Mapping: A cloud enabled knowledge-based approach,” IEEE Transactions on Automation Science and Engineering (TASE): Special Issue on Cloud Robotics and Automation, vol. 12, no, 2, pp. 1-12, 2015.
[49] L. Kunze, T. Roehm, and M. Beetz, “Towards semantic robot description languages,” in Proceedings of 2011 IEEE International Conference on Robotics and Automation (ICRA), 2011.
[50] D. Hearn, M. P. Baker, Computer Graphics. Prentice Hall International, ISBN 0-13-165598-1, pp. 302-205, 1986
[51] R. Gonzalez and R. E. Woods, Digital Image Processing, 2nd ed. Prentice Hall Press, ISBN 0-201-18075-8, p. 295.
[52] J. Denavit and R. S. Hartenberg, “A kinematic notation for lower-pair mechanisms based on matrices,” ASME J. Appl. Mech, vol. 23, pp. 215–221, 1955.
[53] R. S. Hartenberg and J. Denavit, Kinematic synthesis of linkages. McGraw-Hill series in mechanical engineering. New York: McGraw-Hill. p. 435, 1965.
校內:2020-08-21公開