ROBOT LEARNING OF OBJECT MANIPULATION TASK ACTIONS FROM HUMAN DEMONSTRATIONS

Maria Kyrarini; Muhammad Abdul Haseeb; Danijela Ristić-Durrant; Axel Gräser

doi:10.22190/FUME170515010K

ROBOT LEARNING OF OBJECT MANIPULATION TASK ACTIONS FROM HUMAN DEMONSTRATIONS

Maria Kyrarini, Muhammad Abdul Haseeb, Danijela Ristić-Durrant, Axel Gräser

DOI Number

10.22190/FUME170515010K

First page

217

Last page

229

Abstract

Robot learning from demonstration is a method which enables robots to learn in a similar way as humans. In this paper, a framework that enables robots to learn from multiple human demonstrations via kinesthetic teaching is presented. The subject of learning is a high-level sequence of actions, as well as the low-level trajectories necessary to be followed by the robot to perform the object manipulation task. The multiple human demonstrations are recorded and only the most similar demonstrations are selected for robot learning. The high-level learning module identifies the sequence of actions of the demonstrated task. Using Dynamic Time Warping (DTW) and Gaussian Mixture Model (GMM), the model of demonstrated trajectories is learned. The learned trajectory is generated by Gaussian mixture regression (GMR) from the learned Gaussian mixture model. In online working phase, the sequence of actions is identified and experimental results show that the robot performs the learned task successfully.

Keywords

Robot Learning by Demonstration, Dynamic Time Warping, Gaussian Mixture Model, Gaussian Mixture Regression, Sequence of Actions

Full Text:

PDF

References

Li, Q., Takanishi, A. and Kato, I., 1993, Learning of robot biped walking with the cooperation of a human, 2nd IEEE International Workshop on Robot and Human Communication, Tokyo, DOI: 10.1109/ROMAN.1993.367686.

Field, M., Stirling, D., Pan, Z., and Naghdy, F., 2016, Learning trajectories for robot programing by demonstration using a coordinated mixture of factor analyzers, IEEE transactions on cybernetics, 46(3), pp. 706-717.

Ureche, A. L. P., Umezawa, K., Nakamura, Y., and Billard, A., 2015, Task parameterization using continuous constraints extracted from human demonstrations, IEEE Transactions on Robotics, 31(6), pp. 1458-1471.

Bandera, J.P., Rodriguez, J.A., Molina-Tanco, L. and Bandera, A., 2012, A survey of vision-based architectures for robot learning by imitation, International Journal of Humanoid Robotics, 9(01), p.1250006.

Lee, A.X., Gupta, A., Lu, H., Levine, S. and Abbeel, P., 2015, Learning from multiple demonstrations using trajectory-aware non-rigid registration with applications to deformable object manipulation, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 5265-5272, Hamburg.

Schou, C., Damgaard, J.S., Bogh, S. and Madsen, O., 2013, Human-robot interface for instructing industrial tasks using kinesthetic teaching, 2013 44th International Symposium on Robotics, pp. 1-6, Seoul.

Akgun, B., and Thomaz, A., 2016, Simultaneously learning actions and goals from demonstration, Autonomous Robots, 40(2), 211-227.

Calinon, S., Sauser, E.L., Billard, A.G. and Caldwell, D.G., 2010, Evaluation of a probabilistic approach to learn and reproduce gestures by imitation, 2010 IEEE International Conference on Robotics and Automation (ICRA), pp. 2671-2676, Anchrorage, AK, USA.

Billard, A., Calinon, S., Dillmann, R. and Schaal, S., 2008, Robot programming by demonstration, in Siciliano, B., Khatib, O. (Eds.), Springer handbook of robotics, Springer Berlin Heidelberg, pp. 1371-1394.

Sabbaghi, E., Bahrami, M. and Ghidary, S.S., 2014, Learning of gestures by imitation using a monocular vision system on a humanoid robot, 2014 Second RSI/ISM International Conference on Robotics and Mechatronics (ICRoM), pp. 588-594.

Ekvall, S. and Kragic, D., 2006, Learning task models from multiple human demonstrations, The 15th IEEE International Symposium on Robot and Human Interactive Communication, ROMAN 2006, pp. 358-363.

Asfour, T., Azad, P., Gyarfas, F. and Dillmann, R., 2008, Imitation learning of dual-arm manipulation tasks in humanoid robots, International Journal of Humanoid Robotics, 5(02), pp.183-202.

Kruger, V., Herzog, D.L., Baby, S., Ude, A. and Kragic, D., 2010, Learning actions from observations, IEEE robotics & automation magazine, 17(2), pp.30-43.

Alibeigi, M., Ahmadabadi, M. N. and Araabi, B. N., 2017, A Fast, Robust, and Incremental Model for Learning High-Level Concepts From Human Motions by Imitation, IEEE Transactions on Robotics, 33(1), pp. 153–168.

Pi4 Workerbot 3, Online available: http://www.pi4.de/fileadmin/material/datenblatt/Datenblatt_WB3_EN_V1_2.pdf (Last access: 28.04.2017)

Universal Robots UR10, Online Available: https://www.universal-robots.com/products/ur10-robot/ (Last access: 28.04.2017)

Kinect for xbox one, Online Available: http://www.xbox.com/en-US/xbox-one/accessories/kinect (Last access: 28.04.2017)

Kyrarini, M., Leu, A., Ristić-Durrant, D., Gräser, A., Jackowski, A., Gebhard, M., Nelles, J., Bröhl, C., Brandl, C., Mertens, A. and Schlick, C.M., 2016, Human-Robot Synergy for Cooperative Robots, Facta Universitatis, Series: Automatic Control and Robotics, 15(3), pp.187-204.

Calinon, S., 2007, Continuous extraction of task constraints in a robot programming by demonstration framework, PhD dissertation, École Polytechnique Fédérale de Lausanne.

Sakoe, H. and Chiba, S., 1987, Dynamic programming algorithm optimization for spoken word recognition, IEEE Transactions on Acoustics, Speech and Signal Processing, 26(1), pp. 43–49.

Zhang, J. and Qin, B., 2012, Dtw speech recognition algorithm of optimization template matching. World Automation Congress (WAC), pp. 1-4.

Cheng, H., Luo, J. and Chen, X., 2014, A windowed dynamic time warping approach for 3D continuous hand gesture recognition, 2014 IEEE International Conference on Multimedia and Expo (ICME), pp. 1-6

Vakanski, A., Mantegh, I., Irish, A. and Janabi-Sharifi, F., 2012, Trajectory learning for robot programming by demonstration using hidden Markov model and dynamic time warping, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 42(4), pp.1039-1052.

Wang, X., Kyrarini, M., Ristić-Durrant, D., Spranger, M. and Gräser, A., 2016, Monitoring of gait performance using dynamic time warping on IMU-sensor data, 2016 IEEE International Symposium on Medical Measurements and Applications (MeMeA), pp. 1-6, DOI:10.1109/MeMeA.2016.7533745

Calinon, S., Guenter, F. and Billard, A., 2007, On learning, representing, and generalizing a task in a humanoid robot, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 37(2), pp. 286-298.

Guenter, F., Hersch, M., Calinon, S. and Billard, A., 2007. Reinforcement learning for imitating constrained reaching movements, Advanced Robotics, 21(13), pp.1521-1544.

Dempster, A.P., Laird, N.M. and Rubin, D.B., 1977, Maximum likelihood from incomplete data via the EM algorithm, Journal of the royal statistical society. Series B (methodological), pp.1-38.

MoveIt - ROS, Online Available: http:// moveit.ros.org (Last access: 28.04.2017)

DOI: https://doi.org/10.22190/FUME170515010K

Refbacks

There are currently no refbacks.

ISSN: 0354-2025 (Print)

ISSN: 2335-0164 (Online)

COBISS.SR-ID 98732551

ZDB-ID: 2766459-4

Username
Password
Remember me