Skip to main content
Log in

Self-supervised damage-avoiding manipulation strategy optimization via mental simulation

  • Original Research Paper
  • Published:
Intelligent Service Robotics Aims and scope Submit manuscript

Abstract

Everyday robotics are challenged to deal with autonomous product handling in applications such as logistics or retail, possibly causing damage to the items during manipulation. Traditionally, most approaches try to minimize physical interaction with goods. However, this paper proposes to take into account any unintended object motion and to learn damage-minimizing manipulation strategies in a self-supervised way. The presented approach consists of a simulation-based planning method for an optimal manipulation sequence with respect to possible damage. The planned manipulation sequences are generalized to new, unseen scenes in the same application scenario using machine learning. This learned manipulation strategy is continuously refined in a self-supervised, simulation-in-the-loop optimization cycle during load-free times of the system, commonly known as mental simulation. In parallel, the generated manipulation strategies can be deployed in near-real time in an anytime fashion. The approach is validated on an industrial container-unloading scenario and on a retail shelf-replenishment scenario.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14

Similar content being viewed by others

Notes

  1. http://roblog.eu.

  2. http://gazebosim.org.

  3. http://ompl.kavrakilab.org/.

  4. https://github.com/Rasoul77/promts.

  5. The initial training set size should depend on the maximum tolerable time until the classifier is required for the first time.

  6. Convergence of the optimization method strongly depends on the concrete implementation and application scenario and is hard to define generically as explained in the evaluation section.

  7. https://docker.com.

  8. https://github.com/jacobs-robotics/gazebo-mental-simulation.

  9. https://tobias.doernba.ch/research/datasets/mental-simulation.

References

  1. Stoyanov T, Vaskevicius N, Mueller CA, Fromm T, Krug R, Tincani V, Mojtahedzadeh R, Kunaschk S, Mortensen Ernits R, Ricao Canelhas D, Bonilla M, Schwertfeger S, Bonini M, Halfar H, Pathak K, Rohde M, Fantoni G, Bicchi A, Birk A, Lilienthal A, Echelmeyer W (2016) No more heavy lifting: robotic solutions to the container unloading problem. Robot Autom Mag 23(4):94–106

    Article  Google Scholar 

  2. Stilman M, Schamburek JU, Kuffner J, Asfour T (2007) Manipulation planning among movable obstacles. In: International conference on robotics and automation

  3. Katz D, Venkatraman A, Kazemi M, Bagnell JA, Stentz A (2013) Perceiving, learning, and exploiting object affordances for autonomous pile manipulation. In: Robotics: science and systems

  4. Eppner C, Hofer S, Jonschkowski R, Martin-Martin R, Sieverling A, Wall V, Brock O (2016) Lessons from the Amazon picking challenge: Four aspects of building robotic systems. In: Robotics: science and systems

  5. Correll N, Bekris K, Berenson D, Brock O, Causo A, Hauser K, Okada K, Rodriguez A, Romano J, Wurman P (2018) Analysis and observations from the first amazon picking challenge. Trans Autom Sci Eng 15(1):172–188

    Article  Google Scholar 

  6. Pavlichenko D, Martin Garcia G, Koo S, Behnke S (2018) KittingBot: A mobile manipulation robot for collaborative kitting in automotive logistics. In: International conference on intelligent autonomous systems

  7. Kahneman D, Tversky A (1981) The simulation heuristic. Stanford university, Department of psychology, Technical report

  8. Battaglia P, Hamrick J, Tenenbaum J (2013) Simulation as an engine of physical scene understanding. Proc Natl Acad Sci USA 110(45):18,327-32

    Article  Google Scholar 

  9. Kunze L, Beetz M (2017) Envisioning the qualitative effects of robot manipulation actions using simulation-based projections. Artif Intell 247:352–380

    Article  MathSciNet  Google Scholar 

  10. Akbari A, Gillani M, Rosell J (2015) Task and motion planning using physics-based reasoning. In: International conference on emerging technologies and factory automation

  11. Weitnauer E, Haschke R, Ritter H (2010) Evaluating a physics engine as an ingredient for physical reasoning. In: International conference on simulation, modeling, and programming for autonomous robots

    Chapter  Google Scholar 

  12. Hangl S, Ugur E, Szedmak S, Piater J. (2016) Robotic playing for hierarchical complex skill learning. In: International conference on intelligent robots and systems

  13. Bozcuoglu AK, Beetz M (2017) A cloud service for robotic mental simulations. In: International conference on robotics and automation

  14. Haidu A, Beetz M (2016) Action recognition and interpretation from virtual demonstrations. In: International conference on intelligent robots and systems, pp 2833–2838

  15. Levine S, Pastor P, Krizhevsky A, Quillen D (2016) Learning hand-eye coordination for robotic grasping with deep learning and large-scale data collection. In: International symposium on experimental robotics

  16. Fromm T, Birk A (2016) Physics-based damage-aware manipulation strategy planning using scene dynamics anticipation. In: International conference on intelligent robots and systems

  17. Fromm T, Mueller CA, Pfingsthorn M, Birk A, Di Lillo P (2017) Efficient continuous system integration and validation for deep-sea robotics applications. In: Oceans

  18. Winkler J, Balint-Benczedi F, Wiedemeyer T, Beetz M, Vaskevicius N, Mueller CA, Fromm T, Birk A (2016) Knowledge-enabled robotic agents for shelf replenishment in cluttered retail environments. In: International conference on autonomous agents and multiagent systems

  19. Vaskevicius N, Mueller CA, Bonilla M, Tincani V, Stoyanov T, Fantoni G, Pathak K, Lilienthal A, Bicchi A, Birk A (2014) Object recognition and localization for robust grasping with a dexterous gripper in the context of container unloading. In: International conference on automation science and engineering

  20. Mueller CA, Pathak K, Birk A (2014) Object shape categorization in RGBD images using hierarchical graph constellation models based on unsupervisedly learned shape parts described by a set of shape specificity levels. In: International conference on intelligent robots and systems

  21. Vaskevicius N, Pathak K, Ichim A, Birk A (2012) The Jacobs Robotics approach to object recognition and localization in the context of the ICRA’11 solutions in perception challenge. In: International conference on robotics and automation

  22. Koenig N, Howard A (2004) Design and use paradigms for gazebo, an open-source multi-robot simulator. In: International conference on intelligent robots and systems

  23. Sucan I, Moll M, Kavraki L (2012) The open motion planning library. Robot Autom Mag 19(4):72–82

    Article  Google Scholar 

  24. Mojtahedzadeh R, Lilienthal A (2015) A principle of minimum translation search approach for object pose refinement. In: International conference on intelligent robots and systems

  25. Okada K, Haneda A, Nakai H, Inaba M, Inoue H (2004) Environment manipulation planner for humanoid robots using task graph that generates action sequence. In: International conference on intelligent robots and systems

  26. Kitaev N, Mordatch I, Patil S, Abbeel P (2015) Physics-based trajectory optimization for grasping in cluttered environments. In: International conference of robotics and automation

  27. Dogar M, Hsiao K, Ciocarlie M, Srinivasa S (2012) Physics-based grasp planning through clutter. In: Robotics: science and systems

  28. Goldberg K (1990) Stochastic plans for robotic manipulation. Ph.D. thesis

  29. Taeubig H, Baeuml B, Frese U (2011) Real-time swept volume and distance computation for self collision detection. In: International conference on intelligent robots and systems

  30. von Dziegielewski A, Hemmer M, Schoemer E (2015) High precision conservative surface mesh generation for swept volumes. Trans Autom Sci Eng 12(1):764–769

    Google Scholar 

  31. Abdel-Malek K, Yang J, Blackmore D, Joy K (2006) Swept volumes: fundation [sic], perspectives, and applications. Int J Shape Model 12(1):87–127

    Article  Google Scholar 

  32. Mojtahedzadeh R, Bouguerra A, Schaffernicht E, Lilienthal A (2015) Support relation analysis and decision making for safe robotic manipulation tasks. Robot Auton Syst 71:99–117

    Article  Google Scholar 

  33. Rockel S, Konecny S, Stock S, Hertzberg J, Pecora F, Zhang J (2015) Integrating physics-based prediction with semantic plan execution monitoring. In: International conference on intelligent robots and systems

  34. Pastor P, Kalakrishnan M, Chitta S, Theodorou E, Schaal S (2011) Skill learning and task outcome prediction for manipulation. In: International conference on robotics and automation

  35. Panda S, Hafez A, Jawahar C (2013) Learning support order for manipulation in clutter. In: International conference on intelligent robots and systems

  36. Sjöö K, Jensfelt P (2011) Learning spatial relations from functional simulation. In: International conference on intelligent robots and systems

  37. Fischinger D, Weiss A, Vincze M (2015) Learning grasps with topographic features. Int J Robot Res 34(9):1167–1194

    Article  Google Scholar 

  38. Kappler D, Bohg J, Schaal S (2015) Leveraging big data for grasp planning. In: International conference on robotics and automation, pp 4304–4311

  39. Li W, Leonardis A, Fritz M (2017) Visual stability prediction for robotic manipulation. In: International conference on robotics and automation, pp 2606–2613

  40. Mottaghi R, Rastegari M, Gupta A, Farhadi A (2016) What happens if... learning to predict the effect of forces in images. In: European conference on computer vision

  41. de Sa CR, Soares C, Knobbe A, Cortez P (2017) Label ranking forests. Expert Syst 34(1):1–8

    Google Scholar 

  42. Cheng W, Huehn J, Huellermeier E (2009) Decision tree and instance-based learning for label ranking. In: International conference on machine learning

  43. Zhou Y, Liu Y, Gao XZ, Qiu G (2014) A label ranking method based on Gaussian mixture model. Knowl Based Syst 72:108–113

    Article  Google Scholar 

  44. Grbovic M, Djuric N, Vucetic S (2012) Learning from pairwise preference data using Gaussian mixture model. In: Preference learning workshop, European conference on artificial intelligence

  45. Huellermeier E, Fuernkranz J, Cheng W, Brinker K (2008) Label ranking by learning pairwise preferences. Artif Intell 172(16–17):1897–1916

    Article  MathSciNet  Google Scholar 

  46. Huellermeier E, Fuernkranz J (2004) Ranking by pairwise comparison: a note on risk minimization. In: International conference on fuzzy systems

  47. Joachims T (2002) Optimizing search engines using clickthrough data. In: Conference on knowledge discovery and data mining

  48. Sutton C, McCallum A (2011) An introduction to conditional random fields. Mach Learn 4(4):267–373

    Article  Google Scholar 

  49. Burges C, Shaked T, Renshaw E, Lazier A, Deeds M, Hamilton N, Hullender G (2005) Learning to rank using gradient descent. In: International conference on machine learning, pp 89–96

  50. Joachims T (2006) Training linear SVMs in linear time. In: Conference on knowledge discovery and data mining

  51. Xing Z, Pei J, Keogh E (2010) A brief survey on sequence classification. ACM SIGKDD Explor Newslett 12(1):40

    Article  Google Scholar 

  52. le Cessie S, van Houwelingen J (1992) Ridge estimators in logistic regression. Appl Stat 41(1):191–201

    Article  Google Scholar 

  53. Regier T, Carlson L (2001) Grounding spatial language in perception: an empirical and computational investigation. J Exp Psychol Gen 130(2):273–298

    Article  Google Scholar 

  54. Kluth T, Burigo M, Knoeferle P (2016) Shifts of attention during spatial language comprehension: a computational investigation. In: International conference on agents and artificial intelligence

  55. Schwarz G (1978) Estimating the dimension of a model. Annal Stat 6(2):461–464

    Article  MathSciNet  Google Scholar 

  56. Vaskevicius N, Pathak K, Birk A (2014) Fitting superquadrics in noisy , partial views from a low-cost RGBD sensor for recognition and localization of sacks in autonomous unloading of shipping containers. In: International conference on automation science and engineering

  57. Weisz J, Allen P (2012) Pose error robust grasping from contact wrench space metrics. In: International conference on robotics and automation

  58. Kendall M (1938) A new measure of rank correlation. Biometrika 30(1/2):81–93

    Article  Google Scholar 

  59. Shalev-Shwartz S, Singer Y (2006) Efficient learning of label ranking by soft projections onto polyhedra. J Mach Learn Res 7:1567–1599

    MathSciNet  MATH  Google Scholar 

  60. Kumar R, Vassilvitskii S (2010) Generalized distances between rankings. In: International conference on world wide web

  61. Merkel D (2014) Docker: lightweight linux containers for consistent development and deployment. Linux J 2014(239):2

    Google Scholar 

  62. Morgan S, Branicky M (2004) Sampling-based planning for discrete spaces. In: International conference on intelligent robots and systems

  63. Reif J (1985) Depth-first search is inherently sequential. Inf Process Lett 20(5):229–234

    Article  MathSciNet  Google Scholar 

  64. Peshkin M, Sanderson A (1987) Planning robotic manipulation strategies for sliding objects. In: International conference on robotics and automation, vol 4, pp 696–701

Download references

Acknowledgements

The research leading to the presented results has received funding from the European Union’s Seventh Framework program (EU FP7 ICT-2) within the project “Cognitive Robot for Automation of Logistics Processes” (RobLog) under grant agreement No. 270350.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tobias Doernbach.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Tobias Doernbach was formerly known as Tobias Fromm.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Doernbach, T. Self-supervised damage-avoiding manipulation strategy optimization via mental simulation. Intel Serv Robotics 12, 333–357 (2019). https://doi.org/10.1007/s11370-019-00286-7

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11370-019-00286-7

Keywords

Navigation