Reinforcement learning path planning algorithm based on obstacle area expansion strategy

Chen, Haiyang; Ji, Yebiao; Niu, Longhui

doi:10.1007/s11370-020-00313-y

Reinforcement learning path planning algorithm based on obstacle area expansion strategy

Original Research
Published: 03 February 2020

Volume 13, pages 289–297, (2020)
Cite this article

Intelligent Service Robotics Aims and scope Submit manuscript

1217 Accesses
14 Citations
Explore all metrics

Abstract

We improve the traditional Q(\( \lambda \))-learning algorithm by adding the obstacle area expansion strategy. The new algorithm is named OAE-Q(\( \lambda \))-learning and applied to the path planning in the complex environment. The contributions of OAE-Q(\( \lambda \))-learning are as follows: (1) It expands the concave obstacle area in the environment to avoid repeated invalid actions when the agent falls into the obstacle area. (2) It removes the extended obstacle area, which reduces the learning state space and accelerates the convergence speed of the algorithm. Extensive experimental results validate the effectiveness and feasibility of OAE-Q(\( \lambda \))-learning on the path planning in complex environments.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An Algorithm for Path Planning Based on Improved Q-Learning

Reinforcement Learning for Mobile Robot Obstacle Avoidance Under Dynamic Environments

Q-Learning-Based Obstacle Avoidance Path Planning with Local Optimal Points

Abbreviations

OAE-Q(\( \lambda \))-learning:: The algorithm of Q(\( \lambda \))-learning based on obstacle area expansion strategy

References

Galceran E, Cunningham AG, Eustice RM et al (2017) Multipolicy decision-making for autonomous driving via change point-based behavior prediction: theory and experiment. Auton Robots 41(6):1367–1382
Article Google Scholar
Li Y, Li D, Maple C et al (2013) K-order surrounding roadmaps path planner for robot path planning. J Intell Robot Syst 75(3–4):493–516
Google Scholar
Chen Y, Cheng L, Wu H et al (2015) Knowledge-driven path planning for mobile robots: relative state tree. Soft Comput 19(3):763–773
Article MathSciNet Google Scholar
Hebecker T, Buchholz R, Ortmeier F (2015) Model-based local path planning for UAVs. J Intell Rob Syst 78(1):127–142
Article Google Scholar
Chen YB, Luo GC, Mei YS et al (2016) UAV path planning using artificial potential field method updated by optimal control theory. Int J Syst Sci 47(6):14
Article MathSciNet MATH Google Scholar
Lee D, Shim DH (2018) A mini-drone development, genetic vector field-based multi-agent path planning, and flight tests. Int J Aeronaut Space Sci 19(3):785–797
Article Google Scholar
Yue L, Chen H (2019) Unmanned vehicle path planning using a novel ant colony algorithm. EURASIP J Wirel Commun Netw 2019(1):136
Article Google Scholar
Zhang B, Mao Z, Liu W et al (2015) Geometric reinforcement learning for path planning of UAVs. J Intell Rob Syst 77(2):391–409
Article Google Scholar
Jiang J, Xin J (2019) Path planning of a mobile robot in a free-space environment using Q -learning. Progr Artif Intell 8(1):133–142
Article MathSciNet Google Scholar
Haghzad Klidbary S, Bagheri Shouraki S, Sheikhpour Kourabbaslou S (2017) Path planning of modular robots on various terrains using Q-learning versus optimization algorithms[J]. Intel Serv Robot 10(2):121–136
Article Google Scholar
Pakizeh E, Pedram MM, Palhang M (2015) Multi-criteria expertness based cooperative method for SARSA and eligibility trace algorithms. Appl Intell 43(3):487–498
Article Google Scholar
Kim B, Pineau J (2016) Socially adaptive path planning in human environments using inverse reinforcement learning. Int J Social Robot 8(1):51–66
Article Google Scholar
Martinez-Gil F, Lozano M, Fernández F (2014) MARL-Ped: a multi-agent reinforcement learning based framework to simulate pedestrian groups. Simul Model Pract Theory 47:259–275
Article Google Scholar
Ito K, Takeuchi Y (2016) Reinforcement learning in dynamic environment: abstraction of state-action space utilizing properties of the robot body and environment]. Artif Life Robot 21(1):11–17
Article Google Scholar
Yasini S, Naghibi Sitani MB, Kirampor A (2016) Reinforcement learning and neural networks for multi-agent nonzero-sum games of nonlinear constrained-input systems. Int J Mach Learn Cybernet 7(6):967–980
Article Google Scholar
Yu T, Wang HZ, Zhou B et al (2015) Multi-agent correlated equilibrium Q(λ) learning for coordinated smart generation control of interconnected power grids. IEEE Trans Power Syst 30(4):1669–1679
Article Google Scholar

Download references

Acknowledgements

The research of this paper is supported by the National Natural Science Foundation of China.

Funding

The authors are partially supported by NSFC (61573285).

Author information

Authors and Affiliations

School of Electronic Information, Xi’an Polytechnic University, Xi’an, 710000, People’s Republic of China
Haiyang Chen, Yebiao Ji & Longhui Niu

Authors

Haiyang Chen
View author publications
You can also search for this author in PubMed Google Scholar
Yebiao Ji
View author publications
You can also search for this author in PubMed Google Scholar
Longhui Niu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yebiao Ji.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chen, H., Ji, Y. & Niu, L. Reinforcement learning path planning algorithm based on obstacle area expansion strategy. Intel Serv Robotics 13, 289–297 (2020). https://doi.org/10.1007/s11370-020-00313-y

Download citation

Received: 23 August 2019
Accepted: 13 January 2020
Published: 03 February 2020
Issue Date: April 2020
DOI: https://doi.org/10.1007/s11370-020-00313-y

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Reinforcement learning path planning algorithm based on obstacle area expansion strategy

Abstract

Access this article

Similar content being viewed by others

An Algorithm for Path Planning Based on Improved Q-Learning

Reinforcement Learning for Mobile Robot Obstacle Avoidance Under Dynamic Environments

Q-Learning-Based Obstacle Avoidance Path Planning with Local Optimal Points

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Reinforcement learning path planning algorithm based on obstacle area expansion strategy

Abstract

Access this article

Similar content being viewed by others

An Algorithm for Path Planning Based on Improved Q-Learning

Reinforcement Learning for Mobile Robot Obstacle Avoidance Under Dynamic Environments

Q-Learning-Based Obstacle Avoidance Path Planning with Local Optimal Points

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation