Heuristic Q-learning based on experience replay for three-dimensional path planning of the unmanned aerial vehicle.

Xie, Ronglei; Meng, Zhijun; Zhou, Yaoming; Ma, Yunpeng; Wu, Zhe

Heuristic Q-learning based on experience replay for three-dimensional path planning of the unmanned aerial vehicle.

Xie, Ronglei;Meng, Zhijun;Zhou, Yaoming;Ma, Yunpeng;Wu, Zhe;

Science progress 2019 pp. 36850419879024

192

xie2019heuristicscience

Abstract

In order to solve the problem that the existing reinforcement learning algorithm is difficult to converge due to the excessive state space of the three-dimensional path planning of the unmanned aerial vehicle, this article proposes a reinforcement learning algorithm based on the heuristic function and the maximum average reward value of the experience replay mechanism. The knowledge of track performance is introduced to construct heuristic function to guide the unmanned aerial vehicles' action selection and reduce the useless exploration. Experience replay mechanism based on maximum average reward increases the utilization rate of excellent samples and the convergence speed of the algorithm. The simulation results show that the proposed three-dimensional path planning algorithm has good learning efficiency, and the convergence speed and training performance are significantly improved.

Keywords

q-learning path planning unmanned aerial vehicle experience replay heuristic information

Access

DOI:

10.1177/0036850419879024

Citation

ID: 96681

Ref Key: xie2019heuristicscience

Use this key to autocite in SciMatic or Thesis Manager

References

No Bibliography

Blockchain Verification

Account:

NFT Contract Address:

0x95644003c57E6F55A65596E3D9Eac6813e3566dA

Article ID:

96681

Unique Identifier:

10.1177/0036850419879024

Network:

Scimatic Chain (ID: 481)

Blockchain Readiness Checklist

Authors

Abstract

Journal Name

Year

Title

5/5

Creates 1,000,000 NFT tokens for this article

Token Features:

ERC-1155 Standard NFT
1 Million Supply per Article
Transferable via MetaMask
Permanent Blockchain Record

Scan with Saymatik Web3.0 Wallet

Gas fees required in SCI Coins

Buy SCI

Saymatik Web3.0 Wallet

Google Play

App Store

Coming soon

Reference Key: lastname+year+titlefirstword+journalfirstword

Article Type (Article, Book, Proceedings etc.)

Add a reference in a raw form. Our automatic system will correct it later.