Meta-Reinforcement Learning for Trajectory Design in Wireless UAV Networks

Hu, Ye; Chen, Mingzhe; Saad, Walid; Poor, H. Vincent; Cui, Shuguang

Computer Science > Machine Learning

arXiv:2005.12394 (cs)

[Submitted on 25 May 2020]

Title:Meta-Reinforcement Learning for Trajectory Design in Wireless UAV Networks

Authors:Ye Hu, Mingzhe Chen, Walid Saad, H. Vincent Poor, Shuguang Cui

View PDF

Abstract:In this paper, the design of an optimal trajectory for an energy-constrained drone operating in dynamic network environments is studied. In the considered model, a drone base station (DBS) is dispatched to provide uplink connectivity to ground users whose demand is dynamic and unpredictable. In this case, the DBS's trajectory must be adaptively adjusted to satisfy the dynamic user access requests. To this end, a meta-learning algorithm is proposed in order to adapt the DBS's trajectory when it encounters novel environments, by tuning a reinforcement learning (RL) solution. The meta-learning algorithm provides a solution that adapts the DBS in novel environments quickly based on limited former experiences. The meta-tuned RL is shown to yield a faster convergence to the optimal coverage in unseen environments with a considerably low computation complexity, compared to the baseline policy gradient algorithm. Simulation results show that, the proposed meta-learning solution yields a 25% improvement in the convergence speed, and about 10% improvement in the DBS' communication performance, compared to a baseline policy gradient algorithm. Meanwhile, the probability that the DBS serves over 50% of user requests increases about 27%, compared to the baseline policy gradient algorithm.

Comments:	6 pages, Fig.4
Subjects:	Machine Learning (cs.LG); Information Theory (cs.IT); Networking and Internet Architecture (cs.NI); Machine Learning (stat.ML)
Cite as:	arXiv:2005.12394 [cs.LG]
	(or arXiv:2005.12394v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2005.12394

Submission history

From: Ye Hu [view email]
[v1] Mon, 25 May 2020 20:43:59 UTC (4,992 KB)

Computer Science > Machine Learning

Title:Meta-Reinforcement Learning for Trajectory Design in Wireless UAV Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Meta-Reinforcement Learning for Trajectory Design in Wireless UAV Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators