VQC-Based Reinforcement Learning with Data Re-uploading: Performance and Trainability

Coelho, Rodrigo; Sequeira, André; Santos, Luís Paulo

doi:10.1007/s42484-024-00190-z

Quantum Physics

arXiv:2401.11555v2 (quant-ph)

[Submitted on 21 Jan 2024 (v1), last revised 12 Nov 2024 (this version, v2)]

Title:VQC-Based Reinforcement Learning with Data Re-uploading: Performance and Trainability

Authors:Rodrigo Coelho, André Sequeira, Luís Paulo Santos

View PDF HTML (experimental)

Abstract:Reinforcement Learning (RL) consists of designing agents that make intelligent decisions without human supervision. When used alongside function approximators such as Neural Networks (NNs), RL is capable of solving extremely complex problems. Deep Q-Learning, a RL algorithm that uses Deep NNs, achieved super-human performance in some specific tasks. Nonetheless, it is also possible to use Variational Quantum Circuits (VQCs) as function approximators in RL algorithms. This work empirically studies the performance and trainability of such VQC-based Deep Q-Learning models in classic control benchmark environments. More specifically, we research how data re-uploading affects both these metrics. We show that the magnitude and the variance of the gradients of these models remain substantial throughout training due to the moving targets of Deep Q-Learning. Moreover, we empirically show that increasing the number of qubits does not lead to an exponential vanishing behavior of the magnitude and variance of the gradients for a PQC approximating a 2-design, unlike what was expected due to the Barren Plateau Phenomenon. This hints at the possibility of VQCs being specially adequate for being used as function approximators in such a context.

Comments:	26 pages, 11 figures
Subjects:	Quantum Physics (quant-ph); Machine Learning (cs.LG)
Cite as:	arXiv:2401.11555 [quant-ph]
	(or arXiv:2401.11555v2 [quant-ph] for this version)
	https://doi.org/10.48550/arXiv.2401.11555
Journal reference:	Quantum Mach. Intell. 6, 53 (2024)
Related DOI:	https://doi.org/10.1007/s42484-024-00190-z

Submission history

From: Rodrigo Coelho [view email]
[v1] Sun, 21 Jan 2024 18:00:15 UTC (4,207 KB)
[v2] Tue, 12 Nov 2024 18:18:43 UTC (14,686 KB)

Quantum Physics

Title:VQC-Based Reinforcement Learning with Data Re-uploading: Performance and Trainability

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Quantum Physics

Title:VQC-Based Reinforcement Learning with Data Re-uploading: Performance and Trainability

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators