Reinforcement learning is one of three main techniques that allows a model to learn, notably focusing on creating an optimal agent able to reach an objective by interacting with an environment. This thesis tries to analyze the possible potentials and advantages that would derive from using quantum circuits with neural networks. To examine and explain how it is possible to create hybrid algorithms that exploit the improvements of the classical and quantum algorithms. The Cartpole environment tested uses the Deep Q-Network algorithm and is compared with the quantum version to see what kind of advantage is present. After demonstrating the quantum advantage on the Cartpole, a more com?plex environment with an industrial application, the robotic arm, is tested using another kind of algorithm called Soft-Actor Critic. Differently from the DQN version, it requires multiple components with different purposes increasing the possible configurations that needs to be tested. For this reason, multiple configurations were run, such as one where only a single component has the quantum variation and one with all components quantum variated. Finally, confronting the quantum variation and all "classical" models, a clear advantage can be extrapolated, showing possible future applications in the industry and other fields.

Prospetti sull'approcio della computazione quantistica all'apprendimento per rinforzo

CONTERNO, MATTEO
2021/2022

Abstract

Reinforcement learning is one of three main techniques that allows a model to learn, notably focusing on creating an optimal agent able to reach an objective by interacting with an environment. This thesis tries to analyze the possible potentials and advantages that would derive from using quantum circuits with neural networks. To examine and explain how it is possible to create hybrid algorithms that exploit the improvements of the classical and quantum algorithms. The Cartpole environment tested uses the Deep Q-Network algorithm and is compared with the quantum version to see what kind of advantage is present. After demonstrating the quantum advantage on the Cartpole, a more com?plex environment with an industrial application, the robotic arm, is tested using another kind of algorithm called Soft-Actor Critic. Differently from the DQN version, it requires multiple components with different purposes increasing the possible configurations that needs to be tested. For this reason, multiple configurations were run, such as one where only a single component has the quantum variation and one with all components quantum variated. Finally, confronting the quantum variation and all "classical" models, a clear advantage can be extrapolated, showing possible future applications in the industry and other fields.
ENG
IMPORT DA TESIONLINE
File in questo prodotto:
File Dimensione Formato  
834166_document.pdf

non disponibili

Tipologia: Altro materiale allegato
Dimensione 2.44 MB
Formato Adobe PDF
2.44 MB Adobe PDF

I documenti in UNITESI sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14240/86800