Prospetti sull'approcio della computazione quantistica all'apprendimento per rinforzo

Reinforcement learning is one of three main techniques that allows a model to learn, notably focusing on creating an optimal agent able to reach an objective by interacting with an environment. This thesis tries to analyze the possible potentials and advantages that would derive from using quantum circuits with neural networks. To examine and explain how it is possible to create hybrid algorithms that exploit the improvements of the classical and quantum algorithms. The Cartpole environment tested uses the Deep Q-Network algorithm and is compared with the quantum version to see what kind of advantage is present. After demonstrating the quantum advantage on the Cartpole, a more com?plex environment with an industrial application, the robotic arm, is tested using another kind of algorithm called Soft-Actor Critic. Differently from the DQN version, it requires multiple components with different purposes increasing the possible configurations that needs to be tested. For this reason, multiple configurations were run, such as one where only a single component has the quantum variation and one with all components quantum variated. Finally, confronting the quantum variation and all "classical" models, a clear advantage can be extrapolated, showing possible future applications in the industry and other fields.

Prospetti sull'approcio della computazione quantistica all'apprendimento per rinforzo

CONTERNO, MATTEO

2021/2022

Abstract

Reinforcement learning is one of three main techniques that allows a model to learn, notably focusing on creating an optimal agent able to reach an objective by interacting with an environment. This thesis tries to analyze the possible potentials and advantages that would derive from using quantum circuits with neural networks. To examine and explain how it is possible to create hybrid algorithms that exploit the improvements of the classical and quantum algorithms. The Cartpole environment tested uses the Deep Q-Network algorithm and is compared with the quantum version to see what kind of advantage is present. After demonstrating the quantum advantage on the Cartpole, a more com?plex environment with an industrial application, the robotic arm, is tested using another kind of algorithm called Soft-Actor Critic. Differently from the DQN version, it requires multiple components with different purposes increasing the possible configurations that needs to be tested. For this reason, multiple configurations were run, such as one where only a single component has the quantum variation and one with all components quantum variated. Finally, confronting the quantum variation and all "classical" models, a clear advantage can be extrapolated, showing possible future applications in the industry and other fields.

Scheda breve

	Facoltà/Dipartimento
	
				FISICA
			
	Corso di studio
	
				FISICA DEI SISTEMI COMPLESSI
			
	Lingua
	
				ENG
			
	Relatrice / Relatore
	
				CARLINI, Alberto
CASTELLANI, Leonardo
			
	Modalità consultazione tesi
	
				IMPORT DA TESIONLINE
			
	Appare nelle tipologie:
	
				Corso di Laurea Magistrale

File in questo prodotto:

File	Dimensione	Formato
834166_document.pdf non disponibili Tipologia: Altro materiale allegato Dimensione 2.44 MB Formato Adobe PDF	2.44 MB	Adobe PDF

I documenti in UNITESI sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14240/86800