Rilevamento di Anomalie in Serie Temporali usando le Shapelets

In time series classification literature, shapelets are subsequences able to discriminate the different classes. It has been shown that classifiers achieve high accuracy when taking in input the distances from the time series and a properly chosen set of shapelets. Additionally, these subsequences can be easily compared visually to input signals: this characteristic makes such algorithms naturally interpretable. More recent methods propose to parametrize the shapelets instead of searching them through all the possible subsequences, which is computationally expensive. The optimal shapelets are learned through the minimisation of a loss function. This approach typically sacrifices their interpretability in favour of a higher accuracy. This thesis has been carried on in collaboration with Addfor Industriale S.r.l. towards the goal of developing shapelets-based algorithms for unsupervised anomaly detection in univariate and multivariate time series datasets. We propose and analyse three algorithms and compare their results on five benchmark datasets, focusing on the difference between searching and learning approaches. The experiments confirm the main advantages of shapelets methods, which work well when recurring short patterns distinguish the shape of normal and anomalous time series. In particular, the proposed shapelets-learning algorithm manages to learn different discriminating shapelets, while maintaining their interpretable characteristic.

Rilevamento di Anomalie in Serie Temporali usando le Shapelets

BARTOLI, LUDOVICO

2021/2022

Abstract

In time series classification literature, shapelets are subsequences able to discriminate the different classes. It has been shown that classifiers achieve high accuracy when taking in input the distances from the time series and a properly chosen set of shapelets. Additionally, these subsequences can be easily compared visually to input signals: this characteristic makes such algorithms naturally interpretable. More recent methods propose to parametrize the shapelets instead of searching them through all the possible subsequences, which is computationally expensive. The optimal shapelets are learned through the minimisation of a loss function. This approach typically sacrifices their interpretability in favour of a higher accuracy. This thesis has been carried on in collaboration with Addfor Industriale S.r.l. towards the goal of developing shapelets-based algorithms for unsupervised anomaly detection in univariate and multivariate time series datasets. We propose and analyse three algorithms and compare their results on five benchmark datasets, focusing on the difference between searching and learning approaches. The experiments confirm the main advantages of shapelets methods, which work well when recurring short patterns distinguish the shape of normal and anomalous time series. In particular, the proposed shapelets-learning algorithm manages to learn different discriminating shapelets, while maintaining their interpretable characteristic.

Scheda breve

	Facoltà/Dipartimento
	
				MATEMATICA "GIUSEPPE PEANO"
			
	Corso di studio
	
				STOCHASTICS AND DATA SCIENCE
			
	Lingua
	
				ENG
			
	Relatrice / Relatore
	
				CORDERO, Elena
			
	Modalità consultazione tesi
	
				IMPORT DA TESIONLINE
			
	Appare nelle tipologie:
	
				Corso di Laurea Magistrale

File in questo prodotto:

File	Dimensione	Formato
962720A_shapelets_code_python.zip non disponibili Tipologia: Altro materiale allegato Dimensione 71.97 MB Formato Unknown	71.97 MB	Unknown
962720_thesis_sds_bartoli_october2022.pdf non disponibili Tipologia: Altro materiale allegato Dimensione 3.24 MB Formato Adobe PDF	3.24 MB	Adobe PDF

I documenti in UNITESI sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14240/84290