Capsule networks, a novel deep learning architecture proposed by Hinton, introduce the idea of encoding the instantiation parameters of the entities into groups of neurons, called capsules. These capsules are particularly suitable for handling different types of visual stimuli and coding features like pose (position, size, orientation), deformation, speed, etc. It is assumed that the brain has a mechanism for ¿routing¿ low-level visual information to the capsules best suited to manag- ing that data. Capsule networks aim to overcome the limitations of convolutional networks in object recognition. The present work aims to understand these limitations and to test the robustness of capsule networks to affine transformations and to novel viewpoints. This is achieved by implementing a PyTorch framework used to build deeper capsule networks and to conduct more experiments on the task of image classification on datasets more complex than MNIST. Despite their promising results, capsule networks require a large computational cost in terms of speed computation and memory usage. Further research in their optimization is probably needed for them to be a good replacement of convolutional networks for industrial applications.

Uno studio esplorativo sulle capsule networks e su come renderle più profonde

RENZULLI, RICCARDO
2017/2018

Abstract

Capsule networks, a novel deep learning architecture proposed by Hinton, introduce the idea of encoding the instantiation parameters of the entities into groups of neurons, called capsules. These capsules are particularly suitable for handling different types of visual stimuli and coding features like pose (position, size, orientation), deformation, speed, etc. It is assumed that the brain has a mechanism for ¿routing¿ low-level visual information to the capsules best suited to manag- ing that data. Capsule networks aim to overcome the limitations of convolutional networks in object recognition. The present work aims to understand these limitations and to test the robustness of capsule networks to affine transformations and to novel viewpoints. This is achieved by implementing a PyTorch framework used to build deeper capsule networks and to conduct more experiments on the task of image classification on datasets more complex than MNIST. Despite their promising results, capsule networks require a large computational cost in terms of speed computation and memory usage. Further research in their optimization is probably needed for them to be a good replacement of convolutional networks for industrial applications.
ENG
IMPORT DA TESIONLINE
File in questo prodotto:
File Dimensione Formato  
762661_riccardorenzullimasterthesis.pdf

non disponibili

Tipologia: Altro materiale allegato
Dimensione 4.67 MB
Formato Adobe PDF
4.67 MB Adobe PDF

I documenti in UNITESI sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14240/95549