In Machine Learning, one of the most common and discussed questions is how to choose an adequate number of data that will train the model in a satisfying way, in other words a model that is neither underfitted or overfitted but instead obtains a good generalization ability. The problem grows in importance when we consider Genetic Programming. Indeed, the fitness evaluation is a crucial point regarding the time consumption aspect of this approach, and therefore finding the minimum number of data that allows to discover the underlying structure of the problem could bring considerable benefits. In this thesis we use a concept borrowed from Statistics and Information Theory, the entropy of the target function in symbolic regression problems, in order to develop a possible problem independent solution. We present some examples, numerical and not, in order to show how our theoretical results are confirmed by the simulations.

Narrowing the Number of Training Cases in Genetic Programming

ZOPPI, GIACOMO
2018/2019

Abstract

In Machine Learning, one of the most common and discussed questions is how to choose an adequate number of data that will train the model in a satisfying way, in other words a model that is neither underfitted or overfitted but instead obtains a good generalization ability. The problem grows in importance when we consider Genetic Programming. Indeed, the fitness evaluation is a crucial point regarding the time consumption aspect of this approach, and therefore finding the minimum number of data that allows to discover the underlying structure of the problem could bring considerable benefits. In this thesis we use a concept borrowed from Statistics and Information Theory, the entropy of the target function in symbolic regression problems, in order to develop a possible problem independent solution. We present some examples, numerical and not, in order to show how our theoretical results are confirmed by the simulations.
ENG
IMPORT DA TESIONLINE
File in questo prodotto:
File Dimensione Formato  
794922_tesi_zoppi.pdf

non disponibili

Tipologia: Altro materiale allegato
Dimensione 2.33 MB
Formato Adobe PDF
2.33 MB Adobe PDF

I documenti in UNITESI sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14240/103033