Advances in error estimation and multi-dimensional supervised classification

  1. RODRIGUEZ FERNANDEZ, JUAN DIEGO
Dirigida por:
  1. Aritz Pérez Martínez Director/a
  2. José Antonio Lozano Alonso Director/a

Universidad de defensa: Universidad del País Vasco - Euskal Herriko Unibertsitatea

Fecha de defensa: 24 de mayo de 2013

Tribunal:
  1. José Antonio Gámez Martín Presidente/a
  2. Borja Calvo Molinos Secretario/a
  3. Petri Myllymäki Vocal
  4. José Manuel Peña Palomar Vocal
  5. Amparo Alonso Betanzos Vocal

Tipo: Tesis

Teseo: 116039 DIALNET

Resumen

The first contribution is a novel decomposition of the variance of classification error estimators taking into account its different variance sources. We analyze the statistical properties (bias and variance) of the most popular error estimators. A general framework to analyze the decomposition of the variance considering the nature of the variance (reducible/irreducible) and the different sources of sensitivity (internal/external sensitivity) is presented. An extensive empirical study has been performed and, based on the obtained results, we propose the most appropriate error estimators for model selection under different experimental conditions.The second contribution is a novel method for learning multi-dimensional Bayesian network classifiers via a multi-objective evolutionary algorithm. The multi-objective strategy considers the accuracy of each class variable separately as the functions to optimize. In order to evaluate the proposed learning approach , this dissertation includes a study that compares it with the main alternatives to deal with multi-dimensional classification.Finally, a medical application of multi-dimensional Bayesian network classifiers is presented for Multiple Sclerosis. The application tries to help a physician to predict the expected progression of the disease and to plan the most suitable treatment.