Miguel Angel Galindo Martín
Technical University of Madrid
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Miguel Angel Galindo Martín.
international conference on operations research and enterprise systems | 2017
Miguel Angel Galindo Martín; Antonio Jiménez-Martín; Alfonso Mateos
In this paper, we propose a novel allocation strategy based on possibilistic rewards for the multi-armed bandit problem. First, we use possibilistic reward distributions to model the uncertainty about the expected rewards from the arms, derived from a set of infinite confidence intervals nested around the expected value. They are then converted into probability distributions using a pignistic probability transformation. Finally, a simulation experiment is carried out to find out the one with the highest expected reward, which is then pulled. A parametric probability transformation of the proposed is then introduced together with a dynamic optimization. A numerical study proves that the proposed method outperforms other policies in the literature in five scenarios accounting for Bernoulli, Poisson and exponential distributions for the rewards. The regret analysis of the proposed methods suggests a logarithmic asymptotic convergence for the original possibilistic reward method, whereas a polynomial regret could be associated with the parametric extension and the dynamic optimization.
international conference on operations research and enterprise systems | 2017
Miguel Angel Galindo Martín; Antonio Jiménez-Martín; Alfonso Mateos
Different allocation strategies can be found in the literature to deal with the multi-armed bandit problem under a frequentist view or from a Bayesian perspective. In this paper, we propose a novel allocation strategy, the possibilistic reward method. First, possibilistic reward distributions are used to model the uncertainty about the arm expected rewards, which are then converted into probability distributions using a pignistic probability transformation. Finally, a simulation experiment is carried out to find out the one with the highest expected reward, which is then pulled. A parametric probability transformation of the proposed is then introduced together with a dynamic optimization, which implies that neither previous knowledge nor a simulation of the arm distributions is required. A numerical study proves that the proposed method outperforms other policies in the literature in five scenarios: a Bernoulli distribution with very low success probabilities, with success probabilities close to 0.5 and with success probabilities close to 0.5 and Gaussian rewards; and truncated in [0,10] Poisson and exponential distributions.
Neurocomputing | 2018
Miguel Angel Galindo Martín; Antonio Jiménez-Martín; Alfonso Mateos
Abstract In this paper, we propose a set of allocation strategies to deal with the multi-armed bandit problem, the possibilistic reward (PR) methods. First, we use possibilistic reward distributions to model the uncertainty about the expected rewards from the arm, derived from a set of infinite confidence intervals nested around the expected value. Depending on the inequality used to compute the confidence intervals, there are three possible PR methods with different features. Next, we use a pignistic probability transformation to convert these possibilistic functions into probability distributions following the insufficient reason principle. Finally, Thompson sampling techniques are used to identify the arm with the higher expected reward and play that arm. A numerical study analyses the performance of the proposed methods with respect to other policies in the literature. Two PR methods perform well in all representative scenarios under consideration, and are the best allocation strategies if truncated poisson or exponential distributions in [0,10] are considered for the arms.
Mathematics of Planet Earth. Proceedings of the 15th Annual Conference of the International Association for Mathematical Geosciences | Proceedings of the 15th Annual Conference of the International association for Mathematical Geosciences | 02/09/2013 - 06/09/2013 | Madrid | 2014
Carlos García-Gutiérrez; Miguel Angel Galindo Martín; Francisco Muñoz Ortega; Miguel Reyes; Francisco Javier Taguas
The study of granular systems is of great interest to many fields of science and technology. The packing of particles affects to the physical properties of the granular system. In particular, the crucial influence of particle size distribution (PSD) on the random packing structure increase the interest in relating both, either theoretically or by computational methods. A packing computational method is developed in order to estimate the void fraction corresponding to a fractal-like particle size distribution.
Atmospheric Environment | 2004
José Manuel Burón; José María López; Francisco Aparicio; Miguel Angel Galindo Martín; Alejandro García
Hacienda Publica Espanola | 1998
Lorenzo Escot Mangas; Miguel Angel Galindo Martín
Papeles de trabajo del Instituto de Estudios Fiscales. Serie economía | 1998
Lorenzo Escot Mangas; Miguel Angel Galindo Martín
Papeles de trabajo del Instituto de Estudios Fiscales. Serie economía | 1998
Farhang Niroomand; Miguel Angel Galindo Martín
Documentos de trabajo de la Facultad de Ciencias Económicas y Empresariales | 1997
Lorenzo Escot Mangas; Miguel Angel Galindo Martín
Papeles de trabajo del Instituto de Estudios Fiscales. Serie economía | 2000
Lorenzo Escot Mangas; Miguel Angel Galindo Martín