Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Masayuki Horiguchi is active.

Publication


Featured researches published by Masayuki Horiguchi.


Applied Mathematics and Computation | 2001

Controlled Markov set-chains under average criteria

Masanori Hosaka; Masayuki Horiguchi; Masami Kurano

In this paper, applying an interval arithmetic analysis, we consider the average case of controlled Markov set-chains, whose process allows for fluctuating transition matrices at each step in time. We introduce a v-step contractive property for the average case, under which a Pareto optimal periodic policy is characterized as a maximal solution of optimality equation. Also, in the class of stationary policies, the behavior of the expected reward over T-horizon as T approaches ~ is investigated and the left- and right-hand side optimality equations are given, by which a Pareto optimal stationary policy is found. As a numerical example, the Taxicab problem is considered.


Mathematical Methods of Operations Research | 2001

Markov decision processes with a stopping time constraint

Masayuki Horiguchi

Abstract. In this paper, the optimization problem for a stopped Markov decision process with finite states and actions is considered over stopping times τ constrained so that ?τ≦α for some fixed α>0. The problem is solved through randomization of stopping times and mathematical programming formulation by occupation measures. Another representation, called F-representation, of randomized stopping times is given, by which the concept of Markov or stationary randomized stopping times is introduced. We treat two types of occupation measures, running and stopped, but stopped occupation measure is shown to be expressed by running one. We study the properties of the set of running occupation measures achieved by different classes of pairs of policies and randomized stopping times. Analyzing the equivalent mathematical programming problem formulated by running occupation measures corresponding with stationary policies and stationary randomized stopping times, we prove the existence of an optimal constrained pair of stationary policy and stopping time requiring randomization in at most one state.


Mathematical Methods of Operations Research | 2001

Stopped Markov decision processes with multiple constraints

Masayuki Horiguchi

In this paper, a optimization problem for stopped Markov decision processes with vector-valued terminal reward and multiple running cost constraints is considered. Applying the idea of occupation measures and using the scalarization technique for vector maximization problems we obtain the equivalent Mathematical Programming problem and show the existence of a Pareto optimal pair of stationary policy and stopping time requiring randomization in at most k states, where k is the number of constraints. Moreover Lagrange multiplier approaches are considered. The saddle-point statements are given, whose results are applied to obtain a related parametric Mathematical Programming, by which the problem is solved. Numerical examples are given.


Mathematical Methods of Operations Research | 2007

A structured pattern matrix algorithm for multichain Markov decision processes

Tetsuichiro Iki; Masayuki Horiguchi; Masami Kurano

In this paper, we are concerned with a new algorithm for multichain finite state Markov decision processes which finds an average optimal policy through the decomposition of the state space into some communicating classes and a transient class. For each communicating class, a relatively optimal policy is found, which is used to find an optimal policy by applying the value iteration algorithm. Using a pattern matrix determining the behaviour pattern of the decision process, the decomposition of the state space is effectively done, so that the proposed algorithm simplifies the structured one given by the excellent Leizarowitz’s paper (Math Oper Res 28:553–586, 2003). Also, a numerical example is given to comprehend the algorithm.


RIMS Kokyuroku | 2014

Adaptive methods for multivariate Bayesian Control Chart (Theory and Application of Mathematical Decision Making under Uncertainty)

Minoru Sasaki; Masayuki Horiguchi; Masami Kurano


Archive | 2009

The best choice problem for random number of objects with a refusal probability

Masayuki Horiguchi; Masami Yasuda


数理解析研究所講究録 | 2008

A pattern-matrix learning algorithm for adaptive MDPs : The regularly communicating case (Theory and Application of Decision Analysis in Uncertain Situation)

Tetsuichiro Iki; Masayuki Horiguchi; Masami Yasuda; Masami Kurano


Bulletin of informatics and cybernetics | 2007

A LEARNING ALGORITHM FOR COMMUNICATING MARKOV DECISION PROCESSES WITH UNKNOWN TRANSITION MATRICES

Tetsuichiro Iki; Masayuki Horiguchi; Masami Yasuda; Masami Kurano; 哲一郎 伊喜; 正實 安田; 正美 蔵野


Archive | 2006

A learning algorithm for communicating Markov decision processes with unknown transition matrices(Mathematical Models and Decision Making under Uncertainty)

Tetsuichiro Iki; Masayuki Horiguchi; Masami Yasuda; Masami Kurano


Archive | 2006

A modified pattern matrix algorithm for multichain MDPs(The Development of Information and Decision Processes)

Tetsuichiro Iki; Masayuki Horiguchi

Collaboration


Dive into the Masayuki Horiguchi's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge