Sung Wook Yoon
Arizona State University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Sung Wook Yoon.
Journal of Artificial Intelligence Research | 2006
Alan Fern; Sung Wook Yoon; Robert Givan
We explore approximate policy iteration, replacing the usual cost-function learning step with a learning step in policy space. We give policy-language biases that enable solution of very large relational Markov decision processes (MDPs) that no previous technique can solve. In particular, we induce high-quality domain-specific planners for classical planning domains (both deterministic and stochastic variants) by solving such domains as extremely large MDPs.
ACM Transactions on Intelligent Systems and Technology | 2014
Nan Li; William Cushing; Subbarao Kambhampati; Sung Wook Yoon
We introduce an algorithm to automatically learn probabilistic hierarchical task networks (pHTNs) that capture a users preferences on plans by observing only the users behavior. HTNs are a common choice of representation for a variety of purposes in planning, including work on learning in planning. Our contributions are twofold. First, in contrast with prior work, which employs HTNs to represent domain physics or search control knowledge, we use HTNs to model user preferences. Second, while most prior work on HTN learning requires additional information (e.g., annotated traces or tasks) to assist the learning process, our system only takes plan traces as input. Initially, we will assume that users carry out preferred plans more frequently, and thus the observed distribution of plans is an accurate representation of user preference. We then generalize to the situation where feasibility constraints frequently prevent the execution of preferred plans. Taking the prevalent perspective of viewing HTNs as grammars over primitive actions, we adapt an expectation-maximization (EM) technique from the discipline of probabilistic grammar induction to acquire probabilistic context-free grammars (pCFG) that capture the distribution on plans. To account for the difference between the distributions of possible and preferred plans, we subsequently modify this core EM technique by rescaling its input. We empirically demonstrate that the proposed approaches are able to learn HTNs representing user preferences better than the inside-outside algorithm. Furthermore, when feasibility constraints are obfuscated, the algorithm with rescaled input performs better than the algorithm with the original input.
international conference on automated planning and scheduling | 2007
Sung Wook Yoon; Alan Fern; Robert Givan
neural information processing systems | 2003
Alan Fern; Sung Wook Yoon; Robert Givan
national conference on artificial intelligence | 2008
Sung Wook Yoon; Alan Fern; Robert Givan; Subbarao Kambhampati
uncertainty in artificial intelligence | 2002
Sung Wook Yoon; Alan Fern; Robert Givan
Journal of Machine Learning Research | 2008
Sung Wook Yoon; Alan Fern; Robert Givan
international conference on automated planning and scheduling | 2004
Alan Fern; Sung Wook Yoon; Robert Givan
international joint conference on artificial intelligence | 2007
Yuehua Xu; Alan Fern; Sung Wook Yoon
international conference on automated planning and scheduling | 2006
Sung Wook Yoon; Alan Fern; Robert Givan