Derek Hao Hu
Hong Kong University of Science and Technology
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Derek Hao Hu.
ubiquitous computing | 2008
Derek Hao Hu; Sinno Jialin Pan; Vincent Wenchen Zheng; Nathan Nan Liu; Qiang Yang
Recognizing and understanding the activities of people from sensor readings is an important task in ubiquitous computing. Activity recognition is also a particularly difficult task because of the inherent uncertainty and complexity of the data collected by the sensors. Many researchers have tackled this problem in an overly simplistic setting by assuming that users often carry out single activities one at a time or multiple activities consecutively, one after another. However, so far there has been no formal exploration on the degree in which humans perform concurrent or interleaving activities, and no thorough study on how to detect multiple goals in a real world scenario. In this article, we ask the fundamental questions of whether users often carry out multiple concurrent and interleaving activities or single activities in their daily life, and if so, whether such complex behavior can be detected accurately using sensors. We define several classes of complexity levels under a goal taxonomy that describe different granularities of activities, and relate the recognition accuracy with different complexity levels or granularities. We present a theoretical framework for recognizing multiple concurrent and interleaving activities, and evaluate the framework in several real-world ubiquitous computing environments.
Artificial Intelligence | 2010
Hankz Hankui Zhuo; Qiang Yang; Derek Hao Hu; Lei Li
Automated planning requires action models described using languages such as the Planning Domain Definition Language (PDDL) as input, but building action models from scratch is a very difficult and time-consuming task, even for experts. This is because it is difficult to formally describe all conditions and changes, reflected in the preconditions and effects of action models. In the past, there have been algorithms that can automatically learn simple action models from plan traces. However, there are many cases in the real world where we need more complicated expressions based on universal and existential quantifiers, as well as logical implications in action models to precisely describe the underlying mechanisms of the actions. Such complex action models cannot be learned using many previous algorithms. In this article, we present a novel algorithm called LAMP (Learning Action Models from Plan traces), to learn action models with quantifiers and logical implications from a set of observed plan traces with only partially observed intermediate state information. The LAMP algorithm generates candidate formulas that are passed to a Markov Logic Network (MLN) for selecting the most likely subsets of candidate formulas. The selected subset of formulas is then transformed into learned action models, which can then be tweaked by domain experts to arrive at the final models. We evaluate our approach in four planning domains to demonstrate that LAMP is effective in learning complex action models. We also analyze the human effort saved by using LAMP in helping to create action models through a user study. Finally, we apply LAMP to a real-world application domain for software requirement engineering to help the engineers acquire software requirements and show that LAMP can indeed help experts a great deal in real-world knowledge-engineering applications.
IEEE Transactions on Knowledge and Data Engineering | 2010
Evan Wei Xiang; Bin Cao; Derek Hao Hu; Qiang Yang
A major problem of classification learning is the lack of ground-truth labeled data. It is usually expensive to label new data instances for training a model. To solve this problem, domain adaptation in transfer learning has been proposed to classify target domain data by using some other source domain data, even when the data may have different distributions. However, domain adaptation may not work well when the differences between the source and target domains are large. In this paper, we design a novel transfer learning approach, called BIG (Bridging Information Gap), to effectively extract useful knowledge in a worldwide knowledge base, which is then used to link the source and target domains for improving the classification performance. BIG works when the source and target domains share the same feature space but different underlying data distributions. Using the auxiliary source data, we can extract a ¿bridge¿ that allows cross-domain text classification problems to be solved using standard semisupervised learning algorithms. A major contribution of our work is that with BIG, a large amount of worldwide knowledge can be easily adapted and used for learning in the target domain. We conduct experiments on several real-world cross-domain text classification tasks and demonstrate that our proposed approach can outperform several existing domain adaptation approaches significantly.
international joint conference on artificial intelligence | 2011
Derek Hao Hu; Qiang Yang
Activity recognition aims to identify and predict human activities based on a series of sensor readings. In recent years, machine learning methods have become popular in solving activity recognition problems. A special difficulty for adopting machine learning methods is the workload to annotate a large number of sensor readings as training data. Labeling sensor readings for their corresponding activities is a time-consuming task. In practice, we often have a set of labeled training instances ready for an activity recognition task. If we can transfer such knowledge to a new activity recognition scenario that is different from, but related to, the source domain, it will ease our effort to perform manual labeling of training data for the new scenario. In this paper, we propose a transfer learning framework based on automatically learning a correspondence between different sets of sensors to solve this transfer-learning in activity recognition problem. We validate our framework on two different datasets and compare it against previous approaches of activity recognition, and demonstrate its effectiveness.
BMC Bioinformatics | 2009
Qian Xu; Derek Hao Hu; Hong Xue; Weichuan Yu; Qiang Yang
BackgroundProtein subcellular localization is concerned with predicting the location of a protein within a cell using computational method. The location information can indicate key functionalities of proteins. Accurate predictions of subcellular localizations of protein can aid the prediction of protein function and genome annotation, as well as the identification of drug targets. Computational methods based on machine learning, such as support vector machine approaches, have already been widely used in the prediction of protein subcellular localization. However, a major drawback of these machine learning-based approaches is that a large amount of data should be labeled in order to let the prediction system learn a classifier of good generalization ability. However, in real world cases, it is laborious, expensive and time-consuming to experimentally determine the subcellular localization of a protein and prepare instances of labeled data.ResultsIn this paper, we present an approach based on a new learning framework, semi-supervised learning, which can use much fewer labeled instances to construct a high quality prediction model. We construct an initial classifier using a small set of labeled examples first, and then use unlabeled instances to refine the classifier for future predictions.ConclusionExperimental results show that our methods can effectively reduce the workload for labeling data using the unlabeled data. Our method is shown to enhance the state-of-the-art prediction results of SVM classifiers by more than 10%.
ubiquitous computing | 2012
Yiqiang Chen; Zhenyu Chen; Junfa Liu; Derek Hao Hu; Qiang Yang
Bluetooth information can efficiently capture characteristics of user-centric surrounding contexts, such as formal meeting or chatting with friends, shopping with friends or alone, etc. In this paper, we extract novel features from Bluetooth traces and use these features for recognizing contextual behavior as well as inferring continuous episode transition. Evaluation results show that extracted novel features are very effective, which enable the model to achieve an average of 87% accuracy for specific context classification and the ability of episode inference from real-life Bluetooth traces.
pacific rim international conference on artificial intelligence | 2008
Hankui Zhuo; Qiang Yang; Derek Hao Hu; Lei Li
Learning action models is an important and difficult task for AI planning, since it is both time-consuming and tedious for a human to encode the action models by hand using a formal language such as PDDL. In this paper, we present a new algorithm to learn action models from plan traces by transferring useful knowledge from another domain whose action models are already known. We call this algorithm t -LAMP , (transfer Learning Action Models from Plan traces) which can learn action models in PDDL language with quantifiers from plan traces where the intermediate states can contain noise and partial information. We apply Markov Logic Network to enable knowledge transfer, and show that using the transfer learning framework, the quality of the learned action models are generally better than the case when not using an existing domain for transfer.
asian conference on machine learning | 2009
Derek Hao Hu; Dou Shen; Jian-Tao Sun; Qiang Yang; Zheng Chen
With more and more commercial activities moving onto the Internet, people tend to purchase what they need through Internet or conduct some online research before the actual transactions happen. For many Web users, their online commercial activities start from submitting a search query to search engines. Just like the common Web search queries, the queries with commercial intention are usually very short. Recognizing the queries with commercial intention against the common queries will help search engines provide proper search results and advertisements, help Web users obtain the right information they desire and help the advertisers benefit from the potential transactions. However, the intentions behind a query vary a lot for users with different background and interest. The intentions can even be different for the same user, when the query is issued in different contexts. In this paper, we present a new algorithm framework based on skip-chain conditional random field (SCCRF) for automatically classifying Web queries according to context-based online commercial intention . We analyze our algorithm performance both theoretically and empirically. Extensive experiments on several real search engine log datasets show that our algorithm can improve more than 10% on F1 score than previous algorithms on commercial intention detection.
international workshop on data mining and audience intelligence for advertising | 2008
Derek Hao Hu; Qiang Yang; Ying Li
With more and more commercial activities moving onto the Internet, people tend to purchase what they need through Internet or conduct some online research before the actual deals happen. For many Web users, their online commercial activities start from submitting a search query to search engines. Just like the common Web search queries, the queries with commercial intention are usually very short. Recognizing the queries with commercial intention against the common queries will help search engines provide proper search results and advertisements; help Web users obtain the right information they desire and help the advertisers benefit from the potential transactions. The only existing research work, as far as we know, has been done to automatically detect online commercial intention purely based on the issued queries, without considering the Web users information. However, the intentions behind a query vary a lot for users with different background and interest. The intentions can even be different for the same user, when the query is issued in different contexts. In this paper, we present a novel algorithm, which we name as POINT, for the Personalized Online-commercial INTention detection based on a skip-chain conditional random field model. To accurately detect the commercial intentions of a query, our method comprehensively considers the evidences from the target query, the profile of the user issuing the query, which is inferred from his search history, as well as the similarity of different queries in a personal query log. Our proposed method is validated through extensive experiments on a real search engine query log data set. The experimental results show that our algorithm can clearly improve the performance by more than 10% of personalized online-commercial intention detection.
ieee international conference on pervasive computing and communications | 2009
Yiqiang Chen; Zhuo Sun; Juan Qi; Derek Hao Hu; Qiang Yang
Understanding human intention and performing different activities automatically is one of the key problems in pervasive computing. In this paper, a new location-based search computing framework (LoSeCo) is proposed to allow ones pervasive device to augment search devices. The objective of our problem is to recognize the real-time user goal through goal inference from traditional Wi-Fi localization techniques. We use accelerometer-based tracking to reduce the effort we need to collect Wi-Fi signals and save battery power consumption effectively. With the help of short-range search, the goal recognition module is enhanced, compared to previous “locationonly” approaches. Therefore, we could augment our mobile devices by automatically analyzing our needs and connecting to corresponding devices. Experimental results on real-world wireless network environments validate the effectiveness of our approach and that even a rough localization accuracy can meet the need of QoS (Quality of Service) in search computing behaviors.