Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Mingxuan Yuan is active.

Publication


Featured researches published by Mingxuan Yuan.


IEEE Transactions on Knowledge and Data Engineering | 2013

Protecting Sensitive Labels in Social Network Data Anonymization

Mingxuan Yuan; Lei Chen; Philip S. Yu; Ting Yu

Privacy is one of the major concerns when publishing or sharing social network data for social science research and business analysis. Recently, researchers have developed privacy models similar to k-anonymity to prevent node reidentification through structure information. However, even when these privacy models are enforced, an attacker may still be able to infer ones private information if a group of nodes largely share the same sensitive labels (i.e., attributes). In other words, the label-node relationship is not well protected by pure structure anonymization methods. Furthermore, existing approaches, which rely on edge editing or node clustering, may significantly alter key graph properties. In this paper, we define a k-degree-l-diversity anonymity model that considers the protection of structural information as well as sensitive labels of individuals. We further propose a novel anonymization methodology based on adding noise nodes. We develop a new algorithm by adding noise nodes into the original graph with the consideration of introducing the least distortion to graph properties. Most importantly, we provide a rigorous analysis of the theoretical bounds on the number of noise nodes added and their impacts on an important graph property. We conduct extensive experiments to evaluate the effectiveness of the proposed technique.


international conference on management of data | 2015

Telco Churn Prediction with Big Data

Yiqing Huang; Fangzhou Zhu; Mingxuan Yuan; Ke Deng; Yanhua Li; Bing Ni; Wenyuan Dai; Qiang Yang; Jia Zeng

We show that telco big data can make churn prediction much more easier from the


very large data bases | 2015

Differential privacy in telco big data platform

Xueyang Hu; Mingxuan Yuan; Jianguo Yao; Yu Deng; Lei Chen; Qiang Yang; Haibing Guan; Jia Zeng

3


very large data bases | 2014

OceanST: a distributed analytic system for large-scale spatiotemporal mobile broadband data

Mingxuan Yuan; Ke Deng; Jia Zeng; Yanhua Li; Bing Ni; Xiuqiang He; Fei Wang; Wenyuan Dai; Qiang Yang

Vs perspectives: Volume, Variety, Velocity. Experimental results confirm that the prediction performance has been significantly improved by using a large volume of training data, a large variety of features from both business support systems (BSS) and operations support systems (OSS), and a high velocity of processing new coming data. We have deployed this churn prediction system in one of the biggest mobile operators in China. From millions of active customers, this system can provide a list of prepaid customers who are most likely to churn in the next month, having


conference on information and knowledge management | 2016

City-Scale Localization with Telco Big Data

Fangzhou Zhu; Chen Luo; Mingxuan Yuan; Yijian Zhu; Zhengqing Zhang; Tao Gu; Ke Deng; Weixiong Rao; Jia Zeng

0.96


ACM Transactions on Intelligent Systems and Technology | 2016

Telco User Activity Level Prediction with Massive Mobile Broadband Data

Chen Luo; Jia Zeng; Mingxuan Yuan; Wenyuan Dai; Qiang Yang

precision for the top


conference on information and knowledge management | 2015

Sampling Big Trajectory Data

Yanhua Li; Chi-Yin Chow; Ke Deng; Mingxuan Yuan; Jia Zeng; Jia Dong Zhang; Qiang Yang; Zhi Li Zhang

50000


IEEE Transactions on Knowledge and Data Engineering | 2011

Materialization and Decomposition of Dataspaces for Efficient Search

Shaoxu Song; Lei Chen; Mingxuan Yuan

predicted churners in the list. Automatic matching retention campaigns with the targeted potential churners significantly boost their recharge rates, leading to a big business value.


IEEE Transactions on Knowledge and Data Engineering | 2016

Enabling Scalable Geographic Service Sharing with Weighted Imprecise Voronoi Cells

Xike Xie; Peiquan Jin; Man Lung Yiu; Jiang Du; Mingxuan Yuan; Christian S. Jensen

Differential privacy (DP) has been widely explored in academia recently but less so in industry possibly due to its strong privacy guarantee. This paper makes the first attempt to implement three basic DP architectures in the deployed telecommunication (telco) big data platform for data mining applications. We find that all DP architectures have less than 5% loss of prediction accuracy when the weak privacy guarantee is adopted (e.g., privacy budget parameter e ≥ 3). However, when the strong privacy guarantee is assumed (e.g., privacy budget parameter e ≤ 0:1), all DP architectures lead to 15% ~ 30% accuracy loss, which implies that real-word industrial data mining systems cannot work well under such a strong privacy guarantee recommended by previous research works. Among the three basic DP architectures, the Hybridized DM (Data Mining) and DB (Database) architecture performs the best because of its complicated privacy protection design for the specific data mining algorithm. Through extensive experiments on big data, we also observe that the accuracy loss increases by increasing the variety of features, but decreases by increasing the volume of training data. Therefore, to make DP practically usable in large-scale industrial systems, our observations suggest that we may explore three possible research directions in future: (1) Relaxing the privacy guarantee (e.g., increasing privacy budget e) and studying its effectiveness on specific industrial applications; (2) Designing specific privacy scheme for specific data mining algorithms; and (3) Using large volume of data but with low variety for training the classification models.


international conference on data mining | 2015

Dissecting Regional Weather-Traffic Sensitivity Throughout a City

Ye Ding; Yanhua Li; Ke Deng; Haoyu Tan; Mingxuan Yuan; Lionel M. Ni

With the increasing prevalence of versatile mobile devices and the fast deployment of broadband mobile networks, a huge volume of Mobile Broadband (MBB) data has been generated over time. The MBB data naturally contain rich information of a large number of mobile users, covering a considerable fraction of whole population nowadays, including the mobile applications they are using at different locations and time; the MBB data may present the unprecedentedly large knowledge base of human behavior which has highly recognized commercial and social value. However, the storage, management and analysis of the huge and fast growing volume of MBB data post new and significant challenges to the industrial practitioners and research community. In this demonstration, we present a new, MBB data tailored, distributed analytic system named OceanST which has addressed a series of problems and weaknesses of the existing systems, originally designed for more general purpose and capable to handle MBB data to some extent. OceanST is featured by (i) efficiently loading of ever-growing MBB data, (ii) a bunch of spatiotemporal aggregate queries and basic analysis APIs frequently found in various MBB data application scenarios, and (iii) sampling-based approximate solution with provable accuracy bound to cope with huge volume of MBB data. The demonstration will show the advantage of OceanST in a cluster of 5 machines using 3TB data.

Collaboration


Dive into the Mingxuan Yuan's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar

Lei Chen

Hong Kong University of Science and Technology

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Yanhua Li

Worcester Polytechnic Institute

View shared research outputs
Top Co-Authors

Avatar

Qiang Yang

Harbin Institute of Technology

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Jianguo Yao

Shanghai Jiao Tong University

View shared research outputs
Top Co-Authors

Avatar

Wenyuan Dai

Shanghai Jiao Tong University

View shared research outputs
Top Co-Authors

Avatar

Philip S. Yu

University of Illinois at Chicago

View shared research outputs
Researchain Logo
Decentralizing Knowledge