Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where William Yang is active.

Publication


Featured researches published by William Yang.


BMC Bioinformatics | 2014

A new statistical approach to combining p-values using gamma distribution and its application to genome-wide association study

Zhongxue Chen; William Yang; Qingzhong Liu; Jack Y. Yang; Jing Li; Mary Qu Yang

BackgroundCombining information from different studies is an important and useful practice in bioinformatics, including genome-wide association study, rare variant data analysis and other set-based analyses. Many statistical methods have been proposed to combine p-values from independent studies. However, it is known that there is no uniformly most powerful test under all conditions; therefore, finding a powerful test in specific situation is important and desirable.ResultsIn this paper, we propose a new statistical approach to combining p-values based on gamma distribution, which uses the inverse of the p-value as the shape parameter in the gamma distribution.ConclusionsSimulation study and real data application demonstrate that the proposed method has good performance under some situations.


BMC Bioinformatics | 2014

Developing discriminate model and comparative analysis of differentially expressed genes and pathways for bloodstream samples of diabetes mellitus type 2

Chang Liu; Lili Lu; Quan Kong; Yan Li; Haihua Wu; William Yang; Shandan Xu; Xinyu Yang; Xiaolei Song; Jack Y. Yang; Mary Qu Yang; Youping Deng

BackgroundDiabetes mellitus of type 2 (T2D), also known as noninsulin-dependent diabetes mellitus (NIDDM) or adult-onset diabetes, is a common disease. It is estimated that more than 300 million people worldwide suffer from T2D. In this study, we investigated the T2D, pre-diabetic and healthy human (no diabetes) bloodstream samples using genomic, genealogical, and phonemic information. We identified differentially expressed genes and pathways. The study has provided deeper insights into the development of T2D, and provided useful information for further effective prevention and treatment of the disease.ResultsA total of 142 bloodstream samples were collected, including 47 healthy humans, 22 pre-diabetic and 73 T2D patients. Whole genome scale gene expression profiles were obtained using the Agilent Oligo chips that contain over 20,000 human genes. We identified 79 significantly differentially expressed genes that have fold change ≥ 2. We mapped those genes and pinpointed locations of those genes on human chromosomes. Amongst them, 3 genes were not mapped well on the human genome, but the rest of 76 differentially expressed genes were well mapped on the human genome. We found that most abundant differentially expressed genes are on chromosome one, which contains 9 of those genes, followed by chromosome two that contains 7 of the 76 differentially expressed genes. We performed gene ontology (GO) functional analysis of those 79 differentially expressed genes and found that genes involve in the regulation of cell proliferation were among most common pathways related to T2D. The expression of the 79 genes was combined with clinical information that includes age, sex, and race to construct an optimal discriminant model. The overall performance of the model reached 95.1% accuracy, with 91.5% accuracy on identifying healthy humans, 100% accuracy on pre-diabetic patients and 95.9% accuract on T2D patients. The higher performance on identifying pre-diabetic patients was resulted from more significant changes of gene expressions among this particular group of humans, which implicated that patients were having profound genetic changes towards disease development.ConclusionDifferentially expressed genes were distributed across chromosomes, and are more abundant on chromosomes 1 and 2 than the rest of the human genome. We found that regulation of cell proliferation actually plays an important role in the T2D disease development. The predictive model developed in this study has utilized the 79 significant genes in combination with age, sex, and racial information to distinguish pre-diabetic, T2D, and healthy humans. The study not only has provided deeper understanding of the disease molecular mechanisms but also useful information for pathway analysis and effective drug target identification.


bioinformatics and biomedicine | 2016

A hybrid iterative approach for microarray missing value estimation

Chong He; Changbo Zhao; Guo-Zheng Li; Wei Zhu; William Yang; Mary Qu Yang

BackgroundMissing data is an inevitable phenomenon in gene expression microarray experiments due to instrument failure or human error. It has a negative impact on performance of downstream analysis. Technically, most existing approaches suffer from this prevalent problem. Imputation is one of the frequently used methods for processing missing data. Actually many developments have been achieved in the research on estimating missing values. The challenging task is how to improve imputation accuracy for data with a large missing rate.MethodsIn this paper, induced by the thought of collaborative training, we propose a novel hybrid imputation method, called Recursive Mutual Imputation (RMI). Specifically, RMI exploits global correlation information and local structure in the data, captured by two popular methods, Bayesian Principal Component Analysis (BPCA) and Local Least Squares (LLS), respectively. Mutual strategy is implemented by sharing the estimated data sequences at each recursive process. Meanwhile, we consider the imputation sequence based on the number of missing entries in the target gene. Furthermore, a weight based integrated method is utilized in the final assembling step.ResultsWe evaluate RMI with three state-of-art algorithms (BPCA, LLS, Iterated Local Least Squares imputation (ItrLLS)) on four publicly available microarray datasets. Experimental results clearly demonstrate that RMI significantly outperforms comparative methods in terms of Normalized Root Mean Square Error (NRMSE), especially for datasets with large missing rates and less complete genes.ConclusionsIt is noted that our proposed hybrid imputation approach incorporates both global and local information of microarray genes, which achieves lower NRMSE values against to any single approach only. Besides, this study highlights the need for considering the imputing sequence of missing entries for imputation methods.


Genes | 2018

Transcription Factor and lncRNA Regulatory Networks Identify Key Elements in Lung Adenocarcinoma

Dan Li; William Yang; Jialing Zhang; Jack Y. Yang; Renchu Guan; Mary Yang

Lung cancer is the second most commonly diagnosed carcinoma and is the leading cause of cancer death. Although significant progress has been made towards its understanding and treatment, unraveling the complexities of lung cancer is still hampered by a lack of comprehensive knowledge on the mechanisms underlying the disease. High-throughput and multidimensional genomic data have shed new light on cancer biology. In this study, we developed a network-based approach integrating somatic mutations, the transcriptome, DNA methylation, and protein-DNA interactions to reveal the key regulators in lung adenocarcinoma (LUAD). By combining Bayesian network analysis with tissue-specific transcription factor (TF) and targeted gene interactions, we inferred 15 disease-related core regulatory networks in co-expression gene modules associated with LUAD. Through target gene set enrichment analysis, we identified a set of key TFs, including known cancer genes that potentially regulate the disease networks. These TFs were significantly enriched in multiple cancer-related pathways. Specifically, our results suggest that hepatitis viruses may contribute to lung carcinogenesis, highlighting the need for further investigations into the roles that viruses play in treating lung cancer. Additionally, 13 putative regulatory long non-coding RNAs (lncRNAs), including three that are known to be associated with lung cancer, and nine novel lncRNAs were revealed by our study. These lncRNAs and their target genes exhibited high interaction potentials and demonstrated significant expression correlations between normal lung and LUAD tissues. We further extended our study to include 16 solid-tissue tumor types and determined that the majority of these lncRNAs have putative regulatory roles in multiple cancers, with a few showing lung-cancer specific regulations. Our study provides a comprehensive investigation of transcription factor and lncRNA regulation in the context of LUAD regulatory networks and yields new insights into the regulatory mechanisms underlying LUAD. The novel key regulatory elements discovered by our research offer new targets for rational drug design and accompanying therapeutic strategies.


BMC Bioinformatics | 2014

Advances in translational bioinformatics facilitate revealing the landscape of complex disease mechanisms

Jack Y. Yang; A. Keith Dunker; Jun S. Liu; Xiang Qin; Hamid R. Arabnia; William Yang; Andrzej Niemierko; Zhongxue Chen; Zuojie Luo; Liangjiang Wang; Yunlong Liu; Dong Xu; Youping Deng; Weida Tong; Mary Qu Yang

Advances of high-throughput technologies have rapidly produced more and more data from DNAs and RNAs to proteins, especially large volumes of genome-scale data. However, connection of the genomic information to cellular functions and biological behaviours relies on the development of effective approaches at higher systems level. In particular, advances in RNA-Seq technology has helped the studies of transcriptome, RNA expressed from the genome, while systems biology on the other hand provides more comprehensive pictures, from which genes and proteins actively interact to lead to cellular behaviours and physiological phenotypes. As biological interactions mediate many biological processes that are essential for cellular function or disease development, it is important to systematically identify genomic information including genetic mutations from GWAS (genome-wide association study), differentially expressed genes, bidirectional promoters, intrinsic disordered proteins (IDP) and protein interactions to gain deep insights into the underlying mechanisms of gene regulations and networks. Furthermore, bidirectional promoters can co-regulate many biological pathways, where the roles of bidirectional promoters can be studied systematically for identifying co-regulating genes at interactive network level. Combining information from different but related studies can ultimately help revealing the landscape of molecular mechanisms underlying complex diseases such as cancer.


international conference on computational science | 2016

An Integrative Approach Revealing the Landscape of Long Noncoding RNAs in Human Brain

Dan Li; William Yang; Carolyn Arthur; Kenji Yoshigoe; Jun Liu; Weida Tong; Mary Yang

Long noncoding RNAs (lncRNAs) function as regulators and play critical roles in diverse biological processes, however, the majority of lncRNAs are not characterized, and their roles in regulation remain to be elucidated. Present RNA-seq assembly approaches are insufficient to identify complete full-length transcripts and often reveal excessive amount of single-exon lncRNAs, many of them tend to be the fragments of transcripts. Here we developed an integrated approach that combined the results from reference-guided and de novo assembly to systematically identify lncRNAs. Experiments on simulated data and real RNA-seq data showed that our method effectively yielded a more comprehensive set of lncRNAs. We applied the method on 299 human brain RNA-seq samples with a total of 4.5 billion raw reads. The results yield a comprehensive lncRNAs mapping in human brain, allowing discovery of novel lncRNAs and further function annotation.


Genes | 2018

Erratum: Dan Li et al.; Transcription Factor and lncRNA Regulatory Networks Identify Key Elements in Lung Adenocarcinoma. Genes 2018, 9, 12

Dan Li; William Yang; Jialing Zhang; Jack Y. Yang; Renchu Guan; Mary Yang

The authors wish to make the following change to their paper [...].


BMC Bioinformatics | 2014

Identification of genes and pathways involved in kidney renal clear cell carcinoma

William Yang; Kenji Yoshigoe; Xiang Qin; Jun S. Liu; Jack Y. Yang; Andrzej Niemierko; Youping Deng; Yunlong Liu; A. Keith Dunker; Zhongxue Chen; Liangjiang Wang; Dong Xu; Hamid R. Arabnia; Weida Tong; Mary Qu Yang


Archive | 2015

Developing Meta-Analysis Method for Identifying Biomarker of Gastric Cancer. Session Chair: William Yang http://www.worldacademyofscience.org/worldcomp15/ws/program/bic_big_him30.html

Xiaoya Chen; Jinjun Li; William Yang; Lili Lu; Hongyan Jin; Zexiong Wei; Wei Gu; Hamid R. Arabnia; Mary Qu Yang; Youping Deng


PMC | 2014

The emerging genomics and systems biology research lead to systems genomics studies

Mary Qu Yang; Kenji Yoshigoe; William Yang; Weida Tong; Xiang Qin; A. Keith Dunker; Zhongxue Chen; Hamid R. Arbania; Jun S. Liu; Andrzej Niemierko; Jack Y. Yang

Collaboration


Dive into the William Yang's collaboration.

Top Co-Authors

Avatar

Jack Y. Yang

University of Texas at San Antonio

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Weida Tong

Food and Drug Administration

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Zhongxue Chen

Children's Hospital of Philadelphia

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Kenji Yoshigoe

University of Arkansas for Medical Sciences

View shared research outputs
Top Co-Authors

Avatar

Dong Xu

Rush University Medical Center

View shared research outputs
Researchain Logo
Decentralizing Knowledge