Gane Ka-Shu Wong
University of Alberta
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Gane Ka-Shu Wong.
Science | 2002
Jun Yu; Songnian Hu; Jun Wang; Gane Ka-Shu Wong; Songgang Li; Bin Liu; Yajun Deng; Yan Zhou; Xiuqing Zhang; Mengliang Cao; Jing Liu; Jiandong Sun; Jiabin Tang; Yanjiong Chen; Xiaobing Huang; Wei Lin; Chen Ye; Wei Tong; Lijuan Cong; Jianing Geng; Yujun Han; Lin Li; Wei Li; Guangqiang Hu; Xiangang Huang; Wenjie Li; Jian Li; Zhanwei Liu; Long Li; Jianping Liu
The genome of the japonica subspecies of rice, an important cereal and model monocot, was sequenced and assembled by whole-genome shotgun sequencing. The assembled sequence covers 93% of the 420-megabase genome. Gene predictions on the assembled sequence suggest that the genome contains 32,000 to 50,000 genes. Homologs of 98% of the known maize, wheat, and barley proteins are found in rice. Synteny and gene homology between rice and the other cereal genomes are extensive, whereas synteny with Arabidopsis is limited. Assignment of candidate rice orthologs to Arabidopsis genes is possible in many cases. The rice genome sequence provides a foundation for the improvement of cereals, our most important crops.
Nature | 2000
Stover Ck; X. Q. Pham; A. L. Erwin; S. D. Mizoguchi; P. Warrener; M. J. Hickey; Fiona S. L. Brinkman; W. O. Hufnagle; D. J. Kowalik; M. Lagrou; R. L. Garber; L. Goltry; E. Tolentino; S. Westbrock-Wadman; Ye Yuan; L. L. Brody; S. N. Coulter; K. R. Folger; Arnold Kas; K. Larbig; Regina Lim; Kelly D. Smith; David H. Spencer; Gane Ka-Shu Wong; Zhigang Wu; Ian T. Paulsen; Jonathan Reizer; Milton H. Saier; Robert E. W. Hancock; Stephen Lory
Pseudomonas aeruginosa is a ubiquitous environmental bacterium that is one of the top three causes of opportunistic human infections. A major factor in its prominence as a pathogen is its intrinsic resistance to antibiotics and disinfectants. Here we report the complete sequence of P. aeruginosa strain PAO1. At 6.3 million base pairs, this is the largest bacterial genome sequenced, and the sequence provides insights into the basis of the versatility and intrinsic drug resistance of P. aeruginosa. Consistent with its larger genome size and environmental adaptability, P. aeruginosa contains the highest proportion of regulatory genes observed for a bacterial genome and a large number of genes involved in the catabolism, transport and efflux of organic compounds as well as four potential chemotaxis systems. We propose that the size and complexity of the P. aeruginosa genome reflect an evolutionary adaptation permitting it to thrive in diverse environments and resist the effects of a variety of antimicrobial substances.
Nature | 2008
Jun Wang; Wei Wang; Ruiqiang Li; Yingrui Li; Geng Tian; Laurie Goodman; Wei Fan; Junqing Zhang; Jun Li; Juanbin Zhang; Yiran Guo; Binxiao Feng; Heng Li; Yao Lu; Xiaodong Fang; Huiqing Liang; Z. Du; Dong Li; Yiqing Zhao; Yujie Hu; Zhenzhen Yang; Hancheng Zheng; Ines Hellmann; Michael Inouye; John E. Pool; Xin Yi; Jing Zhao; Jinjie Duan; Yan Zhou; Junjie Qin
Here we present the first diploid genome sequence of an Asian individual. The genome was sequenced to 36-fold average coverage using massively parallel sequencing technology. We aligned the short reads onto the NCBI human reference genome to 99.97% coverage, and guided by the reference genome, we used uniquely mapped reads to assemble a high-quality consensus sequence for 92% of the Asian individual’s genome. We identified approximately 3 million single-nucleotide polymorphisms (SNPs) inside this region, of which 13.6% were not in the dbSNP database. Genotyping analysis showed that SNP identification had high accuracy and consistency, indicating the high sequence quality of this assembly. We also carried out heterozygote phasing and haplotype prediction against HapMap CHB and JPT haplotypes (Chinese and Japanese, respectively), sequence comparison with the two available individual genomes (J. D. Watson and J. C. Venter), and structural variation identification. These variations were considered for their potential biological impact. Our sequence data and analyses demonstrate the potential usefulness of next-generation sequencing technologies for personal genomics.
Genomics, Proteomics & Bioinformatics | 2006
Zhang Zhang; Jun Li; Xiaoqian Zhao; Jun Wang; Gane Ka-Shu Wong; Jun Yu
KaKs_Calculator is a software package that calculates nonsynonymous (Ka) and synonymous (Ks) substitution rates through model selection and model averaging. Since existing methods for this estimation adopt their specific mutation (substitution) models that consider different evolutionary features, leading to diverse estimates, KaKs_Calculator implements a set of candidate models in a maximum likelihood framework and adopts the Akaike information criterion to measure fitness between models and data, aiming to include as many features as needed for accurately capturing evolutionary information in protein-coding sequences. In addition, several existing methods for calculating Ka and Ks are also incorporated into this software. KaKs_Calculator, including source codes, compiled executables, and documentation, is freely available for academic use at http://evolution.genomics.org.cn/software.htm.
Proceedings of the National Academy of Sciences of the United States of America | 2014
Norman J. Wickett; Siavash Mirarab; Nam Phuong Nguyen; Tandy J. Warnow; Eric J. Carpenter; Naim Matasci; Saravanaraj Ayyampalayam; Michael S. Barker; J. Gordon Burleigh; Matthew A. Gitzendanner; Brad R. Ruhfel; Eric Wafula; Joshua P. Der; Sean W. Graham; Sarah Mathews; Michael Melkonian; Douglas E. Soltis; Pamela S. Soltis; Nicholas W. Miles; Carl J. Rothfels; Lisa Pokorny; A. Jonathan Shaw; Lisa De Gironimo; Dennis W. Stevenson; Barbara Surek; Juan Carlos Villarreal; Béatrice Roure; Hervé Philippe; Claude W. de Pamphilis; Tao Chen
Significance Early branching events in the diversification of land plants and closely related algal lineages remain fundamental and unresolved questions in plant evolutionary biology. Accurate reconstructions of these relationships are critical for testing hypotheses of character evolution: for example, the origins of the embryo, vascular tissue, seeds, and flowers. We investigated relationships among streptophyte algae and land plants using the largest set of nuclear genes that has been applied to this problem to date. Hypothesized relationships were rigorously tested through a series of analyses to assess systematic errors in phylogenetic inference caused by sampling artifacts and model misspecification. Results support some generally accepted phylogenetic hypotheses, while rejecting others. This work provides a new framework for studies of land plant evolution. Reconstructing the origin and evolution of land plants and their algal relatives is a fundamental problem in plant phylogenetics, and is essential for understanding how critical adaptations arose, including the embryo, vascular tissue, seeds, and flowers. Despite advances in molecular systematics, some hypotheses of relationships remain weakly resolved. Inferring deep phylogenies with bouts of rapid diversification can be problematic; however, genome-scale data should significantly increase the number of informative characters for analyses. Recent phylogenomic reconstructions focused on the major divergences of plants have resulted in promising but inconsistent results. One limitation is sparse taxon sampling, likely resulting from the difficulty and cost of data generation. To address this limitation, transcriptome data for 92 streptophyte taxa were generated and analyzed along with 11 published plant genome sequences. Phylogenetic reconstructions were conducted using up to 852 nuclear genes and 1,701,170 aligned sites. Sixty-nine analyses were performed to test the robustness of phylogenetic inferences to permutations of the data matrix or to phylogenetic method, including supermatrix, supertree, and coalescent-based approaches, maximum-likelihood and Bayesian methods, partitioned and unpartitioned analyses, and amino acid versus DNA alignments. Among other results, we find robust support for a sister-group relationship between land plants and one group of streptophyte green algae, the Zygnematophyceae. Strong and robust support for a clade comprising liverworts and mosses is inconsistent with a widely accepted view of early land plant evolution, and suggests that phylogenetic hypotheses used to understand the evolution of fundamental plant traits should be reevaluated.
Nucleic Acids Research | 2006
Heng Li; Avril Coghlan; Jue Ruan; Lachlan Coin; Jean-Karim Hériché; Lara Osmotherly; Ruiqiang Li; Tao Liu; Zhang Zhang; Lars Bolund; Gane Ka-Shu Wong; Wei-Mou Zheng; Paramvir Dehal; Jun Wang; Richard Durbin
TreeFam is a database of phylogenetic trees of gene families found in animals. It aims to develop a curated resource that presents the accurate evolutionary history of all animal gene families, as well as reliable ortholog and paralog assignments. Curated families are being added progressively, based on seed alignments and trees in a similar fashion to Pfam. Release 1.1 of TreeFam contains curated trees for 690 families and automatically generated trees for another 11 646 families. These represent over 128 000 genes from nine fully sequenced animal genomes and over 45 000 other animal proteins from UniProt; ∼40–85% of proteins encoded in the fully sequenced animal genomes are included in TreeFam. TreeFam is freely available at and .
Bioinformatics | 2014
Yinlong Xie; Gengxiong Wu; Jingbo Tang; Ruibang Luo; Jordan Patterson; Shanlin Liu; Weihua Huang; Guangzhu He; Shengchang Gu; Shengkang Li; Xin Zhou; Tak Wah Lam; Yingrui Li; Xun Xu; Gane Ka-Shu Wong; Jun Wang
MOTIVATION Transcriptome sequencing has long been the favored method for quickly and inexpensively obtaining a large number of gene sequences from an organism with no reference genome. Owing to the rapid increase in throughputs and decrease in costs of next- generation sequencing, RNA-Seq in particular has become the method of choice. However, the very short reads (e.g. 2 × 90 bp paired ends) from next generation sequencing makes de novo assembly to recover complete or full-length transcript sequences an algorithmic challenge. RESULTS Here, we present SOAPdenovo-Trans, a de novo transcriptome assembler designed specifically for RNA-Seq. We evaluated its performance on transcriptome datasets from rice and mouse. Using as our benchmarks the known transcripts from these well-annotated genomes (sequenced a decade ago), we assessed how SOAPdenovo-Trans and two other popular transcriptome assemblers handled such practical issues as alternative splicing and variable expression levels. Our conclusion is that SOAPdenovo-Trans provides higher contiguity, lower redundancy and faster execution. AVAILABILITY AND IMPLEMENTATION Source code and user manual are available at http://sourceforge.net/projects/soapdenovotrans/.
Nature Methods | 2014
Daniel Hochbaum; Yongxin Zhao; Samouil L Farhi; Nathan Cao Klapoetke; Christopher A. Werley; Vikrant Kapoor; Peng Zou; Joel M. Kralj; Dougal Maclaurin; Niklas Smedemark-Margulies; Jessica L. Saulnier; Gabriella L. Boulting; Christoph Straub; Yong Ku Cho; Michael Melkonian; Gane Ka-Shu Wong; Venkatesh N. Murthy; Bernardo L. Sabatini; Edward S. Boyden; Robert E. Campbell; Adam E. Cohen
All-optical electrophysiology—spatially resolved simultaneous optical perturbation and measurement of membrane voltage—would open new vistas in neuroscience research. We evolved two archaerhodopsin-based voltage indicators, QuasAr1 and QuasAr2, which show improved brightness and voltage sensitivity, have microsecond response times and produce no photocurrent. We engineered a channelrhodopsin actuator, CheRiff, which shows high light sensitivity and rapid kinetics and is spectrally orthogonal to the QuasArs. A coexpression vector, Optopatch, enabled cross-talk–free genetically targeted all-optical electrophysiology. In cultured rat neurons, we combined Optopatch with patterned optical excitation to probe back-propagating action potentials (APs) in dendritic spines, synaptic transmission, subcellular microsecond-timescale details of AP propagation, and simultaneous firing of many neurons in a network. Optopatch measurements revealed homeostatic tuning of intrinsic excitability in human stem cell–derived neurons. In rat brain slices, Optopatch induced and reported APs and subthreshold events with high signal-to-noise ratios. The Optopatch platform enables high-throughput, spatially resolved electrophysiology without the use of conventional electrodes.
Genome Research | 2008
M.A.M. Groenen; Per Wahlberg; Mario Foglio; Hans H. Cheng; Hendrik-Jan Megens; R.P.M.A. Crooijmans; Francois Besnier; Mark Lathrop; William M. Muir; Gane Ka-Shu Wong; Ivo Gut; Leif Andersson
The resolution of the chicken consensus linkage map has been dramatically improved in this study by genotyping 12,945 single nucleotide polymorphisms (SNPs) on three existing mapping populations in chicken: the Wageningen (WU), East Lansing (EL), and Uppsala (UPP) mapping populations. As many as 8599 SNPs could be included, bringing the total number of markers in the current consensus linkage map to 9268. The total length of the sex average map is 3228 cM, considerably smaller than previous estimates using the WU and EL populations, reflecting the higher quality of the new map. The current map consists of 34 linkage groups and covers at least 29 of the 38 autosomes. Sex-specific analysis and comparisons of the maps based on the three individual populations showed prominent heterogeneity in recombination rates between populations, but no significant heterogeneity between sexes. The recombination rates in the F(1) Red Jungle fowl/White Leghorn males and females were significantly lower compared with those in the WU broiler population, consistent with a higher recombination rate in purebred domestic animals under strong artificial selection. The recombination rate varied considerably among chromosomes as well as along individual chromosomes. An analysis of the sequence composition at recombination hot and cold spots revealed a strong positive correlation between GC-rich sequences and high recombination rates. The GC-rich cohesin binding sites in particular stood out from other GC-rich sequences with a 3.4-fold higher density at recombination hot spots versus cold spots, suggesting a functional relationship between recombination frequency and cohesin binding.
Plant Journal | 2012
Zhiwen Wang; Neil Hobson; Leonardo Galindo; Shilin Zhu; Daihu Shi; Joshua McDill; Linfeng Yang; Simon Hawkins; Godfrey Neutelings; Raju Datla; Georgina M. Lambert; David W. Galbraith; Christopher J. Grassa; Armando Geraldes; Quentin C. B. Cronk; Christopher A. Cullis; Prasanta K. Dash; Polumetla Ananda Kumar; Sylvie Cloutier; Andrew G. Sharpe; Gane Ka-Shu Wong; Jun Wang; Michael K. Deyholos
Flax (Linum usitatissimum) is an ancient crop that is widely cultivated as a source of fiber, oil and medicinally relevant compounds. To accelerate crop improvement, we performed whole-genome shotgun sequencing of the nuclear genome of flax. Seven paired-end libraries ranging in size from 300 bp to 10 kb were sequenced using an Illumina genome analyzer. A de novo assembly, comprised exclusively of deep-coverage (approximately 94× raw, approximately 69× filtered) short-sequence reads (44-100 bp), produced a set of scaffolds with N(50) =694 kb, including contigs with N(50)=20.1 kb. The contig assembly contained 302 Mb of non-redundant sequence representing an estimated 81% genome coverage. Up to 96% of published flax ESTs aligned to the whole-genome shotgun scaffolds. However, comparisons with independently sequenced BACs and fosmids showed some mis-assembly of regions at the genome scale. A total of 43384 protein-coding genes were predicted in the whole-genome shotgun assembly, and up to 93% of published flax ESTs, and 86% of A. thaliana genes aligned to these predicted genes, indicating excellent coverage and accuracy at the gene level. Analysis of the synonymous substitution rates (K(s) ) observed within duplicate gene pairs was consistent with a recent (5-9 MYA) whole-genome duplication in flax. Within the predicted proteome, we observed enrichment of many conserved domains (Pfam-A) that may contribute to the unique properties of this crop, including agglutinin proteins. Together these results show that de novo assembly, based solely on whole-genome shotgun short-sequence reads, is an efficient means of obtaining nearly complete genome sequence information for some plant species.