Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Christophe G. Lambert is active.

Publication


Featured researches published by Christophe G. Lambert.


Proceedings of the National Academy of Sciences of the United States of America | 2007

Runs of homozygosity reveal highly penetrant recessive loci in schizophrenia

Todd Lencz; Christophe G. Lambert; Pamela DeRosse; Katherine E. Burdick; T. Vance Morgan; John Kane; Raju Kucherlapati; Anil K. Malhotra

Evolutionarily significant selective sweeps may result in long stretches of homozygous polymorphisms in individuals from outbred populations. We developed whole-genome homozygosity association (WGHA) methodology to characterize this phenomenon in healthy individuals and to use this genomic feature to identify genetic risk loci for schizophrenia (SCZ). Applying WGHA to 178 SCZ cases and 144 healthy controls genotyped at 500,000 markers, we found that runs of homozygosity (ROHs), ranging in size from 200 kb to 15 mb, were common in unrelated Caucasians. Properties of common ROHs in healthy subjects, including chromosomal location and presence of nonancestral haplotypes, converged with prior reports identifying regions under selective pressure. This interpretation was further supported by analysis of multiethnic HapMap samples genotyped with the same markers. ROHs were significantly more common in SCZ cases, and a set of nine ROHs significantly differentiated cases from controls. Four of these 9 “risk ROHs” contained or neighbored genes associated with SCZ (NOS1AP, ATF2, NSF, and PIK3C3). Several of these risk ROHs were very rare in healthy subjects, suggesting that recessive effects of relatively high penetrance may explain a proportion of the genetic liability for SCZ. Other risk ROHs feature haplotypes that are also common in healthy individuals, possibly indicating a source of balancing selection.


Journal of Chemical Information and Computer Sciences | 1999

Analysis of a Large Structure/Biological Activity Data Set Using Recursive Partitioning

Andrew Rusinko; Mark W. Farmen; Christophe G. Lambert; Paul L. Brown; S. Stanley Young

Combinatorial chemistry and high-throughput screening are revolutionizing the process of lead discovery in the pharmaceutical industry. Large numbers of structures and vast quantities of biological assay data are quickly being accumulated, overwhelming traditional structure/activity relationship (SAR) analysis technologies. Recursive partitioning is a method for statistically determining rules that classify objects into similar categories or, in this case, structures into groups of molecules with similar potencies. SCAM is a computer program implemented to make extremely efficient use of this methodology. Depending on the size of the data set, rules explaining biological data can be determined interactively. An example data set of 1650 monoamine oxidase inhibitors exemplifies the method, yielding substructural rules and leading to general classifications of these inhibitors. The method scales linearly with the number of descriptors, so hundreds of thousands of structures can be analyzed utilizing thousands to millions of molecular descriptors. There are currently no methods to deal with statistical analysis problems of this size. An important aspect of this analysis is the ability to deal with mixtures, i.e., identify SAR rules for classes of compounds in the same data set that might be binding in different ways. Most current quantitative structure/activity relationship methods require that the compounds follow a single mechanism. Advantages and limitations of this methodology are presented.


PLOS ONE | 2013

Identification of Rare Recurrent Copy Number Variants in High-Risk Autism Families and Their Prevalence in a Large ASD Population

Nori Matsunami; Dexter Hadley; Charles H. Hensel; G. Bryce Christensen; Cecilia Kim; Edward C. Frackelton; Kelly Thomas; Renata Pellegrino da Silva; Jeff Stevens; Lisa Baird; Brith Otterud; Karen Ho; Tena Varvil; Tami Leppert; Christophe G. Lambert; M. Leppert; Hakon Hakonarson

Structural variation is thought to play a major etiological role in the development of autism spectrum disorders (ASDs), and numerous studies documenting the relevance of copy number variants (CNVs) in ASD have been published since 2006. To determine if large ASD families harbor high-impact CNVs that may have broader impact in the general ASD population, we used the Affymetrix genome-wide human SNP array 6.0 to identify 153 putative autism-specific CNVs present in 55 individuals with ASD from 9 multiplex ASD pedigrees. To evaluate the actual prevalence of these CNVs as well as 185 CNVs reportedly associated with ASD from published studies many of which are insufficiently powered, we designed a custom Illumina array and used it to interrogate these CNVs in 3,000 ASD cases and 6,000 controls. Additional single nucleotide variants (SNVs) on the array identified 25 CNVs that we did not detect in our family studies at the standard SNP array resolution. After molecular validation, our results demonstrated that 15 CNVs identified in high-risk ASD families also were found in two or more ASD cases with odds ratios greater than 2.0, strengthening their support as ASD risk variants. In addition, of the 25 CNVs identified using SNV probes on our custom array, 9 also had odds ratios greater than 2.0, suggesting that these CNVs also are ASD risk variants. Eighteen of the validated CNVs have not been reported previously in individuals with ASD and three have only been observed once. Finally, we confirmed the association of 31 of 185 published ASD-associated CNVs in our dataset with odds ratios greater than 2.0, suggesting they may be of clinical relevance in the evaluation of children with ASDs. Taken together, these data provide strong support for the existence and application of high-impact CNVs in the clinical genetic evaluation of children with ASD.


Drug Safety | 2014

Bridging Islands of Information to Establish an Integrated Knowledge Base of Drugs and Health Outcomes of Interest

Richard D. Boyce; Patrick B. Ryan; G. Niklas Norén; Martijn J. Schuemie; Christian G. Reich; Jon D. Duke; Nicholas P. Tatonetti; Gianluca Trifirò; Rave Harpaz; J. Marc Overhage; Abraham G. Hartzema; Mark Khayter; Erica A. Voss; Christophe G. Lambert; Vojtech Huser; Michel Dumontier

The entire drug safety enterprise has a need to search, retrieve, evaluate, and synthesize scientific evidence more efficiently. This discovery and synthesis process would be greatly accelerated through access to a common framework that brings all relevant information sources together within a standardized structure. This presents an opportunity to establish an open-source community effort to develop a global knowledge base, one that brings together and standardizes all available information for all drugs and all health outcomes of interest (HOIs) from all electronic sources pertinent to drug safety. To make this vision a reality, we have established a workgroup within the Observational Health Data Sciences and Informatics (OHDSI, http://ohdsi.org) collaborative. The workgroup’s mission is to develop an open-source standardized knowledge base for the effects of medical products and an efficient procedure for maintaining and expanding it. The knowledge base will make it simpler for practitioners to access, retrieve, and synthesize evidence so that they can reach a rigorous and accurate assessment of causal relationships between a given drug and HOI. Development of the knowledge base will proceed with the measureable goal of supporting an efficient and thorough evidence-based assessment of the effects of 1,000 active ingredients across 100 HOIs. This non-trivial task will result in a high-quality and generally applicable drug safety knowledge base. It will also yield a reference standard of drug–HOI pairs that will enable more advanced methodological research that empirically evaluates the performance of drug safety analysis methods.


PLOS ONE | 2011

Genome-Wide Association Study in Bipolar Patients Stratified by Co-Morbidity

Berit Kerner; Christophe G. Lambert; Bengt Muthén

Background Bipolar disorder is a severe psychiatric disorder with high heritability. Co-morbid conditions are common and might define latent subgroups of patients that are more homogeneous with respect to genetic risk factors. Methodology In the Caucasian GAIN bipolar disorder sample of 1000 cases and 1034 controls, we tested the association of single nucleotide polymorphisms with patient subgroups defined by co-morbidity. Results Bipolar disorder with psychosis and/or substance abuse in the absence of alcohol dependence was associated with the rare variant rs1039002 in the vicinity of the gene phosphodiesterase 10A (PDE10A) on chromosome 6q27 (p = 1.7×10−8). PDE10A has been implicated in the pathophysiology of psychosis. Antagonists to the encoded protein are currently in clinical testing. Another rare variant, rs12563333 (p = 5.9×10−8) on chromosome 1q41 close to the MAP/microtubule affinity-regulating kinase 1 (MARK1) gene, approached the genome-wide level of significance in this subgroup. Homozygotes for the minor allele were present in cases and absent in controls. Bipolar disorder with alcohol dependence and other co-morbidities was associated with SNP rs2727943 (p = 3.3×10−8) on chromosome 3p26.3 located between the genes contactin-4 precursor (BIG-2) and contactin 6 (CNTN6). All three associations were found under the recessive genetic model. Bipolar disorder with low probability of co-morbid conditions did not show significant associations. Conclusion Conceptualizing bipolar disorder as a heterogeneous disorder with regard to co-morbid conditions might facilitate the identification of genetic risk alleles. Rare variants might contribute to the susceptibility to bipolar disorder.


PLOS ONE | 2012

Technical Reproducibility of Genotyping SNP Arrays Used in Genome-Wide Association Studies

Huixiao Hong; Lei Xu; Jie Liu; Wendell D. Jones; Zhenqiang Su; Baitang Ning; Roger Perkins; Weigong Ge; K Miclaus; Li Zhang; Kyung-Hee Park; Bridgett Green; Tao Han; Hong Fang; Christophe G. Lambert; Silvia C. Vega; Simon Lin; Nadereh Jafari; Wendy Czika; Russell D. Wolfinger; Federico Goodsaid; Weida Tong; Leming Shi

During the last several years, high-density genotyping SNP arrays have facilitated genome-wide association studies (GWAS) that successfully identified common genetic variants associated with a variety of phenotypes. However, each of the identified genetic variants only explains a very small fraction of the underlying genetic contribution to the studied phenotypic trait. Moreover, discordance observed in results between independent GWAS indicates the potential for Type I and II errors. High reliability of genotyping technology is needed to have confidence in using SNP data and interpreting GWAS results. Therefore, reproducibility of two widely genotyping technology platforms from Affymetrix and Illumina was assessed by analyzing four technical replicates from each of the six individuals in five laboratories. Genotype concordance of 99.40% to 99.87% within a laboratory for the sample platform, 98.59% to 99.86% across laboratories for the same platform, and 98.80% across genotyping platforms was observed. Moreover, arrays with low quality data were detected when comparing genotyping data from technical replicates, but they could not be detected according to venders’ quality control (QC) suggestions. Our results demonstrated the technical reliability of currently available genotyping platforms but also indicated the importance of incorporating some technical replicates for genotyping QC in order to improve the reliability of GWAS results. The impact of discordant genotypes on association analysis results was simulated and could explain, at least in part, the irreproducibility of some GWAS findings when the effect size (i.e. the odds ratio) and the minor allele frequencies are low.


Algorithmica | 1999

Efficient on-line nonparametric kernel density estimation

Christophe G. Lambert; S. E. Harrington; C. R. Harvey; A. Glodjo

Abstract. Nonparametric density estimation has broad applications in computational finance especially in cases where high frequency data are available. However, the technique is often intractable, given the run times necessary to evaluate a density. We present a new and efficient algorithm based on multipole techniques. Given the n kernels that estimate the density, current methods take O(n) time directly to sum the kernels to perform a single density query. In an on-line algorithm where points are continually added to the density, the cumulative O(n2) running time for n queries makes it very costly, if not impractical, to compute the density for large n . Our new Multipole-accelerated On-line Density Estimation (MODE) algorithm is general in that it can be applied to any kernel (in arbitrary dimensions) that admits a Taylor series expansion. The running time for a density query reduces to O (logn) or even constant time, depending on the kernel chosen, and, hence, the cumulative running time is reduced to O (n logn) or O(n) , respectively. Our results show that the MODE algorithm provides dramatic advantages over the direct approach to density evaluation. For example, we show using a modest computing platform that on-line density updates and queries for 1 million points and two dimensions take 8 days to compute using the direct approach versus 40 seconds with the MODE approach.


Chemometrics and Intelligent Laboratory Systems | 2002

Mixture deconvolution and analysis of Ames mutagenicity data

S. Stanley Young; Vijay K. Gombar; Michael R. Emptage; Neal F. Cariello; Christophe G. Lambert

Abstract Mixtures abound in chemistry: two or more compounds may be present in the same sample, the same biological effect may be produced by two different mechanisms, or two compounds might bind to a receptor in different orientations or even in different places. Sometimes, results are given in summary form. For example, a chemical may be declared a mutagen due to any of several assay results from an Ames test. Clearly, a single mathematical model is not going to hold for data sets where such multiplicity of phenomena are represented. We need molecular descriptors and statistical methods which enable us to deconvolute such mixtures. Our idea is to combine topological chemical descriptors—augmented atoms and through-bond distance measures—with a statistical technique, segmentation recursive partitioning, that is capable of dealing with mixtures. The benefit is the ability to develop structure–activity relationships for large, heterogeneous data sets. We successfully demonstrate the effectiveness of the above descriptors and the technique of recursive partitioning with Ames test results taken from public sources.


Biostatistics | 2012

Learning from our GWAS mistakes: from experimental design to scientific method.

Christophe G. Lambert; Laura J. Black

Many public and private genome-wide association studies that we have analyzed include flaws in design, with avoidable confounding appearing as a norm rather than the exception. Rather than recognizing flawed research design and addressing that, a category of quality-control statistical methods has arisen to treat only the symptoms. Reflecting more deeply, we examine elements of current genomic research in light of the traditional scientific method and find that hypotheses are often detached from data collection, experimental design, and causal theories. Association studies independent of causal theories, along with multiple testing errors, too often drive health care and public policy decisions. In an era of large-scale biological research, we ask questions about the role of statistical analyses in advancing coherent theories of diseases and their mechanisms. We advocate for reinterpretation of the scientific method in the context of large-scale data analysis opportunities and for renewed appreciation of falsifiable hypotheses, so that we can learn more from our best mistakes.


Pharmacogenomics Journal | 2010

Assessing sources of inconsistencies in genotypes and their effects on genome-wide association studies with HapMap samples

Huixiao Hong; Leming Shi; Zhenqiang Su; Weigong Ge; Wendell D. Jones; Wendy Czika; K Miclaus; Christophe G. Lambert; Silvia C. Vega; J. Zhang; Baitang Ning; Jie Liu; Bridgett Green; Lei Xu; Hong Fang; Roger Perkins; Simon Lin; Nadereh Jafari; Kyung-Hee Park; T. Ahn; Marco Chierici; Cesare Furlanello; Lu Zhang; Russell D. Wolfinger; Federico Goodsaid; Weida Tong

The discordance in results of independent genome-wide association studies (GWAS) indicates the potential for Type I and Type II errors. We assessed the repeatibility of current Affymetrix technologies that support GWAS. Reasonable reproducibility was observed for both raw intensity and the genotypes/copy number variants. We also assessed consistencies between different SNP arrays and between genotype calling algorithms. We observed that the inconsistency in genotypes was generally small at the specimen level. To further examine whether the differences from genotyping and genotype calling are possible sources of variation in GWAS results, an association analysis was applied to compare the associated SNPs. We observed that the inconsistency in genotypes not only propagated to the association analysis, but was amplified in the associated SNPs. Our studies show that inconsistencies between SNP arrays and between genotype calling algorithms are potential sources for the lack of reproducibility in GWAS results.

Collaboration


Dive into the Christophe G. Lambert's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Federico Goodsaid

Food and Drug Administration

View shared research outputs
Top Co-Authors

Avatar

Huixiao Hong

Food and Drug Administration

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge