Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Nicholas F. Lahens is active.

Publication


Featured researches published by Nicholas F. Lahens.


Proceedings of the National Academy of Sciences of the United States of America | 2014

A circadian gene expression atlas in mammals: Implications for biology and medicine

Ray Zhang; Nicholas F. Lahens; Heather I. Ballance; Michael E. Hughes; John B. Hogenesch

Significance We generated high-resolution multiorgan expression data showing that nearly half of all genes in the mouse genome oscillate with circadian rhythm somewhere in the body. Such widespread transcriptional oscillations have not been previously reported in mammals. Applying pathway analysis, we observed new clock-mediated spatiotemporal relationships. Moreover, we found a majority of best-selling drugs in the United States target circadian gene products. Many of these drugs have relatively short half-lives, and our data predict which may benefit from timed dosing. To characterize the role of the circadian clock in mouse physiology and behavior, we used RNA-seq and DNA arrays to quantify the transcriptomes of 12 mouse organs over time. We found 43% of all protein coding genes showed circadian rhythms in transcription somewhere in the body, largely in an organ-specific manner. In most organs, we noticed the expression of many oscillating genes peaked during transcriptional “rush hours” preceding dawn and dusk. Looking at the genomic landscape of rhythmic genes, we saw that they clustered together, were longer, and had more spliceforms than nonoscillating genes. Systems-level analysis revealed intricate rhythmic orchestration of gene pathways throughout the body. We also found oscillations in the expression of more than 1,000 known and novel noncoding RNAs (ncRNAs). Supporting their potential role in mediating clock function, ncRNAs conserved between mouse and human showed rhythmic expression in similar proportions as protein coding genes. Importantly, we also found that the majority of best-selling drugs and World Health Organization essential medicines directly target the products of rhythmic genes. Many of these drugs have short half-lives and may benefit from timed dosage. In sum, this study highlights critical, systemic, and surprising roles of the mammalian circadian clock and provides a blueprint for advancement in chronotherapy.


Bioinformatics | 2011

Comparative analysis of RNA-Seq alignment algorithms and the RNA-Seq unified mapper (RUM)

Gregory R. Grant; Michael H. Farkas; Angel Pizarro; Nicholas F. Lahens; Jonathan Schug; Brian P. Brunk; Christian J. Stoeckert; John B. Hogenesch; Eric A. Pierce

MOTIVATION A critical task in high-throughput sequencing is aligning millions of short reads to a reference genome. Alignment is especially complicated for RNA sequencing (RNA-Seq) because of RNA splicing. A number of RNA-Seq algorithms are available, and claim to align reads with high accuracy and efficiency while detecting splice junctions. RNA-Seq data are discrete in nature; therefore, with reasonable gene models and comparative metrics RNA-Seq data can be simulated to sufficient accuracy to enable meaningful benchmarking of alignment algorithms. The exercise to rigorously compare all viable published RNA-Seq algorithms has not been performed previously. RESULTS We developed an RNA-Seq simulator that models the main impediments to RNA alignment, including alternative splicing, insertions, deletions, substitutions, sequencing errors and intron signal. We used this simulator to measure the accuracy and robustness of available algorithms at the base and junction levels. Additionally, we used reverse transcription-polymerase chain reaction (RT-PCR) and Sanger sequencing to validate the ability of the algorithms to detect novel transcript features such as novel exons and alternative splicing in RNA-Seq data from mouse retina. A pipeline based on BLAT was developed to explore the performance of established tools for this problem, and to compare it to the recently developed methods. This pipeline, the RNA-Seq Unified Mapper (RUM), performs comparably to the best current aligners and provides an advantageous combination of accuracy, speed and usability. AVAILABILITY The RUM pipeline is distributed via the Amazon Cloud and for computing clusters using the Sun Grid Engine (http://cbil.upenn.edu/RUM). CONTACT [email protected]; [email protected] SUPPLEMENTARY INFORMATION The RNA-Seq sequence reads described in the article are deposited at GEO, accession GSE26248.


Nucleic Acids Research | 2012

CircaDB: a database of mammalian circadian gene expression profiles

Angel Pizarro; Katharina E. Hayer; Nicholas F. Lahens; John B. Hogenesch

CircaDB (http://circadb.org) is a new database of circadian transcriptional profiles from time course expression experiments from mice and humans. Each transcript’s expression was evaluated by three separate algorithms, JTK_Cycle, Lomb Scargle and DeLichtenberg. Users can query the gene annotations using simple and powerful full text search terms, restrict results to specific data sets and provide probability thresholds for each algorithm. Visualizations of the data are intuitive charts that convey profile information more effectively than a table of probabilities. The CircaDB web application is open source and available at http://github.com/itmat/circadb.


Genome Biology | 2014

IVT-seq reveals extreme bias in RNA sequencing

Nicholas F. Lahens; Ibrahim Halil Kavakli; Ray Zhang; Katharina E. Hayer; Michael B. Black; Hannah Dueck; Angel Pizarro; Junhyong Kim; Rafael A. Irizarry; Russell S. Thomas; Gregory R. Grant; John B. Hogenesch

BackgroundRNA-seq is a powerful technique for identifying and quantifying transcription and splicing events, both known and novel. However, given its recent development and the proliferation of library construction methods, understanding the bias it introduces is incomplete but critical to realizing its value.ResultsWe present a method, in vitro transcription sequencing (IVT-seq), for identifying and assessing the technical biases in RNA-seq library generation and sequencing at scale. We created a pool of over 1,000 in vitro transcribed RNAs from a full-length human cDNA library and sequenced them with polyA and total RNA-seq, the most common protocols. Because each cDNA is full length, and we show in vitro transcription is incredibly processive, each base in each transcript should be equivalently represented. However, with common RNA-seq applications and platforms, we find 50% of transcripts have more than two-fold and 10% have more than 10-fold differences in within-transcript sequence coverage. We also find greater than 6% of transcripts have regions of dramatically unpredictable sequencing coverage between samples, confounding accurate determination of their expression. We use a combination of experimental and computational approaches to show rRNA depletion is responsible for the most significant variability in coverage, and several sequence determinants also strongly influence representation.ConclusionsThese results show the utility of IVT-seq for promoting better understanding of bias introduced by RNA-seq. We find rRNA depletion is responsible for substantial, unappreciated biases in coverage introduced during library preparation. These biases suggest exon-level expression analysis may be inadvisable, and we recommend caution when interpreting RNA-seq results.


eLife | 2016

A new view of transcriptome complexity and regulation through the lens of local splicing variations

Jorge Vaquero-Garcia; Alejandro Barrera; Matthew R. Gazzara; Juan González-Vallinas; Nicholas F. Lahens; John B. Hogenesch; Kristen W. Lynch; Yoseph Barash

Alternative splicing (AS) can critically affect gene function and disease, yet mapping splicing variations remains a challenge. Here, we propose a new approach to define and quantify mRNA splicing in units of local splicing variations (LSVs). LSVs capture previously defined types of alternative splicing as well as more complex transcript variations. Building the first genome wide map of LSVs from twelve mouse tissues, we find complex LSVs constitute over 30% of tissue dependent transcript variations and affect specific protein families. We show the prevalence of complex LSVs is conserved in humans and identify hundreds of LSVs that are specific to brain subregions or altered in Alzheimers patients. Amongst those are novel isoforms in the Camk2 family and a novel poison exon in Ptbp1, a key splice factor in neurogenesis. We anticipate the approach presented here will advance the ability to relate tissue-specific splice variation to genetic variation, phenotype, and disease. DOI: http://dx.doi.org/10.7554/eLife.11752.001


Bioinformatics | 2015

Benchmark analysis of algorithms for determining and quantifying full-length mRNA splice forms from RNA-seq data

Katharina E. Hayer; Angel Pizarro; Nicholas F. Lahens; John B. Hogenesch; Gregory R. Grant

Motivation: Because of the advantages of RNA sequencing (RNA-Seq) over microarrays, it is gaining widespread popularity for highly parallel gene expression analysis. For example, RNA-Seq is expected to be able to provide accurate identification and quantification of full-length splice forms. A number of informatics packages have been developed for this purpose, but short reads make it a difficult problem in principle. Sequencing error and polymorphisms add further complications. It has become necessary to perform studies to determine which algorithms perform best and which if any algorithms perform adequately. However, there is a dearth of independent and unbiased benchmarking studies. Here we take an approach using both simulated and experimental benchmark data to evaluate their accuracy. Results: We conclude that most methods are inaccurate even using idealized data, and that no method is highly accurate once multiple splice forms, polymorphisms, intron signal, sequencing errors, alignment errors, annotation errors and other complicating factors are present. These results point to the pressing need for further algorithm development. Availability and implementation: Simulated datasets and other supporting information can be found at http://bioinf.itmat.upenn.edu/BEERS/bp2 Supplementary information: Supplementary data are available at Bioinformatics online. Contact: [email protected]


Genome Research | 2015

Ribosome profiling reveals an important role for translational control in circadian gene expression

Christopher Jang; Nicholas F. Lahens; John B. Hogenesch; Amita Sehgal

Physiological and behavioral circadian rhythms are driven by a conserved transcriptional/translational negative feedback loop in mammals. Although most core clock factors are transcription factors, post-transcriptional control introduces delays that are critical for circadian oscillations. Little work has been done on circadian regulation of translation, so to address this deficit we conducted ribosome profiling experiments in a human cell model for an autonomous clock. We found that most rhythmic gene expression occurs with little delay between transcription and translation, suggesting that the lag in the accumulation of some clock proteins relative to their mRNAs does not arise from regulated translation. Nevertheless, we found that translation occurs in a circadian fashion for many genes, sometimes imposing an additional level of control on rhythmically expressed mRNAs and, in other cases, conferring rhythms on noncycling mRNAs. Most cyclically transcribed RNAs are translated at one of two major times in a 24-h day, while rhythmic translation of most noncyclic RNAs is phased to a single time of day. Unexpectedly, we found that the clock also regulates the formation of cytoplasmic processing (P) bodies, which control the fate of mRNAs, suggesting circadian coordination of mRNA metabolism and translation.


Nature Genetics | 2017

The Nephila clavipes genome highlights the diversity of spider silk genes and their complex expression

Paul L. Babb; Nicholas F. Lahens; Sandra M. Correa-Garhwal; David N Nicholson; Eun Ji Kim; John B. Hogenesch; Matjaž Kuntner; Linden Higgins; Cheryl Y. Hayashi; Ingi Agnarsson; Benjamin F. Voight

Spider silks are the toughest known biological materials, yet are lightweight and virtually invisible to the human immune system, and they thus have revolutionary potential for medicine and industry. Spider silks are largely composed of spidroins, a unique family of structural proteins. To investigate spidroin genes systematically, we constructed the first genome of an orb-weaving spider: the golden orb-weaver (Nephila clavipes), which builds large webs using an extensive repertoire of silks with diverse physical properties. We cataloged 28 Nephila spidroins, representing all known orb-weaver spidroin types, and identified 394 repeated coding motif variants and higher-order repetitive cassette structures unique to specific spidroins. Characterization of spidroin expression in distinct silk gland types indicates that glands can express multiple spidroin types. We find evidence of an alternatively spliced spidroin, a spidroin expressed only in venom glands, evolutionary mechanisms for spidroin diversification, and non-spidroin genes with expression patterns that suggest roles in silk production.


Scientific Reports | 2017

A Pilot Characterization of the Human Chronobiome

Carsten Skarke; Nicholas F. Lahens; Seth D. Rhoades; Amy E. Campbell; Kyle Bittinger; Aubrey Bailey; Christian Hoffmann; Randal S. Olson; Lihong Chen; Guangrui Yang; Thomas S. Price; Jason H. Moore; Frederic D. Bushman; Casey S. Greene; Gregory R. Grant; Aalim M. Weljie; Garret A. FitzGerald

Physiological function, disease expression and drug effects vary by time-of-day. Clock disruption in mice results in cardio-metabolic, immunological and neurological dysfunction; circadian misalignment using forced desynchrony increases cardiovascular risk factors in humans. Here we integrated data from remote sensors, physiological and multi-omics analyses to assess the feasibility of detecting time dependent signals - the chronobiome – despite the “noise” attributable to the behavioral differences of free-living human volunteers. The majority (62%) of sensor readouts showed time-specific variability including the expected variation in blood pressure, heart rate, and cortisol. While variance in the multi-omics is dominated by inter-individual differences, temporal patterns are evident in the metabolome (5.4% in plasma, 5.6% in saliva) and in several genera of the oral microbiome. This demonstrates, despite a small sample size and limited sampling, the feasibility of characterizing at scale the human chronobiome “in the wild”. Such reference data at scale are a prerequisite to detect and mechanistically interpret discordant data derived from patients with temporal patterns of disease expression, to develop time-specific therapeutic strategies and to refine existing treatments.


BMC Genomics | 2017

A comparison of Illumina and Ion Torrent sequencing platforms in the context of differential gene expression

Nicholas F. Lahens; Emanuela Ricciotti; Olga Smirnova; Erik Toorens; Eun Ji Kim; Giacomo Baruzzo; Katharina E. Hayer; Tapan Ganguly; Jonathan Schug; Gregory R. Grant

BackgroundThough Illumina has largely dominated the RNA-Seq field, the simultaneous availability of Ion Torrent has left scientists wondering which platform is most effective for differential gene expression (DGE) analysis. Previous investigations of this question have typically used reference samples derived from cell lines and brain tissue, and do not involve biological variability. While these comparisons might inform studies of tissue-specific expression, marked by large-scale transcriptional differences, this is not the common use case.ResultsHere we employ a standard treatment/control experimental design, which enables us to evaluate these platforms in the context of the expression differences common in differential gene expression experiments. Specifically, we assessed the hepatic inflammatory response of mice by assaying liver RNA from control and IL-1β treated animals with both the Illumina HiSeq and the Ion Torrent Proton sequencing platforms. We found the greatest difference between the platforms at the level of read alignment, a moderate level of concordance at the level of DGE analysis, and nearly identical results at the level of differentially affected pathways. Interestingly, we also observed a strong interaction between sequencing platform and choice of aligner. By aligning both real and simulated Illumina and Ion Torrent data with the twelve most commonly-cited aligners in the literature, we observed that different aligner and platform combinations were better suited to probing different genomic features; for example, disentangling the source of expression in gene-pseudogene pairs.ConclusionsTaken together, our results indicate that while Illumina and Ion Torrent have similar capacities to detect changes in biology from a treatment/control experiment, these platforms may be tailored to interrogate different transcriptional phenomena through careful selection of alignment software.

Collaboration


Dive into the Nicholas F. Lahens's collaboration.

Top Co-Authors

Avatar

Gregory R. Grant

University of Pennsylvania

View shared research outputs
Top Co-Authors

Avatar

John B. Hogenesch

Cincinnati Children's Hospital Medical Center

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Eun Ji Kim

University of Pennsylvania

View shared research outputs
Top Co-Authors

Avatar

Katharina E. Hayer

Children's Hospital of Philadelphia

View shared research outputs
Top Co-Authors

Avatar

Angel Pizarro

University of Pennsylvania

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Jonathan Schug

University of Pennsylvania

View shared research outputs
Top Co-Authors

Avatar

Aalim M. Weljie

University of Pennsylvania

View shared research outputs
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge