Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Xianfeng Chen is active.

Publication


Featured researches published by Xianfeng Chen.


Scientific Reports | 2016

Multiple sclerosis patients have a distinct gut microbiota compared to healthy controls.

Jun Chen; Nicholas Chia; Krishna R. Kalari; Janet Yao; Martina Novotna; M. Mateo Paz Soldán; David Luckey; Eric V. Marietta; Patricio Jeraldo; Xianfeng Chen; Brian G. Weinshenker; Moses Rodriguez; Heidi Nelson; Joseph A. Murray; Ashutosh Mangalam

Multiple sclerosis (MS) is an immune-mediated disease, the etiology of which involves both genetic and environmental factors. The exact nature of the environmental factors responsible for predisposition to MS remains elusive; however, it’s hypothesized that gastrointestinal microbiota might play an important role in pathogenesis of MS. Therefore, this study was designed to investigate whether gut microbiota are altered in MS by comparing the fecal microbiota in relapsing remitting MS (RRMS) (n = 31) patients to that of age- and gender-matched healthy controls (n = 36). Phylotype profiles of the gut microbial populations were generated using hypervariable tag sequencing of the V3–V5 region of the 16S ribosomal RNA gene. Detailed fecal microbiome analyses revealed that MS patients had distinct microbial community profile compared to healthy controls. We observed an increased abundance of Psuedomonas, Mycoplana, Haemophilus, Blautia, and Dorea genera in MS patients, whereas control group showed increased abundance of Parabacteroides, Adlercreutzia and Prevotella genera. Thus our study is consistent with the hypothesis that MS patients have gut microbial dysbiosis and further study is needed to better understand their role in the etiopathogenesis of MS.


Gastroenterology | 2016

Relationship Between Microbiota of the Colonic Mucosa vs Feces and Symptoms, Colonic Transit, and Methane Production in Female Patients With Chronic Constipation.

Gopanandan Parthasarathy; Jun Chen; Xianfeng Chen; Nicholas Chia; Helen M. O'Connor; Patricia G. Wolf; H. Rex Gaskins; Adil E. Bharucha

BACKGROUND & AIMS In fecal samples from patients with chronic constipation, the microbiota differs from that of healthy subjects. However, the profiles of fecal microbiota only partially replicate those of the mucosal microbiota. It is not clear whether these differences are caused by variations in diet or colonic transit, or are associated with methane production (measured by breath tests). We compared the colonic mucosal and fecal microbiota in patients with chronic constipation and in healthy subjects to investigate the relationships between microbiota and other parameters. METHODS Sigmoid colonic mucosal and fecal microbiota samples were collected from 25 healthy women (controls) and 25 women with chronic constipation and evaluated by 16S ribosomal RNA gene sequencing (average, 49,186 reads/sample). We assessed associations between microbiota (overall composition and operational taxonomic units) and demographic variables, diet, constipation status, colonic transit, and methane production (measured in breath samples after oral lactulose intake). RESULTS Fourteen patients with chronic constipation had slow colonic transit. The profile of the colonic mucosal microbiota differed between constipated patients and controls (P < .05). The overall composition of the colonic mucosal microbiota was associated with constipation, independent of colonic transit (P < .05), and discriminated between patients with constipation and controls with 94% accuracy. Genera from Bacteroidetes were more abundant in the colonic mucosal microbiota of patients with constipation. The profile of the fecal microbiota was associated with colonic transit before adjusting for constipation, age, body mass index, and diet; genera from Firmicutes (Faecalibacterium, Lactococcus, and Roseburia) correlated with faster colonic transit. Methane production was associated with the composition of the fecal microbiota, but not with constipation or colonic transit. CONCLUSIONS After adjusting for diet and colonic transit, the profile of the microbiota in the colonic mucosa could discriminate patients with constipation from healthy individuals. The profile of the fecal microbiota was associated with colonic transit and methane production (measured in breath), but not constipation.


PLOS ONE | 2014

IM-TORNADO: A tool for comparison of 16S reads from paired-end libraries

Patricio Jeraldo; Krishna R. Kalari; Xianfeng Chen; Jaysheel D. Bhavsar; Ashutosh Mangalam; Bryan A. White; Heidi Nelson; Jean Pierre A Kocher; Nicholas Chia

Motivation 16S rDNA hypervariable tag sequencing has become the de facto method for accessing microbial diversity. Illumina paired-end sequencing, which produces two separate reads for each DNA fragment, has become the platform of choice for this application. However, when the two reads do not overlap, existing computational pipelines analyze data from read separately and underutilize the information contained in the paired-end reads. Results We created a workflow known as Illinois Mayo Taxon Organization from RNA Dataset Operations (IM-TORNADO) for processing non-overlapping reads while retaining maximal information content. Using synthetic mock datasets, we show that the use of both reads produced answers with greater correlation to those from full length 16S rDNA when looking at taxonomy, phylogeny, and beta-diversity. Availability and Implementation IM-TORNADO is freely available at http://sourceforge.net/projects/imtornado and produces BIOM format output for cross compatibility with other pipelines such as QIIME, mothur, and phyloseq.


BMC Bioinformatics | 2014

HiChIP: a high-throughput pipeline for integrative analysis of ChIP-Seq data

Huihuang Yan; Jared M. Evans; Mike Kalmbach; Raymond Moore; Sumit Middha; Stanislav Luban; Liguo Wang; Aditya Bhagwate; Ying Li; Zhifu Sun; Xianfeng Chen; Jean-Pierre A. Kocher

BackgroundChromatin immunoprecipitation (ChIP) followed by next-generation sequencing (ChIP-Seq) has been widely used to identify genomic loci of transcription factor (TF) binding and histone modifications. ChIP-Seq data analysis involves multiple steps from read mapping and peak calling to data integration and interpretation. It remains challenging and time-consuming to process large amounts of ChIP-Seq data derived from different antibodies or experimental designs using the same approach. To address this challenge, there is a need for a comprehensive analysis pipeline with flexible settings to accelerate the utilization of this powerful technology in epigenetics research.ResultsWe have developed a highly integrative pipeline, termed HiChIP for systematic analysis of ChIP-Seq data. HiChIP incorporates several open source software packages selected based on internal assessments and published comparisons. It also includes a set of tools developed in-house. This workflow enables the analysis of both paired-end and single-end ChIP-Seq reads, with or without replicates for the characterization and annotation of both punctate and diffuse binding sites. The main functionality of HiChIP includes: (a) read quality checking; (b) read mapping and filtering; (c) peak calling and peak consistency analysis; and (d) result visualization. In addition, this pipeline contains modules for generating binding profiles over selected genomic features, de novo motif finding from transcription factor (TF) binding sites and functional annotation of peak associated genes.ConclusionsHiChIP is a comprehensive analysis pipeline that can be configured to analyze ChIP-Seq data derived from varying antibodies and experiment designs. Using public ChIP-Seq data we demonstrate that HiChIP is a fast and reliable pipeline for processing large amounts of ChIP-Seq data.


Frontiers in Microbiology | 2016

Capturing One of the Human Gut Microbiome’s Most Wanted: Reconstructing the Genome of a Novel Butyrate-Producing, Clostridial Scavenger from Metagenomic Sequence Data

Patricio Jeraldo; Alvaro G. Hernandez; Henrik Bjørn Nielsen; Xianfeng Chen; Bryan A. White; Nigel Goldenfeld; Heidi Nelson; David Alhquist; Lisa A. Boardman; Nicholas Chia

The role of the microbiome in health and disease is attracting great attention, yet we still know little about some of the most prevalent microorganisms inside our bodies. Several years ago, Human Microbiome Project (HMP) researchers generated a list of “most wanted” taxa: bacteria both prevalent among healthy volunteers and distantly related to any sequenced organisms. Unfortunately, the challenge of assembling high-quality genomes from a tangle of metagenomic reads has slowed progress in learning about these uncultured bacteria. Here, we describe how recent advances in sequencing and analysis allowed us to assemble “most wanted” genomes from metagenomic data collected from four stool samples. Using a combination of both de novo and guided assembly methods, we assembled and binned over 100 genomes from an initial data set of over 1,300 Gbp. One of these genome bins, which met HMP’s criteria for a “most wanted” taxa, contained three essentially complete genomes belonging to a previously uncultivated species. This species is most closely related to Eubacterium desmolans and the clostridial cluster IV/Clostridium leptum subgroup species Butyricicoccus pullicaecorum (71–76% average nucleotide identity). Gene function analysis indicates that the species is an obligate anaerobe, forms spores, and produces the anti-inflammatory short-chain fatty acids acetate and butyrate. It also appears to take up metabolically costly molecules such as cobalamin, methionine, and branch-chained amino acids from the environment, and to lack virulence genes. Thus, the evidence is consistent with a secondary degrader that occupies a host-dependent, nutrient-scavenging niche within the gut; its ability to produce butyrate, which is thought to play an anti-inflammatory role, makes it intriguing for the study of diseases such as colon cancer and inflammatory bowel disease. In conclusion, we have assembled essentially complete genomes from stool metagenomic data, yielding valuable information about uncultured organisms’ metabolic and ecologic niches, factors that may be required to successfully culture these bacteria, and their role in maintaining health and causing disease.


GigaScience | 2018

Hybrid-denovo: a de novo OTU-picking pipeline integrating single-end and paired-end 16S sequence tags

Xianfeng Chen; Stephen Johnson; Patricio Jeraldo; Junwen Wang; Nicholas Chia; Jean-Pierre A. Kocher; Jun Chen

Abstract Background Illumina paired-end sequencing has been increasingly popular for 16S rRNA gene-based microbiota profiling. It provides higher phylogenetic resolution than single-end reads due to a longer read length. However, the reverse read (R2) often has significant low base quality, and a large proportion of R2s will be discarded after quality control, resulting in a mixture of paired-end and single-end reads. A typical 16S analysis pipeline usually processes either paired-end or single-end reads but not a mixture. Thus, the quantification accuracy and statistical power will be reduced due to the loss of a large amount of reads. As a result, rare taxa may not be detectable with the paired-end approach, or low taxonomic resolution will result in a single-end approach. Results To have both the higher phylogenetic resolution provided by paired-end reads and the higher sequence coverage by single-end reads, we propose a novel OTU-picking pipeline, hybrid-denovo, that can process a hybrid of single-end and paired-end reads. Using high-quality paired-end reads as a gold standard, we show that hybrid-denovo achieved the highest correlation with the gold standard and performed better than the approaches based on paired-end or single-end reads in terms of quantifying the microbial diversity and taxonomic abundances. By applying our method to a rheumatoid arthritis (RA) data set, we demonstrated that hybrid-denovo captured more microbial diversity and identified more RA-associated taxa than a paired-end or single-end approach. Conclusions Hybrid-denovo utilizes both paired-end and single-end 16S sequencing reads and is recommended for 16S rRNA gene targeted paired-end sequencing data.


Scientific Reports | 2017

UClncR: Ultrafast and comprehensive long non-coding RNA detection from RNA-seq.

Zhifu Sun; Asha Nair; Xianfeng Chen; Naresh Prodduturi; Junwen Wang; Jean Pierre A Kocher

Long non-coding RNA (lncRNA) is a large class of gene transcripts with regulatory functions discovered in recent years. Many more are expected to be revealed with accumulation of RNA-seq data from diverse types of normal and diseased tissues. However, discovering novel lncRNAs and accurately quantifying known lncRNAs is not trivial from massive RNA-seq data. Herein we describe UClncR, an Ultrafast and Comprehensive lncRNA detection pipeline to tackle the challenge. UClncR takes standard RNA-seq alignment file, performs transcript assembly, predicts lncRNA candidates, quantifies and annotates both known and novel lncRNA candidates, and generates a convenient report for downstream analysis. The pipeline accommodates both un-stranded and stranded RNA-seq so that lncRNAs overlapping with other genes can be predicted and quantified. UClncR is fully parallelized in a cluster environment yet allows users to run samples sequentially without a cluster. The pipeline can process a typical RNA-seq sample in a matter of minutes and complete hundreds of samples in a matter of hours. Analysis of predicted lncRNAs from two test datasets demonstrated UClncR’s accuracy and their relevance to sample clinical phenotypes. UClncR would facilitate researchers’ novel lncRNA discovery significantly and is publically available at http://bioinformaticstools.mayo.edu/research/UClncR.


Scientific Reports | 2018

Author Correction: UClncR: Ultrafast and comprehensive long non-coding RNA detection from RNA-seq

Zhifu Sun; Asha Nair; Xianfeng Chen; Naresh Prodduturi; Junwen Wang; Jean-Pierre A. Kocher

A correction to this article has been published and is linked from the HTML and PDF versions of this paper. The error has not been fixed in the paper.


Cancer Research | 2015

Abstract P6-07-03: An exhaustive algorithm for detecting copy number aberrations and large structural variants in whole-genome mate-pair sequencing data

Yan W. Asmann; Chen Wang; Brian M. Necela; Xianfeng Chen; Jean-Pierre A. Kocher; Matthew J. Maurer; Thomas M. Habermann; Susan L. Slager; Andrew L. Feldman; Anne J. Novak; James R. Cerhan; Edith A. Perez; E. Aubrey Thompson

Objectives and Rationale: Structural variants (SV) including large copy number aberrations (CNV), translocations, inversions, and large insertions and deletions (INDEL) play a critical role in tumorigenesis and progression. In fact, we now know that tumors can be categorized according to the size of mutations harbored. In several cancers, including ovarian and breast cancer, the large structural mutations, rather than single site mutations, play a dominate role in tumor etiology. Therefore, it9s critical to implement reliable algorithm for the detection of structural variants in DNA sequencing data. Mate-pair sequencing is a protocol specifically implemented for detection of the whole-genome level structural variants. It requires less sequencing depth therefor is cost effective, and enables the detection of CNVs, translocations, and inversions simultaneously. However, so far there has been no reliable bioinformatics pipeline for the analyses of the mate-pair sequencing data. Methods: Our novel algorithm, the SnowShoes-SV, is an exhaustive search algorithm designed specifically for mate-pair DNA sequencing data analyses. It calls the SVs based on disconcordant read pairs. The false SVs are filtered according to the following criteria: (i) the number of the supporting read pairs; (ii) the lack of reads from control data that implicate SV at the same region; (iii) the mappability and uniqueness of the region based on data from the ENCODE project; (iv) consistencies of the mapping orientations of the supporting read pairs; (v) the similar sizes between sequencing library and the two end read clusters. Results: Using a set of samples previously genotyped by aCGH, the SnowShoes-SV successfully detected all known CNVs and other SVs. It also identified copy number neutral translocations and inversions previously not identified by aCGH. In addition, the algorithm nominated novel SVs which are to be validated by PCR. Conclusions: SnowShoes-SV is a highly sensitive and specific algorithm for SV detection from the mate-pair DNA sequencing data. Citation Format: Yan W Asmann, Chen Wang, Brian M Necela, Xianfeng Chen, Jean-Pierre A Kocher, Matthew J Maurer, Thomas M Habermann, Susan L Slager, Andrew L Feldman, Anne J Novak, James R Cerhan, Edith A Perez, E Aubrey Thompson. An exhaustive algorithm for detecting copy number aberrations and large structural variants in whole-genome mate-pair sequencing data [abstract]. In: Proceedings of the Thirty-Seventh Annual CTRC-AACR San Antonio Breast Cancer Symposium: 2014 Dec 9-13; San Antonio, TX. Philadelphia (PA): AACR; Cancer Res 2015;75(9 Suppl):Abstract nr P6-07-03.


Journal of Clinical Oncology | 2017

An exhaustive algorithm for detecting copy number aberrations and large structural variants in whole-genome, mate-pair sequencing data.

Yan W. Asmann; Chen Wang; Brian M. Necela; Xianfeng Chen; Jean-Pierre A. Kocher; Matthew J. Maurer; Thomas M. Habermann; Susan L. Slager; Andew L. Feldman; Ahmet Dogan; Anne J. Novak; James R. Cerhan; Edith A. Perez; E. Aubrey Thompson

Collaboration


Dive into the Xianfeng Chen's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge