Is this you? Create Your Porfile

Perry B. Cregan

United States Department of Agriculture

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Perry B. Cregan is active.

Explore More

Publication

Featured researches published by Perry B. Cregan.

Nature | 2010

Genome sequence of the palaeopolyploid soybean

Jeremy Schmutz; Steven B. Cannon; Jessica A. Schlueter; Jianxin Ma; Therese Mitros; William Nelson; David L. Hyten; Qijian Song; Jay J. Thelen; Jianlin Cheng; Dong Xu; Uffe Hellsten; Gregory D. May; Yeisoo Yu; Tetsuya Sakurai; Taishi Umezawa; Madan K. Bhattacharyya; Devinder Sandhu; Babu Valliyodan; Erika Lindquist; Myron Peto; David Grant; Shengqiang Shu; David Goodstein; Kerrie Barry; Montona Futrell-Griggs; Brian Abernathy; Jianchang Du; Zhixi Tian; Liucun Zhu

Soybean (Glycine max) is one of the most important crop plants for seed protein and oil content, and for its capacity to fix atmospheric nitrogen through symbioses with soil-borne microorganisms. We sequenced the 1.1-gigabase genome by a whole-genome shotgun approach and integrated it with physical and high-density genetic maps to create a chromosome-scale draft sequence assembly. We predict 46,430 protein-coding genes, 70% more than Arabidopsis and similar to the poplar genome which, like soybean, is an ancient polyploid (palaeopolyploid). About 78% of the predicted genes occur in chromosome ends, which comprise less than one-half of the genome but account for nearly all of the genetic recombination. Genome duplications occurred at approximately 59 and 13 million years ago, resulting in a highly duplicated genome with nearly 75% of the genes present in multiple copies. The two duplication events were followed by gene diversification and loss, and numerous chromosome rearrangements. An accurate soybean genome sequence will facilitate the identification of the genetic basis of many soybean traits, and accelerate the creation of improved soybean varieties.

Nature Genetics | 2014

A reference genome for common bean and genome-wide analysis of dual domestications

Jeremy Schmutz; Phillip E. McClean; Sujan Mamidi; G Albert Wu; Steven B. Cannon; Jane Grimwood; Jerry Jenkins; Shengqiang Shu; Qijian Song; Carolina Chavarro; Mirayda Torres-Torres; Valérie Geffroy; Samira Mafi Moghaddam; Dongying Gao; Brian Abernathy; Kerrie Barry; Matthew W. Blair; Mark A. Brick; Mansi Chovatia; Paul Gepts; David Goodstein; Michael Gonzales; Uffe Hellsten; David L. Hyten; Gaofeng Jia; James D. Kelly; Dave Kudrna; Rian Lee; Manon M. S. Richard; Phillip N. Miklas

Common bean (Phaseolus vulgaris L.) is the most important grain legume for human consumption and has a role in sustainable agriculture owing to its ability to fix atmospheric nitrogen. We assembled 473 Mb of the 587-Mb genome and genetically anchored 98% of this sequence in 11 chromosome-scale pseudomolecules. We compared the genome for the common bean against the soybean genome to find changes in soybean resulting from polyploidy. Using resequencing of 60 wild individuals and 100 landraces from the genetically differentiated Mesoamerican and Andean gene pools, we confirmed 2 independent domestications from genetic pools that diverged before human colonization. Less than 10% of the 74 Mb of sequence putatively involved in domestication was shared by the two domestication events. We identified a set of genes linked with increased leaf and seed size and combined these results with quantitative trait locus data from Mesoamerican cultivars. Genes affected by domestication may be useful for genomics-enabled crop improvement.

Proceedings of the National Academy of Sciences of the United States of America | 2006

Impacts of genetic bottlenecks on soybean genome diversity

David L. Hyten; Qijian Song; Youlin Zhu; Ik Young Choi; Randall L. Nelson; Jose M. Costa; James E. Specht; Randy C. Shoemaker; Perry B. Cregan

Soybean has undergone several genetic bottlenecks. These include domestication in Asia to produce numerous Asian landraces, introduction of relatively few landraces to North America, and then selective breeding over the past 75 years. It is presumed that these three human-mediated events have reduced genetic diversity. We sequenced 111 fragments from 102 genes in four soybean populations representing the populations before and after genetic bottlenecks. We show that soybean has lost many rare sequence variants and has undergone numerous allele frequency changes throughout its history. Although soybean genetic diversity has been eroded by human selection after domestication, it is notable that modern cultivars have retained 72% of the sequence diversity present in the Asian landraces but lost 79% of rare alleles (frequency ≤0.10) found in the Asian landraces. Simulations indicated that the diversity lost through the genetic bottlenecks of introduction and plant breeding was mostly due to the small number of Asian introductions and not the artificial selection subsequently imposed by selective breeding. The bottleneck with the most impact was domestication; when the low sequence diversity present in the wild species was halved, 81% of the rare alleles were lost, and 60% of the genes exhibited evidence of significant allele frequency changes.

Theoretical and Applied Genetics | 2001

DNA markers for Fusarium head blight resistance QTLs in two wheat populations

James A. Anderson; R. W. Stack; Sixin Liu; B. L. Waldron; A. D. Fjeld; Clarice J. Coyne; B. Moreno-Sevilla; J. Mitchell Fetch; Qijian Song; Perry B. Cregan; R. C. Frohberg

Abstract Genetic resistance to Fusarium head blight (FHB), caused by Fusarium graminearum, is necessary to reduce the wheat grain yield and quality losses caused by this disease. Development of resistant cultivars has been slowed by poorly adapted and incomplete resistance sources and confounding environmental effects that make screening of germplasm difficult. DNA markers for FHB resistance QTLs have been identified and may be used to speed the introgression of resistance genes into adapted germplasm. This study was conducted to identify and map additional DNA markers linked to genes controlling FHB resistance in two spring wheat recombinant inbred populations, both segregating for genes from the widely used resistance source ’Sumai 3’. The first population was from the cross of Sumai 3/Stoa in which we previously identified five resistance QTLs. The second population was from the cross of ND2603 (Sumai 3/Wheaton) (resistant)/ Butte 86 (moderately susceptible). Both populations were evaluated for reaction to inoculation with F. graminearum in two greenhouse experiments. A combination of 521 RFLP, AFLP, and SSR markers were mapped in the Sumai 3/Stoa population and all DNA markers associated with resistance were screened on the ND2603/Butte 86 population. Two new QTL on chromosomes 3AL and 6AS wer found in the ND2603/Butte 86 population, and AFLP and SSR markers were identified that explained a greater portion of the phenotypic variation compared to the previous RFLP markers. Both of the Sumai 3-derived QTL regions (on chromosomes 3BS, and 6BS) from the Sumai 3/Stoa population were associated with FHB resistance in the ND2603/Butte 86 population. Markers in the 3BS QTL region (Qfhs.ndsu-3BS) alone explain 41.6 and 24.8% of the resistance to FHB in the Sumai 3/Stoa and ND2603/Butte 86 populations, respectively. This region contains a major QTL for resistance to FHB and should be useful in marker-assisted selection.

Theoretical and Applied Genetics | 1995

The use of microsatellite DNA markers for soybean genotype identification

J. Rongwen; Mahinur S. Akkaya; A. A. Bhagwat; Uri Lavi; Perry B. Cregan

Conventional morphological and pigementation traits, as well as disease resistance, have been used to distinguish the uniqueness of new soybean cultivars for purposes of plant variety protection. With increasing numbers of cultivars and a finite number of conventional characters, it has become apparent that such traits will not suffice to establish uniqueness. The objective of this work was to provide an initial evaluation of microsatellite or simple-sequence-repeat (SSR) DNA markers to develop unique DNA profiles of soybean genotypes. Microsatellites are DNA sequences such as (AT)n/(TA)n and (ATT)n/(TAA)n that are composed of tandemly repeated 2–5-basepair DNA core sequences. The DNA sequences flanking microsatellites are generally conserved allowing the selection of polymerase chain reaction (PCR) primers that will amplify the intervening SSR. Variation in the number of tandem repeats, “n”, results in PCR product length differences. The SSR alleles present at three (AT)n/(TA)n and four (ATT)n/(TAA)n loci were determined in each of 96 diverse soybean genotypes. Between 11 and 26 alleles were found at each of the seven loci. Only two genotypes had identical SSR allelic profiles and these had very similar pedigrees. The gene diversity for the seven markers averaged 0.87 for all 96 genotypes and 0.74 for a subset of 26 North American cultivars. These are much higher than soybean gene diversity values obtained using RFLP markers, and are similar to the average values obtained for human microsatellite markers. SSR markers provide an excellent complement to the conventional markers that are currently used to characterize soybean genotypes.

Genetics | 2007

A Soybean Transcript Map: Gene Distribution, Haplotype and Single-Nucleotide Polymorphism Analysis

Ik Young Choi; David L. Hyten; Lakshmi K. Matukumalli; Qijian Song; Julian M. Chaky; Charles V. Quigley; Kevin Chase; K. Gordon Lark; Robert Reiter; Mun Sup Yoon; Eun Young Hwang; Seung In Yi; Nevin D. Young; Randy C. Shoemaker; Curtis P. Van Tassell; James E. Specht; Perry B. Cregan

The first genetic transcript map of the soybean genome was created by mapping one SNP in each of 1141 genes in one or more of three recombinant inbred line mapping populations, thus providing a picture of the distribution of genic sequences across the mapped portion of the genome. Single-nucleotide polymorphisms (SNPs) were discovered via the resequencing of sequence-tagged sites (STSs) developed from expressed sequence tag (EST) sequence. From an initial set of 9459 polymerase chain reaction primer sets designed to a diverse set of genes, 4240 STSs were amplified and sequenced in each of six diverse soybean genotypes. In the resulting 2.44 Mbp of aligned sequence, a total of 5551 SNPs were discovered, including 4712 single-base changes and 839 indels for an average nucleotide diversity of θ = 0.000997. The analysis of the observed genetic distances between adjacent genes vs. the theoretical distribution based upon the assumption of a random distribution of genes across the 20 soybean linkage groups clearly indicated that genes were clustered. Of the 1141 genes, 291 mapped to 72 of the 112 gaps of 5–10 cM in the preexisting simple sequence repeat (SSR)-based map, while 111 genes mapped in 19 of the 26 gaps >10 cM. The addition of 1141 sequence-based genic markers to the soybean genome map will provide an important resource to soybean geneticists for quantitative trait locus discovery and map-based cloning, as well as to soybean breeders who increasingly depend upon marker-assisted selection in cultivar improvement.

BMC Genomics | 2014

A genome-wide association study of seed protein and oil content in soybean.

Eun-Young Hwang; Qijian Song; Gaofeng Jia; James E. Specht; David L. Hyten; Jose M. Costa; Perry B. Cregan

BackgroundAssociation analysis is an alternative to conventional family-based methods to detect the location of gene(s) or quantitative trait loci (QTL) and provides relatively high resolution in terms of defining the genome position of a gene or QTL. Seed protein and oil concentration are quantitative traits which are determined by the interaction among many genes with small to moderate genetic effects and their interaction with the environment. In this study, a genome-wide association study (GWAS) was performed to identify quantitative trait loci (QTL) controlling seed protein and oil concentration in 298 soybean germplasm accessions exhibiting a wide range of seed protein and oil content.ResultsA total of 55,159 single nucleotide polymorphisms (SNPs) were genotyped using various methods including Illumina Infinium and GoldenGate assays and 31,954 markers with minor allele frequency >0.10 were used to estimate linkage disequilibrium (LD) in heterochromatic and euchromatic regions. In euchromatic regions, the mean LD (r2) rapidly declined to 0.2 within 360 Kbp, whereas the mean LD declined to 0.2 at 9,600 Kbp in heterochromatic regions. The GWAS results identified 40 SNPs in 17 different genomic regions significantly associated with seed protein. Of these, the five SNPs with the highest associations and seven adjacent SNPs were located in the 27.6-30.0 Mbp region of Gm20. A major seed protein QTL has been previously mapped to the same location and potential candidate genes have recently been identified in this region. The GWAS results also detected 25 SNPs in 13 different genomic regions associated with seed oil. Of these markers, seven SNPs had a significant association with both protein and oil.ConclusionsThis research indicated that GWAS not only identified most of the previously reported QTL controlling seed protein and oil, but also resulted in narrower genomic regions than the regions reported as containing these QTL. The narrower GWAS-defined genome regions will allow more precise marker-assisted allele selection and will expedite positional cloning of the causal gene(s).

BMC Genomics | 2010

High-throughput SNP discovery through deep resequencing of a reduced representation library to anchor and orient scaffolds in the soybean whole genome sequence

David L. Hyten; Steven B. Cannon; Qijian Song; Nathan Weeks; Edward W. Fickus; Randy C. Shoemaker; James E. Specht; Andrew D. Farmer; Gregory D. May; Perry B. Cregan

BackgroundThe Soybean Consensus Map 4.0 facilitated the anchoring of 95.6% of the soybean whole genome sequence developed by the Joint Genome Institute, Department of Energy, but its marker density was only sufficient to properly orient 66% of the sequence scaffolds. The discovery and genetic mapping of more single nucleotide polymorphism (SNP) markers were needed to anchor and orient the remaining genome sequence. To that end, next generation sequencing and high-throughput genotyping were combined to obtain a much higher resolution genetic map that could be used to anchor and orient most of the remaining sequence and to help validate the integrity of the existing scaffold builds.ResultsA total of 7,108 to 25,047 predicted SNPs were discovered using a reduced representation library that was subsequently sequenced by the Illumina sequence-by-synthesis method on the clonal single molecule array platform. Using multiple SNP prediction methods, the validation rate of these SNPs ranged from 79% to 92.5%. A high resolution genetic map using 444 recombinant inbred lines was created with 1,790 SNP markers. Of the 1,790 mapped SNP markers, 1,240 markers had been selectively chosen to target existing unanchored or un-oriented sequence scaffolds, thereby increasing the amount of anchored sequence to 97%.ConclusionWe have demonstrated how next generation sequencing was combined with high-throughput SNP detection assays to quickly discover large numbers of SNPs. Those SNPs were then used to create a high resolution genetic map that assisted in the assembly of scaffolds from the 8× whole genome shotgun sequences into pseudomolecules corresponding to chromosomes of the organism.

PLOS ONE | 2013

Development and Evaluation of SoySNP50K, a High-Density Genotyping Array for Soybean

Qijian Song; David L. Hyten; Gaofeng Jia; Charles V. Quigley; Edward W. Fickus; Randall L. Nelson; Perry B. Cregan

The objective of this research was to identify single nucleotide polymorphisms (SNPs) and to develop an Illumina Infinium BeadChip that contained over 50,000 SNPs from soybean (Glycine max L. Merr.). A total of 498,921,777 reads 35–45bp in length were obtained from DNA sequence analysis of reduced representation libraries from several soybean accessions which included six cultivated and two wild soybean (G. soja Sieb. et Zucc.) genotypes. These reads were mapped to the soybean whole genome sequence and 209,903 SNPs were identified. After applying several filters, a total of 146,161 of the 209,903 SNPs were determined to be ideal candidates for Illumina Infinium II BeadChip design. To equalize the distance between selected SNPs, increase assay success rate, and minimize the number of SNPs with low minor allele frequency, an iteration algorithm based on a selection index was developed and used to select 60,800 SNPs for Infinium BeadChip design. Of the 60,800 SNPs, 50,701 were targeted to euchromatic regions and 10,000 to heterochromatic regions of the 20 soybean chromosomes. In addition, 99 SNPs were targeted to unanchored sequence scaffolds. Of the 60,800 SNPs, a total of 52,041 passed Illumina’s manufacturing phase to produce the SoySNP50K iSelect BeadChip. Validation of the SoySNP50K chip with 96 landrace genotypes, 96 elite cultivars and 96 wild soybean accessions showed that 47,337 SNPs were polymorphic and generated successful SNP allele calls. In addition, 40,841 of the 47,337 SNPs (86%) had minor allele frequencies ≥10% among the landraces, elite cultivars and the wild soybean accessions. A total of 620 and 42 candidate regions which may be associated with domestication and recent selection were identified, respectively. The SoySNP50K iSelect SNP beadchip will be a powerful tool for characterizing soybean genetic diversity and linkage disequilibrium, and for constructing high resolution linkage maps to improve the soybean whole genome sequence assembly.

Theoretical and Applied Genetics | 1997

Automated sizing of fluorescent-labeled simple sequence repeat (SSR) markers to assay genetic variation in soybean

N. Diwan; Perry B. Cregan

Abstract Simple Sequence Repeat (SSR) allele sizing provides a useful tool for genotype identification, pedigree analysis, and for estimating genetic distance between organisms. Soybean [Glycine max (L.) Merr.] cultivars are identified for Plant Variety Protection (PVP) purposes by standard pigmentation and morphological traits. However, many commercial soybeans arise from a limited number of elite lines and are often indistinguishable based on these traits. A system based on SSR markers would provide unique DNA profiles of cultivars. Fluorescent labeling of alleles combined with automated sizing with internal size standards in each gel lane was used as an alternative to standard [32P] labeling to assess genetic variability in soybean. Allelic frequencies at 20 SSR loci were determined in 35 soybean genotypes that account for greater than 95% of the alleles in North American soybean cultivars based upon pedigree analysis. An average of 10.1 alleles per locus (range: 5–17), with a mean gene diversity of 0.80 (range: 0.50 to 0.87) were observed at the 20 SSR loci. The 20 loci successfully distinguished modern soybean cultivars that are identical for morphological and pigmentation traits, as well as 7 soybean genotypes reported to be indistinguishable using 17 RFLP probes. Pedigrees of 7 cultivars were studied to estimate stability of SSRs in soybean across generations. Of the 7 pedigrees 6 had one locus in the progeny with an allele(s) that was not present in either parent. These new alleles are most likely the result of mutation. The mutation rate of SSR alleles in soybean was similar to that reported in humans. To avoid difficulty associated with mutation, DNA fingerprint data should be determined from the bulk of 30-50 plants of a cultivar.

Explore More