Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Alencar Xavier is active.

Publication


Featured researches published by Alencar Xavier.


G3: Genes, Genomes, Genetics | 2016

Assessing Predictive Properties of Genome-Wide Selection in Soybeans

Alencar Xavier; William M. Muir; Katy Martin Rainey

Many economically important traits in plant breeding have low heritability or are difficult to measure. For these traits, genomic selection has attractive features and may boost genetic gains. Our goal was to evaluate alternative scenarios to implement genomic selection for yield components in soybean (Glycine max L. merr). We used a nested association panel with cross validation to evaluate the impacts of training population size, genotyping density, and prediction model on the accuracy of genomic prediction. Our results indicate that training population size was the factor most relevant to improvement in genome-wide prediction, with greatest improvement observed in training sets up to 2000 individuals. We discuss assumptions that influence the choice of the prediction model. Although alternative models had minor impacts on prediction accuracy, the most robust prediction model was the combination of reproducing kernel Hilbert space regression and BayesB. Higher genotyping density marginally improved accuracy. Our study finds that breeding programs seeking efficient genomic selection in soybeans would best allocate resources by investing in a representative training set.


Theoretical and Applied Genetics | 2016

Walking through the statistical black boxes of plant breeding

Alencar Xavier; William M. Muir; Bruce A. Craig; Katy Martin Rainey

Key messageThe main statistical procedures in plant breeding are based on Gaussian process and can be computed through mixed linear models.AbstractIntelligent decision making relies on our ability to extract useful information from data to help us achieve our goals more efficiently. Many plant breeders and geneticists perform statistical analyses without understanding the underlying assumptions of the methods or their strengths and pitfalls. In other words, they treat these statistical methods (software and programs) like black boxes. Black boxes represent complex pieces of machinery with contents that are not fully understood by the user. The user sees the inputs and outputs without knowing how the outputs are generated. By providing a general background on statistical methodologies, this review aims (1) to introduce basic concepts of machine learning and its applications to plant breeding; (2) to link classical selection theory to current statistical approaches; (3) to show how to solve mixed models and extend their application to pedigree-based and genomic-based prediction; and (4) to clarify how the algorithms of genome-wide association studies work, including their assumptions and limitations.


G3: Genes, Genomes, Genetics | 2017

Genome-Wide Analysis of Grain Yield Stability and Environmental Interactions in a Multiparental Soybean Population

Alencar Xavier; Diego Jarquin; Reka Howard; Vishnu Ramasubramanian; James E. Specht; George L. Graef; William D. Beavis; Brian W. Diers; Qijian Song; Perry B. Cregan; Randall L. Nelson; Rouf Mian; J. Grover Shannon; Leah K. McHale; Dechun Wang; William T. Schapaugh; Aaron J. Lorenz; Shizhong Xu; William M. Muir; Katy Martin Rainey

Genetic improvement toward optimized and stable agronomic performance of soybean genotypes is desirable for food security. Understanding how genotypes perform in different environmental conditions helps breeders develop sustainable cultivars adapted to target regions. Complex traits of importance are known to be controlled by a large number of genomic regions with small effects whose magnitude and direction are modulated by environmental factors. Knowledge of the constraints and undesirable effects resulting from genotype by environmental interactions is a key objective in improving selection procedures in soybean breeding programs. In this study, the genetic basis of soybean grain yield responsiveness to environmental factors was examined in a large soybean nested association population. For this, a genome-wide association to performance stability estimates generated from a Finlay-Wilkinson analysis and the inclusion of the interaction between marker genotypes and environmental factors was implemented. Genomic footprints were investigated by analysis and meta-analysis using a recently published multiparent model. Results indicated that specific soybean genomic regions were associated with stability, and that multiplicative interactions were present between environments and genetic background. Seven genomic regions in six chromosomes were identified as being associated with genotype-by-environment interactions. This study provides insight into genomic assisted breeding aimed at achieving a more stable agronomic performance of soybean, and documented opportunities to exploit genomic regions that were specifically associated with interactions involving environments and subpopulations.


Euphytica | 2017

Using unsupervised learning techniques to assess interactions among complex traits in soybeans

Alencar Xavier; Benjamin Hall; Shaun N. Casteel; William M. Muir; Katy Martin Rainey

Soybean yield components and agronomic traits are connected through physiological pathways that impose tradeoffs through genetic and environmental constraints. Our primary aim is to assess the interdependence of soybean traits by using unsupervised machine learning techniques to divide phenotypic associations into environmental and genetic associations. This study was performed on large scale, jointly analyzing 14 quantitative traits in a large multi-parental population designed for genetic studies. We collected phenotypes from 2012 to 2015 from a soybean nested association panel with 40 families of approximately 140 individuals each. Pearson and Spearman correlations measured phenotypic associations. A multivariate mixed linear model provided genotypic and environmental correlations. To evaluate relationships among traits, the study used principal component and undirected graphical models from phenotypic, genotypic, and environmental correlation matrices. Results indicate that high phenotypic correlation occurs when traits display both genetic and environmental correlations. In genetic terms, length of reproductive period, node number, and canopy coverage play important roles in determining yield potential. Optimal grain yield production occurs when the growing environment favors faster canopy closure and extended reproductive length. Environmental associations found among yield components give insight into the nature of yield component compensation. The use of unsupervised learning methods provides a good framework for investigating interactions among various quantitative traits and defining target traits for breeding.


Plant Genetic Resources | 2018

Population and quantitative genomic properties of the USDA soybean germplasm collection

Alencar Xavier; Rima Thapa; William M. Muir; Katy Martin Rainey

This study is the first assessment of the entire soybean [ Glycine max (L.) Merr] collection of the United State Department of Agriculture National Plant Germplasm System (USDA) reporting quantitative and population genomic parameters. It also provides a new insight into soybean germplasm structure. Germplasm studies enable plant breeders to incorporate novel genetic resources into breeding pipelines to improve valuable agronomic traits. We conducted comprehensive analyses on the 19,652 soybean accessions in the USDA-ARS germplasm collection, genotyped with the SoySNP50 K iSelect BeadChip SNP array, to elucidate the quantitative properties of existing subpopulations inferred through hierarchical clustering performed with Wards D agglomeration method and Neis standard genetic distance. We found the effective population size to be approximately 106 individuals based on the linkage disequilibrium of unlinked loci. The cladogram indicated the existence of eight major clusters. Each cluster displays particular properties with regard to major quantitative traits. Among those, cluster 3 represents the tropical and semi-tropical genetic material, cluster 5 displays large seeds and may represent food-grade germplasm, and cluster 7 represents the undomesticated material in the germplasm collection. The average F ST among clusters was 0.22 and a total of 914 SNPs were exclusive to specific clusters. Our classification and characterization of the germplasm collection into major clusters provides valuable information about the genetic resources available to soybean breeders and researchers.


BMC Bioinformatics | 2017

Erratum to: Genomic prediction using subsampling

Alencar Xavier; Shizhong Xu; William M. Muir; Katy Martin Rainey

Erratum Following publication of this article [1], it has come to our attention that some of the equations included were distorted by a formatting issue in the PDF form. This issue affected Eqs. 8, 9 and 11. The correct forms of the equations are shown below.


Bioinformatics | 2015

NAM: association studies in multiple populations

Alencar Xavier; Shizhong Xu; William M. Muir; Katy Martin Rainey


BMC Bioinformatics | 2016

Impact of imputation methods on the amount of genetic variation captured by a single-nucleotide polymorphism panel in soybeans

Alencar Xavier; William M. Muir; Katy Martin Rainey


Genetics | 2017

Genetic Architecture of Phenomic-Enabled Canopy Coverage in Glycine max

Alencar Xavier; Benjamin Hall; Anthony Hearst; Keith A. Cherkauer; Katy Martin Rainey


BMC Bioinformatics | 2017

Genomic prediction using subsampling

Alencar Xavier; Shizhong Xu; William M. Muir; Katy Martin Rainey

Collaboration


Dive into the Alencar Xavier's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Shizhong Xu

University of California

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Diego Jarquin

University of Nebraska–Lincoln

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Reka Howard

University of Nebraska–Lincoln

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge