Benjamin J. Hayes
La Trobe University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Benjamin J. Hayes.
BMC Genomics | 2014
Kathryn E. Kemper; Sarah J Saxton; S. Bolormaa; Benjamin J. Hayes; Michael E. Goddard
BackgroundSelection signatures aim to identify genomic regions underlying recent adaptations in populations. However, the effects of selection in the genome are difficult to distinguish from random processes, such as genetic drift. Often associations between selection signatures and selected variants for complex traits is assumed even though this is rarely (if ever) tested. In this paper, we use 8 breeds of domestic cattle under strong artificial selection to investigate if selection signatures are co-located in genomic regions which are likely to be under selection.ResultsOur approaches to identify selection signatures (haplotype heterozygosity, integrated haplotype score and FST) identified strong and recent selection near many loci with mutations affecting simple traits under strong selection, such as coat colour. However, there was little evidence for a genome-wide association between strong selection signatures and regions affecting complex traits under selection, such as milk yield in dairy cattle. Even identifying selection signatures near some major loci was hindered by factors including allelic heterogeneity, selection for ancestral alleles and interactions with nearby selected loci.ConclusionsSelection signatures detect loci with large effects under strong selection. However, the methodology is often assumed to also detect loci affecting complex traits where the selection pressure at an individual locus is weak. We present empirical evidence to suggests little discernible ‘selection signature’ for complex traits in the genome of dairy cattle despite very strong and recent artificial selection.
Genetics Research | 2011
Kathryn E. Kemper; D.L. Emery; Stephen Bishop; Hutton Oddy; Benjamin J. Hayes; Sonja Dominik; John M. Henshall; Michael E. Goddard
SummaryGenetic resistance to gastrointestinal worms is a complex trait of great importance in both livestock and humans. In order to gain insights into the genetic architecture of this trait, a mixed breed population of sheep was artificially infected with Trichostrongylus colubriformis (n=3326) and then Haemonchus contortus (n=2669) to measure faecal worm egg count (WEC). The population was genotyped with the Illumina OvineSNP50 BeadChip and 48 640 single nucleotide polymorphism (SNP) markers passed the quality controls. An independent population of 316 sires of mixed breeds with accurate estimated breeding values for WEC were genotyped for the same SNP to assess the results obtained from the first population. We used principal components from the genomic relationship matrix among genotyped individuals to account for population stratification, and a novel approach to directly account for the sampling error associated with each SNP marker regression. The largest marker effects were estimated to explain an average of 0·48% (T. colubriformis) or 0·08% (H. contortus) of the phenotypic variance in WEC. These effects are small but consistent with results from other complex traits. We also demonstrated that methods which use all markers simultaneously can successfully predict genetic merit for resistance to worms, despite the small effects of individual markers. Correlations of genomic predictions with breeding values of the industry sires reached a maximum of 0·32. We estimate that effective across-breed predictions of genetic merit with multi-breed populations will require an average marker spacing of approximately 10 kbp.
Genetics Selection Evolution | 2015
Kathryn E. Kemper; C. M. Reich; P.J. Bowman; Christy Vander Jagt; Amanda J. Chamberlain; B. A. Mason; Benjamin J. Hayes; Michael E. Goddard
BackgroundGenomic selection is increasingly widely practised, particularly in dairy cattle. However, the accuracy of current predictions using GBLUP (genomic best linear unbiased prediction) decays rapidly across generations, and also as selection candidates become less related to the reference population. This is likely caused by the effects of causative mutations being dispersed across many SNPs (single nucleotide polymorphisms) that span large genomic intervals. In this paper, we hypothesise that the use of a nonlinear method (BayesR), combined with a multi-breed (Holstein/Jersey) reference population will map causative mutations with more precision than GBLUP and this, in turn, will increase the accuracy of genomic predictions for selection candidates that are less related to the reference animals.ResultsBayesR improved the across-breed prediction accuracy for Australian Red dairy cattle for five milk yield and composition traits by an average of 7% over the GBLUP approach (Australian Red animals were not included in the reference population). Using the multi-breed reference population with BayesR improved accuracy of prediction in Australian Red cattle by 2 – 5% compared to using BayesR with a single breed reference population. Inclusion of 8478 Holstein and 3917 Jersey cows in the reference population improved accuracy of predictions for these breeds by 4 and 5%. However, predictions for Holstein and Jersey cattle were similar using within-breed and multi-breed reference populations. We propose that the improvement in across-breed prediction achieved by BayesR with the multi-breed reference population is due to more precise mapping of quantitative trait loci (QTL), which was demonstrated for several regions. New candidate genes with functional links to milk synthesis were identified using differential gene expression in the mammary gland.ConclusionsQTL detection and genomic prediction are usually considered independently but persistence of genomic prediction accuracies across breeds requires accurate estimation of QTL effects. We show that accuracy of across-breed genomic predictions was higher with BayesR than with GBLUP and that BayesR mapped QTL more precisely. Further improvements of across-breed accuracy of genomic predictions and QTL mapping could be achieved by increasing the size of the reference population, including more breeds, and possibly by exploiting pleiotropic effects to improve mapping efficiency for QTL with small effects.
Journal of Dairy Science | 2016
Thuy T.T. Nguyen; Phil J. Bowman; M. Haile-Mariam; J.E. Pryce; Benjamin J. Hayes
Temperature and humidity levels above a certain threshold decrease milk production in dairy cattle, and genetic variation is associated with the amount of lost production. To enable selection for improved heat tolerance, the aim of this study was to develop genomic estimated breeding values (GEBV) for heat tolerance in dairy cattle. Heat tolerance was defined as the rate of decline in production under heat stress. We combined herd test-day recording data from 366,835 Holstein and 76,852 Jersey cows with daily temperature and humidity measurements from weather stations closest to the tested herds for test days between 2003 and 2013. We used daily mean values of temperature-humidity index averaged for the day of test and the 4 previous days as the measure of heat stress. Tolerance to heat stress was estimated for each cow using a random regression model with a common threshold of temperature-humidity index=60 for all cows. The slope solutions for cows from this model were used to define the daughter trait deviations of their sires. Genomic best linear unbiased prediction was used to calculate GEBV for heat tolerance for milk, fat, and protein yield. Two reference populations were used, the first consisted of genotyped sires only (2,300 Holstein and 575 Jersey sires), and the other included genotyped sires and cows (2,189 Holstein and 1,188 Jersey cows). The remainder of the genotyped sires were used as a validation set. All animals had genotypes for 632,003 single nucleotide polymorphisms. When using only genotyped sires in the reference set and only the first parity data, the accuracy of GEBV for heat tolerance in relation to changes in milk, fat, and protein yield were 0.48, 0.50, and 0.49 in the Holstein validation sires and 0.44, 0.61, and 0.53 in the Jersey validation sires, respectively. Some slight improvement in the accuracy of prediction was achieved when cows were included in the reference population for Holsteins. No clear improvements in the accuracy of genomic prediction were observed when data from the second and third parities were included. Correlations of GEBV for heat tolerance with Australian Breeding Values for other traits suggested heat tolerance had a favorable genetic correlation with fertility (0.29-0.39 in Holsteins and 0.15-0.27 in Jerseys), but unfavorable correlations for some production traits. Options to improve heat tolerance with genomic selection in Australian dairy cattle are discussed.
BMC Genomics | 2012
Rasmus Froberg Brøndum; Guosheng Su; Mogens Sandø Lund; P.J. Bowman; Michael E. Goddard; Benjamin J. Hayes
BackgroundThe accuracy of genomic prediction is highly dependent on the size of the reference population. For small populations, including information from other populations could improve this accuracy. The usual strategy is to pool data from different populations; however, this has not proven as successful as hoped for with distantly related breeds. BayesRS is a novel approach to share information across populations for genomic predictions. The approach allows information to be captured even where the phase of SNP alleles and casuative mutation alleles are reversed across populations, or the actual casuative mutation is different between the populations but affects the same gene. Proportions of a four-distribution mixture for SNP effects in segments of fixed size along the genome are derived from one population and set as location specific prior proportions of distributions of SNP effects for the target population. The model was tested using dairy cattle populations of different breeds: 540 Australian Jersey bulls, 2297 Australian Holstein bulls and 5214 Nordic Holstein bulls. The traits studied were protein-, fat- and milk yield. Genotypic data was Illumina 777K SNPs, real or imputed.ResultsResults showed an increase in accuracy of up to 3.5% for the Jersey population when using BayesRS with a prior derived from Australian Holstein compared to a model without location specific priors. The increase in accuracy was however lower than was achieved when reference populations were combined to estimate SNP effects, except in the case of fat yield. The small size of the Jersey validation set meant that these improvements in accuracy were not significant using a Hotelling-Williams t-test at the 5% level. An increase in accuracy of 1-2% for all traits was observed in the Australian Holstein population when using a prior derived from the Nordic Holstein population compared to using no prior information. These improvements were significant (P<0.05) using the Hotelling Williams t-test for protein- and fat yield.ConclusionFor some traits the method might be advantageous compared to pooling of reference data for distantly related populations, but further investigation is needed to confirm the results. For closely related populations the method does not perform better than pooling reference data. However, it does give an increased accuracy compared to analysis based on only one reference population, without an increased computational burden. The approach described here provides a general setup for inclusion of location specific priors: the approach could be used to include biological information in genomic predictions.
The Plant Genome | 2016
Anthony T. Slater; Noel O. I. Cogan; John W. Forster; Benjamin J. Hayes; Hans D. Daetwyler
Progress in conventional potato breeding can be complex and slow. Genomic selection can be applied to autotetraploid potato. A number of factors that will affect genomic selection need to be considered. Genomic selection will accelerate genetic gain in potato.
Journal of Proteome Research | 2013
Anthony D. Maher; Benjamin J. Hayes; Benjamin G. Cocks; L. C. Marett; W. J. Wales; Simone Rochfort
A detailed understanding of the relationships between the distinct metabolic compartments of blood and milk would be of potential benefit to our understanding of the physiology of lactation, and potentially for development of biomarkers for health and commercially relevant traits in dairy cattle. NMR methods were used to measure metabolic profiles from blood and milk samples from Holstein cows. Data were analyzed using PLS regression to identify quantitative relationships between metabolic profiles and important traits. Statistical Heterospectroscopy (SHY), a powerful approach to recovering latent biological information in NMR spectroscopic data sets from multiple complementary samples, was employed to explore the metabolic relationships between blood and milk from these animals. The study confirms milk is a distinct metabolic compartment with a metabolite composition largely not influenced by plasma composition under normal circumstances. However, several significant relationships were identified, including a high correlation for trimethylamine (TMA) and dimethylsulfone (DMSO(2)) across plasma and milk compartments, and evidence plasma valine levels are linked to differences in amino acid catabolism in the mammary gland. The findings provide insights into the physiological mechanisms underlying lactation and identification of links between key metabolites and milk traits such as the protein and fat content of milk. The approach has the potential to enable measurement of health, metabolic status and other important phenotypes with milk sampling.
Journal of Dairy Science | 2017
Thuy T. T. Nguyen; Phil J. Bowman; M. Haile-Mariam; Gert J. Nieuwhof; Benjamin J. Hayes; J.E. Pryce
Excessive ambient temperature and humidity can impair milk production and fertility of dairy cows. Selection for heat-tolerant animals is one possible option to mitigate the effects of heat stress. To enable selection for this trait, we describe the development of a heat tolerance breeding value for Australian dairy cattle. We estimated the direct genomic values of decline in milk, fat, and protein yield per unit increase of temperature-humidity index (THI) using 46,726 single nucleotide polymorphisms and a reference population of 2,236 sires and 11,853 cows for Holsteins and 506 sires and 4,268 cows for Jerseys. This new direct genomic value is the Australian genomic breeding value for heat tolerance (HT ABVg). The components of the HT ABVg are the decline in milk, fat, and protein per unit increase in THI when THI increases above the threshold of 60. These components are weighted by their respective economic values, assumed to be equivalent to the weights applied to milk, fat, and protein yield in the Australian selection indices. Within each breed, the HT ABVg is then standardized to have a mean of 100 and standard deviation (SD) of 5, which is consistent with the presentation of breeding values for many other traits in Australia. The HT ABVg ranged from -4 to +3 SD in Holsteins and -3 to +4 SD in Jerseys. The mean reliabilities of HT ABVg among validation sires, calculated from the prediction error variance and additive genetic variance, were 38% in both breeds. The range in ABVg and their reliability suggests that HT can be improved using genomic selection. There has been a deterioration in the genetic trend of HT, and to moderate the decline it is suggested that the HT ABVg should be included in a multitrait economic index with other traits that contribute to farm profit.
Genetics Selection Evolution | 2017
Min Wang; Timothy P. Hancock; Iona M. MacLeod; J.E. Pryce; Benjamin G. Cocks; Benjamin J. Hayes
BackgroundEnhancers are non-coding DNA sequences, which when they are bound by specific proteins increase the level of gene transcription. Enhancers activate unique gene expression patterns within cells of different types or under different conditions. Enhancers are key contributors to gene regulation, and causative variants that affect quantitative traits in humans and mice have been located in enhancer regions. However, in the bovine genome, enhancers as well as other regulatory elements are not yet well defined. In this paper, we sought to improve the annotation of bovine enhancer regions by using publicly available mammalian enhancer information. To test if the identified putative bovine enhancer regions are enriched with functional variants that affect milk production traits, we performed genome-wide association studies using imputed whole-genome sequence data followed by meta-analysis and enrichment analysis.ResultsWe produced a library of candidate bovine enhancer regions by using publicly available bovine ChIP-Seq enhancer data in combination with enhancer data that were identified based on sequence homology with human and mouse enhancer databases. We found that imputed whole-genome sequence variants associated with milk production traits in 16,581 dairy cattle were enriched with enhancer regions that were marked by bovine-liver H3K4me3 and H3K27ac histone modifications from both permutation tests and gene set enrichment analysis. Enhancer regions that were identified based on sequence homology with human and mouse enhancer regions were not as strongly enriched with trait-associated sequence variants as the bovine ChIP-Seq candidate enhancer regions. The bovine ChIP-Seq enriched enhancer regions were located near genes and quantitative trait loci that are associated with pregnancy, growth, disease resistance, meat quality and quantity, and milk quality and quantity traits in dairy and beef cattle.ConclusionsOur results suggest that sequence variants within enhancer regions that are located in bovine non-coding genomic regions contribute to the variation in complex traits. The level of enrichment was higher in bovine-specific enhancer regions that were identified by detecting histone modifications H3K4me3 and H3K27ac in bovine liver tissues than in enhancer regions identified by sequence homology with human and mouse data. These results highlight the need to use bovine-specific experimental data for the identification of enhancer regions.
GigaScience | 2018
Quan Nguyen; Ross L. Tellam; Marina Naval-Sanchez; Laercio R. Porto-Neto; W. Barendse; Antonio Reverter; Benjamin J. Hayes; James W. Kijas; Brian P. Dalrymple
Abstract Genome sequences for hundreds of mammalian species are available, but an understanding of their genomic regulatory regions, which control gene expression, is only beginning. A comprehensive prediction of potential active regulatory regions is necessary to functionally study the roles of the majority of genomic variants in evolution, domestication, and animal production. We developed a computational method to predict regulatory DNA sequences (promoters, enhancers, and transcription factor binding sites) in production animals (cows and pigs) and extended its broad applicability to other mammals. The method utilizes human regulatory features identified from thousands of tissues, cell lines, and experimental assays to find homologous regions that are conserved in sequences and genome organization and are enriched for regulatory elements in the genome sequences of other mammalian species. Importantly, we developed a filtering strategy, including a machine learning classification method, to utilize a very small number of species-specific experimental datasets available to select for the likely active regulatory regions. The method finds the optimal combination of sensitivity and accuracy to unbiasedly predict regulatory regions in mammalian species. Furthermore, we demonstrated the utility of the predicted regulatory datasets in cattle for prioritizing variants associated with multiple production and climate change adaptation traits and identifying potential genome editing targets.