Scientific Reports | 2021

Rare and common variant discovery by whole-genome sequencing of 101 Thoroughbred racehorses

 
 
 
 
 
 
 
 

Abstract


The Thoroughbred breed was formed by crossing Oriental horse breeds and British native horses and is currently used in horseracing worldwide. In this study, we constructed a single-nucleotide variant (SNV) database using data from 101 Thoroughbred racehorses. Whole genome sequencing (WGS) revealed 11,570,312 and 602,756 SNVs in autosomal (1–31) and X chromosomes, respectively, yielding a total of 12,173,068 SNVs. About 6.9% of identified SNVs were rare variants observed only in one allele in 101 horses. The number of SNVs detected in individual horses ranged from 4.8 to 5.3 million. Individual horses had a maximum of 25,554 rare variants; several of these were functional variants, such as non-synonymous substitutions, start-gained, start-lost, stop-gained, and stop-lost variants. Therefore, these rare variants may affect differences in traits and phenotypes among individuals. When observing the distribution of rare variants among horses, one breeding stallion had a smaller number of rare variants compared to other horses, suggesting that the frequency of rare variants in the Japanese Thoroughbred population increases through breeding. In addition, our variant database may provide useful basic information for industrial applications, such as the detection of genetically modified racehorses in gene-doping control and pedigree-registration of racehorses using SNVs as markers.

Volume 11
Pages None
DOI 10.1038/s41598-021-95669-1
Language English
Journal Scientific Reports

Full Text