Human Mutation | 2021

Prediction of disease‐associated functional variants in noncoding regions through a comprehensive analysis by integrating datasets and features

 
 
 
 
 
 

Abstract


One of the greatest challenges in human genetics is deciphering the link between functional variants in noncoding sequences and the pathophysiology of complex diseases. To address this issue, many methods have been developed to sort functional single‐nucleotide variants (SNVs) for neutral SNVs in noncoding regions. In this study, we integrated well‐established features and commonly used datasets and merged them into large‐scale datasets based on a random forest model, which yielded promising performance and outperformed some cutting‐edge approaches. Our analyses of feature importance and data coverage also provide certain clues for future research in enhancing the prediction of functional noncoding SNVs.

Volume 42
Pages 667 - 684
DOI 10.1002/humu.24203
Language English
Journal Human Mutation

Full Text