Genome Biology | 2019

scBFA: modeling detection patterns to mitigate technical noise in large-scale single-cell genomics data

 
 

Abstract


Technical variation in feature measurements, such as gene expression and locus accessibility, is a key challenge of large-scale single-cell genomic datasets. We show that this technical variation in both scRNA-seq and scATAC-seq datasets can be mitigated by analyzing feature detection patterns alone and ignoring feature quantification measurements. This result holds when datasets have low detection noise relative to quantification noise. We demonstrate state-of-the-art performance of detection pattern models using our new framework, scBFA, for both cell type identification and trajectory inference. Performance gains can also be realized in one line of R code in existing pipelines.

Volume 20
Pages None
DOI 10.1186/s13059-019-1806-0
Language English
Journal Genome Biology

Full Text