Genomics, proteomics & bioinformatics | 2021

DTFLOW: Inference and Visualization of Single-cell Pseudotime Trajectory Using Diffusion Propagation.

 
 
 
 

Abstract


One of the major challenges in single-cell data analysis is the determination of cellular developmental trajectories using single-cell data. Although substantial studies have been conducted in recent years, more effective methods are still strongly needed to infer the developmental processes accurately. This work devises a new method, named DTFLOW, for determining the pseudo-temporal trajectories with multiple branches. DTFLOW consists of two major steps: namely, a new method called Bhattacharyya kernel feature decomposition (BKFD) to reduce the data dimensions, and a novel approach, named reverse searching on k-nearest neighbor graph (RSKG), to identify the multi-branching processes of cellular differentiation. In BKFD we first establish a stationary distribution for each cell to represent the transition of cellular developmental states based on the random walk with restart algorithm, and then propose a new distance metric for calculating pseudotime of single cells by introducing the Bhattacharyya kernel matrix. The effectiveness of DTFLOW is rigorously examined by using four single-cell datasets. We compare the efficiency of DTFLOW with the published state-of-the-art methods. Simulation results suggest that DTFLOW has superior accuracy and strong robustness properties for constructing pseudotime trajectories. The Python source code of DTFLOW can be freely accessed at https://github.com/statway/DTFLOW.

Volume None
Pages None
DOI 10.1016/j.gpb.2020.08.003
Language English
Journal Genomics, proteomics & bioinformatics

Full Text