Science China Chemistry | 2019

Identify crystal structures by a new paradigm based on graph theory for building materials big data

 
 
 
 
 
 
 
 

Abstract


Material identification technique is crucial to the development of structure chemistry and materials genome project. Current methods are promising candidates to identify structures effectively, but have limited ability to deal with all structures accurately and automatically in the big materials database because different material resources and various measurement errors lead to variation of bond length and bond angle. To address this issue, we propose a new paradigm based on graph theory (GTscheme) to improve the efficiency and accuracy of material identification, which focuses on processing the “topological relationship” rather than the value of bond length and bond angle among different structures. By using this method, automatic deduplication for big materials database is achieved for the first time, which identifies 626,772 unique structures from 865,458 original structures. Moreover, the graph theory scheme has been modified to solve some advanced problems such as identifying highly distorted structures, distinguishing structures with strong similarity and classifying complex crystal structures in materials big data.

Volume None
Pages 1-5
DOI 10.1007/s11426-019-9502-5
Language English
Journal Science China Chemistry

Full Text