In biochemistry and structural biology research, Chemical Shift Index (CSI) is a widely used technique specifically for analyzing protein nuclear magnetic resonance (NMR) spectroscopy. This technique can visualize and identify the sites (e.g., start and end positions) and types (β-strands, α-helices, and random coil regions) of protein secondary structures by using only backbone chemical shift data. David S. Wishart began developing this technique in 1992, initially focusing on the analysis of 1Hα chemical shifts and in 1994 expanding it to include 13C backbone chemical shifts.
The basic principle of this method is that the 1Hα chemical shift is usually shifted upward in α-helices (i.e., to the right of the NMR spectrum) and downward in β-sheets (i.e., to the left of the NMR spectrum). on the left). Similar trends can also be found in the dorsal 13C chemical shifts.The core of chemical shift index technology is that it utilizes the characteristics of chemical shift changes of amino acid residues in α-helix and β-sheet.
The CSI method is a graph-based technique that uses amino acid-specific digital filters to convert each assigned backbone chemical shift value into a simple three-state index (-1, 0, +1). The charts generated by this method become more visually clear and easier to understand. If the upshift 1Hα chemical shift of an amino acid residue (relative to its amino acid-specific random coil value) was greater than 0.1 ppm, the residue was assigned a value of -1; if the downshift was greater than 0.1 ppm, it was assigned a value of + 1; if the chemical shift change is less than 0.1 ppm, it is assigned to 0.
By plotting this three-state index as a bar graph, β-strands (clusters of +1 values), α-helices (clusters of -1 values), and random coil segments (clusters of 0 values) can be easily identified.
Such diagrams make it easier to identify protein secondary structure. When identifying the types of secondary structures, simple observation can identify structures such as β-chains and α-helices.
By combining the CSI patterns of 1H and 13C chemical shifts, a composite index is generated with an accuracy of 85% to 90%.
As the research progressed, scientists discovered that not only is there a correlation between the chemical shift of the α-helix and the secondary structure, but the structure of the β-sheet also shows such chemical shift changes.
Historical BackgroundThe connection between chemical shift and protein secondary structure was first described in 1967 by John Markley and colleagues. With the development of modern two-dimensional NMR technology, it has become possible to measure more protein chemical shifts. By the 1990s, after collecting enough 13C and 15N chemical shift assignments, scientists found that the trends of these chemical shift changes could provide strong support for the development of CSI.
Due to these deficiencies, many alternative CSI-based methods have been proposed to provide more comprehensive secondary structure identification methods.
Since it was first described in 1992, the CSI method has been used to characterize the secondary structure of thousands of peptides and proteins. It is popular in the scientific community because it is easy to understand and can be implemented without specialized computing programs. Many commonly used NMR data processing programs, such as NMRView and various web servers, have incorporated CSI methods into these tool frameworks to promote their application.
This method has broad application prospects in protein research. It is not only limited to the identification of secondary structures, but can also further promote our understanding and exploration of protein functions. Looking to the future, can new technologies be developed to make up for the shortcomings of the CSI method?