Si-Min Yan
China Jiliang University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Si-Min Yan.
Food Chemistry | 2013
Lu Xu; Peng-Tao Shi; Zihong Ye; Si-Min Yan; Xiao-Ping Yu
This paper develops a rapid analysis method for adulteration identification of a popular traditional Chinese food, lotus root powder (LRP), by near-infrared spectroscopy and chemometrics. 85 pure LRP samples were collected from 7 main lotus producing areas of China to include most if not all of the significant variations likely to be encountered in unknown authentic materials. To evaluate the model specificity, 80 adulterated LRP samples prepared by blending pure LRP with different levels of four cheaper and commonly used starches were measured and predicted. For multivariate quality models, two class modeling methods, the traditional soft independent modeling of class analogy (SIMCA) and a recently proposed partial least squares class model (PLSCM) were used. Different data preprocessing techniques, including smoothing, taking derivative and standard normal variate (SNV) transformation were used to improve the classification performance. The results indicate that smoothing, taking second-order derivatives and SNV can improve the class models by enhancing signal-to-noise ratio, reducing baseline and background shifts. The most accurate and stable models were obtained with SNV spectra for both SIMCA (sensitivity 0.909 and specificity 0.938) and PLSCM (sensitivity 0.909 and specificity 0.925). Moreover, both SIMCA and PLSCM could detect LRP samples mixed with 5% (w/w) or more other cheaper starches, including cassava, sweet potato, potato and maize starches. Although it is difficult to perform an exhaustive collection of all pure LRP samples and possible adulterations, NIR spectrometry combined with class modeling techniques provides a reliable and effective method to detect most of the current LRP adulterations in Chinese market.
Food Chemistry | 2013
Lu Xu; Si-Min Yan; Chen-Bo Cai; Xiao-Ping Yu
This paper investigates the feasibility of using FT-NIR spectroscopy and chemometrics for rapid analysis of poplar balata (PB) in Chinese propolis. Because practical adulterations usually involve addition of certain known active components, together with commercial PB, the commonly targeted analysis methods are insufficient to identify PB-adulterated propolis. Untargeted analysis of PB was performed by developing class models of pure propolis using one-class partial least squares (OCPLS). Quantitative analysis of PB was performed using partial least squares regression (PLSR). For untargeted analysis, the most accurate OCPLS model was obtained with SNV spectra with sensitivity 0.960 and specificity 0.941. OCPLS could detect adulterations with 2% (w/w) or more PB. For quantitative analysis, the root mean squared error of prediction (RMSEP) value of PB was 0.902 (w/w, %) with SNV-PLS. FT-NIR spectrometry and chemometrics demonstrate potential for rapid analysis of PB adulterations in Chinese propolis.
Analytica Chimica Acta | 2012
Lu Xu; Zihong Ye; Si-Min Yan; Peng-Tao Shi; Haifeng Cui; Xian-Shu Fu; Xiaoping Yu
Class modeling techniques are required to tackle various one-class problems. Because the training of class models is based on the target class and the origins of future test objects usually cannot be exactly predefined, the criteria for feature selection of class models are not very straightforward. Although feature reduction can be expected to improve class models performance, more features retained can provide a sufficient description of the sought-for class. This paper suggests a strategy to balance class description and model specificity by ensemble learning of sub-models based on separate local wavelength intervals. The acceptance or rejection of a future object can be explicitly determined by examining its acceptance frequency by sub-models. Considering the lack of information about sub-model independence, we propose to use a data-driven method to control the sensitivity of the ensemble model by cross validation. In this way, all the wavelength intervals are used for class description and the local wavelength intervals are highlighted to enhance the ability to detect out-of-class objects. The proposed strategy was performed on one-class partial least squares (OCPLS) and soft independent modeling of class analogy (SIMCA). By analysis of two infrared spectral data sets, one for geographical origin identification of white tea and the other for discrimination of adulterations in pure sesame oil, the proposed ensemble class modeling method was demonstrated to have similar sensitivity and better specificity compared with total-spectrum SIMCA and OCPLS models. The results indicate local spectral information can be extracted to enhance class model specificity.
Journal of Automated Methods & Management in Chemistry | 2014
Si-Min Yan; Junping Liu; Lu Xu; Xian-Shu Fu; Haifeng Cui; Zhen-Yu Yun; Xiaoping Yu; Zihong Ye
This paper focuses on a rapid and nondestructive way to discriminate the geographical origin of Anxi-Tieguanyin tea by near-infrared (NIR) spectroscopy and chemometrics. 450 representative samples were collected from Anxi County, the original producing area of Tieguanyin tea, and another 120 Tieguanyin samples with similar appearance were collected from unprotected producing areas in China. All these samples were measured by NIR. The Stahel-Donoho estimates (SDE) outlyingness diagnosis was used to remove the outliers. Partial least squares discriminant analysis (PLSDA) was performed to develop a classification model and predict the authenticity of unknown objects. To improve the sensitivity and specificity of classification, the raw data was preprocessed to reduce unwanted spectral variations by standard normal variate (SNV) transformation, taking second-order derivatives (D2) spectra, and smoothing. As the best model, the sensitivity and specificity reached 0.931 and 1.000 with SNV spectra. Combination of NIR spectrometry and statistical model selection can provide an effective and rapid method to discriminate the geographical producing area of Anxi-Tieguanyin.
Journal of Automated Methods & Management in Chemistry | 2013
Lu Xu; Si-Min Yan; Chen-Bo Cai; Zhen-Ji Wang; Xiaoping Yu
Untargeted detection of protein adulteration in Chinese yogurt was performed using near-infrared (NIR) spectroscopy and chemometrics class modelling techniques. sixty yogurt samples were prepared with pure and fresh milk from local market, and 197 adulterated yogurt samples were prepared by blending the pure yogurt objects with different levels of edible gelatin, industrial gelatin, and soy protein powder, which have been frequently used for yogurt adulteration. A recently proposed one-class partial least squares (OCPLS) model was used to model the NIR spectra of pure yogurt objects and analyze those of future objects. To improve the raw spectra, orthogonal projection (OP) of raw spectra onto the spectrum of pure water and standard normal variate (SNV) transformation were used to remove unwanted spectral variations. The best model was obtained with OP preprocessing with sensitivity of 0.900 and specificity of 0.949. Moreover, adulterations of yogurt with 1% (w/w) edible gelatin, 2% (w/w) industrial gelatin, and 2% (w/w) soy protein powder can be safely detected by the proposed method. This study demonstrates the potential of combining NIR spectroscopy and OCPLS as an untargeted detection tool for protein adulteration in yogurt.
Journal of Automated Methods & Management in Chemistry | 2013
Lu Xu; Si-Min Yan; Zihong Ye; Xian-Shu Fu; Xiaoping Yu
The feasibility of electronic tongue and multivariate analysis was investigated for discriminating the specific geographical origins of a Chinese green tea with Protected Designation of Origin (PDO). 155 Longjing tea samples from three subareas were collected and analyzed by an electronic tongue array of 7 sensors. To remove the influence of abnormal measurements and samples, robust principal component analysis (ROBPCA) was used to detect outliers in each class. Partial least squares discriminant analysis (PLSDA) was then used to develop a classification model. The prediction sensitivity/specificity of PLSDA was 1.000/1.000, 1.000/0.967, and 0.950/1.000 for longjing from Xihu, Qiantang, and Yuezhou, respectively. Electronic tongue and chemometrics can provide a rapid and reliable tool for discriminating the specific producing areas of Longjing.
Spectroscopy | 2013
Lu Xu; Si-Min Yan; Chen-Bo Cai; Wei Zhong; Xiaoping Yu
This paper develops a rapid method for discriminating the geographical origins and age of roasted Torreya grandis seeds by near infrared (NIR) spectroscopic analysis and pattern recognition. 337 samples were collected from three main producing areas and produced in the last two years. The objective of geographical origins analysis is to discriminate the seeds from Fengqiao with a protected geographical indication (PGI) from those of another two provinces. Age classification is aimed to detect the old seeds produced in the last year from the freshly produced ones. Partial least squares discriminant analysis (PLSDA) was used to develop classification models, and the influence of data preprocessing methods on classification performance was also investigated. Taking second-order derivatives of the raw spectra proves to be the most proper and effective preprocessing method, which can remove baselines and backgrounds and reduce model complexity. With second derivative spectra, the sensitivity and specificity were 0.939 and 0.871 for age discrimination, respectively. Perfect classification was obtained, and both sensitivity and specificity were 1 for discrimination of geographical origins.
Spectroscopy | 2013
Lu Xu; Si-Min Yan; Chen-Bo Cai; Xiaoping Yu; Jian-Hui Jiang; Hai-Long Wu; Ru-Qin Yu
This paper aims at developing a rapid and nondestructive method for analyzing the shelf life of preserved eggs (pidan) by near infrared (NIR) spectroscopy and nonlinear multivariate calibration. A major concern with a nonlinear model is that the noncomposition-correlated spectral variations among pidan objects of different batches and production dates would unnecessarily increase model complexity and cause overfitting and degradation of prediction. To reduce the negative influence of unwanted spectral variations, stacked least squares support vector machine (LS-SVM) with an ensemble of 62 commonly used preprocessing methods is proposed to automatically optimize data preprocessing and develop the nonlinear model. The analysis results indicate that stacked LS-SVM can obtain stable calibration model, and the prediction accuracy is improved compared with models with single-preprocessing methods. Since LS-SVM is much faster than its ordinary counterparts, stacked LS-SVM with ensemble preprocessing can be performed within an acceptable computational time. When the objects and spectral variations are very complex, the proposed method can provide a useful tool for data preprocessing and nonlinear multivariate calibration.
Spectroscopy | 2014
Lu Xu; Si-Min Yan; Chen-Bo Cai; Xiaoping Yu
A major safety concern with pidan (preserved eggs) has been the usage of lead (II) oxide (PbO) during its processing. This paper develops a rapid and nondestructive method for discrimination of lead (Pb) in preserved eggs with different processing methods by near-infrared (NIR) spectroscopy and chemometrics. Ten batches of 331 unleaded eggs and six batches of 147 eggs processed with usage of PbO were collected and analyzed by NIR spectroscopy. Inductively coupled plasma mass spectrometry (ICP-MS) analysis was used as a reference method for Pb identification. The Pb contents of leaded eggs ranged from 1.2 to 12.8 ppm. Linear partial least squares discriminant analysis (PLSDA) and nonlinear least squares support vector machine (LS-SVM) were used to classify samples based on NIR spectra. Different preprocessing methods were studied to improve the classification performance. With second-order derivative spectra, PLSDA and LS-SVM obtained accurate and reliable classification of leaded and unleaded preserved eggs. The sensitivity and specificity of PLSDA were 0.975 and 1.000, respectively. Because the strictest safety standard of Pb content in traditional pidan is 2 ppm, the proposed method shows the feasibility for rapid and nondestructive discrimination of Pb in Chinese preserved eggs.
Chemometrics and Intelligent Laboratory Systems | 2013
Lu Xu; Si-Min Yan; Chen-Bo Cai; Xiao-Ping Yu