Chinese Journal of Chromatography ›› 2018, Vol. 36 ›› Issue (8): 772-779.DOI: 10.3724/SP.J.1123.2018.03001

Previous Articles     Next Articles

Gas chromatography-mass spectrometry data analysis algorithm based on sparse model and its application in resolving severely overlapped peaks

WU Yizi1, WEI Weiwei1, YANG Huawu1, CHEN Zengping2, ZOU You3, LI Yanchun1, TUO Suxing1, YIN Shuangfeng2, ZHONG Kejun1   

  1. 1. Hunan Tobacco Industry Co., Ltd., Changsha 410014, China;
    2. College of Chemistry and Chemistry Engineering, Hunan University, Changsha 410082, China;
    3. College of Network Education, Central South University, Changsha 410083, China
  • Received:2018-03-05 Online:2018-08-08 Published:2014-06-28
  • Supported by:

    National Natural Science Foundation of China (No.21775038);Postdoctoral Fund of Hunan Tobacco Industry Co., Ltd.(No.KY2016JC0014).

Abstract:

A Gas chromatography-mass spectrometry (GC-MS) data analysis algorithm is proposed. The mass spectrum at the top of the chromatographic peak is the spectrum to be solved. A certain amount of related reference spectra is retrieved from the spectral library, then, the equation of the chromatographic response value of each pure component is solved. A step by step strategy is used for the mass spectra retrieval. Firstly, an efficient indexing technology is used for rough selection, then, the "strong peak out with high probability" and the "extrusion" rules are used to exclude more unrelated mass spectra. A regression algorithm based on a sparse model is proposed to solve the equation of the chromatographic response value. Compared with the traditional algorithm, this algorithm can extract the main structure of the spectrum to be solved, and avoid over-fitting. The experimental results show that the proposed algorithm has a higher accuracy and smaller residual reference spectrum set, and the sparse model achieves satisfactory experimental results in the analysis of severely overlapped peaks. The proposed method provides an effective solution for resolving overlapped peaks, especially severely overlapped peaks, in GC-MS data.

Key words: gas chromatography-mass spectrometry (GC-MS), mass spectrometry retrieval, severely overlapped peaks, sparse model

CLC Number: