质谱数据处理软件XCMS在环境科学领域的应用综述与研究展望

doi:10.3724/SP.J.1123.2025.01019

色谱 ›› 2025, Vol. 43 ›› Issue (6): 585-593.DOI: 10.3724/SP.J.1123.2025.01019

质谱数据处理软件XCMS在环境科学领域的应用综述与研究展望

杨丞¹, 张奥¹, 高占啟², 苏冠勇¹^,^*()

1.南京理工大学环境与生物工程学院，江苏省化工污染控制与资源化高校重点实验室，江苏南京 210094
2.江苏省环境监测中心，生态环境部地表水环境有机污染物监测分析重点实验室，江苏南京 210019

收稿日期:2025-01-14 出版日期:2025-06-08 发布日期:2025-05-21
通讯作者: * E-mail：sugy@njust.edu.cn.
基金资助:
江苏省自然科学基金面上项目(BK20242011);国家自然科学基金面上项目(42477387)

A review and research prospects on the application of the XCMS mass-spectrometry data-processing software in the environmental science field

YANG Cheng¹, ZHANG Ao¹, GAO Zhanqi², SU Guanyong¹^,^*()

1. Jiangsu Province Key Laboratory of Chemical Pollution Control and Resources Reuse，School of Environmental and Biological Engineering，Nanjing University of Science and Technology，Nanjing 210094，China
2. Key Laboratory of Environment Monitoring and Analysis for Organic Pollutants in Surface Water，Ministry of Ecology and Environment，Jiangsu Province Environmental Monitoring Center，Nanjing 210019，China

Received:2025-01-14 Online:2025-06-08 Published:2025-05-21
Supported by:
Natural Science Foundation General Project of Jiangsu Province(BK20242011);National Natural Science Foundation of China (General Project)(42477387)

摘要/Abstract

摘要：

生物样品和环境样品中化合物种类繁多、成分复杂，使用色谱-高分辨质谱对样品进行分析后会产生大量由质荷比（mass-to-charge ratios，m/z）、保留时间（retention-time，RT）、峰强度等组成的色谱-质谱数据，处理这些数据需要耗费大量的时间和精力，需要借助质谱数据处理软件对其进行识别分析。在众多的质谱数据处理软件中，各种形式的色谱质谱（various forms （X） of chromatography mass spectrometry， XCMS）作为一款高效、准确且可免费获取的质谱数据处理软件，在环境科学领域得到广泛应用。本论文聚焦XCMS在环境科学领域中的应用，综述了XCMS的工作流程、工作原理和参数优化措施。XCMS的工作流程主要包括数据导入、数据处理和数据导出等步骤，数据导入需要借助MSConvert等格式转换工具将不同仪器生成的数据转换为XCMS可接受的格式，数据处理大致包括峰检测、峰对齐和峰填充等步骤。在应用方面，XCMS在环境污染物非靶向筛查、污染物外源性代谢转化鉴定以及生物分子内源性代谢研究中取得了显著进展。例如，在环境污染物非靶向筛查中，XCMS能够高效提取复杂样品中的质谱特征，为后续的鉴别提供可靠的数据基础。尽管XCMS在环境科学领域的应用取得了一定成效，但仍存在一些局限性，如用户交互和自动化程度仍有待提高。XCMS在环境科学领域的发展潜力巨大，未来随着算法的不断优化和数据库的扩展，通过不断改进算法鲁棒性、数据兼容性和用户体验，XCMS有望为环境科学研究提供更强大的支持。

关键词: XCMS, 环境科学, 非靶向筛查, 未知污染物

Abstract:

Biological and environmental samples are complex and contain a highly diverse range of compounds. Analyzing these samples by chromatography-high-resolution mass spectrometry generates a substantial volume of mass-spectrometry data that are composed of mass-to-charge-ratio （m/z）， retention-time （RT）， and peak-intensity information that require considerable time and energy to process. Consequently， employing software to process mass-spectrometry data for identification and analysis purposes is imperative. Among the many mass-spectrometry data-processing options， XCMS （various forms （X） of chromatography mass spectrometry）， which is highly efficient， precise， and freely accessible software for processing mass-spectrometry data， is broadly used in the environmental science field. This study aimed to explore the use of XCMS in environmental science applications by comprehensively reviewing the workflow， underlying principles， and parameter-optimization measures of XCMS. The workflow mainly includes importing， processing， and exporting data. Importing data requires the use of format conversion tools， such as MSConvert， which converts data generated by various instruments into a format acceptable by XCMS， while data processing includes peak detection， alignment， and filling. The various XCMS functions are mainly realized via its built-in algorithms， with the Matched Filter， CentWave， Obiwarp， and Peak Density algorithms most commonly used. The first two algorithms implement the peak-detection function， while the latter two implement the peak-alignment function. XCMS identifies compound peaks from mass-spectrometry data during peak-detection； it first filters for noise and corrects the baseline. An algorithm then detects peaks based on their shapes and intensities. XCMS can also de-emphasize and de-distort to filter out interfering information in each peak signal. The CentWave algorithm is particularly effective for processing high-resolution mass-spectrometry data by improving detection accuracy and recall. Peak-detection is followed by alignment. Here， XCMS uses kernel density estimations to match peaks between samples by estimating the retention-time distribution of matched peaks， which corrects for any nonlinear deviations in retention-times. This step is critical for accurately comparing samples. The peak-filling step resolves missing peaks in the data， and XCMS uses information from other samples to fill these gaps. This process enhances the integrity of the dataset and improves analysis accuracy. In terms of applications， XCMS has demonstrated significant progress for the non-targeted screening of environmental pollutants， identifying exogenous metabolic pollutant transformations， and exploring the endogenous metabolisms of biomolecules. For example， XCMS efficiently extracts the mass spectrometry of complex samples during the non-targeted screening of environmental pollutants， thereby providing a reliable database for subsequent identification. Although the use of XCMS in the environmental science field has delivered particular results， some limitations still exist， including the use of large amounts of memory， problems associated with the software crashing when dealing with large-scale data， and the misclassification of noise as valid signals during feature detection， which results in a large number of false positives， errors， and missed detections when processing data for compounds with complex chemical compositions and structural types. In addition， the degree of user interaction and automation requires further improvement. XCMS offers significant developmental potential in the environmental science field. Continuing algorithmic optimization and database expansion through improvements in algorithmic robustness， data compatibility， and user experience， are expected to see XCMS develop broadly and provide more powerful support for the environmental science field in the future.

Key words: XCMS, environmental science, non-targeted screening, unknown contaminants

中图分类号:

O658

杨丞, 张奥, 高占啟, 苏冠勇. 质谱数据处理软件XCMS在环境科学领域的应用综述与研究展望[J]. 色谱, 2025, 43(6): 585-593.

YANG Cheng, ZHANG Ao, GAO Zhanqi, SU Guanyong. A review and research prospects on the application of the XCMS mass-spectrometry data-processing software in the environmental science field[J]. Chinese Journal of Chromatography, 2025, 43(6): 585-593.

图/表 2

参考文献

1	Smith C A， Want E J， O′maille G， et al. Anal Chem， 2006， 78（3）： 779
2	Benton H P， Wong D M， Trauger S A， et al. Anal Chem， 2008， 80（16）： 6382
3	Tautenhahn R， Böttcher C， Neumann S. BMC Bioinformatics， 2008， 9（1）： 504
4	Prince J T， Marcotte E M. Anal Chem， 2006， 78（17）： 6140
5	Tautenhahn R， Patti G J， Rinehart D， et al. Anal Chem， 2012， 84（11）： 5035
6	Jurich C P， Jeppesen M J， Sakallioglu I T， et al. Anal Chem， 2024， 96（32）： 12943
7	Alygizakis N A， Gago-Ferrero P， Hollender J， et al. J Hazard Mater， 2019， 361： 19
8	Lu J， Muhmood A， Czekała W， et al. Water， 2019， 11（11）： 2420
9	Segura P A， Saadi K， Clair A， et al. Water Sci Technol， 2015， 72（9）： 1578
10	Navarro-Reig M， Jaumot J， García-Reiriz A， et al. Anal Bioanal Chem， 2015， 407（29）： 8835
11	Forsberg E M， Huan T， Rinehart D， et al. Nat Protoc， 2018， 13（4）： 633
12	Domingo-Almenara X， Siuzdak G. Methods Mol Biol， 2020， 2104： 11
13	Lassen J， Nielsen K L， Johannsen M， et al. Anal Chem， 2021， 93（40）： 13459
14	Libiseller G， Dvorzak M， Kleb U， et al. BMC Bioinformatics， 2015， 16（1）： 118
15	Albóniga O E， González O， Alonso R M， et al. Metabolomics， 2020， 16（1）： 14
16	Mclean C， Kujawinski E B. Anal Chem， 2020， 92（8）： 5724
17	Sadia M， Boudguiyer Y， Helmus R， et al. Anal Bioanal Chem， 2024. DOI： 10.1007/s00216-024-05425-3
18	Wang X， Yu N， Yang J， et al. Environ Int， 2020， 137： 105599
19	Liang M Y， Fan D L， Gu W， et al. Chinese Journal of Environmental Monitoring and Forewarning， 2020， 12（5）： 14
19	梁梦园，范德玲，古文，等. 环境监控与预警， 2020， 12（5）： 14
20	Szabo D， Fischer S， Mathew A P， et al. Anal Chem， 2024， 96（35）： 14150
21	Zhong W， Liu S Q， Dong Y R， et al. Acta Scientiarum Naturalium Universitatis Pekinensis， 2022， 58（4）： 711
21	钟蔚，刘思琪，董艳冉，等. 北京大学学报（自然科学版）， 2022， 58（4）： 711
22	Rocha D G， Lana M A G， De Assis D C S， et al. Drug Test Anal， 2022， 14（4）： 667
23	Canuto G A B， Da Costa J L， Da Cruz P L R， et al. Quim Nova， 2018， 41（1）： 75
24	Mahieu N G， Genenbacher J L， Patti G J. Curr Opin Chem Biol， 2016， 30： 87
25	Miao Y， Tan C， Peng C X， et al. Journal of Chinese Institute of Food Science and Technology， 2022， 22（11）： 355
25	苗玥，谭超，彭春秀，等. 中国食品学报， 2022， 22（11）： 355
26	Wang J Y， Zhou Z， Xie B， et al. Chinese Journal of Analytical Chemistry， 2023， 51（3）： 390
26	王纪阳，周帜，谢冰，等. 分析化学， 2023， 51（3）： 390
27	Warth B， Spangler S， Fang M， et al. Anal Chem， 2017， 89（21）： 11505
28	Li H， Sun Q， Li F， et al. Metabolites， 2024， 14（7）： 377
29	Zhang M Y， Wang L R， Ai L F， et al. Acta Scientiae Circumstantiae， 2021， 41（7）： 2905
29	张梦妍，王乐嵘，艾连峰，等. 环境科学学报， 2021， 41（7）： 2905
30	Zhang Y K， Yang B K， Xie P F， et al. Asian Journal of Ecotoxicology， 2022， 17（3）： 35
30	张彦坤，杨兵坤，谢鹏飞，等. 生态毒理学报， 2022， 17（3）： 35
31	Stancliffe E， Schwaiger-Haber M， Sindelar M， et al. Anal Chem， 2022， 94（50）： 17370
32	Aigensberger M， Bueschl C， Castillo-Lopez E， et al. Anal Chim Acta， 2025， 1336： 343491
33	Helmus R， Ter Laak T L， Van Wezel A P， et al. J Cheminformatics， 2021， 13（1）： 1
34	Myers O D， Sumner S J， Li S， et al. Anal Chem， 2017， 89（17）： 8689
35	Castillo S， Gopalacharyulu P， Yetukuri L， et al. Chemom Intell Lab Syst， 2011， 108（1）： 23

质谱数据处理软件XCMS在环境科学领域的应用综述与研究展望

A review and research prospects on the application of the XCMS mass-spectrometry data-processing software in the environmental science field

RichHTML

PDF

可视化

摘要/Abstract

引用本文

使用本文

扫码分享

图/表 2

参考文献

相关文章 1

编辑推荐

Metrics