色谱 ›› 2022, Vol. 40 ›› Issue (9): 788-796.DOI: 10.3724/SP.J.1123.2022.03025

• 研究论文 • 上一篇    下一篇

基于修饰组和探针分子的重要途径代谢物筛选和注释新方法

李在芳1,2,#, 郑福建1,2,#, 夏悦怡1,2, 张秀琼1,2, 王鑫欣1,2, 赵春霞1,2, 赵欣捷1,2, 路鑫1,2,*(), 许国旺1,2   

  1. 1.中国科学院大连化学物理研究所, 中国科学院分离分析化学重点实验室, 辽宁省代谢组学重点实验室, 辽宁 大连 116023
    2.中国科学院大学, 北京 100049
  • 收稿日期:2022-03-17 出版日期:2022-09-08 发布日期:2022-09-26
  • 通讯作者: 路鑫
  • 作者简介:第一联系人:

    #共同第一作者.

  • 基金资助:
    国家自然科学基金项目(21874132);国家自然科学基金项目(22074145);中国科学院大连化学物理研究所科研创新基金项目(DICP I202004)

A novel method for efficient screening and annotation of important pathway-associated metabolites based on the modified metabolome and probe molecules

LI Zaifang1,2,#, ZHENG Fujian1,2,#, XIA Yueyi1,2, ZHANG Xiuqiong1,2, WANG Xinxin1,2, ZHAO Chunxia1,2, ZHAO Xinjie1,2, LU Xin1,2,*(), XU Guowang1,2   

  1. 1. CAS Key Laboratory of Separation Science for Analytical Chemistry, Liaoning Province Key Laboratory of Metabolomics, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, China
    2. University of Chinese Academy of Sciences, Beijing 100049, China
  • Received:2022-03-17 Online:2022-09-08 Published:2022-09-26
  • Contact: LU Xin
  • Supported by:
    National Natural Science Foundation of China(21874132);National Natural Science Foundation of China(22074145);Innovation Program of Science and Research from Dalian Institute of Chemical Physics, Chinese Academy of Sciences(DICP I202004)

摘要:

植物次生代谢物在抵御生物/非生物胁迫、生物间互作以及信息传递等方面发挥重要作用,次生代谢途径解析对植物分子育种、天然产物合成等方面具有重要意义。液相色谱-高分辨串联质谱(LC-HRMS/MS)为次生代谢物鉴定及途径表征提供了技术手段。非靶向LC-HRMS/MS方法可获得丰富的质谱信号,包括一级质谱和二级质谱(MS, MS/MS),但受质谱数据库规模以及次生代谢物复杂性的制约,次生代谢物注释十分困难。该研究以玉米叶片中苯丙烷途径代谢物为例,发展用于非靶向代谢组数据中重要途径代谢物的高效筛选和注释新方法。首先,利用公共代谢途径数据库及文献获取参与苯丙烷代谢途径的61种修饰反应类型,进而从非靶向实验数据中筛选出修饰代谢组。其次,获取开源串联质谱数据中的苯丙烷类化合物作为探针分子,构建探针分子质谱数据库。将探针分子与修饰代谢组共建分子网络,锁定目标途径代谢物并注释结构。该方法在正、负离子模式下分别筛选出玉米叶片中392个和417个苯丙烷途径候选代谢物,去冗余后共注释出129个代谢物,涉及苯丙烷代谢的主要分支途径,如黄酮途径的8个类黄酮、19个氧苷类黄酮和32个碳苷类黄酮,31个羟基肉桂酸途径代谢物以及22个木脂素途径代谢物;其中26个在PubChem和SciFinder数据库中未见收录。该研究利用探针分子结合修饰组可快速锁定途径代谢物,且有助于快速、准确的网络传播注释,可显著提高目标途径代谢物筛选与注释效率,为植物次生代谢途径的深入解析提供分析手段。

关键词: 液相色谱-高分辨串联质谱, 修饰代谢组, 次生代谢物, 探针分子, 注释

Abstract:

Plants produce a wide variety of secondary metabolites in the process of evolution. Secondary metabolites have highly diverse structures due to the modification of the basic skeletons of metabolites. They are required for interaction with the environment and are produced in response to abiotic/biotic stress. Characterization of secondary metabolic pathways is significant to plant molecular breeding and natural product biosynthesis. The liquid chromatography-high resolution tandem mass spectrometry (LC-HRMS/MS) is one of the major techniques for untargeted metabolomics study. The LC-HRMS/MS method could detect tens of thousands of metabolic features and provide abundant structural information. It has been widely used in the discovery and characterization of the secondary metabolome. However, due to the largely diverse structure and limited records in the mass spectral library, the annotation of the secondary metabolome is very difficult.

To address the analytical challenges associated with the vast structural diversity and the large numbers of secondary metabolites, particularly those previously unknown structural metabolites, a novel method for the efficient characterization of pathway-associated metabolites was developed. Modification reactions and MS/MS spectral information were collected using the metabolic pathways database and mass spectral library. Screening and annotation of metabolites involved in phenylpropanoid metabolism in maize leaves were used as an example. First, a database of modified groups was established via pathway-associated modifications from open access metabolic pathway database and literature. Here, pathway databases included the Kyoto Encyclopedia of Genes and Genomes (KEGG) and Plant Metabolic Pathways (PlantCyc). A total of 61 modification types were enrolled, including 10 generic and 51 pathway-specific modifications. Modified metabolomes were filtered from untargeted LC-HRMS/MS metabolomics data. Next, MS/MS spectra of the pathway-associated compounds (probe molecules) were collected in the Global Natural Products Social Molecular Networking (GNPS) MS/MS spectral library. The MS/MS of compounds assigned to chemical classes of phenylpropanoids were kept. An MS/MS spectral database of the probe molecules was constructed. It included 2677 spectra of 1542 phenylpropanoid compounds in the positive mode and 814 spectra of 661 phenylpropanoid compounds in the negative mode. Then, an MS/MS molecular network was generated by modified metabolome and probe molecules. The clusters comprising both probe molecules and modified metabolites were kept. To explore more previously unknown structural metabolites, the clusters with one more pathway-specific modified metabolite were retained even though they didn’t contain any probe molecule. A total of 392 and 417 phenylpropanoid pathway-related metabolic metabolites were obtained in positive and negative ion modes, respectively. The pathway-associated metabolites were annotated based on the propagation of the molecular network. For the metabolites within the co-cluster, annotations were performed using the probe molecules as the initial seed. The modification group’s substructure information was used for network propagation annotation. For the clusters containing only pathway-specific modified metabolites, the annotation is similar to the above process if identified nodes were present within the cluster. Otherwise, de novo annotation was manually executed based on substructure information. Finally, 129 unique metabolites were annotated after integration and removal of redundancy. Ten annotated metabolites were validated using commercially available or synthesized reference compounds. The other annotation results were validated using predicted chemical classes, in silico MS/MS, and predicted retention time. They are mainly involved in the downstream branch of phenylpropanoid pathways, including the flavonoid pathway (8 flavonoids, 19 flavonoid O-glycosides, 32 flavonoid C-glycosides), the hydroxycinnamic acid pathway (31 hydroxycinnamic acids and derivatives), and the lignan pathway (22 neo-lignans/lignan/lignan glycosides). All the annotated structures were searched against the PubChem and SciFinder databases. Among them, 26 metabolites were previously unreported in both the databases. In this study, the pathway-associated metabolites could be quickly discovered and annotated by the integration of probe molecules and modified metabolome. It provides a method for the in-depth study of the phenylpropanoid pathway.

Key words: liquid chromatography-high resolution tandem mass spectrometry (LC-HRMS/MS), modified metabolome, secondary metabolites, probe molecule, annotation

中图分类号: