Mass spectral reconstruction of LC/MS data with entropy minimization
Graphical abstract
Introduction
Liquid chromatography-mass spectrometry (LC/MS) is a widely used technique in biochemical and biomedical research [[1], [2], [3], [4], [5]]. The technique offers a combination of sensitivity and selectivity as well as enables the analysis of a wide range of non-volatile compounds. However, LC/MS produces complex data and demands systematic, stringent and innovative data processing steps to extract meaningful information. The major challenges include resolving asymmetric, co-eluted and trace-level peaks. Current solutions to this challenge rely mainly on the optimization of experimental protocols. However, this can be very time-consuming and may not always resolve the problem, especially for complex biological samples.
Spectral deconvolution is a viable solution to resolve co-eluted and trace-level peaks in LC/MS data. Numerous empirical peak shape modeling techniques have been introduced over the years, such as the bi-Gaussian model and exponentially modified Gaussian model for this purpose [[6], [7], [8], [9]]. Other approaches apply feature selection algorithms (e.g., CODA, CAMERA, centWave) and its variants [[10], [11], [12], [13]]. However, a one-size-fit-all modeling solution may not be available since the peak shape and width in LC/MS data can vary from one equipment to another or even across repeated data from the same equipment. The analysis often requires fine adjustments of various parameter settings to ensure accurate deconvolution and is thus not easily accessible to non-experts.
In recent years, information entropy minimization approach has been implemented as a deconvolution technique [14,15]. As a blind source separation algorithm, the family of band-target entropy minimization (BTEM) algorithms can extract pure signals from mixture signals by finding the linear combination of pure components and without the need for any a priori information [[16], [17], [18], [19]]. In comparison to empirical peak shape modeling techniques, BTEM approach does not require any parameter settings and is robust in handling different forms of peak shape and width. Recently, rBTEM was introduced for the deconvolution of GC/MS data and the result showed that it was able to increase the sensitivity and accuracy for analysis of co-eluting and trace-level components in comparison to empirical-based approach [20].
In this work, we extend the information entropy minimization approach to deconvolute co-eluted and trace-level peaks in LC/MS scan data. The rBTEM variant herein, known as lc-BTEM, considers the higher resolution of data generated by a triple quadrupole system to enable the extraction of pure mass spectra without the need for any parameter settings. We demonstrate its application on LC/MS scan data of American ginseng (Panax quinquefolius) extract. The scan data contains valuable information related to molecular ions and their fragment ions resulting from in-source collisional activation dissociation at the ESI interface, which enables the application of the lc-BTEM algorithm which is based on finding the linear combination of pure components in a mixture system. The availability of pure mass spectra and inherent in-source fragments information enabled a seamless recognition of [M+H]+, [M+H-Neu]+ and [M-H]- ions and subsequently led to the putative annotation of ginsenoside subtypes for more than 50 ginsenosides.
Section snippets
Material and instrumentations
Methanol (LCMS grade), acetonitrile (LCMS grade) and formic acid (98–100%, analytical grade) were purchased from Sigma Aldirch, Merck, Singapore. American ginseng (Panax quinquefolius) which was harvested from Canada and three years old, as well as Cordyceps militaris were purchased from Hockhua Tonic, Singapore.
LC/MS analyses were performed on the Shimadzu LCMS-8040 and Agilent G6550 QTOF LC/MS. Sonic dismembrator (Fisherbrand Model 505) was used for ultra-sonicating the samples.
Sample preparation
Panax
Results and discussion
Full-scan positive- and negative-ion mode LC/MS analysis were performed on the ginseng extract. The positive-ion mode TIC is shown in Fig. 1, with the inset indicating two regions of interests for analysis, specifically region A and B. Region A consists of a trace-level peak while region B contains a subtle overlapping peak which can be easily mistaken as a pure component peak. These regions were systematically analyzed with lc-BTEM to extract the pure component spectra for putative annotation.
Conclusion
In this work, a spectral deconvolution approach for LC/MS data is introduced for the analysis of co-eluting and trace-level components in full-scan mode. The extracted pure component spectra provide crucial information of clean in-source fragmentation patterns and characteristics fragment ions which are especially useful for a quick survey on the total amount and subtypes of ginsenosides present in the ginseng extract of this work.
More importantly, this study demonstrated the capability of
CRediT authorship contribution statement
Hua Jun Zhang: Supervision, Conceptualization, Software. Yunbo Lv: Conceptualization, Data curation, Project administration, Writing - original draft. Chun Kiang Chua: Conceptualization, Visualization, Writing - original draft. Tai Guo: Software. Zhe Sun: Writing - review & editing. Zhaoqi Zhan: Supervision, Writing - review & editing.
Declaration of competing interest
The SmartDalton software is from ChemoPower Technology Pte Ltd in Singapore. Hua Jun Zhang, Yunbo Lv, Chun Kiang Chua and Tai Guo worked for ChemoPower Technology Pte Ltd at the time of submission.
Acknowledgment
We would like to thank Ms. Li Limin for preparing the ginseng extract.
References (37)
- et al.
Current approaches and challenges for the metabolite profiling of complex natural extracts
J. Chromatogr. A
(2015) Foundation review: principles and applications of LC-MS in new drug discovery
Drug Discov. Today
(2005)- et al.
Mathematical functions for the representation of chromatographic peaks
J. Chromatogr. A
(2001) - et al.
Chemometric analysis of complex hyphenated data - improvements of the component detection algorithm
J. Chromatogr. A
(2007) - et al.
Weighted two-band target entropy minimization for the reconstruction of pure component mass spectra: simulation studies and the application to real systems
J. Am. Soc. Mass Spectrom.
(2003) - et al.
A strategy for efficient discovery of new natural compounds by integrating orthogonal column chromatography and liquid chromatography/mass spectrometry analysis: its application in Panax ginseng, Panax quinquefolium and Panax notoginseng to characterize 437 potential new ginsenosides
Anal. Chim. Acta
(2012) - et al.
Ginsenosides from American ginseng: chemical and pharmacological diversity
Phytochemistry
(2011) - et al.
Chemical analysis of Panax quinquefolius (North American ginseng): a review
J. Chromatogr. A
(2015) - et al.
Generation of ginsenosides Rg3 and Rh2 from north American ginseng
Phytochemistry
(2004) - et al.
Ginsenoside content and variation among and within American ginseng (Panax quinquefolius L.) populations
Phytochemistry
(2006)
Determination of ginsenosides in Asian and American ginsengs by liquid chromatography–quadrupole/time-of-flight MS: assessing variations based on morphological characteristics
J. Ginseng Res.
The integration of GC–MS and LC–MS to assay the metabolomics profiling in Panax ginseng and Panax quinquefolius reveals a tissue-and species-specific connectivity of primary metabolites and ginsenosides accumulation
J. Pharmaceut. Biomed. Anal.
A metabolomics approach for authentication of Ophiocordyceps sinensis by liquid chromatography coupled with quadrupole time-of-flight mass spectrometry
Food Res. Int.
Electrospray ionization for mass spectrometry of large biomolecules
Science
Electrospray ionization mass spectrometry: a technique to access the information beyond the molecular weight of the analyte
Int. J. Anal. Chem.
Proposed quantitative and alphanumeric metabolite identification metrics
Metabolomics
apLCMS—adaptive processing of high-resolution LC/MS data
Bioinformatics
MZmine 2: modular framework for processing, visualizing, and analyzing mass spectrometry-based molecular profile data
BMC Bioinf.
Cited by (3)
An authentic assessment method for cordyceps sinensis
2024, Journal of Pharmaceutical and Biomedical AnalysisProfiling complex volatile components by HS-GC-MS and entropy minimization software: An example on Ligusticum chuanxiong Hort.
2022, Journal of Pharmaceutical and Biomedical AnalysisCitation Excerpt :For such cases, mathematical algorithms may provide possible solutions for many problems that cannot be solved by experiments in the field of chemical analysis. To obtain pure mass spectra, mathematical deconvolution of mass spectral data is proven a more practical solution [5–7]. The empirical-based solution, as one class of the mathematical algorithms, was an early strategy for mass spectral deconvolution.
Reconstruction of ultra-high vacuum mass spectra using genetic algorithms
2021, Applied Sciences (Switzerland)