Estimating influence of cofragmentation on peptide quantification and identification in itraq experiments by simulating multiplexed spectra

Honglan Li, Kyu Baek Hwang, Dong Gi Mun, Hokeun Kim, Hangyeore Lee, Sang-Won Lee, Eunok Paek

Research output: Contribution to journalArticle

7 Citations (Scopus)

Abstract

Isobaric tag-based quantification such as iTRAQ and TMT is a promising approach to mass spectrometry-based quantification in proteomics as it provides wide proteome coverage with greatly increased experimental throughput. However, it is known to suffer from inaccurate quantification and identification of a target peptide due to cofragmentation of multiple peptides, which likely leads to under-estimation of differentially expressed peptides (DEPs). A simple method of filtering out cofragmented spectra with less than 100% precursor isolation purity (PIP) would decrease the coverage of iTRAQ/TMT experiments. In order to estimate the impact of cofragmentation on quantification and identification of iTRAQ-labeled peptide samples, we generated multiplexed spectra with varying degrees of PIP by mixing the two MS/MS spectra of 100% PIP obtained in global proteome profiling experiments on gastric tumor-normal tissue pair proteomes labeled by 4-plex iTRAQ. Despite cofragmentation, the simulation experiments showed that more than 99% of multiplexed spectra with PIP greater than 80% were correctly identified by three different database search engines-MODa, MS-GF+, and Proteome Discoverer. Using the multiplexed spectra that have been correctly identified, we estimated the effect of cofragmentation on peptide quantification. In 74% of the multiplexed spectra, however, the cancer-to-normal expression ratio was compressed, and a fair number of spectra showed the "ratio inflation" phenomenon. On the basis of the estimated distribution of distortions on quantification, we were able to calculate cutoff values for DEP detection from cofragmented spectra, which were corrected according to a specific PIP and probability of type I (or type II) error. When we applied these corrected cutoff values to real cofragmented spectra with PIP larger than or equal to 70%, we were able to identify reliable DEPs by removing about 25% of DEPs, which are highly likely to be false positives. Our experimental results provide useful insight into the effect of cofragmentation on isobaric tag-based quantification methods. The simulation procedure as well as the corrected cutoff calculation method could be adopted for quantifying the effect of cofragmentation and reducing false positives (or false negatives) in the DEP identification with general quantification experiments based on isobaric labeling techniques.

Original languageEnglish
Pages (from-to)3488-3497
Number of pages10
JournalJournal of Proteome Research
Volume13
Issue number7
DOIs
Publication statusPublished - 2014 Jul 3

Fingerprint

Peptides
Proteome
Experiments
Search Engine
Economic Inflation
Search engines
Proteomics
Labeling
Mass spectrometry
Tumors
Mass Spectrometry
Neoplasms
Stomach
Throughput
Databases
Tissue

ASJC Scopus subject areas

  • Biochemistry
  • Chemistry(all)

Cite this

Estimating influence of cofragmentation on peptide quantification and identification in itraq experiments by simulating multiplexed spectra. / Li, Honglan; Hwang, Kyu Baek; Mun, Dong Gi; Kim, Hokeun; Lee, Hangyeore; Lee, Sang-Won; Paek, Eunok.

In: Journal of Proteome Research, Vol. 13, No. 7, 03.07.2014, p. 3488-3497.

Research output: Contribution to journalArticle

Li, Honglan ; Hwang, Kyu Baek ; Mun, Dong Gi ; Kim, Hokeun ; Lee, Hangyeore ; Lee, Sang-Won ; Paek, Eunok. / Estimating influence of cofragmentation on peptide quantification and identification in itraq experiments by simulating multiplexed spectra. In: Journal of Proteome Research. 2014 ; Vol. 13, No. 7. pp. 3488-3497.
@article{8b937197c74d4b52b1af30d699432bad,
title = "Estimating influence of cofragmentation on peptide quantification and identification in itraq experiments by simulating multiplexed spectra",
abstract = "Isobaric tag-based quantification such as iTRAQ and TMT is a promising approach to mass spectrometry-based quantification in proteomics as it provides wide proteome coverage with greatly increased experimental throughput. However, it is known to suffer from inaccurate quantification and identification of a target peptide due to cofragmentation of multiple peptides, which likely leads to under-estimation of differentially expressed peptides (DEPs). A simple method of filtering out cofragmented spectra with less than 100{\%} precursor isolation purity (PIP) would decrease the coverage of iTRAQ/TMT experiments. In order to estimate the impact of cofragmentation on quantification and identification of iTRAQ-labeled peptide samples, we generated multiplexed spectra with varying degrees of PIP by mixing the two MS/MS spectra of 100{\%} PIP obtained in global proteome profiling experiments on gastric tumor-normal tissue pair proteomes labeled by 4-plex iTRAQ. Despite cofragmentation, the simulation experiments showed that more than 99{\%} of multiplexed spectra with PIP greater than 80{\%} were correctly identified by three different database search engines-MODa, MS-GF+, and Proteome Discoverer. Using the multiplexed spectra that have been correctly identified, we estimated the effect of cofragmentation on peptide quantification. In 74{\%} of the multiplexed spectra, however, the cancer-to-normal expression ratio was compressed, and a fair number of spectra showed the {"}ratio inflation{"} phenomenon. On the basis of the estimated distribution of distortions on quantification, we were able to calculate cutoff values for DEP detection from cofragmented spectra, which were corrected according to a specific PIP and probability of type I (or type II) error. When we applied these corrected cutoff values to real cofragmented spectra with PIP larger than or equal to 70{\%}, we were able to identify reliable DEPs by removing about 25{\%} of DEPs, which are highly likely to be false positives. Our experimental results provide useful insight into the effect of cofragmentation on isobaric tag-based quantification methods. The simulation procedure as well as the corrected cutoff calculation method could be adopted for quantifying the effect of cofragmentation and reducing false positives (or false negatives) in the DEP identification with general quantification experiments based on isobaric labeling techniques.",
keywords = "cofragmentation, differentially expressed peptides, isobaric tag-based quantification, iTRAQ, multiplexed spectra, peptide quantification, simulation",
author = "Honglan Li and Hwang, {Kyu Baek} and Mun, {Dong Gi} and Hokeun Kim and Hangyeore Lee and Sang-Won Lee and Eunok Paek",
year = "2014",
month = "7",
day = "3",
doi = "10.1021/pr500060d",
language = "English",
volume = "13",
pages = "3488--3497",
journal = "Journal of Proteome Research",
issn = "1535-3893",
publisher = "American Chemical Society",
number = "7",

}

TY - JOUR

T1 - Estimating influence of cofragmentation on peptide quantification and identification in itraq experiments by simulating multiplexed spectra

AU - Li, Honglan

AU - Hwang, Kyu Baek

AU - Mun, Dong Gi

AU - Kim, Hokeun

AU - Lee, Hangyeore

AU - Lee, Sang-Won

AU - Paek, Eunok

PY - 2014/7/3

Y1 - 2014/7/3

N2 - Isobaric tag-based quantification such as iTRAQ and TMT is a promising approach to mass spectrometry-based quantification in proteomics as it provides wide proteome coverage with greatly increased experimental throughput. However, it is known to suffer from inaccurate quantification and identification of a target peptide due to cofragmentation of multiple peptides, which likely leads to under-estimation of differentially expressed peptides (DEPs). A simple method of filtering out cofragmented spectra with less than 100% precursor isolation purity (PIP) would decrease the coverage of iTRAQ/TMT experiments. In order to estimate the impact of cofragmentation on quantification and identification of iTRAQ-labeled peptide samples, we generated multiplexed spectra with varying degrees of PIP by mixing the two MS/MS spectra of 100% PIP obtained in global proteome profiling experiments on gastric tumor-normal tissue pair proteomes labeled by 4-plex iTRAQ. Despite cofragmentation, the simulation experiments showed that more than 99% of multiplexed spectra with PIP greater than 80% were correctly identified by three different database search engines-MODa, MS-GF+, and Proteome Discoverer. Using the multiplexed spectra that have been correctly identified, we estimated the effect of cofragmentation on peptide quantification. In 74% of the multiplexed spectra, however, the cancer-to-normal expression ratio was compressed, and a fair number of spectra showed the "ratio inflation" phenomenon. On the basis of the estimated distribution of distortions on quantification, we were able to calculate cutoff values for DEP detection from cofragmented spectra, which were corrected according to a specific PIP and probability of type I (or type II) error. When we applied these corrected cutoff values to real cofragmented spectra with PIP larger than or equal to 70%, we were able to identify reliable DEPs by removing about 25% of DEPs, which are highly likely to be false positives. Our experimental results provide useful insight into the effect of cofragmentation on isobaric tag-based quantification methods. The simulation procedure as well as the corrected cutoff calculation method could be adopted for quantifying the effect of cofragmentation and reducing false positives (or false negatives) in the DEP identification with general quantification experiments based on isobaric labeling techniques.

AB - Isobaric tag-based quantification such as iTRAQ and TMT is a promising approach to mass spectrometry-based quantification in proteomics as it provides wide proteome coverage with greatly increased experimental throughput. However, it is known to suffer from inaccurate quantification and identification of a target peptide due to cofragmentation of multiple peptides, which likely leads to under-estimation of differentially expressed peptides (DEPs). A simple method of filtering out cofragmented spectra with less than 100% precursor isolation purity (PIP) would decrease the coverage of iTRAQ/TMT experiments. In order to estimate the impact of cofragmentation on quantification and identification of iTRAQ-labeled peptide samples, we generated multiplexed spectra with varying degrees of PIP by mixing the two MS/MS spectra of 100% PIP obtained in global proteome profiling experiments on gastric tumor-normal tissue pair proteomes labeled by 4-plex iTRAQ. Despite cofragmentation, the simulation experiments showed that more than 99% of multiplexed spectra with PIP greater than 80% were correctly identified by three different database search engines-MODa, MS-GF+, and Proteome Discoverer. Using the multiplexed spectra that have been correctly identified, we estimated the effect of cofragmentation on peptide quantification. In 74% of the multiplexed spectra, however, the cancer-to-normal expression ratio was compressed, and a fair number of spectra showed the "ratio inflation" phenomenon. On the basis of the estimated distribution of distortions on quantification, we were able to calculate cutoff values for DEP detection from cofragmented spectra, which were corrected according to a specific PIP and probability of type I (or type II) error. When we applied these corrected cutoff values to real cofragmented spectra with PIP larger than or equal to 70%, we were able to identify reliable DEPs by removing about 25% of DEPs, which are highly likely to be false positives. Our experimental results provide useful insight into the effect of cofragmentation on isobaric tag-based quantification methods. The simulation procedure as well as the corrected cutoff calculation method could be adopted for quantifying the effect of cofragmentation and reducing false positives (or false negatives) in the DEP identification with general quantification experiments based on isobaric labeling techniques.

KW - cofragmentation

KW - differentially expressed peptides

KW - isobaric tag-based quantification

KW - iTRAQ

KW - multiplexed spectra

KW - peptide quantification

KW - simulation

UR - http://www.scopus.com/inward/record.url?scp=84903697568&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84903697568&partnerID=8YFLogxK

U2 - 10.1021/pr500060d

DO - 10.1021/pr500060d

M3 - Article

VL - 13

SP - 3488

EP - 3497

JO - Journal of Proteome Research

JF - Journal of Proteome Research

SN - 1535-3893

IS - 7

ER -