Genome wide proteomics of ERBB2 and EGFR and other oncogenic pathways in inflammatory breast cancer

Emma Yue Zhang, Massimo Cristofanilli, Fredika Robertson, James M. Reuben, Zhaomei Mu, Ronald C. Beavis, Hogune Im, Michael Snyder, Matan Hofree, Trey Ideker, Gilbert S. Omenn, Susan Fanayan, Seul Ki Jeong, Young-Ki Paik, Anna Fan Zhang, Shiaw Lin Wu, William S. Hancock

Research output: Contribution to journalArticle

27 Citations (Scopus)

Abstract

In this study we selected three breast cancer cell lines (SKBR3, SUM149 and SUM190) with different oncogene expression levels involved in ERBB2 and EGFR signaling pathways as a model system for the evaluation of selective integration of subsets of transcriptomic and proteomic data. We assessed the oncogene status with reads per kilobase per million mapped reads (RPKM) values for ERBB2 (14.4, 400, and 300 for SUM149, SUM190, and SKBR3, respectively) and for EGFR (60.1, not detected, and 1.4 for the same 3 cell lines). We then used RNA-Seq data to identify those oncogenes with significant transcript levels in these cell lines (total 31) and interrogated the corresponding proteomics data sets for proteins with significant interaction values with these oncogenes. The number of observed interactors for each oncogene showed a significant range, e.g., 4.2% (JAK1) to 27.3% (MYC). The percentage is measured as a fraction of the total protein interactions in a given data set vs total interactors for that oncogene in STRING (Search Tool for the Retrieval of Interacting Genes/Proteins, version 9.0) and I2D (Interologous Interaction Database, version 1.95). This approach allowed us to focus on 4 main oncogenes, ERBB2, EGFR, MYC, and GRB2, for pathway analysis. We used bioinformatics sites GeneGo, PathwayCommons and NCI receptor signaling networks to identify pathways that contained the four main oncogenes and had good coverage in the transcriptomic and proteomic data sets as well as a significant number of oncogene interactors. The four pathways identified were ERBB signaling, EGFR1 signaling, integrin outside-in signaling, and validated targets of C-MYC transcriptional activation. The greater dynamic range of the RNA-Seq values allowed the use of transcript ratios to correlate observed protein values with the relative levels of the ERBB2 and EGFR transcripts in each of the four pathways. This provided us with potential proteomic signatures for the SUM149 and 190 cell lines, growth factor receptor-bound protein 7 (GRB7), Crk-like protein (CRKL) and Catenin delta-1 (CTNND1) for ERBB signaling; caveolin 1 (CAV1), plectin (PLEC) for EGFR signaling; filamin A (FLNA) and actinin alpha1 (ACTN1) (associated with high levels of EGFR transcript) for integrin signalings; branched chain amino-acid transaminase 1 (BCAT1), carbamoyl-phosphate synthetase (CAD), nucleolin (NCL) (high levels of EGFR transcript); transferrin receptor (TFRC), metadherin (MTDH) (high levels of ERBB2 transcript) for MYC signaling; S100-A2 protein (S100A2), caveolin 1 (CAV1), Serpin B5 (SERPINB5), stratifin (SFN), PYD and CARD domain containing (PYCARD), and EPH receptor A2 (EPHA2) for PI3K signaling, p53 subpathway. Future studies of inflammatory breast cancer (IBC), from which the cell lines were derived, will be used to explore the significance of these observations.

Original languageEnglish
Pages (from-to)2805-2817
Number of pages13
JournalJournal of Proteome Research
Volume12
Issue number6
DOIs
Publication statusPublished - 2013 Jul 1

Fingerprint

Inflammatory Breast Neoplasms
Oncogenes
Proteomics
Genes
Genome
Cells
Cell Line
Caveolin 1
Proteins
GRB7 Adaptor Protein
varespladib methyl
Integrins
Plectin
Filamins
RNA
Carbamyl Phosphate
Serpins
Actinin
Transferrin Receptors
S100 Proteins

All Science Journal Classification (ASJC) codes

  • Biochemistry
  • Chemistry(all)

Cite this

Zhang, E. Y., Cristofanilli, M., Robertson, F., Reuben, J. M., Mu, Z., Beavis, R. C., ... Hancock, W. S. (2013). Genome wide proteomics of ERBB2 and EGFR and other oncogenic pathways in inflammatory breast cancer. Journal of Proteome Research, 12(6), 2805-2817. https://doi.org/10.1021/pr4001527
Zhang, Emma Yue ; Cristofanilli, Massimo ; Robertson, Fredika ; Reuben, James M. ; Mu, Zhaomei ; Beavis, Ronald C. ; Im, Hogune ; Snyder, Michael ; Hofree, Matan ; Ideker, Trey ; Omenn, Gilbert S. ; Fanayan, Susan ; Jeong, Seul Ki ; Paik, Young-Ki ; Zhang, Anna Fan ; Wu, Shiaw Lin ; Hancock, William S. / Genome wide proteomics of ERBB2 and EGFR and other oncogenic pathways in inflammatory breast cancer. In: Journal of Proteome Research. 2013 ; Vol. 12, No. 6. pp. 2805-2817.
@article{7420e371fa8e49648d94a8a5ba14a9c3,
title = "Genome wide proteomics of ERBB2 and EGFR and other oncogenic pathways in inflammatory breast cancer",
abstract = "In this study we selected three breast cancer cell lines (SKBR3, SUM149 and SUM190) with different oncogene expression levels involved in ERBB2 and EGFR signaling pathways as a model system for the evaluation of selective integration of subsets of transcriptomic and proteomic data. We assessed the oncogene status with reads per kilobase per million mapped reads (RPKM) values for ERBB2 (14.4, 400, and 300 for SUM149, SUM190, and SKBR3, respectively) and for EGFR (60.1, not detected, and 1.4 for the same 3 cell lines). We then used RNA-Seq data to identify those oncogenes with significant transcript levels in these cell lines (total 31) and interrogated the corresponding proteomics data sets for proteins with significant interaction values with these oncogenes. The number of observed interactors for each oncogene showed a significant range, e.g., 4.2{\%} (JAK1) to 27.3{\%} (MYC). The percentage is measured as a fraction of the total protein interactions in a given data set vs total interactors for that oncogene in STRING (Search Tool for the Retrieval of Interacting Genes/Proteins, version 9.0) and I2D (Interologous Interaction Database, version 1.95). This approach allowed us to focus on 4 main oncogenes, ERBB2, EGFR, MYC, and GRB2, for pathway analysis. We used bioinformatics sites GeneGo, PathwayCommons and NCI receptor signaling networks to identify pathways that contained the four main oncogenes and had good coverage in the transcriptomic and proteomic data sets as well as a significant number of oncogene interactors. The four pathways identified were ERBB signaling, EGFR1 signaling, integrin outside-in signaling, and validated targets of C-MYC transcriptional activation. The greater dynamic range of the RNA-Seq values allowed the use of transcript ratios to correlate observed protein values with the relative levels of the ERBB2 and EGFR transcripts in each of the four pathways. This provided us with potential proteomic signatures for the SUM149 and 190 cell lines, growth factor receptor-bound protein 7 (GRB7), Crk-like protein (CRKL) and Catenin delta-1 (CTNND1) for ERBB signaling; caveolin 1 (CAV1), plectin (PLEC) for EGFR signaling; filamin A (FLNA) and actinin alpha1 (ACTN1) (associated with high levels of EGFR transcript) for integrin signalings; branched chain amino-acid transaminase 1 (BCAT1), carbamoyl-phosphate synthetase (CAD), nucleolin (NCL) (high levels of EGFR transcript); transferrin receptor (TFRC), metadherin (MTDH) (high levels of ERBB2 transcript) for MYC signaling; S100-A2 protein (S100A2), caveolin 1 (CAV1), Serpin B5 (SERPINB5), stratifin (SFN), PYD and CARD domain containing (PYCARD), and EPH receptor A2 (EPHA2) for PI3K signaling, p53 subpathway. Future studies of inflammatory breast cancer (IBC), from which the cell lines were derived, will be used to explore the significance of these observations.",
author = "Zhang, {Emma Yue} and Massimo Cristofanilli and Fredika Robertson and Reuben, {James M.} and Zhaomei Mu and Beavis, {Ronald C.} and Hogune Im and Michael Snyder and Matan Hofree and Trey Ideker and Omenn, {Gilbert S.} and Susan Fanayan and Jeong, {Seul Ki} and Young-Ki Paik and Zhang, {Anna Fan} and Wu, {Shiaw Lin} and Hancock, {William S.}",
year = "2013",
month = "7",
day = "1",
doi = "10.1021/pr4001527",
language = "English",
volume = "12",
pages = "2805--2817",
journal = "Journal of Proteome Research",
issn = "1535-3893",
publisher = "American Chemical Society",
number = "6",

}

Zhang, EY, Cristofanilli, M, Robertson, F, Reuben, JM, Mu, Z, Beavis, RC, Im, H, Snyder, M, Hofree, M, Ideker, T, Omenn, GS, Fanayan, S, Jeong, SK, Paik, Y-K, Zhang, AF, Wu, SL & Hancock, WS 2013, 'Genome wide proteomics of ERBB2 and EGFR and other oncogenic pathways in inflammatory breast cancer', Journal of Proteome Research, vol. 12, no. 6, pp. 2805-2817. https://doi.org/10.1021/pr4001527

Genome wide proteomics of ERBB2 and EGFR and other oncogenic pathways in inflammatory breast cancer. / Zhang, Emma Yue; Cristofanilli, Massimo; Robertson, Fredika; Reuben, James M.; Mu, Zhaomei; Beavis, Ronald C.; Im, Hogune; Snyder, Michael; Hofree, Matan; Ideker, Trey; Omenn, Gilbert S.; Fanayan, Susan; Jeong, Seul Ki; Paik, Young-Ki; Zhang, Anna Fan; Wu, Shiaw Lin; Hancock, William S.

In: Journal of Proteome Research, Vol. 12, No. 6, 01.07.2013, p. 2805-2817.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Genome wide proteomics of ERBB2 and EGFR and other oncogenic pathways in inflammatory breast cancer

AU - Zhang, Emma Yue

AU - Cristofanilli, Massimo

AU - Robertson, Fredika

AU - Reuben, James M.

AU - Mu, Zhaomei

AU - Beavis, Ronald C.

AU - Im, Hogune

AU - Snyder, Michael

AU - Hofree, Matan

AU - Ideker, Trey

AU - Omenn, Gilbert S.

AU - Fanayan, Susan

AU - Jeong, Seul Ki

AU - Paik, Young-Ki

AU - Zhang, Anna Fan

AU - Wu, Shiaw Lin

AU - Hancock, William S.

PY - 2013/7/1

Y1 - 2013/7/1

N2 - In this study we selected three breast cancer cell lines (SKBR3, SUM149 and SUM190) with different oncogene expression levels involved in ERBB2 and EGFR signaling pathways as a model system for the evaluation of selective integration of subsets of transcriptomic and proteomic data. We assessed the oncogene status with reads per kilobase per million mapped reads (RPKM) values for ERBB2 (14.4, 400, and 300 for SUM149, SUM190, and SKBR3, respectively) and for EGFR (60.1, not detected, and 1.4 for the same 3 cell lines). We then used RNA-Seq data to identify those oncogenes with significant transcript levels in these cell lines (total 31) and interrogated the corresponding proteomics data sets for proteins with significant interaction values with these oncogenes. The number of observed interactors for each oncogene showed a significant range, e.g., 4.2% (JAK1) to 27.3% (MYC). The percentage is measured as a fraction of the total protein interactions in a given data set vs total interactors for that oncogene in STRING (Search Tool for the Retrieval of Interacting Genes/Proteins, version 9.0) and I2D (Interologous Interaction Database, version 1.95). This approach allowed us to focus on 4 main oncogenes, ERBB2, EGFR, MYC, and GRB2, for pathway analysis. We used bioinformatics sites GeneGo, PathwayCommons and NCI receptor signaling networks to identify pathways that contained the four main oncogenes and had good coverage in the transcriptomic and proteomic data sets as well as a significant number of oncogene interactors. The four pathways identified were ERBB signaling, EGFR1 signaling, integrin outside-in signaling, and validated targets of C-MYC transcriptional activation. The greater dynamic range of the RNA-Seq values allowed the use of transcript ratios to correlate observed protein values with the relative levels of the ERBB2 and EGFR transcripts in each of the four pathways. This provided us with potential proteomic signatures for the SUM149 and 190 cell lines, growth factor receptor-bound protein 7 (GRB7), Crk-like protein (CRKL) and Catenin delta-1 (CTNND1) for ERBB signaling; caveolin 1 (CAV1), plectin (PLEC) for EGFR signaling; filamin A (FLNA) and actinin alpha1 (ACTN1) (associated with high levels of EGFR transcript) for integrin signalings; branched chain amino-acid transaminase 1 (BCAT1), carbamoyl-phosphate synthetase (CAD), nucleolin (NCL) (high levels of EGFR transcript); transferrin receptor (TFRC), metadherin (MTDH) (high levels of ERBB2 transcript) for MYC signaling; S100-A2 protein (S100A2), caveolin 1 (CAV1), Serpin B5 (SERPINB5), stratifin (SFN), PYD and CARD domain containing (PYCARD), and EPH receptor A2 (EPHA2) for PI3K signaling, p53 subpathway. Future studies of inflammatory breast cancer (IBC), from which the cell lines were derived, will be used to explore the significance of these observations.

AB - In this study we selected three breast cancer cell lines (SKBR3, SUM149 and SUM190) with different oncogene expression levels involved in ERBB2 and EGFR signaling pathways as a model system for the evaluation of selective integration of subsets of transcriptomic and proteomic data. We assessed the oncogene status with reads per kilobase per million mapped reads (RPKM) values for ERBB2 (14.4, 400, and 300 for SUM149, SUM190, and SKBR3, respectively) and for EGFR (60.1, not detected, and 1.4 for the same 3 cell lines). We then used RNA-Seq data to identify those oncogenes with significant transcript levels in these cell lines (total 31) and interrogated the corresponding proteomics data sets for proteins with significant interaction values with these oncogenes. The number of observed interactors for each oncogene showed a significant range, e.g., 4.2% (JAK1) to 27.3% (MYC). The percentage is measured as a fraction of the total protein interactions in a given data set vs total interactors for that oncogene in STRING (Search Tool for the Retrieval of Interacting Genes/Proteins, version 9.0) and I2D (Interologous Interaction Database, version 1.95). This approach allowed us to focus on 4 main oncogenes, ERBB2, EGFR, MYC, and GRB2, for pathway analysis. We used bioinformatics sites GeneGo, PathwayCommons and NCI receptor signaling networks to identify pathways that contained the four main oncogenes and had good coverage in the transcriptomic and proteomic data sets as well as a significant number of oncogene interactors. The four pathways identified were ERBB signaling, EGFR1 signaling, integrin outside-in signaling, and validated targets of C-MYC transcriptional activation. The greater dynamic range of the RNA-Seq values allowed the use of transcript ratios to correlate observed protein values with the relative levels of the ERBB2 and EGFR transcripts in each of the four pathways. This provided us with potential proteomic signatures for the SUM149 and 190 cell lines, growth factor receptor-bound protein 7 (GRB7), Crk-like protein (CRKL) and Catenin delta-1 (CTNND1) for ERBB signaling; caveolin 1 (CAV1), plectin (PLEC) for EGFR signaling; filamin A (FLNA) and actinin alpha1 (ACTN1) (associated with high levels of EGFR transcript) for integrin signalings; branched chain amino-acid transaminase 1 (BCAT1), carbamoyl-phosphate synthetase (CAD), nucleolin (NCL) (high levels of EGFR transcript); transferrin receptor (TFRC), metadherin (MTDH) (high levels of ERBB2 transcript) for MYC signaling; S100-A2 protein (S100A2), caveolin 1 (CAV1), Serpin B5 (SERPINB5), stratifin (SFN), PYD and CARD domain containing (PYCARD), and EPH receptor A2 (EPHA2) for PI3K signaling, p53 subpathway. Future studies of inflammatory breast cancer (IBC), from which the cell lines were derived, will be used to explore the significance of these observations.

UR - http://www.scopus.com/inward/record.url?scp=84879327240&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84879327240&partnerID=8YFLogxK

U2 - 10.1021/pr4001527

DO - 10.1021/pr4001527

M3 - Article

VL - 12

SP - 2805

EP - 2817

JO - Journal of Proteome Research

JF - Journal of Proteome Research

SN - 1535-3893

IS - 6

ER -