Serveur d'exploration sur Pittsburgh

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Network Analysis of Breast Cancer Progression and Reversal Using a Tree-Evolving Network Algorithm

Identifieur interne : 000260 ( Pmc/Corpus ); précédent : 000259; suivant : 000261

Network Analysis of Breast Cancer Progression and Reversal Using a Tree-Evolving Network Algorithm

Auteurs : Ankur P. Parikh ; Ross E. Curtis ; Irene Kuhn ; Sabine Becker-Weimann ; Mina Bissell ; Eric P. Xing ; Wei Wu

Source :

RBID : PMC:4109850

Abstract

The HMT3522 progression series of human breast cells have been used to discover how tissue architecture, microenvironment and signaling molecules affect breast cell growth and behaviors. However, much remains to be elucidated about malignant and phenotypic reversion behaviors of the HMT3522-T4-2 cells of this series. We employed a “pan-cell-state” strategy, and analyzed jointly microarray profiles obtained from different state-specific cell populations from this progression and reversion model of the breast cells using a tree-lineage multi-network inference algorithm, Treegl. We found that different breast cell states contain distinct gene networks. The network specific to non-malignant HMT3522-S1 cells is dominated by genes involved in normal processes, whereas the T4-2-specific network is enriched with cancer-related genes. The networks specific to various conditions of the reverted T4-2 cells are enriched with pathways suggestive of compensatory effects, consistent with clinical data showing patient resistance to anticancer drugs. We validated the findings using an external dataset, and showed that aberrant expression values of certain hubs in the identified networks are associated with poor clinical outcomes. Thus, analysis of various reversion conditions (including non-reverted) of HMT3522 cells using Treegl can be a good model system to study drug effects on breast cancer.


Url:
DOI: 10.1371/journal.pcbi.1003713
PubMed: 25057922
PubMed Central: 4109850

Links to Exploration step

PMC:4109850

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Network Analysis of Breast Cancer Progression and Reversal Using a Tree-Evolving Network Algorithm</title>
<author>
<name sortKey="Parikh, Ankur P" sort="Parikh, Ankur P" uniqKey="Parikh A" first="Ankur P." last="Parikh">Ankur P. Parikh</name>
<affiliation>
<nlm:aff id="aff1">
<addr-line>Machine Learning Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, Pennsylvania, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Curtis, Ross E" sort="Curtis, Ross E" uniqKey="Curtis R" first="Ross E." last="Curtis">Ross E. Curtis</name>
<affiliation>
<nlm:aff id="aff2">
<addr-line>Lane Center for Computational Biology, School of Computer Science, Carnegie Mellon University, Pittsburgh, Pennsylvania, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Kuhn, Irene" sort="Kuhn, Irene" uniqKey="Kuhn I" first="Irene" last="Kuhn">Irene Kuhn</name>
<affiliation>
<nlm:aff id="aff3">
<addr-line>Life Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Becker Weimann, Sabine" sort="Becker Weimann, Sabine" uniqKey="Becker Weimann S" first="Sabine" last="Becker-Weimann">Sabine Becker-Weimann</name>
<affiliation>
<nlm:aff id="aff3">
<addr-line>Life Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Bissell, Mina" sort="Bissell, Mina" uniqKey="Bissell M" first="Mina" last="Bissell">Mina Bissell</name>
<affiliation>
<nlm:aff id="aff3">
<addr-line>Life Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Xing, Eric P" sort="Xing, Eric P" uniqKey="Xing E" first="Eric P." last="Xing">Eric P. Xing</name>
<affiliation>
<nlm:aff id="aff1">
<addr-line>Machine Learning Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, Pennsylvania, United States of America</addr-line>
</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="aff2">
<addr-line>Lane Center for Computational Biology, School of Computer Science, Carnegie Mellon University, Pittsburgh, Pennsylvania, United States of America</addr-line>
</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="aff4">
<addr-line>Joint Carnegie Mellon University-University of Pittsburgh PhD Program in Computational Biology, Pittsburgh, Pennsylvania, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Wu, Wei" sort="Wu, Wei" uniqKey="Wu W" first="Wei" last="Wu">Wei Wu</name>
<affiliation>
<nlm:aff id="aff2">
<addr-line>Lane Center for Computational Biology, School of Computer Science, Carnegie Mellon University, Pittsburgh, Pennsylvania, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PMC</idno>
<idno type="pmid">25057922</idno>
<idno type="pmc">4109850</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4109850</idno>
<idno type="RBID">PMC:4109850</idno>
<idno type="doi">10.1371/journal.pcbi.1003713</idno>
<date when="2014">2014</date>
<idno type="wicri:Area/Pmc/Corpus">000260</idno>
<idno type="wicri:explorRef" wicri:stream="Pmc" wicri:step="Corpus" wicri:corpus="PMC">000260</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a" type="main">Network Analysis of Breast Cancer Progression and Reversal Using a Tree-Evolving Network Algorithm</title>
<author>
<name sortKey="Parikh, Ankur P" sort="Parikh, Ankur P" uniqKey="Parikh A" first="Ankur P." last="Parikh">Ankur P. Parikh</name>
<affiliation>
<nlm:aff id="aff1">
<addr-line>Machine Learning Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, Pennsylvania, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Curtis, Ross E" sort="Curtis, Ross E" uniqKey="Curtis R" first="Ross E." last="Curtis">Ross E. Curtis</name>
<affiliation>
<nlm:aff id="aff2">
<addr-line>Lane Center for Computational Biology, School of Computer Science, Carnegie Mellon University, Pittsburgh, Pennsylvania, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Kuhn, Irene" sort="Kuhn, Irene" uniqKey="Kuhn I" first="Irene" last="Kuhn">Irene Kuhn</name>
<affiliation>
<nlm:aff id="aff3">
<addr-line>Life Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Becker Weimann, Sabine" sort="Becker Weimann, Sabine" uniqKey="Becker Weimann S" first="Sabine" last="Becker-Weimann">Sabine Becker-Weimann</name>
<affiliation>
<nlm:aff id="aff3">
<addr-line>Life Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Bissell, Mina" sort="Bissell, Mina" uniqKey="Bissell M" first="Mina" last="Bissell">Mina Bissell</name>
<affiliation>
<nlm:aff id="aff3">
<addr-line>Life Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Xing, Eric P" sort="Xing, Eric P" uniqKey="Xing E" first="Eric P." last="Xing">Eric P. Xing</name>
<affiliation>
<nlm:aff id="aff1">
<addr-line>Machine Learning Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, Pennsylvania, United States of America</addr-line>
</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="aff2">
<addr-line>Lane Center for Computational Biology, School of Computer Science, Carnegie Mellon University, Pittsburgh, Pennsylvania, United States of America</addr-line>
</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="aff4">
<addr-line>Joint Carnegie Mellon University-University of Pittsburgh PhD Program in Computational Biology, Pittsburgh, Pennsylvania, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Wu, Wei" sort="Wu, Wei" uniqKey="Wu W" first="Wei" last="Wu">Wei Wu</name>
<affiliation>
<nlm:aff id="aff2">
<addr-line>Lane Center for Computational Biology, School of Computer Science, Carnegie Mellon University, Pittsburgh, Pennsylvania, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
</analytic>
<series>
<title level="j">PLoS Computational Biology</title>
<idno type="ISSN">1553-734X</idno>
<idno type="eISSN">1553-7358</idno>
<imprint>
<date when="2014">2014</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass></textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">
<p>The HMT3522 progression series of human breast cells have been used to discover how tissue architecture, microenvironment and signaling molecules affect breast cell growth and behaviors. However, much remains to be elucidated about malignant and phenotypic reversion behaviors of the HMT3522-T4-2 cells of this series. We employed a “pan-cell-state” strategy, and analyzed jointly microarray profiles obtained from different state-specific cell populations from this progression and reversion model of the breast cells using a tree-lineage multi-network inference algorithm,
<italic>Treegl</italic>
. We found that different breast cell states contain distinct gene networks. The network specific to non-malignant HMT3522-S1 cells is dominated by genes involved in normal processes, whereas the T4-2-specific network is enriched with cancer-related genes. The networks specific to various conditions of the reverted T4-2 cells are enriched with pathways suggestive of compensatory effects, consistent with clinical data showing patient resistance to anticancer drugs. We validated the findings using an external dataset, and showed that aberrant expression values of certain hubs in the identified networks are associated with poor clinical outcomes. Thus, analysis of various reversion conditions (including non-reverted) of HMT3522 cells using
<italic>Treegl</italic>
can be a good model system to study drug effects on breast cancer.</p>
</div>
</front>
<back>
<div1 type="bibliography">
<listBibl>
<biblStruct>
<analytic>
<author>
<name sortKey="Luscombe, Nm" uniqKey="Luscombe N">NM Luscombe</name>
</author>
<author>
<name sortKey="Babu, Mm" uniqKey="Babu M">MM Babu</name>
</author>
<author>
<name sortKey="Yu, H" uniqKey="Yu H">H Yu</name>
</author>
<author>
<name sortKey="Snyder, M" uniqKey="Snyder M">M Snyder</name>
</author>
<author>
<name sortKey="Teichmann, Sa" uniqKey="Teichmann S">SA Teichmann</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Ernst, J" uniqKey="Ernst J">J Ernst</name>
</author>
<author>
<name sortKey="Vainas, O" uniqKey="Vainas O">O Vainas</name>
</author>
<author>
<name sortKey="Harbison, Ct" uniqKey="Harbison C">CT Harbison</name>
</author>
<author>
<name sortKey="Simon, I" uniqKey="Simon I">I Simon</name>
</author>
<author>
<name sortKey="Bar Joseph, Z" uniqKey="Bar Joseph Z">Z Bar-Joseph</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Schulz, Mh" uniqKey="Schulz M">MH Schulz</name>
</author>
<author>
<name sortKey="Devanny, We" uniqKey="Devanny W">WE Devanny</name>
</author>
<author>
<name sortKey="Gitter, A" uniqKey="Gitter A">A Gitter</name>
</author>
<author>
<name sortKey="Zhong, S" uniqKey="Zhong S">S Zhong</name>
</author>
<author>
<name sortKey="Ernst, J" uniqKey="Ernst J">J Ernst</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Bissell, Mj" uniqKey="Bissell M">MJ Bissell</name>
</author>
<author>
<name sortKey="Weaver, Vm" uniqKey="Weaver V">VM Weaver</name>
</author>
<author>
<name sortKey="Lelievre, Sa" uniqKey="Lelievre S">SA Lelievre</name>
</author>
<author>
<name sortKey="Wang, F" uniqKey="Wang F">F Wang</name>
</author>
<author>
<name sortKey="Petersen, Ow" uniqKey="Petersen O">OW Petersen</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Bissell, Mj" uniqKey="Bissell M">MJ Bissell</name>
</author>
<author>
<name sortKey="Radisky, D" uniqKey="Radisky D">D Radisky</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Weaver, Vm" uniqKey="Weaver V">VM Weaver</name>
</author>
<author>
<name sortKey="Petersen, Ow" uniqKey="Petersen O">OW Petersen</name>
</author>
<author>
<name sortKey="Wang, F" uniqKey="Wang F">F Wang</name>
</author>
<author>
<name sortKey="Larabell, Ca" uniqKey="Larabell C">CA Larabell</name>
</author>
<author>
<name sortKey="Briand, P" uniqKey="Briand P">P Briand</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Wang, F" uniqKey="Wang F">F Wang</name>
</author>
<author>
<name sortKey="Weaver, Vm" uniqKey="Weaver V">VM Weaver</name>
</author>
<author>
<name sortKey="Petersen, Ow" uniqKey="Petersen O">OW Petersen</name>
</author>
<author>
<name sortKey="Larabell, Ca" uniqKey="Larabell C">CA Larabell</name>
</author>
<author>
<name sortKey="Dedhar, S" uniqKey="Dedhar S">S Dedhar</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Liu, H" uniqKey="Liu H">H Liu</name>
</author>
<author>
<name sortKey="Radisky, Dc" uniqKey="Radisky D">DC Radisky</name>
</author>
<author>
<name sortKey="Wang, F" uniqKey="Wang F">F Wang</name>
</author>
<author>
<name sortKey="Bissell, Mj" uniqKey="Bissell M">MJ Bissell</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Briand, P" uniqKey="Briand P">P Briand</name>
</author>
<author>
<name sortKey="Petersen, Ow" uniqKey="Petersen O">OW Petersen</name>
</author>
<author>
<name sortKey="Van Deurs, B" uniqKey="Van Deurs B">B Van Deurs</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Briand, P" uniqKey="Briand P">P Briand</name>
</author>
<author>
<name sortKey="Nielsen, Kv" uniqKey="Nielsen K">KV Nielsen</name>
</author>
<author>
<name sortKey="Madsen, Mw" uniqKey="Madsen M">MW Madsen</name>
</author>
<author>
<name sortKey="Petersen, Ow" uniqKey="Petersen O">OW Petersen</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Inman, Jl" uniqKey="Inman J">JL Inman</name>
</author>
<author>
<name sortKey="Bissell, Mj" uniqKey="Bissell M">MJ Bissell</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Kenny, Pa" uniqKey="Kenny P">PA Kenny</name>
</author>
<author>
<name sortKey="Bissell, Mj" uniqKey="Bissell M">MJ Bissell</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Itoh, M" uniqKey="Itoh M">M Itoh</name>
</author>
<author>
<name sortKey="Nelson, Cm" uniqKey="Nelson C">CM Nelson</name>
</author>
<author>
<name sortKey="Myers, Ca" uniqKey="Myers C">CA Myers</name>
</author>
<author>
<name sortKey="Bissell, Mj" uniqKey="Bissell M">MJ Bissell</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Becker Weimann, S" uniqKey="Becker Weimann S">S Becker-Weimann</name>
</author>
<author>
<name sortKey="Xiong, G" uniqKey="Xiong G">G Xiong</name>
</author>
<author>
<name sortKey="Furuta, S" uniqKey="Furuta S">S Furuta</name>
</author>
<author>
<name sortKey="Han, J" uniqKey="Han J">J Han</name>
</author>
<author>
<name sortKey="Kuhn, I" uniqKey="Kuhn I">I Kuhn</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Marbach, D" uniqKey="Marbach D">D Marbach</name>
</author>
<author>
<name sortKey="Costello, Jc" uniqKey="Costello J">JC Costello</name>
</author>
<author>
<name sortKey="Kuffner, R" uniqKey="Kuffner R">R Kuffner</name>
</author>
<author>
<name sortKey="Vega, Nm" uniqKey="Vega N">NM Vega</name>
</author>
<author>
<name sortKey="Prill, Rj" uniqKey="Prill R">RJ Prill</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Bar Joseph, Z" uniqKey="Bar Joseph Z">Z Bar-Joseph</name>
</author>
<author>
<name sortKey="Gitter, A" uniqKey="Gitter A">A Gitter</name>
</author>
<author>
<name sortKey="Simon, I" uniqKey="Simon I">I Simon</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Friedman, N" uniqKey="Friedman N">N Friedman</name>
</author>
<author>
<name sortKey="Linial, M" uniqKey="Linial M">M Linial</name>
</author>
<author>
<name sortKey="Nachman, I" uniqKey="Nachman I">I Nachman</name>
</author>
<author>
<name sortKey="Pe Er, D" uniqKey="Pe Er D">D Pe'er</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Margolin, Aa" uniqKey="Margolin A">AA Margolin</name>
</author>
<author>
<name sortKey="Nemenman, I" uniqKey="Nemenman I">I Nemenman</name>
</author>
<author>
<name sortKey="Basso, K" uniqKey="Basso K">K Basso</name>
</author>
<author>
<name sortKey="Wiggins, C" uniqKey="Wiggins C">C Wiggins</name>
</author>
<author>
<name sortKey="Stolovitzky, G" uniqKey="Stolovitzky G">G Stolovitzky</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Yeung, Mk" uniqKey="Yeung M">MK Yeung</name>
</author>
<author>
<name sortKey="Tegner, J" uniqKey="Tegner J">J Tegner</name>
</author>
<author>
<name sortKey="Collins, Jj" uniqKey="Collins J">JJ Collins</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Ahmed, A" uniqKey="Ahmed A">A Ahmed</name>
</author>
<author>
<name sortKey="Xing, Ep" uniqKey="Xing E">EP Xing</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Song, L" uniqKey="Song L">L Song</name>
</author>
<author>
<name sortKey="Kolar, M" uniqKey="Kolar M">M Kolar</name>
</author>
<author>
<name sortKey="Xing, Ep" uniqKey="Xing E">EP Xing</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Grzegorczyk, M" uniqKey="Grzegorczyk M">M Grzegorczyk</name>
</author>
<author>
<name sortKey="Husmeier, D" uniqKey="Husmeier D">D Husmeier</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Parikh, Ap" uniqKey="Parikh A">AP Parikh</name>
</author>
<author>
<name sortKey="Wu, W" uniqKey="Wu W">W Wu</name>
</author>
<author>
<name sortKey="Curtis, Re" uniqKey="Curtis R">RE Curtis</name>
</author>
<author>
<name sortKey="Xing, Ep" uniqKey="Xing E">EP Xing</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Meinshausen, N" uniqKey="Meinshausen N">N Meinshausen</name>
</author>
<author>
<name sortKey="Buhlmann, P" uniqKey="Buhlmann P">P Buhlmann</name>
</author>
<author>
<name sortKey="Zurich, E" uniqKey="Zurich E">E Zurich</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Ravikumar, P" uniqKey="Ravikumar P">P Ravikumar</name>
</author>
<author>
<name sortKey="Wainwright, Mj" uniqKey="Wainwright M">MJ Wainwright</name>
</author>
<author>
<name sortKey="Lafferty, J" uniqKey="Lafferty J">J Lafferty</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Kolar, M" uniqKey="Kolar M">M Kolar</name>
</author>
<author>
<name sortKey="Xing, Ep" uniqKey="Xing E">EP Xing</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Kolar, M" uniqKey="Kolar M">M Kolar</name>
</author>
<author>
<name sortKey="Song, L" uniqKey="Song L">L Song</name>
</author>
<author>
<name sortKey="Xing, Ep" uniqKey="Xing E">EP Xing</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Segal, E" uniqKey="Segal E">E Segal</name>
</author>
<author>
<name sortKey="Shapira, M" uniqKey="Shapira M">M Shapira</name>
</author>
<author>
<name sortKey="Regev, A" uniqKey="Regev A">A Regev</name>
</author>
<author>
<name sortKey="Pe Er, D" uniqKey="Pe Er D">D Pe'er</name>
</author>
<author>
<name sortKey="Botstein, D" uniqKey="Botstein D">D Botstein</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Baselga, J" uniqKey="Baselga J">J Baselga</name>
</author>
<author>
<name sortKey="Albanell, J" uniqKey="Albanell J">J Albanell</name>
</author>
<author>
<name sortKey="Ruiz, A" uniqKey="Ruiz A">A Ruiz</name>
</author>
<author>
<name sortKey="Lluch, A" uniqKey="Lluch A">A Lluch</name>
</author>
<author>
<name sortKey="Gascon, P" uniqKey="Gascon P">P Gascon</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Nahta, R" uniqKey="Nahta R">R Nahta</name>
</author>
<author>
<name sortKey="Yu, D" uniqKey="Yu D">D Yu</name>
</author>
<author>
<name sortKey="Hung, Mc" uniqKey="Hung M">MC Hung</name>
</author>
<author>
<name sortKey="Hortobagyi, Gn" uniqKey="Hortobagyi G">GN Hortobagyi</name>
</author>
<author>
<name sortKey="Esteva, Fj" uniqKey="Esteva F">FJ Esteva</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Di Cosimo, S" uniqKey="Di Cosimo S">S Di Cosimo</name>
</author>
<author>
<name sortKey="Baselga, J" uniqKey="Baselga J">J Baselga</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Huang, S" uniqKey="Huang S">S Huang</name>
</author>
<author>
<name sortKey="Bjornsti, Ma" uniqKey="Bjornsti M">MA Bjornsti</name>
</author>
<author>
<name sortKey="Houghton, Pj" uniqKey="Houghton P">PJ Houghton</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Nakayama, Ki" uniqKey="Nakayama K">KI Nakayama</name>
</author>
<author>
<name sortKey="Nakayama, K" uniqKey="Nakayama K">K Nakayama</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Coussens, Lm" uniqKey="Coussens L">LM Coussens</name>
</author>
<author>
<name sortKey="Fingleton, B" uniqKey="Fingleton B">B Fingleton</name>
</author>
<author>
<name sortKey="Matrisian, Lm" uniqKey="Matrisian L">LM Matrisian</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Kessenbrock, K" uniqKey="Kessenbrock K">K Kessenbrock</name>
</author>
<author>
<name sortKey="Plaks, V" uniqKey="Plaks V">V Plaks</name>
</author>
<author>
<name sortKey="Werb, Z" uniqKey="Werb Z">Z Werb</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Van De Vijver, Mj" uniqKey="Van De Vijver M">MJ van de Vijver</name>
</author>
<author>
<name sortKey="He, Yd" uniqKey="He Y">YD He</name>
</author>
<author>
<name sortKey="Van T Veer, Lj" uniqKey="Van T Veer L">LJ van't Veer</name>
</author>
<author>
<name sortKey="Dai, H" uniqKey="Dai H">H Dai</name>
</author>
<author>
<name sortKey="Hart, Aa" uniqKey="Hart A">AA Hart</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Han, Jd" uniqKey="Han J">JD Han</name>
</author>
<author>
<name sortKey="Bertin, N" uniqKey="Bertin N">N Bertin</name>
</author>
<author>
<name sortKey="Hao, T" uniqKey="Hao T">T Hao</name>
</author>
<author>
<name sortKey="Goldberg, Ds" uniqKey="Goldberg D">DS Goldberg</name>
</author>
<author>
<name sortKey="Berriz, Gf" uniqKey="Berriz G">GF Berriz</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Deng, Xa" uniqKey="Deng X">XA Deng</name>
</author>
<author>
<name sortKey="Norris, A" uniqKey="Norris A">A Norris</name>
</author>
<author>
<name sortKey="Panaviene, Z" uniqKey="Panaviene Z">Z Panaviene</name>
</author>
<author>
<name sortKey="Moncman, Cl" uniqKey="Moncman C">CL Moncman</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Citri, A" uniqKey="Citri A">A Citri</name>
</author>
<author>
<name sortKey="Yarden, Y" uniqKey="Yarden Y">Y Yarden</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Revillion, F" uniqKey="Revillion F">F Revillion</name>
</author>
<author>
<name sortKey="Lhotellier, V" uniqKey="Lhotellier V">V Lhotellier</name>
</author>
<author>
<name sortKey="Hornez, L" uniqKey="Hornez L">L Hornez</name>
</author>
<author>
<name sortKey="Bonneterre, J" uniqKey="Bonneterre J">J Bonneterre</name>
</author>
<author>
<name sortKey="Peyrat, Jp" uniqKey="Peyrat J">JP Peyrat</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Miyamoto, S" uniqKey="Miyamoto S">S Miyamoto</name>
</author>
<author>
<name sortKey="Yagi, H" uniqKey="Yagi H">H Yagi</name>
</author>
<author>
<name sortKey="Yotsumoto, F" uniqKey="Yotsumoto F">F Yotsumoto</name>
</author>
<author>
<name sortKey="Horiuchi, S" uniqKey="Horiuchi S">S Horiuchi</name>
</author>
<author>
<name sortKey="Yoshizato, T" uniqKey="Yoshizato T">T Yoshizato</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Vanacova, S" uniqKey="Vanacova S">S Vanacova</name>
</author>
<author>
<name sortKey="Stefl, R" uniqKey="Stefl R">R Stefl</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Jeong, H" uniqKey="Jeong H">H Jeong</name>
</author>
<author>
<name sortKey="Mason, Sp" uniqKey="Mason S">SP Mason</name>
</author>
<author>
<name sortKey="Barabasi, Al" uniqKey="Barabasi A">AL Barabasi</name>
</author>
<author>
<name sortKey="Oltvai, Zn" uniqKey="Oltvai Z">ZN Oltvai</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Yu, H" uniqKey="Yu H">H Yu</name>
</author>
<author>
<name sortKey="Greenbaum, D" uniqKey="Greenbaum D">D Greenbaum</name>
</author>
<author>
<name sortKey="Xin Lu, H" uniqKey="Xin Lu H">H Xin Lu</name>
</author>
<author>
<name sortKey="Zhu, X" uniqKey="Zhu X">X Zhu</name>
</author>
<author>
<name sortKey="Gerstein, M" uniqKey="Gerstein M">M Gerstein</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Petersen, Ow" uniqKey="Petersen O">OW Petersen</name>
</author>
<author>
<name sortKey="Ronnov Jessen, L" uniqKey="Ronnov Jessen L">L Ronnov-Jessen</name>
</author>
<author>
<name sortKey="Howlett, Ar" uniqKey="Howlett A">AR Howlett</name>
</author>
<author>
<name sortKey="Bissell, Mj" uniqKey="Bissell M">MJ Bissell</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Beliveau, A" uniqKey="Beliveau A">A Beliveau</name>
</author>
<author>
<name sortKey="Mott, Jd" uniqKey="Mott J">JD Mott</name>
</author>
<author>
<name sortKey="Lo, A" uniqKey="Lo A">A Lo</name>
</author>
<author>
<name sortKey="Chen, Ei" uniqKey="Chen E">EI Chen</name>
</author>
<author>
<name sortKey="Koller, Aa" uniqKey="Koller A">AA Koller</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Lee, Gy" uniqKey="Lee G">GY Lee</name>
</author>
<author>
<name sortKey="Kenny, Pa" uniqKey="Kenny P">PA Kenny</name>
</author>
<author>
<name sortKey="Lee, Eh" uniqKey="Lee E">EH Lee</name>
</author>
<author>
<name sortKey="Bissell, Mj" uniqKey="Bissell M">MJ Bissell</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Fournier, Mv" uniqKey="Fournier M">MV Fournier</name>
</author>
<author>
<name sortKey="Martin, Kj" uniqKey="Martin K">KJ Martin</name>
</author>
<author>
<name sortKey="Kenny, Pa" uniqKey="Kenny P">PA Kenny</name>
</author>
<author>
<name sortKey="Xhaja, K" uniqKey="Xhaja K">K Xhaja</name>
</author>
<author>
<name sortKey="Bosch, I" uniqKey="Bosch I">I Bosch</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Bolstad, Bm" uniqKey="Bolstad B">BM Bolstad</name>
</author>
<author>
<name sortKey="Irizarry, Ra" uniqKey="Irizarry R">RA Irizarry</name>
</author>
<author>
<name sortKey="Astrand, M" uniqKey="Astrand M">M Astrand</name>
</author>
<author>
<name sortKey="Speed, Tp" uniqKey="Speed T">TP Speed</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Irizarry, Ra" uniqKey="Irizarry R">RA Irizarry</name>
</author>
<author>
<name sortKey="Hobbs, B" uniqKey="Hobbs B">B Hobbs</name>
</author>
<author>
<name sortKey="Collin, F" uniqKey="Collin F">F Collin</name>
</author>
<author>
<name sortKey="Beazer Barclay, Yd" uniqKey="Beazer Barclay Y">YD Beazer-Barclay</name>
</author>
<author>
<name sortKey="Antonellis, Kj" uniqKey="Antonellis K">KJ Antonellis</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Zhou, X" uniqKey="Zhou X">X Zhou</name>
</author>
<author>
<name sortKey="Wu, W" uniqKey="Wu W">W Wu</name>
</author>
<author>
<name sortKey="Hu, H" uniqKey="Hu H">H Hu</name>
</author>
<author>
<name sortKey="Milosevic, J" uniqKey="Milosevic J">J Milosevic</name>
</author>
<author>
<name sortKey="Konishi, K" uniqKey="Konishi K">K Konishi</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Tibshirani, R" uniqKey="Tibshirani R">R Tibshirani</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Kolar, M" uniqKey="Kolar M">M Kolar</name>
</author>
<author>
<name sortKey="Xing, Ep" uniqKey="Xing E">EP Xing</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Liu, H" uniqKey="Liu H">H Liu</name>
</author>
<author>
<name sortKey="Roeder, K" uniqKey="Roeder K">K Roeder</name>
</author>
<author>
<name sortKey="Wasserman, L" uniqKey="Wasserman L">L Wasserman</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Benjamini, Y" uniqKey="Benjamini Y">Y Benjamini</name>
</author>
<author>
<name sortKey="Hochberg, Y" uniqKey="Hochberg Y">Y Hochberg</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Beissbarth, T" uniqKey="Beissbarth T">T Beissbarth</name>
</author>
<author>
<name sortKey="Speed, Tp" uniqKey="Speed T">TP Speed</name>
</author>
</analytic>
</biblStruct>
</listBibl>
</div1>
</back>
</TEI>
<pmc article-type="research-article">
<pmc-dir>properties open_access</pmc-dir>
<front>
<journal-meta>
<journal-id journal-id-type="nlm-ta">PLoS Comput Biol</journal-id>
<journal-id journal-id-type="iso-abbrev">PLoS Comput. Biol</journal-id>
<journal-id journal-id-type="publisher-id">plos</journal-id>
<journal-id journal-id-type="pmc">ploscomp</journal-id>
<journal-title-group>
<journal-title>PLoS Computational Biology</journal-title>
</journal-title-group>
<issn pub-type="ppub">1553-734X</issn>
<issn pub-type="epub">1553-7358</issn>
<publisher>
<publisher-name>Public Library of Science</publisher-name>
<publisher-loc>San Francisco, USA</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="pmid">25057922</article-id>
<article-id pub-id-type="pmc">4109850</article-id>
<article-id pub-id-type="publisher-id">PCOMPBIOL-D-13-01348</article-id>
<article-id pub-id-type="doi">10.1371/journal.pcbi.1003713</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Research Article</subject>
</subj-group>
<subj-group subj-group-type="Discipline-v2">
<subject>Biology and Life Sciences</subject>
<subj-group>
<subject>Cell Biology</subject>
<subj-group>
<subject>Molecular Cell Biology</subject>
</subj-group>
</subj-group>
<subj-group>
<subject>Computational Biology</subject>
</subj-group>
<subj-group>
<subject>Systems Biology</subject>
</subj-group>
</subj-group>
<subj-group subj-group-type="Discipline-v2">
<subject>Computer and Information Sciences</subject>
<subj-group>
<subject>Computer Applications</subject>
</subj-group>
<subj-group>
<subject>Computer Modeling</subject>
</subj-group>
</subj-group>
<subj-group subj-group-type="Discipline-v2">
<subject>Physical Sciences</subject>
<subj-group>
<subject>Mathematics</subject>
<subj-group>
<subject>Applied Mathematics</subject>
<subj-group>
<subject>Algorithms</subject>
</subj-group>
</subj-group>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Network Analysis of Breast Cancer Progression and Reversal Using a Tree-Evolving Network Algorithm</article-title>
<alt-title alt-title-type="running-head">Network Analysis of Breast Cancer Cells</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Parikh</surname>
<given-names>Ankur P.</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Curtis</surname>
<given-names>Ross E.</given-names>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Kuhn</surname>
<given-names>Irene</given-names>
</name>
<xref ref-type="aff" rid="aff3">
<sup>3</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Becker-Weimann</surname>
<given-names>Sabine</given-names>
</name>
<xref ref-type="aff" rid="aff3">
<sup>3</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Bissell</surname>
<given-names>Mina</given-names>
</name>
<xref ref-type="aff" rid="aff3">
<sup>3</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Xing</surname>
<given-names>Eric P.</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
<xref ref-type="aff" rid="aff4">
<sup>4</sup>
</xref>
<xref ref-type="corresp" rid="cor1">
<sup>*</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Wu</surname>
<given-names>Wei</given-names>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
<xref ref-type="corresp" rid="cor1">
<sup>*</sup>
</xref>
</contrib>
</contrib-group>
<aff id="aff1">
<label>1</label>
<addr-line>Machine Learning Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, Pennsylvania, United States of America</addr-line>
</aff>
<aff id="aff2">
<label>2</label>
<addr-line>Lane Center for Computational Biology, School of Computer Science, Carnegie Mellon University, Pittsburgh, Pennsylvania, United States of America</addr-line>
</aff>
<aff id="aff3">
<label>3</label>
<addr-line>Life Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California, United States of America</addr-line>
</aff>
<aff id="aff4">
<label>4</label>
<addr-line>Joint Carnegie Mellon University-University of Pittsburgh PhD Program in Computational Biology, Pittsburgh, Pennsylvania, United States of America</addr-line>
</aff>
<contrib-group>
<contrib contrib-type="editor">
<name>
<surname>Leslie</surname>
<given-names>Christina</given-names>
</name>
<role>Editor</role>
<xref ref-type="aff" rid="edit1"></xref>
</contrib>
</contrib-group>
<aff id="edit1">
<addr-line>Memorial Sloan-Kettering Cancer Center, United States of America</addr-line>
</aff>
<author-notes>
<corresp id="cor1">* E-mail:
<email>epxing@cs.cmu.edu</email>
(EPX);
<email>weiwu2@cs.cmu.edu</email>
(WW)</corresp>
<fn fn-type="conflict">
<p>The authors have declared that no competing interests exist.</p>
</fn>
<fn fn-type="con">
<p>Conceived and designed the experiments: APP EPX WW. Performed the experiments: APP WW. Analyzed the data: APP WW. Contributed reagents/materials/analysis tools: REC IK SBW MB EPX. Wrote the paper: APP EPX WW. Edited the manuscript: IK SBW MB.</p>
</fn>
</author-notes>
<pub-date pub-type="collection">
<month>7</month>
<year>2014</year>
</pub-date>
<pub-date pub-type="epub">
<day>24</day>
<month>7</month>
<year>2014</year>
</pub-date>
<volume>10</volume>
<issue>7</issue>
<elocation-id>e1003713</elocation-id>
<history>
<date date-type="received">
<day>29</day>
<month>7</month>
<year>2013</year>
</date>
<date date-type="accepted">
<day>10</day>
<month>4</month>
<year>2014</year>
</date>
</history>
<permissions>
<copyright-statement>© 2014 Parikh et al</copyright-statement>
<copyright-year>2014</copyright-year>
<copyright-holder>Parikh et al</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<license-p>This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.</license-p>
</license>
</permissions>
<abstract>
<p>The HMT3522 progression series of human breast cells have been used to discover how tissue architecture, microenvironment and signaling molecules affect breast cell growth and behaviors. However, much remains to be elucidated about malignant and phenotypic reversion behaviors of the HMT3522-T4-2 cells of this series. We employed a “pan-cell-state” strategy, and analyzed jointly microarray profiles obtained from different state-specific cell populations from this progression and reversion model of the breast cells using a tree-lineage multi-network inference algorithm,
<italic>Treegl</italic>
. We found that different breast cell states contain distinct gene networks. The network specific to non-malignant HMT3522-S1 cells is dominated by genes involved in normal processes, whereas the T4-2-specific network is enriched with cancer-related genes. The networks specific to various conditions of the reverted T4-2 cells are enriched with pathways suggestive of compensatory effects, consistent with clinical data showing patient resistance to anticancer drugs. We validated the findings using an external dataset, and showed that aberrant expression values of certain hubs in the identified networks are associated with poor clinical outcomes. Thus, analysis of various reversion conditions (including non-reverted) of HMT3522 cells using
<italic>Treegl</italic>
can be a good model system to study drug effects on breast cancer.</p>
</abstract>
<abstract abstract-type="summary">
<title>Author Summary</title>
<p>The HMT3522 isogenic human breast cancer progression series has been used to study the effect of various drugs on the reversion of the breast cancer cells. Despite significant efforts to delineate key signaling events responsible for phenotypic reversion of the malignant HMT3522-T4-2 (T4-2) breast cells in this series, many questions remain. For example, what is involved in the phenotypic reversion of T4-2 cells at the systems level? In order to answer this question, we analyzed gene expression microarray data obtained from these cells using our recently developed tree-evolving network inference algorithm
<italic>Treegl</italic>
. We reconstructed cell-state-specific gene networks using
<italic>Treegl</italic>
. Our functional analysis results show that we can not only unravel cell-state specific information characteristic of non-malignant HMT3522-S1 (S1) and malignant T4-2 cells in the series, but can also provide insight into the T4-2 cells reverted by various agents. We found that the networks specific to various conditions of the T4-2 reverted cells are all suggestive of compensatory signaling effects, which, however, are mediated by different signaling pathways to antagonize different drug effects in the reverted cells. Our results demonstrate that the HMT3522 system when analyzed with
<italic>Treegl</italic>
may potentially become an effective tool for novel drug-target discovery and identification.</p>
</abstract>
<funding-group>
<funding-statement>This research was made possible by grants NIH R01 GM093156-01 to EPX and WW, NSF DBI-0546594, IIS-0713379, and an Alfred P. Sloan Fellowship to EPX. APP is partly supported by an NSF Graduate Research Fellowship (under Grant No. 0750271). MB, IK and SBW are supported by NIH R37 CA064786. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.</funding-statement>
</funding-group>
<counts>
<page-count count="18"></page-count>
</counts>
</article-meta>
</front>
<body>
<sec id="s1">
<title>Introduction</title>
<p>A major challenge in systems biology is to uncover dynamic changes in cellular pathways that either respond to the changing microenvironment of cells, or drive cellular transformation during various biological processes such as cell cycle, differentiation, and development. These changes may involve rewiring of transcriptional regulatory circuitry or signal transduction pathways that control cellular behaviors. Such information is of particular importance for seeking a deep mechanistic understanding of cellular responses to drug treatments in various diseases, offering a more holistic view of both microscopic and macroscopic changes in the cellular functional machinery than has been available from traditional analyses which usually focus only on finding differential markers or close-up analysis of changes in a handful of molecules constituting parts of some selected pathways of interest.</p>
<p>Network-based differential analysis naturally requires the availability of multiple networks each in principle corresponding to a specific biological condition in question, that are then topologically rewired across conditions
<xref rid="pcbi.1003713-Luscombe1" ref-type="bibr">[1]</xref>
. However, most existing computational techniques for reconstructing molecular networks based on high-throughput data cannot capture such dynamic aspects of the network topology; instead, they represent the networks as an invariant graph. For example, it is common to infer a single invariant gene network using microarray data obtained from samples collected over time or multiple conditions. More sophisticated methods such as a trace-back algorithm
<xref rid="pcbi.1003713-Luscombe1" ref-type="bibr">[1]</xref>
and DREM
<xref rid="pcbi.1003713-Ernst1" ref-type="bibr">[2]</xref>
,
<xref rid="pcbi.1003713-Schulz1" ref-type="bibr">[3]</xref>
do emphasize uncovering the dynamic changes of a network over time using time series data, but limitations in these algorithms allow only certain kinds of dynamic behaviors, such as “active path”
<xref rid="pcbi.1003713-Luscombe1" ref-type="bibr">[1]</xref>
or bifurcating sequence of transcriptional activations
<xref rid="pcbi.1003713-Ernst1" ref-type="bibr">[2]</xref>
. Moreover, such methods are heuristic in nature and do not offer statistical guarantees on the asymptotic correctness of the inferred “transient” components in the network, making the results difficult to withstand the harsh standard on stability and robustness when sample quality and size become less ideal, as we face in the analysis to be conducted in this paper.</p>
<p>Indeed, a number of in-depth investigations of disease models have suggested that over the course of cellular transformation in response to microenvironmental changes due to disease progression or drug-induced reversion, there may exist multiple underlying “themes” that determine each molecule's function and relationship with other molecules
<xref rid="pcbi.1003713-Bissell1" ref-type="bibr">[4]</xref>
,
<xref rid="pcbi.1003713-Bissell2" ref-type="bibr">[5]</xref>
. As a result, molecular networks at each cellular stage are context-dependent and can undergo systematic rewiring (
<xref ref-type="fig" rid="pcbi-1003713-g001">Figure 1</xref>
). For example, strong evidence of alterations of various pathways have been reported in the HMT3522 progression series of breast cells when malignant T4-2 cells were phenotypically reverted by various drugs, albeit only manifested by a small number of well-known signaling molecules as discussed below
<xref rid="pcbi.1003713-Weaver1" ref-type="bibr">[6]</xref>
<xref rid="pcbi.1003713-Liu1" ref-type="bibr">[8]</xref>
.</p>
<fig id="pcbi-1003713-g001" orientation="portrait" position="float">
<object-id pub-id-type="doi">10.1371/journal.pcbi.1003713.g001</object-id>
<label>Figure 1</label>
<caption>
<title>A schematic representation of the relationship of the non-reverted and various conditions of the reverted HMT3522 breast cells.</title>
<p>The nonmalignant S1 cell is the root of the tree. It is also the parent of the malignant T4-2 cell since T4-2 cells were derived from S1. The T4-2 cells can be reverted to phenotypically normal-looking structures by treatment with various agents, such as: i) either EGFR or β1-integrin inhibitor, ii) either PI3K or MAPKK inhibitor, or iii) MMP inhibitors, they are thus represented as the parent of the various conditions of the reverted T4-2 cells. Microarray profiles were generated from each cell state represented in the tree, and gene networks specific to each state were reverse engineered using
<italic>Treegl</italic>
.</p>
</caption>
<graphic xlink:href="pcbi.1003713.g001"></graphic>
</fig>
<p>In this paper, we conduct an in-depth study of the structural changes in the gene regulatory networks underlying each cell state in both the non-reverted and the reverted HMT3522 progression series of breast cells. The HMT3522 cells have been shown to be an excellent model system for studying the roles of tissue architecture, microenvironment and signaling molecules involved in the nonmalignant and malignant growth and behaviors of breast cells, including the potential of various factors to cause phenotypic reversion of malignant cells to nonmalignant states. These cells originated from a nonmalignant human breast epithelial sample, HMT3522
<xref rid="pcbi.1003713-Briand1" ref-type="bibr">[9]</xref>
,
<xref rid="pcbi.1003713-Briand2" ref-type="bibr">[10]</xref>
. HMT3522-S1_LBNL (S1) cells are from early passages which are nonmalignant and dependent on exogenous epidermal growth factor (EGF) to grow. HMT3522-T4-2_LBNL (T4-2) cells were generated from S1 cells by a multi-step process: 238 passages in medium without EGF followed by transplantation into a mouse which generated a tumor, and T4-2 cells were isolated from the serial passage of this tumor; thus T4-2 cells are malignant and tumorigenic
<xref rid="pcbi.1003713-Briand2" ref-type="bibr">[10]</xref>
. Interestingly, when cultured in three-dimensional (3D) laminin-rich extracellular matrices (lrECM), S1 cells form polarized acinus structures with a central lumen which resemble the terminal milk-secreting alveolar units in normal breasts
<xref rid="pcbi.1003713-Weaver1" ref-type="bibr">[6]</xref>
,
<xref rid="pcbi.1003713-Inman1" ref-type="bibr">[11]</xref>
, whereas T4-2 cells form disorganized structures under the same conditions. Signaling molecules such as EGFR, β1-integrin, PI3K, and MAPK are overexpressed in T4-2 cells relative to their levels in S1. Crosstalk between these molecules plays pivotal roles in defining malignant behaviors of T4-2 cells, and downmodulation of them causes phenotypic reversion of T4-2 cells into growth-arrested, normal-looking cells (also called T4R cells later) which form structures resembling S1 acini but often without the lumen
<xref rid="pcbi.1003713-Liu1" ref-type="bibr">[8]</xref>
. Other molecules, such as TACE or Rap1, have also been shown to be important for reversion of T4-2 cells
<xref rid="pcbi.1003713-Kenny1" ref-type="bibr">[12]</xref>
,
<xref rid="pcbi.1003713-Itoh1" ref-type="bibr">[13]</xref>
. NFkappaB was identified as one of the transcriptional regulators involved in disorganization of T4-2 cells
<xref rid="pcbi.1003713-BeckerWeimann1" ref-type="bibr">[14]</xref>
. Despite significant efforts to delineate key signaling events responsible for phenotypic reversion of these malignant breast cells, many questions remain. For example, are T4-2 cells reverted by inhibitors of different molecules intrinsically the same? What is involved in the phenotypic reversion of T4-2 cells at the systems level other than a few genes directly related to the signaling molecules mentioned above?</p>
<p>One classical approach to address these questions is to identify genes differentially expressed between different cell states. While this can lead to some information about marginal effects of the genes in a particular stage of cancer progression or reversion, it cannot yield insight into the underlying regulatory mechanisms that govern interaction of genes with one another to carry out complex cellular processes. Instead, we propose a network-based differential analysis, by reverse engineering gene regulatory networks of various conditions of the breast cells to depict a fuller picture of regulatory mechanisms of the cells.</p>
<p>Many methods, as reviewed in
<xref rid="pcbi.1003713-Marbach1" ref-type="bibr">[15]</xref>
,
<xref rid="pcbi.1003713-BarJoseph1" ref-type="bibr">[16]</xref>
, have been proposed for reconstructing gene networks using gene expression microarray data. Most of them
<xref rid="pcbi.1003713-Friedman1" ref-type="bibr">[17]</xref>
<xref rid="pcbi.1003713-Yeung1" ref-type="bibr">[19]</xref>
, however, rely on the statistical assumption that the samples in question were independent and identically distributed (
<italic>i.i.d</italic>
), and thus they either lead to estimation of a single network by pooling data from all the samples together, or lead to estimation of a network for each cell state independently. Since the breast cells in this study came from non-reverted HMT3522 cells as well as various conditions of the reverted cells, the regulatory mechanisms in different cell states can be significantly different; therefore, pooling data from different cell states together to estimate one single network does not reveal networks in their full depth. On the other hand, reconstructing a network specific to each cell state independently of the other ones can be statistically inaccurate due to a small sample size for each cell state. Recently, time-varying network detection methods have been proposed that allow information sharing across time and can thus recover a sequence of networks even with small sample sizes
<xref rid="pcbi.1003713-Ahmed1" ref-type="bibr">[20]</xref>
<xref rid="pcbi.1003713-Grzegorczyk1" ref-type="bibr">[23]</xref>
. For example, Song
<italic>et al.</italic>
proposed a time-varying dynamic Bayesian network method to estimate a chain of evolving networks over time
<xref rid="pcbi.1003713-Song2" ref-type="bibr">[22]</xref>
. However, these methods estimate networks that evolve as a chain of networks over time, not as a series of networks shared by the tree-shaped phenotypic relationships as shown in
<xref ref-type="fig" rid="pcbi-1003713-g001">Figure 1</xref>
.</p>
<p>Due to the unique challenges we encountered to reconstruct networks that rewire over the tree-shaped phenotypic relationships, we recently proposed
<italic>Treegl</italic>
<xref rid="pcbi.1003713-Parikh1" ref-type="bibr">[24]</xref>
, a network reconstruction algorithm that can effectively and jointly recover rewiring regulatory networks present in multiple related cell states. Our approach can not only recover a distinct network for each cell state and reveal sharp differences among networks for different cell states, but also capture and leverage similarities of the networks in the cell states nearby in the phenotypic tree, thereby leading to more accurate estimation of gene interactions in small sample size scenarios. This new angle of estimating networks can reveal information that has not been mined in traditional analysis.</p>
<p>In this paper, we conduct an extensive network analysis of non-reverted HMT3522 cells (normal S1 and malignant T4-2 cells) as well as three different conditions of reverted T4-2 cells using gene expression microarray data obtained from these cells. It is notable that the same set of the gene expression data was first described and used in our previous work published in
<xref rid="pcbi.1003713-Parikh1" ref-type="bibr">[24]</xref>
, however, our focus then was to report the novel methodology behind the
<italic>Treegl</italic>
algorithm, but not a thorough biological analysis of the HMT3522 series of cells from which the gene expression data was generated. In this current work, we focus more on the biological findings discovered by a more comprehensive network analysis of the data using
<italic>Treegl</italic>
and other bioinformatics tools, and aim to provide better biological insights and understandings of the various breast cell states in the HMT3522 series.</p>
<p>In particular, we estimated the network specific to each cell state using
<italic>Treegl</italic>
. Our results showed that while the S1-specific network contains predominantly nonmalignant pathways, the T4-specific network contains various cancer-related pathways, both findings consistent with biological evidence
<xref rid="pcbi.1003713-Bissell1" ref-type="bibr">[4]</xref>
. Furthermore, we found that the networks specific to various conditions of the T4-2 reverted cells are enriched with pathways suggestive of compensatory effects. In the T4-2 cells reverted by inhibition of either EGFR or β1-integrin, signaling pathways downstream of EGFR or β1-integrin, mainly via the PI3K-AKT-mTOR axis, are upregulated. Similarly, in the T4-2 cells reverted by either PI3K or MAPKK, we observed upregulation of the pathways both upstream and downstream of PI3K. These results are supported by clinical evidence showing patient resistance to the same anti-breast cancer drugs as we used in the study. Moreover, the compensatory signaling is also observed in the differential network of the T4-2 cells reverted by MMPIs, which involves genes participating in protein catabolic processes. Together, our findings suggest a common resistance mechanism employed by breast cancer cells to antagonize drug effects. Finally, in order to identify potential novel drug targets, we also investigated hubs (i.e., genes with high degrees, see details in
<xref ref-type="sec" rid="s4">Materials and Methods</xref>
) in the differential networks of the breast cells, and characterized specifically three hubs (NEBL, HBEGF, and PAPD7) whose aberrant expression values are linked with the worst survival outcomes in the breast cancer patients to provide insight into their functional significance on the growth and development of breast cancer cells. Our data suggest that
<italic>Treegl</italic>
when applied to an effective disease model system, such as the HMT3522 cells, may potentially become an effective tool for elucidating disease mechanism and discovering novel drug targets, and thus help make personalized medicine possible.</p>
</sec>
<sec id="s2">
<title>Results</title>
<p>We model a gene network as a Markov network
<xref rid="pcbi.1003713-Koller1" ref-type="bibr">[25]</xref>
, which is a graph
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e001.jpg"></inline-graphic>
</inline-formula>
where
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e002.jpg"></inline-graphic>
</inline-formula>
is the set of vertices (genes), and
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e003.jpg"></inline-graphic>
</inline-formula>
is the set of edges. Genes
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e004.jpg"></inline-graphic>
</inline-formula>
and
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e005.jpg"></inline-graphic>
</inline-formula>
do
<italic>not</italic>
have an edge between them if and only if they are conditionally independent given the values of all other genes. We contrast this with a correlation network (a common approach for modeling gene networks), in which
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e006.jpg"></inline-graphic>
</inline-formula>
and
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e007.jpg"></inline-graphic>
</inline-formula>
are connected if their marginal pairwise correlation is greater than a certain threshold. Correlation can be effective when analyzing a pair of genes in isolation. However, when studying the dependence between two genes in the
<italic>context</italic>
of other genes, correlation can confound direct/indirect relationships, thus producing undesirable results (
<xref ref-type="fig" rid="pcbi-1003713-g002">Figure 2</xref>
).</p>
<fig id="pcbi-1003713-g002" orientation="portrait" position="float">
<object-id pub-id-type="doi">10.1371/journal.pcbi.1003713.g002</object-id>
<label>Figure 2</label>
<caption>
<title>An example demonstrating the difference between a Markov network and a correlation network.</title>
<p>(A) A true network, in which
<bold>
<italic>R</italic>
</bold>
is the regulator of
<bold>
<italic>A</italic>
</bold>
,
<bold>
<italic>B</italic>
</bold>
, and
<bold>
<italic>C</italic>
</bold>
. (B) A clique graph produced by a correlation network. Since all the genes in (A) are correlated with one another, the correlation network cannot distinguish between indirect and direct relationships and thus connects all the nodes. (C) A correct graph recovered by a Markov network. The Markov network recovers true relationship of the nodes because it uses conditional independence to determine the presence of an edge.</p>
</caption>
<graphic xlink:href="pcbi.1003713.g002"></graphic>
</fig>
<p>Specifically, we model the gene network for each cell state
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e008.jpg"></inline-graphic>
</inline-formula>
as a
<italic>Gaussian</italic>
Markov network. Gaussian distributions are special in that the inverse of the covariance matrix, called the precision matrix
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e009.jpg"></inline-graphic>
</inline-formula>
, completely encodes the structure of the Markov network. In particular, an edge
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e010.jpg"></inline-graphic>
</inline-formula>
exists in the Markov network if and only if the corresponding precision matrix element is non-zero.</p>
<p>Thus, under our model, the problem of learning the structure of a gene network reduces to estimating
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e011.jpg"></inline-graphic>
</inline-formula>
. This is straightforward when the number of genes
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e012.jpg"></inline-graphic>
</inline-formula>
is smaller than that of microarray samples
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e013.jpg"></inline-graphic>
</inline-formula>
: one can compute the sample covariance matrix and simply get its inverse. However, in high dimensional settings, when
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e014.jpg"></inline-graphic>
</inline-formula>
(as is our case), the sample covariance matrix is not invertible, and thus the problem becomes substantially more challenging. A statistically principled solution is the graphical lasso
<xref rid="pcbi.1003713-Meinshausen1" ref-type="bibr">[26]</xref>
, which estimates the neighborhood of gene
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e015.jpg"></inline-graphic>
</inline-formula>
by using regularized regression. The regression coefficients can be interpreted as estimates of the precision matrix elements up to a proportionality constant (see
<xref ref-type="sec" rid="s4">Materials and Methods</xref>
for more details). After the neighborhood of each gene is estimated independently, the results are combined to form a network.</p>
<sec id="s2a">
<title>Tree-evolving network detection algorithm,
<italic>Treegl</italic>
</title>
<p>However, our goal is not estimate a single network, but rather a collection of networks, one for each cell state. One simple solution is to estimate the network for each state independently of the others using the graphical lasso. However, this approach can result in poor quality of the networks due to the small sample size per cell state. To overcome this challenge, our recently proposed algorithm,
<italic>Treegl</italic>
<xref rid="pcbi.1003713-Parikh1" ref-type="bibr">[24]</xref>
, utilizes the following strategy. Similar to the graphical lasso
<xref rid="pcbi.1003713-Meinshausen1" ref-type="bibr">[26]</xref>
,
<xref rid="pcbi.1003713-Ravikumar1" ref-type="bibr">[27]</xref>
,
<italic>Treegl</italic>
estimates the neighborhood of each gene independently of those of other genes using regularized regression. However, unlike previous methods learning only a single network,
<italic>Treegl</italic>
simultaneously estimates neighborhoods of a gene in multiple networks each corresponding to a unique state in the phenotypic tree of the breast cells. It is unique in that
<italic>Treegl</italic>
makes use of a total variation regularizer based on the progression and reversal relationships between pairs of cell states in question to bias the amount of topological differences between networks underlying the related states, and to allow information regarding probabilistic independencies between genes to propagate across all states either directly or indirectly related by phenotypes. Such a strategy can lead to highly statistically confident estimation of a gene Markov network
<xref rid="pcbi.1003713-Kolar1" ref-type="bibr">[28]</xref>
, even under small sample size scenarios. In the
<xref ref-type="sec" rid="s4">Materials and Methods</xref>
section, we will offer details of a novel statistical regularization technique that makes this possible.</p>
<p>From the theoretical standpoint,
<italic>Treegl</italic>
is an instance of the general varying coefficient varying structure (VCVS) formalism analyzed in
<xref rid="pcbi.1003713-Kolar2" ref-type="bibr">[29]</xref>
. The VCVS model encodes changing structures of gene networks in different cell states as a function of regression coefficients in regularized regression problems. Estimating these regression coefficients, and thus the associated network structures then reduces to solving a convex optimization problem jointly over all cell states. The global optimal solution for such a problem can be found using standard convex solvers.</p>
<p>Moreover, the VCVS formalism allows one to theoretically examine and prove the statistical conditions under which changes in structures can be correctly estimated even in the high dimensional setting when
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e016.jpg"></inline-graphic>
</inline-formula>
. This distinguishes our approach from other methods
<xref rid="pcbi.1003713-Friedman1" ref-type="bibr">[17]</xref>
,
<xref rid="pcbi.1003713-Segal1" ref-type="bibr">[30]</xref>
that are highly non-convex and therefore rely on local search heuristics that only find local optima. These existing methods also do not offer sound statistical machinery for addressing difficult conditions such as nonstationarity (e.g., time-evolving) and high-dimensionality under small sample size as we encountered in our study.</p>
<p>Having a theoretical framework allows us to trade-off model expressivity and learning complexity in a principled manner. For example, if we allow a complex and arbitrary network model (i.e., a dense network), then there would be no guarantees on the quality of the recovered network structure in small-sample size scenarios. Instead, by enforcing a restricted model (i.e., a sparse network), its likelihood function is by definition convex and an optimal solution may be found. Thus, the quality of the resulting solution can be theoretically characterized, and it can be determined under which conditions the correct underlying parameters (network structure in this case) are discovered. Fortunately, sparsity is also biologically justifiable. For example, it is common to find a transcription factor regulating a limited number of genes under specific conditions
<xref rid="pcbi.1003713-Davidson1" ref-type="bibr">[31]</xref>
.</p>
</sec>
<sec id="s2b">
<title>Simulation results</title>
<p>We first evaluate
<italic>Treegl</italic>
's performance on simulated microarray data. In order to find out how effectively
<italic>Treegl</italic>
can detect change points of multiple networks while sharing information among related cell states at the same time, we design the simulated networks as illustrated in
<xref ref-type="supplementary-material" rid="pcbi.1003713.s001">Figure S1</xref>
. In particular, for each experiment, an artificial collection of 70 networks related by a tree-shaped lineage are generated, in which a sequence of 10 identical networks is connected to a network of different topology via a change/branching point (see details in
<xref ref-type="sec" rid="s4">Materials and Methods</xref>
). Then, a small number of samples are generated from each of the networks. It is important to note that
<italic>Treegl</italic>
does not know a priori which of the networks are identical and which are different and thus has to discover this based on the samples.</p>
<p>In order to evaluate how well
<italic>Treegl</italic>
can recover the underlying network structures for the samples in the simulation data, we compare
<italic>Treegl</italic>
with the static method estimating a single network and the method estimating each network independently by plotting the precision-recall curves which show the recall for different values of precision based on the network estimated by the three methods. As illustrated in
<xref ref-type="fig" rid="pcbi-1003713-g003">Figures 3</xref>
&
<xref ref-type="supplementary-material" rid="pcbi.1003713.s002">S2</xref>
,
<italic>Treegl</italic>
performs favorably to the other two methods. It should also be noted that compared to the static method which produces only one network,
<italic>Treegl</italic>
can produce different networks related by the tree lineage. The independent method also produces different networks but it performs poorly compared to
<italic>Treegl</italic>
.</p>
<fig id="pcbi-1003713-g003" orientation="portrait" position="float">
<object-id pub-id-type="doi">10.1371/journal.pcbi.1003713.g003</object-id>
<label>Figure 3</label>
<caption>
<title>Simulation results comparing the performance of
<italic>Treegl</italic>
to a method estimating a single static network and a method estimating each network independently.</title>
<p>In all cases, 70 networks were generated that are related by a tree lineage (See
<xref ref-type="sec" rid="s4">Materials and Methods</xref>
for details). (A) Each network has 30 nodes and 5 samples. (B) Each network has 30 nodes and 10 samples. (C) Each network has 50 nodes and 5 samples. (D) Each network has 50 nodes and 10 samples. In all cases,
<italic>Treegl</italic>
(shown in blue) performs favorably to the method estimating a single static network (red) or the method estimating each network independently (green).</p>
</caption>
<graphic xlink:href="pcbi.1003713.g003"></graphic>
</fig>
</sec>
<sec id="s2c">
<title>A phenotypic tree representation of the HMT3522 series of the breast cells</title>
<p>In order to reverse engineer gene networks of the breast cells, we first used a phenotypic tree to represent the relationships of the cells (
<xref ref-type="fig" rid="pcbi-1003713-g001">Figure 1</xref>
). Due to the small sample size of the microarray data and imbalance of the sample abundance for different cell states (see
<xref ref-type="sec" rid="s4">Materials and Methods</xref>
for details) — both of the problems pose significant challenges to network reconstruction — we used what is known of the interrelatedness of signaling pathways affecting phenotypic reversion to pool data derived from various samples in order to increase the power of the network inference. In particular, since EGFR and β1-integrin are cross-modulated in the HMT3522 cells
<xref rid="pcbi.1003713-Liu1" ref-type="bibr">[8]</xref>
, we assumed that the gene networks in the T4-2 cells reverted by inhibiting either of the molecules share reasonable similarity, and hence we grouped data from these reverted cells together to form the EGFR/ITGB1-T4R group. Likewise, we grouped together data from T4-2 cells reverted by either a PI3K inhibitor, a MAPK inhibitor, or a dominant-negative Rap1 to form the PI3K/MAPKK-T4R group, because PI3K and MAPK are also cross-modulated in the breast cells and Rap1 signals through PI3K. A tree diagram illustrating the relationships of the cells are shown in
<xref ref-type="fig" rid="pcbi-1003713-g001">Figure 1</xref>
. Based on these relationships, we reverse engineered gene networks for the HMT3522 cells using
<italic>Treegl</italic>
.</p>
<p>It is important to point out that it would be nearly statistically impossible to reconstruct a cell-state-specific gene network using existing methodology based on three microarray samples per group as in the dataset we used here. Note that we will also refer to different groups of the cells in the phenotypic tree in
<xref ref-type="fig" rid="pcbi-1003713-g001">Figure 1</xref>
as different cell conditions or states.</p>
</sec>
<sec id="s2d">
<title>The reconstructed gene networks for the HMT3522 cells</title>
<p>The reconstructed networks for non-reverted and various conditions of the reverted HMT3522 cells are illustrated in
<xref ref-type="supplementary-material" rid="pcbi.1003713.s003">Figure S3</xref>
. They share many topological similarities as well as differences. About 60% of the network edges are common to all cell conditions represented in the phenotypic tree diagram, consistent with underlying biological similarities shared between them. In the following, we concentrate on only the edges specific to each cell state, which we call the
<italic>differential network</italic>
for each cell state.</p>
<sec id="s2d1">
<title>S1 differential network</title>
<p>Our pathway and GO analysis showed that genes in the S1 differential network are significantly enriched with those involved in normal cellular processes, such as cell cycle, TCA cycle, and cellular respiration (
<xref ref-type="fig" rid="pcbi-1003713-g004">Figure 4A</xref>
,
<xref ref-type="supplementary-material" rid="pcbi.1003713.s008">Tables S1A</xref>
&
<xref ref-type="supplementary-material" rid="pcbi.1003713.s009">S2A</xref>
). Notably, genes involved in tube lumen formation are enriched only in S1 cells (
<xref ref-type="supplementary-material" rid="pcbi.1003713.s009">Table S2A</xref>
) while absent in the other cell states, consistent with the observation that a central lumen is always present in the acinus structure formed by S1 cells but often absent from spheres formed by reverted T4-2 cells. Furthermore, our disease relevance analysis did not find association of the genes in the S1 differential network with any disease (
<xref ref-type="supplementary-material" rid="pcbi.1003713.s010">Table S3A</xref>
). Together, these results agree with the biological fact that S1 cells are nonmalignant.</p>
<fig id="pcbi-1003713-g004" orientation="portrait" position="float">
<object-id pub-id-type="doi">10.1371/journal.pcbi.1003713.g004</object-id>
<label>Figure 4</label>
<caption>
<title>Illustration of selected enriched pathways in the differential network of each breast cell state.</title>
<p>(A) S1; (B) T4; (C) the EGFR/ITGB1-T4R group; (D) the PI3K/MAPKK-T4R group; and (E) the MMP-T4R group. In each plot, the differential network for the corresponding cell state is shown. Nodes represent genes; edges represent interaction of the genes. Selected pathways enriched in the network (unadjusted p-values<0.05) are color-coded and shown on the right; genes participating in the selected pathways are also colored based on the color code for the corresponding pathway. An asterisk indicates a pathway that is significantly enriched in the corresponding differential network. The pathway names are shortened to save space. See
<xref ref-type="supplementary-material" rid="pcbi.1003713.s008">Table S1</xref>
for detailed information about the enriched pathways in each cell state.</p>
</caption>
<graphic xlink:href="pcbi.1003713.g004"></graphic>
</fig>
</sec>
<sec id="s2d2">
<title>T4-2 differential network</title>
<p>The T4-2 differential network, on the other hand, is significantly enriched with genes involved in a number of pathways important for tumor growth and progression, such as ErbB and MAPK signaling pathways, ECM-receptor interaction, and regulation of actin cytoskeleton pathways (
<xref ref-type="fig" rid="pcbi-1003713-g004">Figure 4B</xref>
,
<xref ref-type="supplementary-material" rid="pcbi.1003713.s008">Table S1B</xref>
). Moreover, disease relevance analysis showed that genes in the T4-2 differential network are associated with various cancers, such as small cell lung cancer, renal cell carcinoma, colorectal cancer, among others (
<xref ref-type="fig" rid="pcbi-1003713-g005">Figure 5B</xref>
,
<xref ref-type="supplementary-material" rid="pcbi.1003713.s010">Table S3B</xref>
), and that genes involved in the pathways such as ErbB and MAPK signaling pathways, focal adhesion, and ECM-receptor interaction (
<xref ref-type="supplementary-material" rid="pcbi.1003713.s008">Table S1B</xref>
) have significant association with pathways in cancer (FDR adjusted p-values<0.1). These data are supported by biological evidence showing that ErbB and MAPK signaling pathways, microenvironment, and integrity of tissue architecture play significant roles in the malignant T4-2 cells
<xref rid="pcbi.1003713-Bissell1" ref-type="bibr">[4]</xref>
,
<xref rid="pcbi.1003713-Weaver1" ref-type="bibr">[6]</xref>
<xref rid="pcbi.1003713-Liu1" ref-type="bibr">[8]</xref>
. Together, these functional results suggest that
<italic>Treegl</italic>
can indeed reveal biological characteristic that is specific to the states in the HMT3522 cells.</p>
<fig id="pcbi-1003713-g005" orientation="portrait" position="float">
<object-id pub-id-type="doi">10.1371/journal.pcbi.1003713.g005</object-id>
<label>Figure 5</label>
<caption>
<title>Diseases associated with the genes in the differential network of each breast cell state.</title>
<p>(A) S1; (B) T4; (C) the EGFR/ITGB1-T4R group; (D) the PI3K/MAPKK-T4R group; and (E) the MMP-T4R group. In each plot, the differential network for the corresponding cell state is shown. Nodes represent genes; edges represent interaction of the genes. Diseases associated with the genes in the network (unadjusted p-values<0.05) are color-coded and shown on the right; genes associated with a certain disease are also colored based on the color code for the corresponding disease. An asterisk indicates a disease that is significantly enriched in the corresponding differential network.</p>
</caption>
<graphic xlink:href="pcbi.1003713.g005"></graphic>
</fig>
</sec>
<sec id="s2d3">
<title>Differential networks of the three conditions of the reverted T4-2 cells</title>
<p>In order to find out whether T4-2 cells reverted by various agents are merely phenotypically or intrinsically normal, and also whether these different conditions of the reverted cells are identical to one another at the systems level, we examined the differential networks specific to each condition of the reverted T4-2 cells.</p>
</sec>
<sec id="s2d4">
<title>EGFR/ITGB1-T4R differential network</title>
<p>Our pathway analysis suggests that there are no pathways significantly enriched in the differential network for this group of the cells (
<xref ref-type="supplementary-material" rid="pcbi.1003713.s008">Table S1C</xref>
). However, a close examination of the enriched pathways (unadjusted p values<0.05) revealed that genes involved in the pathways downstream of EGFR, such as phosphatidylinositol and mTOR pathways are enriched (
<xref ref-type="fig" rid="pcbi-1003713-g004">Figure 4C</xref>
,
<xref ref-type="supplementary-material" rid="pcbi.1003713.s008">Tables S1C</xref>
&
<xref ref-type="supplementary-material" rid="pcbi.1003713.s009">S2C</xref>
). Since PI3K is a component of the phosphatidylinositol pathway, the enrichment of the pathway suggests upmodulation of PI3K in these cells (
<xref ref-type="supplementary-material" rid="pcbi.1003713.s004">Figure S4</xref>
). This is confirmed by the results showing PIK3R2 (encoding a regulatory subunit of PI3K) and AKT1, but not mTOR itself, are among the genes upmodulated in the enriched mTOR pathway (
<xref ref-type="supplementary-material" rid="pcbi.1003713.s005">Figure S5</xref>
). Together, these results suggest PI3K-AKT-mTOR signaling is activated in the EGFR/ITGB1-T4R cells. Further, genes in the EGFR/ITGB1-T4R differential network are connected with various cancers, including prostate, and small cell lung cancer (
<xref ref-type="fig" rid="pcbi-1003713-g005">Figure 5C</xref>
,
<xref ref-type="supplementary-material" rid="pcbi.1003713.s010">Table S3C</xref>
), suggesting that despite phenotypic reversion to normal, some malignant genes remain active in the EGFR/ITGB1-T4R cells.</p>
<p>Interestingly, activation of PI3K-AKT-mTOR signaling in our data agrees well with clinical evidence showing that some breast cancer patients develop drug resistance after being treated with EGFR inhibitors, and that compensatory signaling via the PI3K-AKT-mTOR pathway has been implicated in such resistance
<xref rid="pcbi.1003713-Baselga1" ref-type="bibr">[32]</xref>
<xref rid="pcbi.1003713-DiCosimo1" ref-type="bibr">[34]</xref>
. Furthermore, our data suggest that mTOR signaling is not mediated directly by mTOR, but rather by other genes in the mTOR pathway, because mTOR per se is not involved in the differential network of this condition of the reverted cells; multiple other genes in the pathway, however, are upregulated. These results also agree with previous evidence in the literature; despite a close link between mTOR signaling and cancer, overexpression of mTOR has not been reported in human tumors; instead, signaling events related to mTOR occur frequently
<xref rid="pcbi.1003713-Huang1" ref-type="bibr">[35]</xref>
. Our results may thus identify genes directly involved in such drug resistance.</p>
</sec>
<sec id="s2d5">
<title>PI3K/MAPKK-T4R differential network</title>
<p>As expected, the enrichment of the PI3K-related pathways is absent in the differential network of the PI3K/MAPKK-T4R group of the reverted cells due to the blockage of intracellular signaling (
<xref ref-type="fig" rid="pcbi-1003713-g004">Figure 4D</xref>
). Our data, however, show that pathways both upstream (insulin pathway) and downstream (mTOR pathway) of PI3K are enriched (unadjusted p-values<0.05,
<xref ref-type="supplementary-material" rid="pcbi.1003713.s008">Table S1D</xref>
,
<xref ref-type="supplementary-material" rid="pcbi.1003713.s006">Figures S6</xref>
&
<xref ref-type="supplementary-material" rid="pcbi.1003713.s005">S5</xref>
), suggesting activation of compensatory signaling due to the loss of PI3K signaling in the cells. Moreover, genes in the differential network, particularly those involved in the pathways such as mTOR, insulin, and apoptotic signaling, showed association with cancer (
<xref ref-type="fig" rid="pcbi-1003713-g005">Figure 5D</xref>
,
<xref ref-type="supplementary-material" rid="pcbi.1003713.s010">Table S3D</xref>
). Even though the p-values of the pathways are not significant after the FDR adjustment, these results agree with recent clinical findings that some breast cancer patients exhibit resistance to treatment by PI3K inhibitors, and that mTOR and insulin growth factor (IGF) signaling pathways have been implicated for such resistance
<xref rid="pcbi.1003713-DiCosimo1" ref-type="bibr">[34]</xref>
. However, no genes in these pathways have been identified for this resistance yet. A close connection between insulin and IGF pathways can be seen in
<xref ref-type="supplementary-material" rid="pcbi.1003713.s005">Figure S5</xref>
.</p>
</sec>
<sec id="s2d6">
<title>MMP-T4R differential network</title>
<p>Unlike the differential networks in the other phenotypic reversion conditions mentioned above, the MMP-T4R differential network has far fewer edges, suggesting that fewer biological processes take place in the MMP-T4R group of the reverted cells. Functional pathway and GO analysis revealed that genes involved in protein catabolic processes and transfer, such as ubiquitin-dependent and proteasomal protein catabolic processes and receptor-mediated endocytosis, are significantly enriched (FDR p-values<0.05,
<xref ref-type="fig" rid="pcbi-1003713-g004">Figure 4E</xref>
,
<xref ref-type="supplementary-material" rid="pcbi.1003713.s008">Tables S1E</xref>
&
<xref ref-type="supplementary-material" rid="pcbi.1003713.s009">S2E</xref>
). Interestingly, MMPs are known to play key roles in protein catabolic processes, these results, therefore, suggest that when MMPs are inhibited in T4-2 cells, genes participating in protein catabolic processes and transfer are upregulated as compensatory pathways to make up for the loss of the MMP function. Since protein catabolic processes have been implicated in tumor development
<xref rid="pcbi.1003713-Nakayama1" ref-type="bibr">[36]</xref>
, our data suggest the MMP-T4R cells still possess tumor-developing potentials, consistent with clinical data showing MMP inhibitors (MMPIs) delivered disappointing results as anticancer treatments in clinical trials
<xref rid="pcbi.1003713-Coussens1" ref-type="bibr">[37]</xref>
,
<xref rid="pcbi.1003713-Kessenbrock1" ref-type="bibr">[38]</xref>
. Furthermore, disease relevance analysis showed that some pathways, such as oxidative phosphorylation and metabolic pathways, in the MMP-T4R differential network are significantly associated with diseases, such as Alzheimer's and Parkinson's diseases (
<xref ref-type="fig" rid="pcbi-1003713-g005">Figure 5E</xref>
,
<xref ref-type="supplementary-material" rid="pcbi.1003713.s010">Table S3E</xref>
). This suggests that despite being not as malignant as T4-2 cells, the MMP-T4R group of the reverted cells are not intrinsically normal.</p>
</sec>
</sec>
<sec id="s2e">
<title>Survival analysis of hubs</title>
<p>In order to validate our network reconstruction results, we used an external microarray dataset. Since previous evidence suggests that there is a high correlation between the degrees and essentiality of genes in yeast networks, we hypothesize that i) if a gene is indeed involved in the networks of the breast cells, its abnormal expression would have higher impact on the survival of breast cancer patients than those genes which are not in the networks; and ii) if a gene is a hub (with a high degree) in the differential networks of breast cells, its abnormal expression would have higher impact on the survival of breast cancer patients than those with low degrees. We therefore investigated the effect of the aberrant expression values of the hubs in the differential networks of the HMT3522 cells on the survival of human breast cancer patients. The external dataset we used is a gene expression microarray dataset obtained from 295 primary human breast tumors
<xref rid="pcbi.1003713-vandeVijver1" ref-type="bibr">[39]</xref>
, employed previously to identify gene expression signatures which may be predictive of patient clinical outcomes. The same dataset was also used previously
<xref rid="pcbi.1003713-Kenny1" ref-type="bibr">[12]</xref>
to demonstrate the impact of abnormal expression of TACE, TGFA, and AREG, which were shown to play important roles in the HMT3522 series of the breast cancer cells grown in the 3D culture, on the survival of the same cohort of the breast cancer patients.</p>
<p>In order to define hubs, we examined the distribution of genes with varying degrees in the differential networks.
<xref ref-type="supplementary-material" rid="pcbi.1003713.s007">Figure S7</xref>
shows that while a majority of the genes have degrees of 1–3, much fewer genes have a degree greater than 5, and therefore, we designated hubs to be genes with degree greater than 5 in the differential networks. Note that the same criterion was also used to define hubs in
<xref rid="pcbi.1003713-Han1" ref-type="bibr">[40]</xref>
.</p>
<p>Indeed, we found that 18% of the genes in the networks of the five breast cell states affect patient survival significantly, whereas that only 6% of the genes which are not in the breast cell networks but are present in the external dataset affect patient survival significantly. Our results also showed that 22% of the hubs in the differential networks of the breast cells affect patient survival significantly. GO analysis revealed that these significant hubs are enriched with genes involved in regulation of cell migration, mobility, growth, and proliferation (see
<xref ref-type="supplementary-material" rid="pcbi.1003713.s011">Table S4</xref>
for a list of the hubs), and all of these biological activities are known to be essential for cancer cell development and progression. Similarly, 23% of the genes with degree >10 and also with degree >20 affect patient survival significantly. However, for genes with degrees equal to one to five in the differential networks, the percentage of them affecting patient survival drops to 17%. Together, these results indicate that i) genes in the breast cell networks indeed have higher tendency of influencing patient survival significantly than those not in the networks, and also that ii) hubs in the differential networks are more likely to affect patient survival significantly than those with low degrees, suggesting the structures of the reconstructed networks are valid.</p>
</sec>
<sec id="s2f">
<title>Neighborhood analysis of the hubs significantly affecting patient survival</title>
<p>In order to identify potential novel drug targets, we examined three hubs, NEBL, HBEGF, and PAPD7, whose extreme (i.e., either lower or higher) expression values are correlated with the lowest 15-year patient survival rates (35%, 30% and 34%, respectively) and also with low 10-year survival rates (60%, 42% and 56%, respectively) in the examined dataset (
<xref ref-type="fig" rid="pcbi-1003713-g006">Figure 6</xref>
). Previous evidence has shown that abnormal expression of TACE, TGFA, and AREG are associated with 62%, 61% and 54% of the 10-year survival rates respectively, and associated with 57%, 50% and 54% of the 15-year survival rates respectively, in the same cohort of the patients
<xref rid="pcbi.1003713-Kenny1" ref-type="bibr">[12]</xref>
. Our results, therefore, suggest that NEBL, HBEGF, and PAPD7, similar to TACE, TGFA, and AREG, also play important roles in breast cells.</p>
<fig id="pcbi-1003713-g006" orientation="portrait" position="float">
<object-id pub-id-type="doi">10.1371/journal.pcbi.1003713.g006</object-id>
<label>Figure 6</label>
<caption>
<title>Kaplan–Meier curves estimating the association of different expression values of three hubs in the differential networks of the breast cell states with survival of breast cancer patients.</title>
<p>(A) NEBL in the S1 differential network; (B) HBEGF in the T4-2 differential network; (C) PAPD7 in the MMP-T4R differential network.</p>
</caption>
<graphic xlink:href="pcbi.1003713.g006"></graphic>
</fig>
<p>Since little is known about NEBL, HBEGF, and PAPD7, we examined their neighbors in the corresponding differential networks (which we call neighborhood analysis) to shed light on their functions in breast cancer.</p>
<p>
<xref ref-type="fig" rid="pcbi-1003713-g007">Figure 7A</xref>
shows the NEBL subnetwork in the S1 differential network. NEBL encodes a member of the nebulin family of proteins, which bind actin and are components of focal adhesion complex. Our data showed that decreased expression of NEBL is associated with 36% of 15-year survival rate for breast cancer, suggesting a protective role of this protein when overexpressed. Genes interacting with NEBL in the NEBL subnetwork are mainly involved in energy production by oxidation of organic compounds, actin and cytoskeletal protein binding, regulation of growth, and anatomical structure morphogenesis, all of which are consistent with the biological evidence suggesting involvement of nebulin in migratory cells
<xref rid="pcbi.1003713-Deng1" ref-type="bibr">[41]</xref>
.</p>
<fig id="pcbi-1003713-g007" orientation="portrait" position="float">
<object-id pub-id-type="doi">10.1371/journal.pcbi.1003713.g007</object-id>
<label>Figure 7</label>
<caption>
<title>Selected pathways or GO groups in the neighborhood of the three hubs in the differential networks of the breast cell states.</title>
<p>(A) NEBL in the S1 differential network; (B) HBEGF in the T4-2 differential network; and (C) PAPD7 in the MMP-T4R differential network. An asterisk indicates a pathway or a GO group that is significantly enriched in the corresponding differential network.</p>
</caption>
<graphic xlink:href="pcbi.1003713.g007"></graphic>
</fig>
<p>
<xref ref-type="fig" rid="pcbi-1003713-g007">Figure 7B</xref>
shows the HBEGF subnetwork in the T4-2 differential network. HBEGF encodes a heparin-binding EGF-like growth factor, which is an EGFR ligand
<xref rid="pcbi.1003713-Citri1" ref-type="bibr">[42]</xref>
. We found that higher expression of HBEGF is correlated with 34% of 15-year survival rate (
<xref ref-type="fig" rid="pcbi-1003713-g006">Figure 6B</xref>
), and that the neighbors of HBEGF in the HBEGF subnetwork are involved in diverse biological processes and functions, such as glycolysis, apoptosis, participating in laminin-5 complex, notch signaling pathway, and angiogenesis, all of which are in line with previous findings showing the high expression level of HBEGF is positively related to the aggressiveness of the breast tumors
<xref rid="pcbi.1003713-Revillion1" ref-type="bibr">[43]</xref>
and that HBEGF plays key roles in tumorigenicity and invasiveness of ovarian cancer
<xref rid="pcbi.1003713-Miyamoto1" ref-type="bibr">[44]</xref>
.</p>
<p>Finally, we examined the PAPD7 subnetwork in the MMP-T4R differential network (
<xref ref-type="fig" rid="pcbi-1003713-g007">Figure 7C</xref>
). PAPD7 encodes DNA polymerase sigma. Our data indicated that overexpression of PAPD7 is associated with 30% of 15-year survival rate and 8.5 years of median survival time (
<xref ref-type="fig" rid="pcbi-1003713-g006">Figure 6C</xref>
), both of which are the worst patient outcomes correlated with all the hubs in the differential networks, suggesting that PAPD7 plays significant roles in breast cancer cells. Previous evidence showed that a homolog of PAPD7 in
<italic>Saccharomyces cerevisiae</italic>
Trf4 plays a key role in RNA quality control by degrading aberrant or unwanted RNAs in the nucleus
<xref rid="pcbi.1003713-Vanacova1" ref-type="bibr">[45]</xref>
. Interestingly, our functional analysis revealed that genes interacting with PAPD7 in the MMP-T4R differential network are significantly enriched with those involved in RNA degradation and metabolic process, as well as regulation of Ras and small GTPase mediated signal transduction, and phosphatidylinositol signaling system (
<xref ref-type="fig" rid="pcbi-1003713-g007">Figure 7C</xref>
), which implicates that similar to Trf4, PAPD7 also participates in crucial functions such as RNA quality control in human cells.</p>
<p>Taken together, these findings suggest that the three hubs, NEBL, HBEGF, and PAPD7, in the differential networks play important roles in growth and development of breast cancer cells, and may thus become potential novel therapeutic targets. More important, these results also suggest that our reconstructed networks can not only reveal genes which have high impact on patient survival in specific cell conditions, but also can provide insight into their functions by neighborhood analysis, and thus facilitate personalized drug target discovery and identification, and help make personalized breast cancer therapy possible.</p>
</sec>
</sec>
<sec id="s3">
<title>Discussion</title>
<p>The problem of estimating rewiring networks simultaneously from multiple cell states in the phenotypic tree, as solved by
<italic>Treegl</italic>
, is fundamentally different from either estimating a single “average” network from the samples pooled from all states and subsequently “trace-out” active subnetworks corresponding to each state
<xref rid="pcbi.1003713-Luscombe1" ref-type="bibr">[1]</xref>
, or estimating multiple networks independently. The latter strategies are common practices in the system biology community, which either directly or indirectly assume the network in question is static, and samples of the nodal states in the phenotypic tree are
<italic>i.i.d.</italic>
across (when pooled) or within cell states. In reality, such an assumption is biologically invalid as well as statistically unsubstantiated. The
<italic>Treegl</italic>
algorithm elegantly couples all the inference problems pertained to each network in the tree of multiple conditions, and achieves a globally optimal and statistically well behaving solution based on a principled VCVS model and a convex optimization formulation.</p>
<p>In our analysis of the HMT3522 breast cancer cell lines, we reverse engineered 5 different gene networks specific to each cell state represented in the phenotypic tree. The S1 differential network contains genes predominantly involved in normal cellular activities, while the T4-2 differential network is enriched with pathways playing active roles in cancers. Interestingly, compensatory signaling appears to be a recurring theme of the T4-2 cells phenotypically reverted by different agents. In the T4-2 cells reverted by inhibition of either EGFR or β1-integrin (i.e., the EGFR/ITGB1-T4R group), despite the absence of the ErbB pathway, signaling events downstream of EGFR or β1-integrin, mainly via the PI3K-AKT-mTOR axis, seem to be upregulated. These results are supported by clinical evidence showing that some breast cancer patients exhibit drug resistance after being treated with EGFR inhibitors. Similarly, in the PI3K/MAPKK-T4R cells, their differential network is enriched with genes closely connected to PI3K, suggesting they are upmodulated to make up for the loss of PI3K signaling, also agreeing with clinical findings showing patient resistance to PI3K inhibitors. Likewise, the compensatory effect is observed in the differential network of the T4-2 cells reverted by MMPIs, which involves genes participating in protein catabolic processes presumably to make up for the loss of the MMP function. The effect of MMPIs for treating breast cancer patients was disappointing in clinical trials, but no conclusive evidence for ineffectiveness has been put forward
<xref rid="pcbi.1003713-Kessenbrock1" ref-type="bibr">[38]</xref>
. Our results suggest that the failure of treating breast cancer patients by MMPIs involves upmodulation of the catabolic processes in the treated patients due to compensatory effect. Together, these results suggest despite phenotypic similarities, T4-2 cells reverted by various drugs are intrinsically different from one another; similar compensatory mechanisms, however, appear to be utilized by the T4-2 cells to antagonize effects of the different drugs.</p>
<p>In order to compare our network-based approach with traditional statistical test-based approach, we also analyzed the gene expression data using ANOVA, and identified 1432 genes significantly differentially expressed (FDR p-value<0.05) across different cell states; then we used pairwise t-tests to further identify significant differences between cell states. We found that due to small sample size problems, these traditional approaches are too stringent to reveal interesting signals. For example, we examined the genes differentially expressed between the T4-2 cells reverted by MMP inhibitors (MMP-T4R) and other cell states, in particularly between MMP-T4R and S1, as well as between MMP-T4R and T4. Our results show that there are 473 genes significantly differentially expressed in MMP-T4R, comparing to S1, and the only two GO functional groups significantly enriched (FDR p-value<0.05) among these genes are “mitotic cell cycle” and “sterol biosynthesis process.” Comparing to T4, there are 375 genes differentially expressed in MMP-T4R, and there are no GO groups significantly enriched among these genes.</p>
<p>Moreover, we examined genes in the differential network of MMP-T4R which are involved in some of the significantly enriched GO groups, e.g., “proteasome complex” and “cellular catabolic process”, both of which suggest compensatory signaling in the MMP-T4R cells. We found that among 12 genes in the differential network of MMP-T4R (“PSME3, PSMA4, PSMB8, PSMD10, PSMA3, PSMB9, PSME2, PSMD7, PSMA6, PSMC2, PSMA2, PSMD6”) which are involved in “proteasome complex”, only two of them (PSMA3, PSMB9) significantly differ between MMP-T4R and S1 as identified by ANOVA, and two (PSMC2, PSMB9) significantly differ between MMP-T4R and T4. Likewise, among 33 genes in the differential network of MMP-T4R which are involved in “cellular catabolic process”, only 5 genes (“PSMB9, ANAPC5, USP18, IDH1, PSMA3”) significantly differ between MMP-T4R and S1 as identified by ANOVA, and 3 genes (“PSMB9, USP18, IDH1”) differ between MMP-T4R and T4.</p>
<p>Furthermore, we looked into the 22 hubs in the differential networks which significantly affect patient survival, and found that only 8 (36%) of them are differentially expressed across 5 cell states as identified by ANOVA and a majority (64%) of them are not differentially expressed. Similarly, among 99 hubs in the differential networks, 43% are differentially expressed, while 57% are not. These results suggest that under small-sample-size scenarios, traditional statistical tests are too stringent to capture interesting signals, while our network-based differential analysis can leverage on similarities among different samples while revealing key differences which set them apart.</p>
<p>In order to identify potential novel drug targets, we also investigated hubs in the breast cells whose aberrant expression values are significantly associated with survival outcomes of breast cancer patients. We found that genes in the networks of the breast cells have 2 times higher tendency than those not in the networks to affect patient survival in the cohort we studied. Also, hubs in the breast networks appear more likely to influence patient survival than genes with low degrees. Indeed, the proportion of the hubs with high degrees which are significant survival genes (22% for hubs with degree >5, and 23% for hubs with degree >10 and also for those with degree >20) is not much higher than that (17%) of the genes with low degrees. The reasons for this can be explained as follows.</p>
<p>When previous evidence suggests that in yeast networks, a gene with a higher degree is more likely to be an essential gene
<xref rid="pcbi.1003713-Jeong1" ref-type="bibr">[46]</xref>
,
<xref rid="pcbi.1003713-Yu1" ref-type="bibr">[47]</xref>
, an essential gene is defined as “the cell is unviable when the gene is knocked off”
<xref rid="pcbi.1003713-Yu1" ref-type="bibr">[47]</xref>
. However, it is difficult to know/determine which genes are essential in humans. Nevertheless, in light of the definition of ‘essentiality’ in yeast, we think it is plausible to believe that the actual percentage of the hubs (with degree >5) in the differential networks of the breast cells, which can affect patients significantly, is 22%+x%, rather than 22%, and the reasons why we cannot see the phenotypic effect of the x% of the hubs on patient survival may include: i) these hubs are so essential to humans that any abnormality would lead to death, even before breast tumors were formed or diagnosed; and/or ii) there are some redundant genes which can make up for the loss/gain of functions of these essential hubs.</p>
<p>Despite the fact that our results suggest that the genes in the breast cell networks are more likely to affect patient survival than those which are not, and also that hubs in the differential networks tend to affect patient survival more than genes with low degrees, our data show that the distributions of the patient survival rates (5-year, 10-year or 15-years) associated with these different groups of genes are not significantly different, suggesting that the patient survival rates are not only affected by degrees of genes in the breast cell networks, but also affected by the functionalities of the genes.</p>
<p>We have also characterized the three hubs in the cell-state-specific differential networks whose aberrant expression values are linked with the worst survival outcomes in the breast cancer patients: NEBL in S1 cells, HBEGF in T4-2 cells, and PAPD7 in the MMP-T4R group of the reverted cells. Our results are not only in line with existing information known about these genes, but also provide insight into their functional significance on the growth and development of breast cancer cells. These hubs are promising to serve as potential drug targets for personalized breast cancer therapy.</p>
<p>The major challenge of this work is the small sample size of the microarray data we have used for the network inference. The data was from 15 microarrays in total, and the T4-2 cells reverted by different agents had to be pooled together in order to increase the power of the network inference. Even though the sample grouping strategy is biologically justifiable (see details in the
<xref ref-type="sec" rid="s2">Results</xref>
section), our abilities to find differences between T4-2 cells reverted by different agents are limited due to mixed samples in the EGFR/ITGB1-T4R and PI3K/MAPKK-T4R groups of the reversion cells. For example, it is difficult to dissect which specific pathways are abnormally regulated (compared to S1 cells) in which reversion cell state: T4-2 reverted by EGFR inhibitors or by ITGB1 inhibitors. Likewise, it is also difficult to reveal differences in the T4-2 cells reverted by different agents in the PI3K/MAPKK-T4R group. Moreover, mixed samples can reduce power to detect interesting signals in the data. Despite suggesting compensatory events in the reversion cells, the enriched pathways in the EGFR/ITGB1-T4R and the PI3K/MAPKK-T4R cells are not significant (unadjusted p-values<0.05, but FDR p-values>0.1). However, since our data agree well with clinical evidence, they may facilitate clinicians to identify specific molecules which lead to resistance in the drug-treated breast cancer patients. In order to overcome the limitations of the mixed samples, we also focus on finding similarities of the different T4-2 reversion cells. Our results show that we were able to discover a significant amount of information that agrees with the facts and evidence previously known in the literature. Moreover, we were also able to delineate a mechanistic framework at the systems level that can facilitate further elucidation of the mechanisms underlying different states of the breast cells in the progression and reversion model. Experimental validations are nevertheless needed to further verify our findings.</p>
<p>In summary, this work demonstrates our recently developed
<italic>Treegl</italic>
algorithm can not only provide a holistic view (i.e., the so-called “pan-cell-state” view that echoes the emerging “pan-cancer” or “pan-disease” approach nowadays to biomedical analysis) of the progression and reversion model of the breast cells worthy of further exploration, but also allows us to gain a deeper and systems-level understanding about the behaviors of nonmalignant and malignant breast cells, which may help novel drug target discovery and make personalized breast cancer therapy possible.</p>
</sec>
<sec sec-type="materials|methods" id="s4">
<title>Materials and Methods</title>
<sec id="s4a">
<title>Cell culture and microarray hybridization</title>
<p>HMT3522 S1 and T4-2 cells were grown in 3D lrECM as previously described
<xref rid="pcbi.1003713-Weaver1" ref-type="bibr">[6]</xref>
,
<xref rid="pcbi.1003713-Petersen1" ref-type="bibr">[48]</xref>
. The T4-2 cells were reverted using each of the following reverting agents as described previously: an EGFR inhibitor Tyrphostin AG 1478 and a human EGFR-blocking monoclonal antibody mAb225
<xref rid="pcbi.1003713-Wang1" ref-type="bibr">[7]</xref>
, a β1-integrin inhibitor AIIB2
<xref rid="pcbi.1003713-Weaver1" ref-type="bibr">[6]</xref>
, a MAPK inhibitor PD98059
<xref rid="pcbi.1003713-Wang1" ref-type="bibr">[7]</xref>
, a PI3K inhibitor LY294002
<xref rid="pcbi.1003713-Liu1" ref-type="bibr">[8]</xref>
, dominant-negative Rap1
<xref rid="pcbi.1003713-Itoh1" ref-type="bibr">[13]</xref>
; an MMP inhibitor GM6001
<xref rid="pcbi.1003713-Beliveau1" ref-type="bibr">[49]</xref>
, and a broad-range inhibitor of MMPs and ADAMs, TNF protease inhibitor–2 (TAPI-2)
<xref rid="pcbi.1003713-Kenny1" ref-type="bibr">[12]</xref>
.</p>
<p>S1, T4-2 and reverted T4-2 cells were isolated from 3D cultures with PBS/EDTA as previously described
<xref rid="pcbi.1003713-Lee1" ref-type="bibr">[50]</xref>
. Total cellular RNA was extracted using RNeasy Mini Kit with on column DNase digestion (Qiagen). RNA was quantified by measuring optical density at A260 and quality was verified by agarose gel electrophoresis. Purified total cellular RNA was biotin labeled and hybridized to the Affymetrix GeneChip human genome HG-U133A arrays as previously described
<xref rid="pcbi.1003713-Fournier1" ref-type="bibr">[51]</xref>
.</p>
</sec>
<sec id="s4b">
<title>Gene expression microarray data and the sample grouping strategy</title>
<p>Gene expression microarray data was obtained from 15 total RNA samples prepared from the HMT3522 breast cells grown in 3D lrECM and treated with various reverting agents or vehicle controls as mentioned above. Unfortunately, T4-2 cells reverted by some agents have only one sample per each reversion cell state. Even though our method,
<italic>Treegl</italic>
, is designed for small sample size scenarios, having only one sample per state is not enough for network inference — as it is known that it takes at least two samples to measure even a simple quantity like correlation. Thus, in order to increase the power of the network inference, we grouped the arrays into the following five categories with each having 3 samples: (i) S1 cells (3 arrays); (ii) T4-2 cells (3 arrays); (iii) the EGFR/ITGB1-T4R group, which contains two arrays of the T4-2 cells reverted by the EGFR inhibitor Tyrphostin AG 1478 and the human EGFR-blocking monoclonal antibody mAb225, respectively, and one array of the T4-2 cells reverted by a β1-integrin inhibitor AIIB2; vi) the PI3K/MAPKK-T4R group, which contains one array of the T4-2 cells reverted by a MAPK inhibitor PD98059, one array of the T4-2 cells reverted by a PI3K inhibitor LY294002, and one array of the T4-2 cells reverted by dominant-negative Rap1; and (v) the MMP-T4R group, which contains two arrays of the T4-2 cells reverted by an MMP inhibitor GM6001, and one array of the T4-2 cells reverted by a broad-range inhibitor of MMPs and ADAMs, TAPI-2. The biological justification on this grouping strategy is provided in the
<xref ref-type="sec" rid="s2">Results</xref>
section.</p>
<p>In order to identify networks specific to each state of the breast cells, we utilized a phenotypic tree model to represent the relationships of different states of the breast cells (
<xref ref-type="fig" rid="pcbi-1003713-g001">Figure 1</xref>
). In particular, since the HMT3522 series were originated from S1 cells, we positioned S1 cells as the root of the phenotypic tree. Then we made S1 cells the parent of T4-2 cells, since T4-2 cells were derived from S1 cells. Finally, we made T4-2 cells the parent of the three conditions of the T4-2 cells reverted by various agents (the EGFR/ITGB1-T4R group, the PI3K/MAPKK-T4R group, and the MMP-T4R group).</p>
</sec>
<sec id="s4c">
<title>Microarray data preprocessing</title>
<p>Raw gene expression data was preprocessed using the following procedure. The data from the
<italic>perfect match</italic>
(PM) probes on the Affymetrix arrays was first log2-transformed, and normalized using the
<italic>CyclicLoess</italic>
normalization method to minimize unwanted noise in the data
<xref rid="pcbi.1003713-Bolstad1" ref-type="bibr">[52]</xref>
. We did not use the difference between the values from the PM probes and those from the mismatch (MM) probes (i.e., PM – MM) to represent values of the probes for each gene, because it has been shown that the MM values can pick up both non-specific and specific signal of the probes, and thus PM-MM values may attenuate real signal values from the PM probes
<xref rid="pcbi.1003713-Irizarry1" ref-type="bibr">[53]</xref>
. The normalized PM values were then summarized into gene expression values using the median polish technique
<xref rid="pcbi.1003713-Holder1" ref-type="bibr">[54]</xref>
. For some transcripts, multiple probes on an array target the same transcript; the values of the probes were combined by taking the median of the values to represent the expression level of the corresponding transcript. There are 12,977 unique genes on the arrays. The complete microarray dataset is available at the Gene Expression Omnibus (GEO) database (
<ext-link ext-link-type="uri" xlink:href="http://www.ncbi.nlm.nih.gov/geo">http://www.ncbi.nlm.nih.gov/geo</ext-link>
- GSE42125).</p>
<p>To reduce biological noise in the data, we removed genes whose expression values showed low variability across different groups of the breast cells. In particular, for each gene, we calculated its median expression values in five different groups of the breast cell states. If the fold change value of a gene between any of the two groups was larger than 1.3, we included the gene for the downstream analysis. The reasons why we used the fold change of 1.3 as the threshold value to filter genes are as follows: i) based on our previous experience with human lung disease studies
<xref rid="pcbi.1003713-Zhou1" ref-type="bibr">[55]</xref>
, we found that a fold change of 1.2–1.3 is enough to elicit significant biological changes in humans; and ii) When using the threshold value of 1.3, 5,440 genes passed the filter, which we consider is a reasonable number for the downstream network analysis by
<italic>Treegl</italic>
. Then we applied
<italic>Treegl</italic>
to reconstruct gene networks in the five breast cell states using expression values of the qualified genes.</p>
</sec>
<sec id="s4d">
<title>A mathematical representation of the gene networks for the cell states in the phenotypic tree</title>
<p>We now give a mathematical formulation of representing the gene networks in order to introduce our algorithm. Consider the problem of modeling
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e017.jpg"></inline-graphic>
</inline-formula>
different gene networks, each corresponding to a unique cell state
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e018.jpg"></inline-graphic>
</inline-formula>
. Each cell state
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e019.jpg"></inline-graphic>
</inline-formula>
has
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e020.jpg"></inline-graphic>
</inline-formula>
<italic>i.i.d.</italic>
microarray replicates. All the arrays in the dataset have the same set of
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e021.jpg"></inline-graphic>
</inline-formula>
genes. In our case, we have 5 different conditions of the breast cells in the phenotypic tree: S1, T4, EGFR/ITGB1-T4R, PI3K/MAPKK-T4R, and MMP-T4R.</p>
<p>As commonly done, we model each gene network as a weighted undirected graph, where the vertices represent genes and the edges represent interactions in the network. Let
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e022.jpg"></inline-graphic>
</inline-formula>
represent a network in cell state
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e023.jpg"></inline-graphic>
</inline-formula>
, where
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e024.jpg"></inline-graphic>
</inline-formula>
denotes the set of genes that is fixed for all cell states and
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e025.jpg"></inline-graphic>
</inline-formula>
denotes the set of edges specific to the network for cell state
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e026.jpg"></inline-graphic>
</inline-formula>
. Let
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e027.jpg"></inline-graphic>
</inline-formula>
where
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e028.jpg"></inline-graphic>
</inline-formula>
be the vector of expression values of genes on array
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e029.jpg"></inline-graphic>
</inline-formula>
in cell state
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e030.jpg"></inline-graphic>
</inline-formula>
. We assume
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e031.jpg"></inline-graphic>
</inline-formula>
, i.e. that the vector of expression values follows a multivariate Gaussian distribution.</p>
<p>We are interested in reconstructing
<italic>a set of networks</italic>
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e032.jpg"></inline-graphic>
</inline-formula>
that are related by the phenotypic tree as shown in
<xref ref-type="fig" rid="pcbi-1003713-g001">Figure 1</xref>
. For each cell state
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e033.jpg"></inline-graphic>
</inline-formula>
, let
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e034.jpg"></inline-graphic>
</inline-formula>
be the parent of the cell state in the tree; alternatively, we can also view
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e035.jpg"></inline-graphic>
</inline-formula>
as a
<italic>descendant</italic>
of
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e036.jpg"></inline-graphic>
</inline-formula>
. In our case,
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e037.jpg"></inline-graphic>
</inline-formula>
,
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e038.jpg"></inline-graphic>
</inline-formula>
,
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e039.jpg"></inline-graphic>
</inline-formula>
. We generally let
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e040.jpg"></inline-graphic>
</inline-formula>
correspond to S1,
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e041.jpg"></inline-graphic>
</inline-formula>
correspond to T4, and
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e042.jpg"></inline-graphic>
</inline-formula>
correspond to the EGFR/ITGB1-T4R group, the PI3K/MAPKK-T4R group, and the MMP-T4R group, respectively. Thus, in our formulation, recovering the structures of the gene regulatory networks in different breast cell states corresponds to estimating the network structure for each cell state.</p>
</sec>
<sec id="s4e">
<title>Estimating a gene network in a cell state</title>
<p>Consider first estimating the edge set of a single network
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e043.jpg"></inline-graphic>
</inline-formula>
from the data. As described in the
<xref ref-type="sec" rid="s2">Results</xref>
section, we model the gene network for each cell state
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e044.jpg"></inline-graphic>
</inline-formula>
as a
<italic>Gaussian</italic>
Markov network. Therefore the inverse of the covariance matrix, called the precision matrix,
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e045.jpg"></inline-graphic>
</inline-formula>
, completely encodes the structure of the Markov network. An edge
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e046.jpg"></inline-graphic>
</inline-formula>
exists in the Markov network if and only if the corresponding precision matrix element is non-zero.</p>
<p>A Gaussian Markov network, encoded via the precision matrix, allows us to model more sophisticated dependencies than a correlation network, which is encoded by the covariance matrix. In particular, the precision matrix elements
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e047.jpg"></inline-graphic>
</inline-formula>
are related to the
<italic>partial</italic>
correlation between
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e048.jpg"></inline-graphic>
</inline-formula>
and
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e049.jpg"></inline-graphic>
</inline-formula>
(denoted as
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e050.jpg"></inline-graphic>
</inline-formula>
, see below for details). Formally, partial correlation between a pair of random variables
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e051.jpg"></inline-graphic>
</inline-formula>
given a set
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e052.jpg"></inline-graphic>
</inline-formula>
of controlling variables is defined as follows. Let
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e053.jpg"></inline-graphic>
</inline-formula>
and
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e054.jpg"></inline-graphic>
</inline-formula>
denote the residuals from performing linear regression of
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e055.jpg"></inline-graphic>
</inline-formula>
with
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e056.jpg"></inline-graphic>
</inline-formula>
and
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e057.jpg"></inline-graphic>
</inline-formula>
with
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e058.jpg"></inline-graphic>
</inline-formula>
, respectively. The partial correlation is then defined as the correlation between
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e059.jpg"></inline-graphic>
</inline-formula>
and
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e060.jpg"></inline-graphic>
</inline-formula>
. Unlike correlation, which simply measures the association between a pair of random variables, partial correlation intuitively measures the association between a pair of variables with a set
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e061.jpg"></inline-graphic>
</inline-formula>
of controlling variables removed (where here
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e062.jpg"></inline-graphic>
</inline-formula>
is all the other genes). The partial correlation, due to its close relationship with the elements of the precision matrix, makes the latter much more suitable than the covariance matrix for distinguishing between indirect and direct relationships as shown in
<xref ref-type="fig" rid="pcbi-1003713-g002">Figure 2</xref>
.</p>
<p>Since our goal is to learn the structure of the Markov network, we are only concerned with estimating which precision matrix elements are zero and which are not (rather than the exact precision matrix values). Therefore it suffices to estimate the partial correlation coefficients, which are proportional to the precision matrix elements by the equation
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e063.jpg"></inline-graphic>
</inline-formula>
. An estimation algorithm can be constructed by exploiting the relationships between the partial correlation coefficients and a linear regression model
<xref rid="pcbi.1003713-Lauritzen1" ref-type="bibr">[56]</xref>
. Specifically, consider a linear regression model where gene
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e064.jpg"></inline-graphic>
</inline-formula>
is treated as the response variable and all the other genes are covariates. The regression coefficient
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e065.jpg"></inline-graphic>
</inline-formula>
of covariate
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e066.jpg"></inline-graphic>
</inline-formula>
is then proportional to the partial correlation
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e067.jpg"></inline-graphic>
</inline-formula>
.</p>
<p>The above facts enable us to use regression-based methods to estimate the elements of the precision matrix (up to a proportionality constant), and thus the underlying network structure. In particular, our method is based on an efficient neighborhood selection algorithm
<xref rid="pcbi.1003713-Meinshausen1" ref-type="bibr">[26]</xref>
based on
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e068.jpg"></inline-graphic>
</inline-formula>
-norm regularized regression that works well in practice and has strong theoretical guarantees. In this approach, the neighborhood of each gene
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e069.jpg"></inline-graphic>
</inline-formula>
(the set of edges incident to
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e070.jpg"></inline-graphic>
</inline-formula>
) is estimated independently of the neighborhoods of other genes. After estimating each neighborhood, the results are then combined to produce the estimated network. In every neighborhood estimation step, gene
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e071.jpg"></inline-graphic>
</inline-formula>
is treated as a response variable, and all the other genes are the covariates. An
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e072.jpg"></inline-graphic>
</inline-formula>
penalized linear regression (also known as the lasso
<xref rid="pcbi.1003713-Tibshirani1" ref-type="bibr">[57]</xref>
) is performed to give an estimate of the regression coefficients
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e073.jpg"></inline-graphic>
</inline-formula>
.</p>
<p>Then by leveraging the relationship between the regression coefficients and the partial correlation, the estimated gene network
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e074.jpg"></inline-graphic>
</inline-formula>
is constructed by adding an edge
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e075.jpg"></inline-graphic>
</inline-formula>
to
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e076.jpg"></inline-graphic>
</inline-formula>
if either
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e077.jpg"></inline-graphic>
</inline-formula>
or
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e078.jpg"></inline-graphic>
</inline-formula>
is non-zero (max-symmetrization).</p>
</sec>
<sec id="s4f">
<title>Estimating a tree-shaped genealogy of gene networks in the breast cells</title>
<p>Obviously, networks for each cell state can be estimated independently by using the method described above. However, this can lead to very poor estimates of the edge sets, because in common laboratory settings only a few replicates of gene expression data can be obtained. To overcome this limitation, we estimated the networks by assuming that the networks share similarities due to their relationships as suggested by the phenotypic tree, but also have some sharp differences. For example, for S1 and T4-2 cells, we assume they have considerable differences as the former is nonmalignant while the latter is tumorigenic; however, since T4-2 were derived from S1, we also assume that these cells share substantial similarity. This is the motivation behind
<italic>Treegl</italic>
, the algorithm that we first presented in
<xref rid="pcbi.1003713-Parikh1" ref-type="bibr">[24]</xref>
.</p>
<p>
<italic>Treegl</italic>
is unique in that it makes use of a total variation regularizer, which allows information to be shared across different cell states, and thus encourages the resulting networks to be similar while allowing differences in the networks to be revealed. More specifically,
<italic>Treegl</italic>
adopts the idea of neighborhood selection and additionally penalizes the differences between the neighborhoods of adjacent states in the breast cell phenotypic tree. This makes
<italic>Treegl</italic>
more effective in small-sample-size settings than existing approaches since it can estimate a collection of networks more robustly by leveraging the similarities among them.</p>
<p>In summary,
<italic>Treegl</italic>
proposes the following optimization problem for jointly recovering the neighborhoods of genes for all the cell states in the phenotypic tree of the breast cells:
<disp-formula id="pcbi.1003713.e079">
<graphic xlink:href="pcbi.1003713.e079.jpg" position="anchor" orientation="portrait"></graphic>
</disp-formula>
</p>
<p>In the equation above, the first term corresponds to the residual sum of squares as in normal linear regression.
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e080.jpg"></inline-graphic>
</inline-formula>
indicates the
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e081.jpg"></inline-graphic>
</inline-formula>
vector of the expression values of all genes except
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e082.jpg"></inline-graphic>
</inline-formula>
, and similarly,
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e083.jpg"></inline-graphic>
</inline-formula>
.
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e084.jpg"></inline-graphic>
</inline-formula>
is defined as
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e085.jpg"></inline-graphic>
</inline-formula>
. The second term (corresponding to
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e086.jpg"></inline-graphic>
</inline-formula>
) is a
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e087.jpg"></inline-graphic>
</inline-formula>
penalty on the edge weights (similar to
<xref rid="pcbi.1003713-Meinshausen1" ref-type="bibr">[26]</xref>
), where
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e088.jpg"></inline-graphic>
</inline-formula>
denotes the
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e089.jpg"></inline-graphic>
</inline-formula>
norm of vector
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e090.jpg"></inline-graphic>
</inline-formula>
, which is the sum of the absolute values of the components of
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e091.jpg"></inline-graphic>
</inline-formula>
. This penalty promotes
<bold>sparsity</bold>
in the edge weights by enforcing most of the edge weights to be zero. The assumption of sparsity is biologically justifiable. For example, it is common to find a transcription factor regulating a limited number of genes under specific conditions
<xref rid="pcbi.1003713-Davidson1" ref-type="bibr">[31]</xref>
. The details of the
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e092.jpg"></inline-graphic>
</inline-formula>
regularization can be seen in
<xref rid="pcbi.1003713-Tibshirani1" ref-type="bibr">[57]</xref>
. The third term (also called the total variation penalty) associated with
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e093.jpg"></inline-graphic>
</inline-formula>
enforces sparsity of differences between S1 and T4-2 as well as between T4-2 and each of the T4R groups (as illustrated in the tree structure in
<xref ref-type="fig" rid="pcbi-1003713-g001">Figure 1</xref>
), but not between T4Rs and S1. This encourages many (but not all) of the elements of
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e094.jpg"></inline-graphic>
</inline-formula>
to be identical to those of
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e095.jpg"></inline-graphic>
</inline-formula>
. The fourth term (also associated with
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e096.jpg"></inline-graphic>
</inline-formula>
) additionally penalizes the differences between each of the T4R groups and S1, while allowing for sharp differences to be revealed between the two groups. Note that if the fourth term was not used, the T4R networks would be biased to be more similar to the T4-2 network than the S1 network. This would be undesirable, since it is unknown a priori whether each of the T4R states are more similar to T4-2 or S1 cells.</p>
<p>
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e097.jpg"></inline-graphic>
</inline-formula>
and
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e098.jpg"></inline-graphic>
</inline-formula>
are regularization parameters that control the amount of penalization (see below for details on how we selected these parameters). Because the minimization problem is convex, we solved it using the CVX solver
<xref rid="pcbi.1003713-Grant1" ref-type="bibr">[58]</xref>
, as we described in
<xref rid="pcbi.1003713-Parikh1" ref-type="bibr">[24]</xref>
.</p>
<p>In this work, we focus on genes linked by positive edges, because interaction of these genes is easier to interpret. For example, suppose genes
<italic>X</italic>
and
<italic>Y</italic>
are linked by positive edges, and genes
<italic>Y</italic>
and
<italic>Z</italic>
are also linked by positive edges. Intuitively, this suggests that genes
<italic>X</italic>
and
<italic>Y</italic>
are regulated in the same direction, that is, when gene
<italic>X</italic>
is up (or down)-regulated, gene
<italic>Y</italic>
is also up (or down)-regulated. Same is true for genes
<italic>Y</italic>
and
<italic>Z</italic>
which are also regulated in the same direction. As a result, we can also decide that genes
<italic>X</italic>
and
<italic>Z</italic>
are regulated in the same direction.</p>
<p>On the other hand, interpreting interaction of genes linked by negative edges is more complicated. For example, suppose genes
<italic>A</italic>
and
<italic>B</italic>
are linked by a negative edge and genes
<italic>B</italic>
and
<italic>C</italic>
also linked by a negative edge. This intuitively means that
<italic>A</italic>
and
<italic>B</italic>
are regulated in the opposite direction, and that
<italic>B</italic>
and
<italic>C</italic>
are also regulated in the opposite direction; it is, however, unclear what is the relationship of
<italic>A</italic>
and
<italic>C</italic>
, which may be regulated in either the same or the opposite direction.</p>
<p>Due to the reasons stated above, we chose to limit the scope of this work by focusing only on the positive edges to simplify our interpretation of the results.</p>
</sec>
<sec id="s4g">
<title>Selecting regularization parameters</title>
<p>Choosing the regularization parameters
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e099.jpg"></inline-graphic>
</inline-formula>
and
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e100.jpg"></inline-graphic>
</inline-formula>
is a challenging problem in high dimensional statistics. Kolar and Xing proposed to use the Bayesian information criterion (BIC) score to select these parameters
<xref rid="pcbi.1003713-Kolar3" ref-type="bibr">[59]</xref>
. This approach can be useful in low dimensional settings; however, it does not perform well in high dimensional settings
<xref rid="pcbi.1003713-Liu2" ref-type="bibr">[60]</xref>
. In this work, since we have a good knowledge of the biological properties of the S1 and T4-2 cells in the HTM3522 system, we employed a knowledge-based approach to tune
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e101.jpg"></inline-graphic>
</inline-formula>
and
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e102.jpg"></inline-graphic>
</inline-formula>
, namely, we tuned these parameters based on our prior knowledge about S1 and T4-2 cells, which turned out to be highly effective in the high dimensional, small sample size setting as we encountered in this work. Specifically, we first varied
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e103.jpg"></inline-graphic>
</inline-formula>
and
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e104.jpg"></inline-graphic>
</inline-formula>
in the set {4, 4.5, 5, 5.5, 6, 6.5, 7} and the set {0.5, 1, 1.5, 2, 2.5}, respectively, and generated cell-state-specific networks for each possible pair of
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e105.jpg"></inline-graphic>
</inline-formula>
and
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e106.jpg"></inline-graphic>
</inline-formula>
. These sets of
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e107.jpg"></inline-graphic>
</inline-formula>
and
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e108.jpg"></inline-graphic>
</inline-formula>
were chosen because the networks can be generated with reasonable sparsity. Then we examined the biological pathways significantly enriched in the differential network of the S1 and T4-2 cells, and found that when
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e109.jpg"></inline-graphic>
</inline-formula>
and
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e110.jpg"></inline-graphic>
</inline-formula>
, almost all of the enriched pathways in the T4-2 network make the best biological sense in that they are either well described in previous studies or are known pathways active in cancers. Since we used S1 and T4-2 cells to help tune the regularization parameters, we present and discuss mainly biological findings we made from the networks of the T4-2 reversion cells to avoid circular reasoning.</p>
</sec>
<sec id="s4h">
<title>Synthetic network generation and evaluation</title>
<p>We describe below how the networks in our simulation experiments were generated. Consider the following artificial collection of 70 networks, related by a tree:</p>
<list list-type="order">
<list-item>
<p>A network A with
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e111.jpg"></inline-graphic>
</inline-formula>
nodes, with an average degree 4, and maximum degree 6 is randomly generated. For the first 10 states,
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e112.jpg"></inline-graphic>
</inline-formula>
,
<bold>
<italic>A</italic>
</bold>
remains unchanged, and thus, the networks for cell states
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e113.jpg"></inline-graphic>
</inline-formula>
are identical.</p>
</list-item>
<list-item>
<p>After
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e114.jpg"></inline-graphic>
</inline-formula>
, the network branches into two child networks,
<bold>
<italic>B</italic>
</bold>
and
<bold>
<italic>C</italic>
</bold>
. To generate each child network, 25% of the edges are randomly deleted and the same number of the edges are randomly added. This represents a sharp, sparse change in the network. These child networks remain unchanged for another 10 states (
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e115.jpg"></inline-graphic>
</inline-formula>
for
<bold>
<italic>B</italic>
</bold>
,
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e116.jpg"></inline-graphic>
</inline-formula>
for
<bold>
<italic>C</italic>
</bold>
).</p>
</list-item>
<list-item>
<p>
<bold>
<italic>B</italic>
</bold>
and
<bold>
<italic>C</italic>
</bold>
then branch (similar to step 2) to generate networks
<bold>
<italic>D</italic>
</bold>
and
<bold>
<italic>E</italic>
</bold>
from
<bold>
<italic>B</italic>
</bold>
, and
<bold>
<italic>F</italic>
</bold>
and
<bold>
<italic>G</italic>
</bold>
from
<bold>
<italic>C</italic>
</bold>
. These networks remain unchanged for another 10 states
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e117.jpg"></inline-graphic>
</inline-formula>
for
<bold>
<italic>D</italic>
</bold>
,
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e118.jpg"></inline-graphic>
</inline-formula>
for
<bold>
<italic>E</italic>
</bold>
,
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e119.jpg"></inline-graphic>
</inline-formula>
for
<bold>
<italic>F</italic>
</bold>
, and
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e120.jpg"></inline-graphic>
</inline-formula>
for
<bold>
<italic>G</italic>
</bold>
.</p>
</list-item>
</list>
<p>
<italic>Treegl</italic>
does not know a priori which networks are identical and which are not. The number (
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e121.jpg"></inline-graphic>
</inline-formula>
) of samples are then generated for each network under the Gaussian Graphical Model assumption. We vary the values of
<inline-formula>
<inline-graphic xlink:href="pcbi.1003713.e122.jpg"></inline-graphic>
</inline-formula>
in the simulation experiments, and the results presented in
<xref ref-type="fig" rid="pcbi-1003713-g003">Figures 3</xref>
are based on the values indicated in the figure. In each scenario, the number of edges is twice as much as the number of nodes.</p>
<p>To evaluate
<italic>Treegl</italic>
, we conduct a total of 10 simulation experiments, and plot the precision-recall curves showing the recall for different values of precision based on the networks reconstructed by
<italic>Treegl</italic>
. The error bars in the curves indicate the first and third quartiles of the results. Details on how we generated the precision-recall curves and selected the regularization parameters can be found in
<xref rid="pcbi.1003713-Parikh1" ref-type="bibr">[24]</xref>
.</p>
</sec>
<sec id="s4i">
<title>Pathway analysis</title>
<p>To identify pathways significantly enriched in the gene networks of the 5 breast cell states estimated by
<italic>Treegl</italic>
, we performed pathway analysis on the list of the genes involved in each network using the
<italic>Category</italic>
Bioconductor package with minor modification (
<ext-link ext-link-type="uri" xlink:href="http://www.bioconductor.org">http://www.bioconductor.org</ext-link>
). The
<italic>Category</italic>
package uses hypergeometric tests to assess overrepresentation of the KEGG pathways among genes of interest. A list of 12,977 unique genes on the Affymetrix GeneChip Human Genome U133A was used as the reference gene list for the pathway analysis. A pathway is considered to be significant if
<italic>p</italic>
<0.1 with the FDR controlling procedure of Benjamini & Hochberg
<xref rid="pcbi.1003713-Benjamini1" ref-type="bibr">[61]</xref>
.</p>
</sec>
<sec id="s4j">
<title>Disease relevance analysis</title>
<p>To find out genes significantly associated with certain diseases in the differential networks of the breast cell states, we performed pathway analysis as described above. For each differential network, pathways related to diseases and significantly enriched in the network were singled out; genes in the network that are involved in the enriched disease-related pathways were reported as the genes significantly associated with the diseases in the network.</p>
</sec>
<sec id="s4k">
<title>GO analysis</title>
<p>To identify functional groups of genes significantly enriched in the gene networks of the breast cells estimated by
<italic>Treegl</italic>
, we performed GO analysis on the list of the genes involved in each network using the
<italic>GOstat</italic>
program
<xref rid="pcbi.1003713-Beissbarth1" ref-type="bibr">[62]</xref>
. The
<italic>GOstat</italic>
program finds the enriched functional groups using Fisher's exact tests. The
<italic>GOstat</italic>
program was also used to identify functional groups of genes enriched among the neighborhoods (or the subnetworks) of the hubs significantly affecting patient survival. A functional group is considered to be significant if
<italic>p</italic>
<0.05 with the FDR controlling procedure of Benjamini & Hochberg. A list of 12,977 unique genes on the Affymetrix GeneChip Human Genome U133A was used as the reference gene list for the
<italic>GOstat</italic>
program.</p>
</sec>
<sec id="s4l">
<title>Survival analysis of hubs</title>
<p>We define hubs as genes with positive degree greater than 5 in the differential networks of the breast cell states. Survival analysis was performed using microarray expression values of the hubs extracted from a gene expression microarray data set obtained from 295 primary human breast tumors
<xref rid="pcbi.1003713-vandeVijver1" ref-type="bibr">[39]</xref>
. For each hub, its expression values across all patients were divided into three groups: lower quartile, interquartile, and upper quartile groups. Kaplan–Meier curves were used to estimate the association of expression values of the hubs in the three groups with patient survival. The log-rank test was used to calculate
<italic>p</italic>
-values of the survival curves. A hub was considered as significant if the p value of its associated survival curve <0.05 after controlling for multiple testing using the Bonferroni procedure.</p>
</sec>
</sec>
<sec sec-type="supplementary-material" id="s5">
<title>Supporting Information</title>
<supplementary-material content-type="local-data" id="pcbi.1003713.s001">
<label>Figure S1</label>
<caption>
<p>Diagram depicting relationship of the networks in the simulation experiment. Networks with the same color have an identical network structure, i.e., networks 1–10 are identical, networks 11–20 are identical, etc. The black solid lines represent change-points where the network structure changes.</p>
<p>(EPS)</p>
</caption>
<media xlink:href="pcbi.1003713.s001.eps">
<caption>
<p>Click here for additional data file.</p>
</caption>
</media>
</supplementary-material>
<supplementary-material content-type="local-data" id="pcbi.1003713.s002">
<label>Figure S2</label>
<caption>
<p>Precision-recall plots for each of the 70 individual networks in the simulation experiment for 50 nodes, 10 samples per network as shown in
<xref ref-type="fig" rid="pcbi-1003713-g003">Figure 3D</xref>
. Row 1 represents networks 1–10, Row 2 represents networks 11–20, Row 3 represents networks 21–30, …, and Row 7 represents networks 61–70. Treegl is shown in blue. The static approach of pooling all the samples to estimate only one network is shown in red, while the results for estimating each network independently without any information sharing is shown in green. The scales of the x-axis and y-axis are identical to those in
<xref ref-type="fig" rid="pcbi-1003713-g003">Figure 3</xref>
.</p>
<p>(EPS)</p>
</caption>
<media xlink:href="pcbi.1003713.s002.eps">
<caption>
<p>Click here for additional data file.</p>
</caption>
</media>
</supplementary-material>
<supplementary-material content-type="local-data" id="pcbi.1003713.s003">
<label>Figure S3</label>
<caption>
<p>(Figure from
<xref rid="pcbi.1003713-Parikh1" ref-type="bibr">[24]</xref>
). An overview of the identified cell-state-specific networks: (A) S1, (B) T4, (C) EGFR/ITGB1-T4R, (D) PI3K/MAPKK-T4R, and (E) MMP-T4R. Only edges of absolute weight >0.1 are shown. Hubs (i.e., nodes with >5 edges) are in orange and enlarged proportional to their degrees.</p>
<p>(EPS)</p>
</caption>
<media xlink:href="pcbi.1003713.s003.eps">
<caption>
<p>Click here for additional data file.</p>
</caption>
</media>
</supplementary-material>
<supplementary-material content-type="local-data" id="pcbi.1003713.s004">
<label>Figure S4</label>
<caption>
<p>A KEGG diagram of the phosphatidylinositol signaling pathway enriched in the differential network of the EGFR/ITGB1-T4R cells. PI3K is identified by red arrows. Only a section of the pathway is shown.</p>
<p>(EPS)</p>
</caption>
<media xlink:href="pcbi.1003713.s004.eps">
<caption>
<p>Click here for additional data file.</p>
</caption>
</media>
</supplementary-material>
<supplementary-material content-type="local-data" id="pcbi.1003713.s005">
<label>Figure S5</label>
<caption>
<p>A KEGG diagram of the mTOR signaling pathway. This pathway is enriched in the differential networks of both the EGFR/ITGB1-T4R cells and the PI3K/MAPKK-T4R cells. PI3K and mTOR are identified by red and blue arrows, respectively. Insulin signaling pathway and INS/IGF are identified by purple and pink arrows, respectively. Notice that IGF is intimately connected with both insulin and mTOR pathways.</p>
<p>(EPS)</p>
</caption>
<media xlink:href="pcbi.1003713.s005.eps">
<caption>
<p>Click here for additional data file.</p>
</caption>
</media>
</supplementary-material>
<supplementary-material content-type="local-data" id="pcbi.1003713.s006">
<label>Figure S6</label>
<caption>
<p>A KEGG diagram of the Insulin signaling pathway enriched in the differential network of the PI3K/MAPKK-T4R cells. PI3K, mTOR, and INS/IGF are identified by red, blue, and pink arrows, respectively.</p>
<p>(EPS)</p>
</caption>
<media xlink:href="pcbi.1003713.s006.eps">
<caption>
<p>Click here for additional data file.</p>
</caption>
</media>
</supplementary-material>
<supplementary-material content-type="local-data" id="pcbi.1003713.s007">
<label>Figure S7</label>
<caption>
<p>A plot showing the number of genes that have degree d for various values of d. The plot in the inset displays the same data, except that the y-axis is shown in log scale. The red arrow points to the number of the genes with degree = 6. Since comparing to genes with degree = 5, there is a noticeable decreased number of genes with degree = 6, thus we designate all the genes with degree >5 to be hubs.</p>
<p>(EPS)</p>
</caption>
<media xlink:href="pcbi.1003713.s007.eps">
<caption>
<p>Click here for additional data file.</p>
</caption>
</media>
</supplementary-material>
<supplementary-material content-type="local-data" id="pcbi.1003713.s008">
<label>Table S1</label>
<caption>
<p>Significantly enriched pathways in the differential networks of the breast cell states in the progression and reversion model of the HMT3522 cells. (A) S1 differential network; (B) T4-2 differential network; (C) EGFR/ITGB1-T4R differential network; (D) PI3K/MAPKK-T4R differential network; (E) MMP-T4R differential network.</p>
<p>(DOCX)</p>
</caption>
<media xlink:href="pcbi.1003713.s008.docx">
<caption>
<p>Click here for additional data file.</p>
</caption>
</media>
</supplementary-material>
<supplementary-material content-type="local-data" id="pcbi.1003713.s009">
<label>Table S2</label>
<caption>
<p>Significantly enriched GO groups in the differential networks of the breast cell states in the progression and reversion model of the HMT3522 cells. (A) S1 differential network; (B) T4-2 differential network; (C) EGFR/ITGB1-T4R differential network; (D) PI3K/MAPKK-T4R differential network; (E) MMP-T4R differential network.</p>
<p>(DOCX)</p>
</caption>
<media xlink:href="pcbi.1003713.s009.docx">
<caption>
<p>Click here for additional data file.</p>
</caption>
</media>
</supplementary-material>
<supplementary-material content-type="local-data" id="pcbi.1003713.s010">
<label>Table S3</label>
<caption>
<p>Diseases significantly associated with the genes in the differential networks of the breast cell states in the progression and reversion model of the HMT3522 cells. (A) S1 differential network; (B) T4-2 differential network; (C) EGFR/ITGB1-T4R differential network; (D) PI3K/MAPKK-T4R differential network; (E) MMP-T4R differential network.</p>
<p>(DOCX)</p>
</caption>
<media xlink:href="pcbi.1003713.s010.docx">
<caption>
<p>Click here for additional data file.</p>
</caption>
</media>
</supplementary-material>
<supplementary-material content-type="local-data" id="pcbi.1003713.s011">
<label>Table S4</label>
<caption>
<p>Hubs in the differential networks of the breast cell states significantly affecting survival of breast cancer patients.</p>
<p>(DOCX)</p>
</caption>
<media xlink:href="pcbi.1003713.s011.docx">
<caption>
<p>Click here for additional data file.</p>
</caption>
</media>
</supplementary-material>
</sec>
</body>
<back>
<ack>
<p>We thank Dr. Ren Xu for providing us information about the gene expression microarray dataset used in this work.</p>
</ack>
<ref-list>
<title>References</title>
<ref id="pcbi.1003713-Luscombe1">
<label>1</label>
<mixed-citation publication-type="journal">
<name>
<surname>Luscombe</surname>
<given-names>NM</given-names>
</name>
,
<name>
<surname>Babu</surname>
<given-names>MM</given-names>
</name>
,
<name>
<surname>Yu</surname>
<given-names>H</given-names>
</name>
,
<name>
<surname>Snyder</surname>
<given-names>M</given-names>
</name>
,
<name>
<surname>Teichmann</surname>
<given-names>SA</given-names>
</name>
,
<etal>et al</etal>
(
<year>2004</year>
)
<article-title>Genomic analysis of regulatory network dynamics reveals large topological changes</article-title>
.
<source>Nature</source>
<volume>431</volume>
:
<fpage>308</fpage>
<lpage>312</lpage>
.
<pub-id pub-id-type="pmid">15372033</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Ernst1">
<label>2</label>
<mixed-citation publication-type="journal">
<name>
<surname>Ernst</surname>
<given-names>J</given-names>
</name>
,
<name>
<surname>Vainas</surname>
<given-names>O</given-names>
</name>
,
<name>
<surname>Harbison</surname>
<given-names>CT</given-names>
</name>
,
<name>
<surname>Simon</surname>
<given-names>I</given-names>
</name>
,
<name>
<surname>Bar-Joseph</surname>
<given-names>Z</given-names>
</name>
(
<year>2007</year>
)
<article-title>Reconstructing dynamic regulatory maps</article-title>
.
<source>Mol Syst Biol</source>
<volume>3</volume>
:
<fpage>74</fpage>
.
<pub-id pub-id-type="pmid">17224918</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Schulz1">
<label>3</label>
<mixed-citation publication-type="journal">
<name>
<surname>Schulz</surname>
<given-names>MH</given-names>
</name>
,
<name>
<surname>Devanny</surname>
<given-names>WE</given-names>
</name>
,
<name>
<surname>Gitter</surname>
<given-names>A</given-names>
</name>
,
<name>
<surname>Zhong</surname>
<given-names>S</given-names>
</name>
,
<name>
<surname>Ernst</surname>
<given-names>J</given-names>
</name>
,
<etal>et al</etal>
(
<year>2012</year>
)
<article-title>DREM 2.0: Improved reconstruction of dynamic regulatory networks from time-series expression data</article-title>
.
<source>BMC Syst Biol</source>
<volume>6</volume>
:
<fpage>104</fpage>
.
<pub-id pub-id-type="pmid">22897824</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Bissell1">
<label>4</label>
<mixed-citation publication-type="journal">
<name>
<surname>Bissell</surname>
<given-names>MJ</given-names>
</name>
,
<name>
<surname>Weaver</surname>
<given-names>VM</given-names>
</name>
,
<name>
<surname>Lelievre</surname>
<given-names>SA</given-names>
</name>
,
<name>
<surname>Wang</surname>
<given-names>F</given-names>
</name>
,
<name>
<surname>Petersen</surname>
<given-names>OW</given-names>
</name>
,
<etal>et al</etal>
(
<year>1999</year>
)
<article-title>Tissue structure, nuclear organization, and gene expression in normal and malignant breast</article-title>
.
<source>Cancer Res</source>
<volume>59</volume>
:
<fpage>1757</fpage>
<lpage>1763s</lpage>
discussion 1763s–1764s.
<pub-id pub-id-type="pmid">10197593</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Bissell2">
<label>5</label>
<mixed-citation publication-type="journal">
<name>
<surname>Bissell</surname>
<given-names>MJ</given-names>
</name>
,
<name>
<surname>Radisky</surname>
<given-names>D</given-names>
</name>
(
<year>2001</year>
)
<article-title>Putting tumours in context</article-title>
.
<source>Nat Rev Cancer</source>
<volume>1</volume>
:
<fpage>46</fpage>
<lpage>54</lpage>
.
<pub-id pub-id-type="pmid">11900251</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Weaver1">
<label>6</label>
<mixed-citation publication-type="journal">
<name>
<surname>Weaver</surname>
<given-names>VM</given-names>
</name>
,
<name>
<surname>Petersen</surname>
<given-names>OW</given-names>
</name>
,
<name>
<surname>Wang</surname>
<given-names>F</given-names>
</name>
,
<name>
<surname>Larabell</surname>
<given-names>CA</given-names>
</name>
,
<name>
<surname>Briand</surname>
<given-names>P</given-names>
</name>
,
<etal>et al</etal>
(
<year>1997</year>
)
<article-title>Reversion of the malignant phenotype of human breast cells in three-dimensional culture and in vivo by integrin blocking antibodies</article-title>
.
<source>J Cell Biol</source>
<volume>137</volume>
:
<fpage>231</fpage>
<lpage>245</lpage>
.
<pub-id pub-id-type="pmid">9105051</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Wang1">
<label>7</label>
<mixed-citation publication-type="journal">
<name>
<surname>Wang</surname>
<given-names>F</given-names>
</name>
,
<name>
<surname>Weaver</surname>
<given-names>VM</given-names>
</name>
,
<name>
<surname>Petersen</surname>
<given-names>OW</given-names>
</name>
,
<name>
<surname>Larabell</surname>
<given-names>CA</given-names>
</name>
,
<name>
<surname>Dedhar</surname>
<given-names>S</given-names>
</name>
,
<etal>et al</etal>
(
<year>1998</year>
)
<article-title>Reciprocal interactions between beta1-integrin and epidermal growth factor receptor in three-dimensional basement membrane breast cultures: a different perspective in epithelial biology</article-title>
.
<source>Proc Natl Acad Sci U S A</source>
<volume>95</volume>
:
<fpage>14821</fpage>
<lpage>14826</lpage>
.
<pub-id pub-id-type="pmid">9843973</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Liu1">
<label>8</label>
<mixed-citation publication-type="journal">
<name>
<surname>Liu</surname>
<given-names>H</given-names>
</name>
,
<name>
<surname>Radisky</surname>
<given-names>DC</given-names>
</name>
,
<name>
<surname>Wang</surname>
<given-names>F</given-names>
</name>
,
<name>
<surname>Bissell</surname>
<given-names>MJ</given-names>
</name>
(
<year>2004</year>
)
<article-title>Polarity and proliferation are controlled by distinct signaling pathways downstream of PI3-kinase in breast epithelial tumor cells</article-title>
.
<source>J Cell Biol</source>
<volume>164</volume>
:
<fpage>603</fpage>
<lpage>612</lpage>
.
<pub-id pub-id-type="pmid">14769856</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Briand1">
<label>9</label>
<mixed-citation publication-type="journal">
<name>
<surname>Briand</surname>
<given-names>P</given-names>
</name>
,
<name>
<surname>Petersen</surname>
<given-names>OW</given-names>
</name>
,
<name>
<surname>Van Deurs</surname>
<given-names>B</given-names>
</name>
(
<year>1987</year>
)
<article-title>A new diploid nontumorigenic human breast epithelial cell line isolated and propagated in chemically defined medium</article-title>
.
<source>In Vitro Cell Dev Biol</source>
<volume>23</volume>
:
<fpage>181</fpage>
<lpage>188</lpage>
.
<pub-id pub-id-type="pmid">3558253</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Briand2">
<label>10</label>
<mixed-citation publication-type="journal">
<name>
<surname>Briand</surname>
<given-names>P</given-names>
</name>
,
<name>
<surname>Nielsen</surname>
<given-names>KV</given-names>
</name>
,
<name>
<surname>Madsen</surname>
<given-names>MW</given-names>
</name>
,
<name>
<surname>Petersen</surname>
<given-names>OW</given-names>
</name>
(
<year>1996</year>
)
<article-title>Trisomy 7p and malignant transformation of human breast epithelial cells following epidermal growth factor withdrawal</article-title>
.
<source>Cancer Res</source>
<volume>56</volume>
:
<fpage>2039</fpage>
<lpage>2044</lpage>
.
<pub-id pub-id-type="pmid">8616848</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Inman1">
<label>11</label>
<mixed-citation publication-type="journal">
<name>
<surname>Inman</surname>
<given-names>JL</given-names>
</name>
,
<name>
<surname>Bissell</surname>
<given-names>MJ</given-names>
</name>
(
<year>2010</year>
)
<article-title>Apical polarity in three-dimensional culture systems: where to now?</article-title>
<source>J Biol</source>
<volume>9</volume>
:
<fpage>2</fpage>
.
<pub-id pub-id-type="pmid">20092610</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Kenny1">
<label>12</label>
<mixed-citation publication-type="journal">
<name>
<surname>Kenny</surname>
<given-names>PA</given-names>
</name>
,
<name>
<surname>Bissell</surname>
<given-names>MJ</given-names>
</name>
(
<year>2007</year>
)
<article-title>Targeting TACE-dependent EGFR ligand shedding in breast cancer</article-title>
.
<source>J Clin Invest</source>
<volume>117</volume>
:
<fpage>337</fpage>
<lpage>345</lpage>
.
<pub-id pub-id-type="pmid">17218988</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Itoh1">
<label>13</label>
<mixed-citation publication-type="journal">
<name>
<surname>Itoh</surname>
<given-names>M</given-names>
</name>
,
<name>
<surname>Nelson</surname>
<given-names>CM</given-names>
</name>
,
<name>
<surname>Myers</surname>
<given-names>CA</given-names>
</name>
,
<name>
<surname>Bissell</surname>
<given-names>MJ</given-names>
</name>
(
<year>2007</year>
)
<article-title>Rap1 integrates tissue polarity, lumen formation, and tumorigenic potential in human breast epithelial cells</article-title>
.
<source>Cancer Res</source>
<volume>67</volume>
:
<fpage>4759</fpage>
<lpage>4766</lpage>
.
<pub-id pub-id-type="pmid">17510404</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-BeckerWeimann1">
<label>14</label>
<mixed-citation publication-type="journal">
<name>
<surname>Becker-Weimann</surname>
<given-names>S</given-names>
</name>
,
<name>
<surname>Xiong</surname>
<given-names>G</given-names>
</name>
,
<name>
<surname>Furuta</surname>
<given-names>S</given-names>
</name>
,
<name>
<surname>Han</surname>
<given-names>J</given-names>
</name>
,
<name>
<surname>Kuhn</surname>
<given-names>I</given-names>
</name>
,
<etal>et al</etal>
(
<year>2013</year>
)
<article-title>NFkB disrupts tissue polarity in 3D by preventing integration of microenvironmental signals</article-title>
.
<source>Oncotarget</source>
<volume>4</volume>
:
<fpage>2010</fpage>
<lpage>2020</lpage>
.
<pub-id pub-id-type="pmid">24243820</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Marbach1">
<label>15</label>
<mixed-citation publication-type="journal">
<name>
<surname>Marbach</surname>
<given-names>D</given-names>
</name>
,
<name>
<surname>Costello</surname>
<given-names>JC</given-names>
</name>
,
<name>
<surname>Kuffner</surname>
<given-names>R</given-names>
</name>
,
<name>
<surname>Vega</surname>
<given-names>NM</given-names>
</name>
,
<name>
<surname>Prill</surname>
<given-names>RJ</given-names>
</name>
,
<etal>et al</etal>
(
<year>2012</year>
)
<article-title>Wisdom of crowds for robust gene network inference</article-title>
.
<source>Nat Methods</source>
<volume>9</volume>
:
<fpage>796</fpage>
<lpage>804</lpage>
.
<pub-id pub-id-type="pmid">22796662</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-BarJoseph1">
<label>16</label>
<mixed-citation publication-type="journal">
<name>
<surname>Bar-Joseph</surname>
<given-names>Z</given-names>
</name>
,
<name>
<surname>Gitter</surname>
<given-names>A</given-names>
</name>
,
<name>
<surname>Simon</surname>
<given-names>I</given-names>
</name>
(
<year>2012</year>
)
<article-title>Studying and modelling dynamic biological processes using time-series gene expression data</article-title>
.
<source>Nat Rev Genet</source>
<volume>13</volume>
:
<fpage>552</fpage>
<lpage>564</lpage>
.
<pub-id pub-id-type="pmid">22805708</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Friedman1">
<label>17</label>
<mixed-citation publication-type="journal">
<name>
<surname>Friedman</surname>
<given-names>N</given-names>
</name>
,
<name>
<surname>Linial</surname>
<given-names>M</given-names>
</name>
,
<name>
<surname>Nachman</surname>
<given-names>I</given-names>
</name>
,
<name>
<surname>Pe'er</surname>
<given-names>D</given-names>
</name>
(
<year>2000</year>
)
<article-title>Using Bayesian networks to analyze expression data</article-title>
.
<source>J Comput Biol</source>
<volume>7</volume>
:
<fpage>601</fpage>
<lpage>620</lpage>
.
<pub-id pub-id-type="pmid">11108481</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Margolin1">
<label>18</label>
<mixed-citation publication-type="journal">
<name>
<surname>Margolin</surname>
<given-names>AA</given-names>
</name>
,
<name>
<surname>Nemenman</surname>
<given-names>I</given-names>
</name>
,
<name>
<surname>Basso</surname>
<given-names>K</given-names>
</name>
,
<name>
<surname>Wiggins</surname>
<given-names>C</given-names>
</name>
,
<name>
<surname>Stolovitzky</surname>
<given-names>G</given-names>
</name>
,
<etal>et al</etal>
(
<year>2006</year>
)
<article-title>ARACNE: an algorithm for the reconstruction of gene regulatory networks in a mammalian cellular context</article-title>
.
<source>BMC Bioinformatics</source>
<volume>7</volume>
<issue>Suppl 1</issue>
:
<fpage>S7</fpage>
.
<pub-id pub-id-type="pmid">16723010</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Yeung1">
<label>19</label>
<mixed-citation publication-type="journal">
<name>
<surname>Yeung</surname>
<given-names>MK</given-names>
</name>
,
<name>
<surname>Tegner</surname>
<given-names>J</given-names>
</name>
,
<name>
<surname>Collins</surname>
<given-names>JJ</given-names>
</name>
(
<year>2002</year>
)
<article-title>Reverse engineering gene networks using singular value decomposition and robust regression</article-title>
.
<source>Proc Natl Acad Sci U S A</source>
<volume>99</volume>
:
<fpage>6163</fpage>
<lpage>6168</lpage>
.
<pub-id pub-id-type="pmid">11983907</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Ahmed1">
<label>20</label>
<mixed-citation publication-type="journal">
<name>
<surname>Ahmed</surname>
<given-names>A</given-names>
</name>
,
<name>
<surname>Xing</surname>
<given-names>EP</given-names>
</name>
(
<year>2009</year>
)
<article-title>Recovering time-varying networks of dependencies in social and biological studies</article-title>
.
<source>Proc Natl Acad Sci U S A</source>
<volume>106</volume>
:
<fpage>11878</fpage>
<lpage>11883</lpage>
.
<pub-id pub-id-type="pmid">19570995</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Song1">
<label>21</label>
<mixed-citation publication-type="journal">
<name>
<surname>Song</surname>
<given-names>L</given-names>
</name>
,
<name>
<surname>Kolar</surname>
<given-names>M</given-names>
</name>
,
<name>
<surname>Xing</surname>
<given-names>EP</given-names>
</name>
(
<year>2009</year>
)
<article-title>KELLER: estimating time-varying interactions between genes</article-title>
.
<source>Bioinformatics</source>
<volume>25</volume>
:
<fpage>i128</fpage>
<lpage>136</lpage>
.
<pub-id pub-id-type="pmid">19477978</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Song2">
<label>22</label>
<mixed-citation publication-type="book">Song L, Kolar M, Xing EP (2009) Time-Varying Dynamic Bayesian Networks. In: Bengio Y, Schuurmans D, Lafferty J, Williams CKI, Culotta A, editors. Advances in Neural Information Processing Systems 22. pp. 1732–1740.</mixed-citation>
</ref>
<ref id="pcbi.1003713-Grzegorczyk1">
<label>23</label>
<mixed-citation publication-type="journal">
<name>
<surname>Grzegorczyk</surname>
<given-names>M</given-names>
</name>
,
<name>
<surname>Husmeier</surname>
<given-names>D</given-names>
</name>
(
<year>2011</year>
)
<article-title>Improvements in the reconstruction of time-varying gene regulatory networks: dynamic programming and regularization by information sharing among genes</article-title>
.
<source>Bioinformatics</source>
<volume>27</volume>
:
<fpage>693</fpage>
<lpage>699</lpage>
.
<pub-id pub-id-type="pmid">21177328</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Parikh1">
<label>24</label>
<mixed-citation publication-type="journal">
<name>
<surname>Parikh</surname>
<given-names>AP</given-names>
</name>
,
<name>
<surname>Wu</surname>
<given-names>W</given-names>
</name>
,
<name>
<surname>Curtis</surname>
<given-names>RE</given-names>
</name>
,
<name>
<surname>Xing</surname>
<given-names>EP</given-names>
</name>
(
<year>2011</year>
)
<article-title>TREEGL: reverse engineering tree-evolving gene networks underlying developing biological lineages</article-title>
.
<source>Bioinformatics</source>
<volume>27</volume>
:
<fpage>i196</fpage>
<lpage>204</lpage>
.
<pub-id pub-id-type="pmid">21685070</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Koller1">
<label>25</label>
<mixed-citation publication-type="book">Koller D, Friedman N (2009) Probabilistic graphical models: principles and techniques: MIT press.</mixed-citation>
</ref>
<ref id="pcbi.1003713-Meinshausen1">
<label>26</label>
<mixed-citation publication-type="journal">
<name>
<surname>Meinshausen</surname>
<given-names>N</given-names>
</name>
,
<name>
<surname>Buhlmann</surname>
<given-names>P</given-names>
</name>
,
<name>
<surname>Zurich</surname>
<given-names>E</given-names>
</name>
(
<year>2006</year>
)
<article-title>High-dimensional graphs and variable selection with the lasso</article-title>
.
<source>Annals of Statistics</source>
<volume>34</volume>
:
<fpage>1436</fpage>
<lpage>1462</lpage>
.</mixed-citation>
</ref>
<ref id="pcbi.1003713-Ravikumar1">
<label>27</label>
<mixed-citation publication-type="journal">
<name>
<surname>Ravikumar</surname>
<given-names>P</given-names>
</name>
,
<name>
<surname>Wainwright</surname>
<given-names>MJ</given-names>
</name>
,
<name>
<surname>Lafferty</surname>
<given-names>J</given-names>
</name>
(
<year>2010</year>
)
<article-title>High-dimensional ising model selection using ℓ1-regularized logistic regression</article-title>
.
<source>Annals of Statistics</source>
<volume>38</volume>
:
<fpage>1287</fpage>
<lpage>1319</lpage>
.</mixed-citation>
</ref>
<ref id="pcbi.1003713-Kolar1">
<label>28</label>
<mixed-citation publication-type="journal">
<name>
<surname>Kolar</surname>
<given-names>M</given-names>
</name>
,
<name>
<surname>Xing</surname>
<given-names>EP</given-names>
</name>
(
<year>2012</year>
)
<article-title>Estimating Networks With Jumps</article-title>
.
<source>Electronic Journal of Statistics</source>
<volume>6</volume>
:
<fpage>38</fpage>
.</mixed-citation>
</ref>
<ref id="pcbi.1003713-Kolar2">
<label>29</label>
<mixed-citation publication-type="journal">
<name>
<surname>Kolar</surname>
<given-names>M</given-names>
</name>
,
<name>
<surname>Song</surname>
<given-names>L</given-names>
</name>
,
<name>
<surname>Xing</surname>
<given-names>EP</given-names>
</name>
(
<year>2009</year>
)
<article-title>Sparsistent Learning of Varying-coefficient Models with Structural Changes</article-title>
.
<source>Advances in Neural Information Processing Systems</source>
<volume>21</volume>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Segal1">
<label>30</label>
<mixed-citation publication-type="journal">
<name>
<surname>Segal</surname>
<given-names>E</given-names>
</name>
,
<name>
<surname>Shapira</surname>
<given-names>M</given-names>
</name>
,
<name>
<surname>Regev</surname>
<given-names>A</given-names>
</name>
,
<name>
<surname>Pe'er</surname>
<given-names>D</given-names>
</name>
,
<name>
<surname>Botstein</surname>
<given-names>D</given-names>
</name>
,
<etal>et al</etal>
(
<year>2003</year>
)
<article-title>Module networks: identifying regulatory modules and their condition-specific regulators from gene expression data</article-title>
.
<source>Nat Genet</source>
<volume>34</volume>
:
<fpage>166</fpage>
<lpage>176</lpage>
.
<pub-id pub-id-type="pmid">12740579</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Davidson1">
<label>31</label>
<mixed-citation publication-type="book">Davidson EH (2001) Genomic regulatory systems. San Diego: Academic Press.</mixed-citation>
</ref>
<ref id="pcbi.1003713-Baselga1">
<label>32</label>
<mixed-citation publication-type="journal">
<name>
<surname>Baselga</surname>
<given-names>J</given-names>
</name>
,
<name>
<surname>Albanell</surname>
<given-names>J</given-names>
</name>
,
<name>
<surname>Ruiz</surname>
<given-names>A</given-names>
</name>
,
<name>
<surname>Lluch</surname>
<given-names>A</given-names>
</name>
,
<name>
<surname>Gascon</surname>
<given-names>P</given-names>
</name>
,
<etal>et al</etal>
(
<year>2005</year>
)
<article-title>Phase II and tumor pharmacodynamic study of gefitinib in patients with advanced breast cancer</article-title>
.
<source>J Clin Oncol</source>
<volume>23</volume>
:
<fpage>5323</fpage>
<lpage>5333</lpage>
.
<pub-id pub-id-type="pmid">15939921</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Nahta1">
<label>33</label>
<mixed-citation publication-type="journal">
<name>
<surname>Nahta</surname>
<given-names>R</given-names>
</name>
,
<name>
<surname>Yu</surname>
<given-names>D</given-names>
</name>
,
<name>
<surname>Hung</surname>
<given-names>MC</given-names>
</name>
,
<name>
<surname>Hortobagyi</surname>
<given-names>GN</given-names>
</name>
,
<name>
<surname>Esteva</surname>
<given-names>FJ</given-names>
</name>
(
<year>2006</year>
)
<article-title>Mechanisms of disease: understanding resistance to HER2-targeted therapy in human breast cancer</article-title>
.
<source>Nat Clin Pract Oncol</source>
<volume>3</volume>
:
<fpage>269</fpage>
<lpage>280</lpage>
.
<pub-id pub-id-type="pmid">16683005</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-DiCosimo1">
<label>34</label>
<mixed-citation publication-type="journal">
<name>
<surname>Di Cosimo</surname>
<given-names>S</given-names>
</name>
,
<name>
<surname>Baselga</surname>
<given-names>J</given-names>
</name>
(
<year>2010</year>
)
<article-title>Management of breast cancer with targeted agents: importance of heterogeneity. [corrected]</article-title>
.
<source>Nat Rev Clin Oncol</source>
<volume>7</volume>
:
<fpage>139</fpage>
<lpage>147</lpage>
.
<pub-id pub-id-type="pmid">20125090</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Huang1">
<label>35</label>
<mixed-citation publication-type="journal">
<name>
<surname>Huang</surname>
<given-names>S</given-names>
</name>
,
<name>
<surname>Bjornsti</surname>
<given-names>MA</given-names>
</name>
,
<name>
<surname>Houghton</surname>
<given-names>PJ</given-names>
</name>
(
<year>2003</year>
)
<article-title>Rapamycins: mechanism of action and cellular resistance</article-title>
.
<source>Cancer Biol Ther</source>
<volume>2</volume>
:
<fpage>222</fpage>
<lpage>232</lpage>
.
<pub-id pub-id-type="pmid">12878853</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Nakayama1">
<label>36</label>
<mixed-citation publication-type="journal">
<name>
<surname>Nakayama</surname>
<given-names>KI</given-names>
</name>
,
<name>
<surname>Nakayama</surname>
<given-names>K</given-names>
</name>
(
<year>2006</year>
)
<article-title>Ubiquitin ligases: cell-cycle control and cancer</article-title>
.
<source>Nat Rev Cancer</source>
<volume>6</volume>
:
<fpage>369</fpage>
<lpage>381</lpage>
.
<pub-id pub-id-type="pmid">16633365</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Coussens1">
<label>37</label>
<mixed-citation publication-type="journal">
<name>
<surname>Coussens</surname>
<given-names>LM</given-names>
</name>
,
<name>
<surname>Fingleton</surname>
<given-names>B</given-names>
</name>
,
<name>
<surname>Matrisian</surname>
<given-names>LM</given-names>
</name>
(
<year>2002</year>
)
<article-title>Matrix metalloproteinase inhibitors and cancer: trials and tribulations</article-title>
.
<source>Science</source>
<volume>295</volume>
:
<fpage>2387</fpage>
<lpage>2392</lpage>
.
<pub-id pub-id-type="pmid">11923519</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Kessenbrock1">
<label>38</label>
<mixed-citation publication-type="journal">
<name>
<surname>Kessenbrock</surname>
<given-names>K</given-names>
</name>
,
<name>
<surname>Plaks</surname>
<given-names>V</given-names>
</name>
,
<name>
<surname>Werb</surname>
<given-names>Z</given-names>
</name>
(
<year>2010</year>
)
<article-title>Matrix metalloproteinases: regulators of the tumor microenvironment</article-title>
.
<source>Cell</source>
<volume>141</volume>
:
<fpage>52</fpage>
<lpage>67</lpage>
.
<pub-id pub-id-type="pmid">20371345</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-vandeVijver1">
<label>39</label>
<mixed-citation publication-type="journal">
<name>
<surname>van de Vijver</surname>
<given-names>MJ</given-names>
</name>
,
<name>
<surname>He</surname>
<given-names>YD</given-names>
</name>
,
<name>
<surname>van't Veer</surname>
<given-names>LJ</given-names>
</name>
,
<name>
<surname>Dai</surname>
<given-names>H</given-names>
</name>
,
<name>
<surname>Hart</surname>
<given-names>AA</given-names>
</name>
,
<etal>et al</etal>
(
<year>2002</year>
)
<article-title>A gene-expression signature as a predictor of survival in breast cancer</article-title>
.
<source>N Engl J Med</source>
<volume>347</volume>
:
<fpage>1999</fpage>
<lpage>2009</lpage>
.
<pub-id pub-id-type="pmid">12490681</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Han1">
<label>40</label>
<mixed-citation publication-type="journal">
<name>
<surname>Han</surname>
<given-names>JD</given-names>
</name>
,
<name>
<surname>Bertin</surname>
<given-names>N</given-names>
</name>
,
<name>
<surname>Hao</surname>
<given-names>T</given-names>
</name>
,
<name>
<surname>Goldberg</surname>
<given-names>DS</given-names>
</name>
,
<name>
<surname>Berriz</surname>
<given-names>GF</given-names>
</name>
,
<etal>et al</etal>
(
<year>2004</year>
)
<article-title>Evidence for dynamically organized modularity in the yeast protein-protein interaction network</article-title>
.
<source>Nature</source>
<volume>430</volume>
:
<fpage>88</fpage>
<lpage>93</lpage>
.
<pub-id pub-id-type="pmid">15190252</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Deng1">
<label>41</label>
<mixed-citation publication-type="journal">
<name>
<surname>Deng</surname>
<given-names>XA</given-names>
</name>
,
<name>
<surname>Norris</surname>
<given-names>A</given-names>
</name>
,
<name>
<surname>Panaviene</surname>
<given-names>Z</given-names>
</name>
,
<name>
<surname>Moncman</surname>
<given-names>CL</given-names>
</name>
(
<year>2008</year>
)
<article-title>Ectopic expression of LIM-nebulette (LASP2) reveals roles in cell migration and spreading</article-title>
.
<source>Cell Motil Cytoskeleton</source>
<volume>65</volume>
:
<fpage>827</fpage>
<lpage>840</lpage>
.
<pub-id pub-id-type="pmid">18677772</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Citri1">
<label>42</label>
<mixed-citation publication-type="journal">
<name>
<surname>Citri</surname>
<given-names>A</given-names>
</name>
,
<name>
<surname>Yarden</surname>
<given-names>Y</given-names>
</name>
(
<year>2006</year>
)
<article-title>EGF-ERBB signalling: towards the systems level</article-title>
.
<source>Nat Rev Mol Cell Biol</source>
<volume>7</volume>
:
<fpage>505</fpage>
<lpage>516</lpage>
.
<pub-id pub-id-type="pmid">16829981</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Revillion1">
<label>43</label>
<mixed-citation publication-type="journal">
<name>
<surname>Revillion</surname>
<given-names>F</given-names>
</name>
,
<name>
<surname>Lhotellier</surname>
<given-names>V</given-names>
</name>
,
<name>
<surname>Hornez</surname>
<given-names>L</given-names>
</name>
,
<name>
<surname>Bonneterre</surname>
<given-names>J</given-names>
</name>
,
<name>
<surname>Peyrat</surname>
<given-names>JP</given-names>
</name>
(
<year>2008</year>
)
<article-title>ErbB/HER ligands in human breast cancer, and relationships with their receptors, the bio-pathological features and prognosis</article-title>
.
<source>Ann Oncol</source>
<volume>19</volume>
:
<fpage>73</fpage>
<lpage>80</lpage>
.
<pub-id pub-id-type="pmid">17962208</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Miyamoto1">
<label>44</label>
<mixed-citation publication-type="journal">
<name>
<surname>Miyamoto</surname>
<given-names>S</given-names>
</name>
,
<name>
<surname>Yagi</surname>
<given-names>H</given-names>
</name>
,
<name>
<surname>Yotsumoto</surname>
<given-names>F</given-names>
</name>
,
<name>
<surname>Horiuchi</surname>
<given-names>S</given-names>
</name>
,
<name>
<surname>Yoshizato</surname>
<given-names>T</given-names>
</name>
,
<etal>et al</etal>
(
<year>2007</year>
)
<article-title>New approach to cancer therapy: heparin binding-epidermal growth factor-like growth factor as a novel targeting molecule</article-title>
.
<source>Anticancer Res</source>
<volume>27</volume>
:
<fpage>3713</fpage>
<lpage>3721</lpage>
.
<pub-id pub-id-type="pmid">17970033</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Vanacova1">
<label>45</label>
<mixed-citation publication-type="journal">
<name>
<surname>Vanacova</surname>
<given-names>S</given-names>
</name>
,
<name>
<surname>Stefl</surname>
<given-names>R</given-names>
</name>
(
<year>2007</year>
)
<article-title>The exosome and RNA quality control in the nucleus</article-title>
.
<source>EMBO Rep</source>
<volume>8</volume>
:
<fpage>651</fpage>
<lpage>657</lpage>
.
<pub-id pub-id-type="pmid">17603538</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Jeong1">
<label>46</label>
<mixed-citation publication-type="journal">
<name>
<surname>Jeong</surname>
<given-names>H</given-names>
</name>
,
<name>
<surname>Mason</surname>
<given-names>SP</given-names>
</name>
,
<name>
<surname>Barabasi</surname>
<given-names>AL</given-names>
</name>
,
<name>
<surname>Oltvai</surname>
<given-names>ZN</given-names>
</name>
(
<year>2001</year>
)
<article-title>Lethality and centrality in protein networks</article-title>
.
<source>Nature</source>
<volume>411</volume>
:
<fpage>41</fpage>
<lpage>42</lpage>
.
<pub-id pub-id-type="pmid">11333967</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Yu1">
<label>47</label>
<mixed-citation publication-type="journal">
<name>
<surname>Yu</surname>
<given-names>H</given-names>
</name>
,
<name>
<surname>Greenbaum</surname>
<given-names>D</given-names>
</name>
,
<name>
<surname>Xin Lu</surname>
<given-names>H</given-names>
</name>
,
<name>
<surname>Zhu</surname>
<given-names>X</given-names>
</name>
,
<name>
<surname>Gerstein</surname>
<given-names>M</given-names>
</name>
(
<year>2004</year>
)
<article-title>Genomic analysis of essentiality within protein networks</article-title>
.
<source>Trends Genet</source>
<volume>20</volume>
:
<fpage>227</fpage>
<lpage>231</lpage>
.
<pub-id pub-id-type="pmid">15145574</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Petersen1">
<label>48</label>
<mixed-citation publication-type="journal">
<name>
<surname>Petersen</surname>
<given-names>OW</given-names>
</name>
,
<name>
<surname>Ronnov-Jessen</surname>
<given-names>L</given-names>
</name>
,
<name>
<surname>Howlett</surname>
<given-names>AR</given-names>
</name>
,
<name>
<surname>Bissell</surname>
<given-names>MJ</given-names>
</name>
(
<year>1992</year>
)
<article-title>Interaction with basement membrane serves to rapidly distinguish growth and differentiation pattern of normal and malignant human breast epithelial cells</article-title>
.
<source>Proc Natl Acad Sci U S A</source>
<volume>89</volume>
:
<fpage>9064</fpage>
<lpage>9068</lpage>
.
<pub-id pub-id-type="pmid">1384042</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Beliveau1">
<label>49</label>
<mixed-citation publication-type="journal">
<name>
<surname>Beliveau</surname>
<given-names>A</given-names>
</name>
,
<name>
<surname>Mott</surname>
<given-names>JD</given-names>
</name>
,
<name>
<surname>Lo</surname>
<given-names>A</given-names>
</name>
,
<name>
<surname>Chen</surname>
<given-names>EI</given-names>
</name>
,
<name>
<surname>Koller</surname>
<given-names>AA</given-names>
</name>
,
<etal>et al</etal>
(
<year>2010</year>
)
<article-title>Raf-induced MMP9 disrupts tissue architecture of human breast cells in three-dimensional culture and is necessary for tumor growth in vivo</article-title>
.
<source>Genes Dev</source>
<volume>24</volume>
:
<fpage>2800</fpage>
<lpage>2811</lpage>
.
<pub-id pub-id-type="pmid">21159820</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Lee1">
<label>50</label>
<mixed-citation publication-type="journal">
<name>
<surname>Lee</surname>
<given-names>GY</given-names>
</name>
,
<name>
<surname>Kenny</surname>
<given-names>PA</given-names>
</name>
,
<name>
<surname>Lee</surname>
<given-names>EH</given-names>
</name>
,
<name>
<surname>Bissell</surname>
<given-names>MJ</given-names>
</name>
(
<year>2007</year>
)
<article-title>Three-dimensional culture models of normal and malignant breast epithelial cells</article-title>
.
<source>Nat Methods</source>
<volume>4</volume>
:
<fpage>359</fpage>
<lpage>365</lpage>
.
<pub-id pub-id-type="pmid">17396127</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Fournier1">
<label>51</label>
<mixed-citation publication-type="journal">
<name>
<surname>Fournier</surname>
<given-names>MV</given-names>
</name>
,
<name>
<surname>Martin</surname>
<given-names>KJ</given-names>
</name>
,
<name>
<surname>Kenny</surname>
<given-names>PA</given-names>
</name>
,
<name>
<surname>Xhaja</surname>
<given-names>K</given-names>
</name>
,
<name>
<surname>Bosch</surname>
<given-names>I</given-names>
</name>
,
<etal>et al</etal>
(
<year>2006</year>
)
<article-title>Gene expression signature in organized and growth-arrested mammary acini predicts good outcome in breast cancer</article-title>
.
<source>Cancer Res</source>
<volume>66</volume>
:
<fpage>7095</fpage>
<lpage>7102</lpage>
.
<pub-id pub-id-type="pmid">16849555</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Bolstad1">
<label>52</label>
<mixed-citation publication-type="journal">
<name>
<surname>Bolstad</surname>
<given-names>BM</given-names>
</name>
,
<name>
<surname>Irizarry</surname>
<given-names>RA</given-names>
</name>
,
<name>
<surname>Astrand</surname>
<given-names>M</given-names>
</name>
,
<name>
<surname>Speed</surname>
<given-names>TP</given-names>
</name>
(
<year>2003</year>
)
<article-title>A comparison of normalization methods for high density oligonucleotide array data based on variance and bias</article-title>
.
<source>Bioinformatics</source>
<volume>19</volume>
:
<fpage>185</fpage>
<lpage>193</lpage>
.
<pub-id pub-id-type="pmid">12538238</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Irizarry1">
<label>53</label>
<mixed-citation publication-type="journal">
<name>
<surname>Irizarry</surname>
<given-names>RA</given-names>
</name>
,
<name>
<surname>Hobbs</surname>
<given-names>B</given-names>
</name>
,
<name>
<surname>Collin</surname>
<given-names>F</given-names>
</name>
,
<name>
<surname>Beazer-Barclay</surname>
<given-names>YD</given-names>
</name>
,
<name>
<surname>Antonellis</surname>
<given-names>KJ</given-names>
</name>
,
<etal>et al</etal>
(
<year>2003</year>
)
<article-title>Exploration, normalization, and summaries of high density oligonucleotide array probe level data</article-title>
.
<source>Biostatistics</source>
<volume>4</volume>
:
<fpage>249</fpage>
<lpage>264</lpage>
.
<pub-id pub-id-type="pmid">12925520</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Holder1">
<label>54</label>
<mixed-citation publication-type="book">Holder D, Raubertas RF, Pikounis VB, Svetnik V, Soper K (2001) Statistical analysis of high density oligonucleotide arrays: a SAFER approach. In: Proceedings of the ASA Annual Meeting; Atlanta, Georgia.</mixed-citation>
</ref>
<ref id="pcbi.1003713-Zhou1">
<label>55</label>
<mixed-citation publication-type="journal">
<name>
<surname>Zhou</surname>
<given-names>X</given-names>
</name>
,
<name>
<surname>Wu</surname>
<given-names>W</given-names>
</name>
,
<name>
<surname>Hu</surname>
<given-names>H</given-names>
</name>
,
<name>
<surname>Milosevic</surname>
<given-names>J</given-names>
</name>
,
<name>
<surname>Konishi</surname>
<given-names>K</given-names>
</name>
,
<etal>et al</etal>
(
<year>2011</year>
)
<article-title>Genomic differences distinguish the myofibroblast phenotype of distal lung fibroblasts from airway fibroblasts</article-title>
.
<source>Am J Respir Cell Mol Biol</source>
<volume>45</volume>
:
<fpage>1256</fpage>
<lpage>1262</lpage>
.
<pub-id pub-id-type="pmid">21757679</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Lauritzen1">
<label>56</label>
<mixed-citation publication-type="book">Lauritzen SL (1996) Graphical models: Oxford University Press.</mixed-citation>
</ref>
<ref id="pcbi.1003713-Tibshirani1">
<label>57</label>
<mixed-citation publication-type="journal">
<name>
<surname>Tibshirani</surname>
<given-names>R</given-names>
</name>
(
<year>1996</year>
)
<article-title>Regression shrinkage and selection via the lasso</article-title>
.
<source>Journal of the Royal Statistical Society Series B (Methodological)</source>
<volume>58</volume>
:
<fpage>267</fpage>
<lpage>288</lpage>
.</mixed-citation>
</ref>
<ref id="pcbi.1003713-Grant1">
<label>58</label>
<mixed-citation publication-type="other">Grant M, Boyd S, Y. Y (2008) CVX: Matlab software for disciplined convex programming.</mixed-citation>
</ref>
<ref id="pcbi.1003713-Kolar3">
<label>59</label>
<mixed-citation publication-type="journal">
<name>
<surname>Kolar</surname>
<given-names>M</given-names>
</name>
,
<name>
<surname>Xing</surname>
<given-names>EP</given-names>
</name>
(
<year>2012</year>
)
<article-title>Estimating networks with jumps</article-title>
.
<source>Electronic Journal of Statistics</source>
<volume>6</volume>
:
<fpage>2069</fpage>
<lpage>2106</lpage>
.
<pub-id pub-id-type="pmid">25013533</pub-id>
</mixed-citation>
</ref>
<ref id="pcbi.1003713-Liu2">
<label>60</label>
<mixed-citation publication-type="journal">
<name>
<surname>Liu</surname>
<given-names>H</given-names>
</name>
,
<name>
<surname>Roeder</surname>
<given-names>K</given-names>
</name>
,
<name>
<surname>Wasserman</surname>
<given-names>L</given-names>
</name>
(
<year>2010</year>
)
<article-title>Stability Approach to Regularization Selection (StARS) for High Dimensional Graphical Models</article-title>
.
<source>Advances in Neural Information Processing Systems</source>
<volume>23</volume>
.</mixed-citation>
</ref>
<ref id="pcbi.1003713-Benjamini1">
<label>61</label>
<mixed-citation publication-type="journal">
<name>
<surname>Benjamini</surname>
<given-names>Y</given-names>
</name>
,
<name>
<surname>Hochberg</surname>
<given-names>Y</given-names>
</name>
(
<year>1995</year>
)
<article-title>Controlling the false discovery rate: A practical and powerful approach to multiple testing</article-title>
.
<source>J Roy Statist Soc Ser B</source>
<volume>57</volume>
:
<fpage>289</fpage>
<lpage>300</lpage>
.</mixed-citation>
</ref>
<ref id="pcbi.1003713-Beissbarth1">
<label>62</label>
<mixed-citation publication-type="journal">
<name>
<surname>Beissbarth</surname>
<given-names>T</given-names>
</name>
,
<name>
<surname>Speed</surname>
<given-names>TP</given-names>
</name>
(
<year>2004</year>
)
<article-title>GOstat: find statistically overrepresented Gene Ontologies within a group of genes</article-title>
.
<source>Bioinformatics</source>
<volume>20</volume>
:
<fpage>1464</fpage>
<lpage>1465</lpage>
.
<pub-id pub-id-type="pmid">14962934</pub-id>
</mixed-citation>
</ref>
</ref-list>
</back>
</pmc>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Amérique/explor/PittsburghV1/Data/Pmc/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000260 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Pmc/Corpus/biblio.hfd -nk 000260 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Amérique
   |area=    PittsburghV1
   |flux=    Pmc
   |étape=   Corpus
   |type=    RBID
   |clé=     PMC:4109850
   |texte=   Network Analysis of Breast Cancer Progression and Reversal Using a Tree-Evolving Network Algorithm
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Pmc/Corpus/RBID.i   -Sk "pubmed:25057922" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Pmc/Corpus/biblio.hfd   \
       | NlmPubMed2Wicri -a PittsburghV1 

Wicri

This area was generated with Dilib version V0.6.38.
Data generation: Fri Jun 18 17:37:45 2021. Site generation: Fri Jun 18 18:15:47 2021