Predicting proteolytic sites in extracellular proteins: only halfway there
Identifieur interne : 000848 ( Istex/Curation ); précédent : 000847; suivant : 000849Predicting proteolytic sites in extracellular proteins: only halfway there
Auteurs : Yossef Kliger [Israël] ; Eyal Gofer [Israël] ; Assaf Wool [Israël] ; Amir Toporik [Israël] ; Avihay Apatoff [Israël] ; Moshe Olshansky [Israël]Source :
- Bioinformatics [ 1367-4803 ] ; 2008.
Abstract
Motivation: Many secretory proteins are synthesized as inactive precursors that must undergo post-translational proteolysis in order to mature and become active. In the current study, we address the challenge of sequence-based discovery of proteolytic sites in secreted proteins using machine learning. Results: The results revealed that only half of the extracellular proteolytic sites are currently annotated, leaving over 3600 unannotated ones. Furthermore, we have found that only 6% of the unannotated sites are similar to known proteolytic sites, whereas the remaining 94% do not share significant similarity with any annotated proteolytic site. The computational challenges in these two cases are very different. While the precision in detecting the former group is close to perfect, only a mere 22% of the latter group were detected with a precision of 80%. The applicability of the classifier is demonstrated through members of the FGF family, in which we verified the conservation of physiologically-relevant proteolytic sites in homologous proteins. Contact: kliger@compugen.co.il; yossef.kliger@gmail.com Supplementary information: Supplementary data are available at Bioinformatics online.
Url:
DOI: 10.1093/bioinformatics/btn084
Links toward previous steps (curation, corpus...)
- to stream Istex, to step Corpus: Pour aller vers cette notice dans l'étape Curation :000848
Links to Exploration step
ISTEX:B0C4B4C1EC355D0EB86F236BEA4E417647565B77Le document en format XML
<record><TEI wicri:istexFullTextTei="biblStruct"><teiHeader><fileDesc><titleStmt><title>Predicting proteolytic sites in extracellular proteins: only halfway there</title>
<author><name sortKey="Kliger, Yossef" sort="Kliger, Yossef" uniqKey="Kliger Y" first="Yossef" last="Kliger">Yossef Kliger</name>
<affiliation wicri:level="1"><mods:affiliation>Compugen Ltd, 72 Pinchas Rosen, Tel Aviv 69512 and The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan, Israel</mods:affiliation>
<country xml:lang="fr">Israël</country>
<wicri:regionArea>Compugen Ltd, 72 Pinchas Rosen, Tel Aviv 69512 and The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan</wicri:regionArea>
</affiliation>
<affiliation><mods:affiliation>*To whom correspondence should be addressed.</mods:affiliation>
<wicri:noCountry code="no comma">*To whom correspondence should be addressed.</wicri:noCountry>
</affiliation>
</author>
<author><name sortKey="Gofer, Eyal" sort="Gofer, Eyal" uniqKey="Gofer E" first="Eyal" last="Gofer">Eyal Gofer</name>
<affiliation wicri:level="1"><mods:affiliation>Compugen Ltd, 72 Pinchas Rosen, Tel Aviv 69512 and The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan, Israel</mods:affiliation>
<country xml:lang="fr">Israël</country>
<wicri:regionArea>Compugen Ltd, 72 Pinchas Rosen, Tel Aviv 69512 and The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Wool, Assaf" sort="Wool, Assaf" uniqKey="Wool A" first="Assaf" last="Wool">Assaf Wool</name>
<affiliation wicri:level="1"><mods:affiliation>Compugen Ltd, 72 Pinchas Rosen, Tel Aviv 69512 and The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan, Israel</mods:affiliation>
<country xml:lang="fr">Israël</country>
<wicri:regionArea>Compugen Ltd, 72 Pinchas Rosen, Tel Aviv 69512 and The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Toporik, Amir" sort="Toporik, Amir" uniqKey="Toporik A" first="Amir" last="Toporik">Amir Toporik</name>
<affiliation wicri:level="1"><mods:affiliation>Compugen Ltd, 72 Pinchas Rosen, Tel Aviv 69512 and The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan, Israel</mods:affiliation>
<country xml:lang="fr">Israël</country>
<wicri:regionArea>Compugen Ltd, 72 Pinchas Rosen, Tel Aviv 69512 and The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Apatoff, Avihay" sort="Apatoff, Avihay" uniqKey="Apatoff A" first="Avihay" last="Apatoff">Avihay Apatoff</name>
<affiliation wicri:level="1"><mods:affiliation>Compugen Ltd, 72 Pinchas Rosen, Tel Aviv 69512 and The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan, Israel</mods:affiliation>
<country xml:lang="fr">Israël</country>
<wicri:regionArea>Compugen Ltd, 72 Pinchas Rosen, Tel Aviv 69512 and The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Olshansky, Moshe" sort="Olshansky, Moshe" uniqKey="Olshansky M" first="Moshe" last="Olshansky">Moshe Olshansky</name>
<affiliation wicri:level="1"><mods:affiliation>Compugen Ltd, 72 Pinchas Rosen, Tel Aviv 69512 and The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan, Israel</mods:affiliation>
<country xml:lang="fr">Israël</country>
<wicri:regionArea>Compugen Ltd, 72 Pinchas Rosen, Tel Aviv 69512 and The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan</wicri:regionArea>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:B0C4B4C1EC355D0EB86F236BEA4E417647565B77</idno>
<date when="2008" year="2008">2008</date>
<idno type="doi">10.1093/bioinformatics/btn084</idno>
<idno type="url">https://api.istex.fr/ark:/67375/HXZ-RQ5WHZPQ-4/fulltext.pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000848</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Corpus" wicri:corpus="ISTEX">000848</idno>
<idno type="wicri:Area/Istex/Curation">000848</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title level="a" type="main">Predicting proteolytic sites in extracellular proteins: only halfway there</title>
<author><name sortKey="Kliger, Yossef" sort="Kliger, Yossef" uniqKey="Kliger Y" first="Yossef" last="Kliger">Yossef Kliger</name>
<affiliation wicri:level="1"><mods:affiliation>Compugen Ltd, 72 Pinchas Rosen, Tel Aviv 69512 and The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan, Israel</mods:affiliation>
<country xml:lang="fr">Israël</country>
<wicri:regionArea>Compugen Ltd, 72 Pinchas Rosen, Tel Aviv 69512 and The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan</wicri:regionArea>
</affiliation>
<affiliation><mods:affiliation>*To whom correspondence should be addressed.</mods:affiliation>
</affiliation>
</author>
<author><name sortKey="Gofer, Eyal" sort="Gofer, Eyal" uniqKey="Gofer E" first="Eyal" last="Gofer">Eyal Gofer</name>
<affiliation wicri:level="1"><mods:affiliation>Compugen Ltd, 72 Pinchas Rosen, Tel Aviv 69512 and The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan, Israel</mods:affiliation>
<country xml:lang="fr">Israël</country>
<wicri:regionArea>Compugen Ltd, 72 Pinchas Rosen, Tel Aviv 69512 and The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Wool, Assaf" sort="Wool, Assaf" uniqKey="Wool A" first="Assaf" last="Wool">Assaf Wool</name>
<affiliation wicri:level="1"><mods:affiliation>Compugen Ltd, 72 Pinchas Rosen, Tel Aviv 69512 and The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan, Israel</mods:affiliation>
<country xml:lang="fr">Israël</country>
<wicri:regionArea>Compugen Ltd, 72 Pinchas Rosen, Tel Aviv 69512 and The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Toporik, Amir" sort="Toporik, Amir" uniqKey="Toporik A" first="Amir" last="Toporik">Amir Toporik</name>
<affiliation wicri:level="1"><mods:affiliation>Compugen Ltd, 72 Pinchas Rosen, Tel Aviv 69512 and The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan, Israel</mods:affiliation>
<country xml:lang="fr">Israël</country>
<wicri:regionArea>Compugen Ltd, 72 Pinchas Rosen, Tel Aviv 69512 and The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Apatoff, Avihay" sort="Apatoff, Avihay" uniqKey="Apatoff A" first="Avihay" last="Apatoff">Avihay Apatoff</name>
<affiliation wicri:level="1"><mods:affiliation>Compugen Ltd, 72 Pinchas Rosen, Tel Aviv 69512 and The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan, Israel</mods:affiliation>
<country xml:lang="fr">Israël</country>
<wicri:regionArea>Compugen Ltd, 72 Pinchas Rosen, Tel Aviv 69512 and The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Olshansky, Moshe" sort="Olshansky, Moshe" uniqKey="Olshansky M" first="Moshe" last="Olshansky">Moshe Olshansky</name>
<affiliation wicri:level="1"><mods:affiliation>Compugen Ltd, 72 Pinchas Rosen, Tel Aviv 69512 and The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan, Israel</mods:affiliation>
<country xml:lang="fr">Israël</country>
<wicri:regionArea>Compugen Ltd, 72 Pinchas Rosen, Tel Aviv 69512 and The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan</wicri:regionArea>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series><title level="j" type="main">Bioinformatics</title>
<idno type="ISSN">1367-4803</idno>
<idno type="eISSN">1460-2059</idno>
<imprint><publisher>Oxford University Press</publisher>
<date type="published">2008</date>
<date type="e-published">2008</date>
<biblScope unit="vol">24</biblScope>
<biblScope unit="issue">8</biblScope>
<biblScope unit="page" from="1049">1049</biblScope>
<biblScope unit="page" to="1055">1055</biblScope>
</imprint>
<idno type="ISSN">1367-4803</idno>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt><idno type="ISSN">1367-4803</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass></textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract">Motivation: Many secretory proteins are synthesized as inactive precursors that must undergo post-translational proteolysis in order to mature and become active. In the current study, we address the challenge of sequence-based discovery of proteolytic sites in secreted proteins using machine learning. Results: The results revealed that only half of the extracellular proteolytic sites are currently annotated, leaving over 3600 unannotated ones. Furthermore, we have found that only 6% of the unannotated sites are similar to known proteolytic sites, whereas the remaining 94% do not share significant similarity with any annotated proteolytic site. The computational challenges in these two cases are very different. While the precision in detecting the former group is close to perfect, only a mere 22% of the latter group were detected with a precision of 80%. The applicability of the classifier is demonstrated through members of the FGF family, in which we verified the conservation of physiologically-relevant proteolytic sites in homologous proteins. Contact: kliger@compugen.co.il; yossef.kliger@gmail.com Supplementary information: Supplementary data are available at Bioinformatics online.</div>
</front>
</TEI>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Sante/explor/MersV1/Data/Istex/Curation
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000848 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Istex/Curation/biblio.hfd -nk 000848 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Sante |area= MersV1 |flux= Istex |étape= Curation |type= RBID |clé= ISTEX:B0C4B4C1EC355D0EB86F236BEA4E417647565B77 |texte= Predicting proteolytic sites in extracellular proteins: only halfway there }}
![]() | This area was generated with Dilib version V0.6.33. | ![]() |