SgmlV1, Ncbi, Merge, bibRecord, 000052

A Strategy to Retrieve the Whole Set of Protein Modules in Microbial Proteomes

Identifieur interne : 000052 ( Ncbi/Merge ); précédent : 000051; suivant : 000053

A Strategy to Retrieve the Whole Set of Protein Modules in Microbial Proteomes

Auteurs : Stéphanie Le Bouder-Langevin ; Isabelle Capron-Montaland ; Renaud De Rosa ; Bernard Labedan

Source :

Genome Research [ 1088-9051 ] ; 2002.

RBID : PMC:187581

Abstract

Protein homology is often limited to long structural segments that we have previously called modules. We describe here a suite of programs used to catalog the whole set of modules present in microbial proteomes. First, the Darwin AllAll program detects homologous segments using thresholds for evolutionary distance and alignment length, and another program classifies these modules. After assembling these homologous modules in families, we further group families which are related by a chain of neighboring unrelated homologous modules. With the automatic analysis of these groups of families sharing homologous modules in independent multimodular proteins, one can split into their component parts many fused modules and/or deduce by logic more distant modules. All detected and inferred modules are reassembled in refined families. These two last steps are made by a unique program. Eventually, the soundness of the data obtained by this experimental approach is checked using independent tests. To illustrate this modular approach, we compared four proteobacterial proteomes (Campylobacter jejuni, Escherichia coli, Haemophilus influenzae, and Helicobacter pylori). It appears that this method might retrieve from present-day proteins many of the modules which can help to trace back ancient events of gene duplication and/or fusion.

Url:

http://www.ncbi.nlm.nih.gov/pmc/articles/PMC187581

DOI: 10.1101/gr.393902
PubMed: 12466301
PubMed Central: 187581

Links toward previous steps (curation, corpus...)

to stream Pmc, to step Corpus: 000011
to stream Pmc, to step Curation: 000011
to stream Pmc, to step Checkpoint: 000073

Links to Exploration step

PMC:187581

Le document en format XML

<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en">A Strategy to Retrieve the Whole Set of Protein Modules in Microbial Proteomes</title>
<author><name sortKey="Le Bouder Langevin, Stephanie" sort="Le Bouder Langevin, Stephanie" uniqKey="Le Bouder Langevin S" first="Stéphanie" last="Le Bouder-Langevin">Stéphanie Le Bouder-Langevin</name>
</author>
<author><name sortKey="Capron Montaland, Isabelle" sort="Capron Montaland, Isabelle" uniqKey="Capron Montaland I" first="Isabelle" last="Capron-Montaland">Isabelle Capron-Montaland</name>
</author>
<author><name sortKey="De Rosa, Renaud" sort="De Rosa, Renaud" uniqKey="De Rosa R" first="Renaud" last="De Rosa">Renaud De Rosa</name>
</author>
<author><name sortKey="Labedan, Bernard" sort="Labedan, Bernard" uniqKey="Labedan B" first="Bernard" last="Labedan">Bernard Labedan</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">PMC</idno>
<idno type="pmid">12466301</idno>
<idno type="pmc">187581</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC187581</idno>
<idno type="RBID">PMC:187581</idno>
<idno type="doi">10.1101/gr.393902</idno>
<date when="2002">2002</date>
<idno type="wicri:Area/Pmc/Corpus">000011</idno>
<idno type="wicri:explorRef" wicri:stream="Pmc" wicri:step="Corpus" wicri:corpus="PMC">000011</idno>
<idno type="wicri:Area/Pmc/Curation">000011</idno>
<idno type="wicri:explorRef" wicri:stream="Pmc" wicri:step="Curation">000011</idno>
<idno type="wicri:Area/Pmc/Checkpoint">000073</idno>
<idno type="wicri:explorRef" wicri:stream="Pmc" wicri:step="Checkpoint">000073</idno>
<idno type="wicri:Area/Ncbi/Merge">000052</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en" level="a" type="main">A Strategy to Retrieve the Whole Set of Protein Modules in Microbial Proteomes</title>
<author><name sortKey="Le Bouder Langevin, Stephanie" sort="Le Bouder Langevin, Stephanie" uniqKey="Le Bouder Langevin S" first="Stéphanie" last="Le Bouder-Langevin">Stéphanie Le Bouder-Langevin</name>
</author>
<author><name sortKey="Capron Montaland, Isabelle" sort="Capron Montaland, Isabelle" uniqKey="Capron Montaland I" first="Isabelle" last="Capron-Montaland">Isabelle Capron-Montaland</name>
</author>
<author><name sortKey="De Rosa, Renaud" sort="De Rosa, Renaud" uniqKey="De Rosa R" first="Renaud" last="De Rosa">Renaud De Rosa</name>
</author>
<author><name sortKey="Labedan, Bernard" sort="Labedan, Bernard" uniqKey="Labedan B" first="Bernard" last="Labedan">Bernard Labedan</name>
</author>
</analytic>
<series><title level="j">Genome Research</title>
<idno type="ISSN">1088-9051</idno>
<imprint><date when="2002">2002</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><textClass></textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en"><p>Protein homology is often limited to long structural segments that we have previously called modules. We describe here a suite of programs used to catalog the whole set of modules present in microbial proteomes. First, the Darwin AllAll program detects homologous segments using thresholds for evolutionary distance and alignment length, and another program classifies these modules. After assembling these homologous modules in families, we further group families which are related by a chain of neighboring unrelated homologous modules. With the automatic analysis of these groups of families sharing homologous modules in independent multimodular proteins, one can split into their component parts many fused modules and/or deduce by logic more distant modules. All detected and inferred modules are reassembled in refined families. These two last steps are made by a unique program. Eventually, the soundness of the data obtained by this experimental approach is checked using independent tests. To illustrate this modular approach, we compared four proteobacterial proteomes (<italic>Campylobacter jejuni, Escherichia coli</italic>
, <italic>Haemophilus influenzae</italic>
, and <italic>Helicobacter pylori</italic>
). It appears that this method might retrieve from present-day proteins many of the modules which can help to trace back ancient events of gene duplication and/or fusion.</p>
</div>
</front>
</TEI>
<pmc article-type="research-article"><pmc-comment>The publisher of this article does not allow downloading of the full text in XML form.</pmc-comment>
  <front><journal-meta><journal-id journal-id-type="nlm-ta">Genome Res</journal-id>
<journal-id journal-id-type="publisher-id">GENOME RES</journal-id>
<journal-title>Genome Research</journal-title>
<issn pub-type="ppub">1088-9051</issn>
<publisher><publisher-name>Cold Spring Harbor Laboratory Press</publisher-name>
</publisher>
</journal-meta>
<article-meta><article-id pub-id-type="pmid">12466301</article-id>
<article-id pub-id-type="pmc">187581</article-id>
<article-id pub-id-type="publisher-id">GR-3939R</article-id>
<article-id pub-id-type="doi">10.1101/gr.393902</article-id>
<article-categories><subj-group subj-group-type="heading"><subject>Methods</subject>
</subj-group>
</article-categories>
<title-group><article-title>A Strategy to Retrieve the Whole Set of Protein Modules in Microbial Proteomes</article-title>
</title-group>
<contrib-group><contrib contrib-type="author"><name><surname>Le Bouder-Langevin</surname>
<given-names>Stéphanie</given-names>
</name>
<xref ref-type="author-notes" rid="FN1">1</xref>
</contrib>
<contrib contrib-type="author"><name><surname>Capron-Montaland</surname>
<given-names>Isabelle</given-names>
</name>
<xref ref-type="author-notes" rid="FN2">2</xref>
</contrib>
<contrib contrib-type="author"><name><surname>De Rosa</surname>
<given-names>Renaud</given-names>
</name>
<xref ref-type="author-notes" rid="FN3">3</xref>
</contrib>
<contrib contrib-type="author"><name><surname>Labedan</surname>
<given-names>Bernard</given-names>
</name>
</contrib>
</contrib-group>
<aff id="N0x8c00d38.0x9db4308">Évolution Moléculaire et Génomique, Institut de Génétique et Microbiologie, Université Paris-Sud, 91405 Orsay Cedex, France</aff>
<author-notes><fn id="FN1"><label>1</label>
<p>Present address: ValiGen, Tour Neptune, 92086 Paris-La-Défense, France</p>
</fn>
<fn id="FN2"><label>2</label>
<p>Present address: UMR 144 CNRS/Institut Curie, Bâtiment Lhomond, 26, rue d'Ulm, 75248 Paris Cedex 05, France</p>
</fn>
<fn id="FN3"><label>3</label>
<p>Present address: Centre de Génétique Moléculaire, CNRS - Bâtiment 26, 91110 Gif-sur-Yvette, France</p>
</fn>
</author-notes>
<pub-date pub-type="ppub"><month>12</month>
<year>2002</year>
</pub-date>
<volume>12</volume>
<issue>12</issue>
<fpage>1961</fpage>
<lpage>1973</lpage>
<history><date date-type="received"><day>5</day>
<month>5</month>
<year>2002</year>
</date>
<date date-type="accepted"><day>30</day>
<month>9</month>
<year>2002</year>
</date>
</history>
<copyright-statement>Copyright © 2002, Cold Spring Harbor Laboratory Press</copyright-statement>
<copyright-year>2002</copyright-year>
<abstract><p>Protein homology is often limited to long structural segments that we have previously called modules. We describe here a suite of programs used to catalog the whole set of modules present in microbial proteomes. First, the Darwin AllAll program detects homologous segments using thresholds for evolutionary distance and alignment length, and another program classifies these modules. After assembling these homologous modules in families, we further group families which are related by a chain of neighboring unrelated homologous modules. With the automatic analysis of these groups of families sharing homologous modules in independent multimodular proteins, one can split into their component parts many fused modules and/or deduce by logic more distant modules. All detected and inferred modules are reassembled in refined families. These two last steps are made by a unique program. Eventually, the soundness of the data obtained by this experimental approach is checked using independent tests. To illustrate this modular approach, we compared four proteobacterial proteomes (<italic>Campylobacter jejuni, Escherichia coli</italic>
, <italic>Haemophilus influenzae</italic>
, and <italic>Helicobacter pylori</italic>
). It appears that this method might retrieve from present-day proteins many of the modules which can help to trace back ancient events of gene duplication and/or fusion.</p>
</abstract>
</article-meta>
</front>
</pmc>
<affiliations><list></list>
<tree><noCountry><name sortKey="Capron Montaland, Isabelle" sort="Capron Montaland, Isabelle" uniqKey="Capron Montaland I" first="Isabelle" last="Capron-Montaland">Isabelle Capron-Montaland</name>
<name sortKey="De Rosa, Renaud" sort="De Rosa, Renaud" uniqKey="De Rosa R" first="Renaud" last="De Rosa">Renaud De Rosa</name>
<name sortKey="Labedan, Bernard" sort="Labedan, Bernard" uniqKey="Labedan B" first="Bernard" last="Labedan">Bernard Labedan</name>
<name sortKey="Le Bouder Langevin, Stephanie" sort="Le Bouder Langevin, Stephanie" uniqKey="Le Bouder Langevin S" first="Stéphanie" last="Le Bouder-Langevin">Stéphanie Le Bouder-Langevin</name>
</noCountry>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Informatique/explor/SgmlV1/Data/Ncbi/Merge

HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000052 | SxmlIndent | more

HfdSelect -h $EXPLOR_AREA/Data/Ncbi/Merge/biblio.hfd -nk 000052 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Informatique
   |area=    SgmlV1
   |flux=    Ncbi
   |étape=   Merge
   |type=    RBID
   |clé=     PMC:187581
   |texte=   A Strategy to Retrieve the Whole Set of Protein Modules in Microbial Proteomes
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Ncbi/Merge/RBID.i   -Sk "pubmed:12466301" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Ncbi/Merge/biblio.hfd   \
       | NlmPubMed2Wicri -a SgmlV1

This area was generated with Dilib version V0.6.33.
Data generation: Mon Jul 1 14:26:08 2019. Site generation: Wed Apr 28 21:40:44 2021

	Serveur d'exploration sur SGML
	Attention, ce site est en cours de développement ! Attention, site généré par des moyens informatiques à partir de corpus bruts. Les informations ne sont donc pas validées.

Serveur d'exploration sur SGML

A Strategy to Retrieve the Whole Set of Protein Modules in Microbial Proteomes

A Strategy to Retrieve the Whole Set of Protein Modules in Microbial Proteomes

Source :

Abstract

Links toward previous steps (curation, corpus...)

Links to Exploration step

Le document en format XML

Pour manipuler ce document sous Unix (Dilib)

Pour mettre un lien sur cette page dans le réseau Wicri

Pour générer des pages wiki