Whole-proteome phylogeny of prokaryotes by feature frequency profiles: An alignment-free method with optimal feature resolution.
Identifieur interne : 002601 ( Main/Curation ); précédent : 002600; suivant : 002602Whole-proteome phylogeny of prokaryotes by feature frequency profiles: An alignment-free method with optimal feature resolution.
Auteurs : Se-Ran Jun [États-Unis] ; Gregory E. Sims ; Guohong A. Wu ; Sung-Hou KimSource :
- Proceedings of the National Academy of Sciences of the United States of America [ 1091-6490 ] ; 2010.
Descripteurs français
- KwdFr :
- MESH :
- génétique : Protéome.
- physiologie : Cellules procaryotes.
- Alignement de séquences, Analyse de séquence de protéine, Cellules procaryotes, Génome, Phylogénie, Protéomique.
English descriptors
- KwdEn :
- MESH :
- chemical , genetics : Proteome.
- classification : Prokaryotic Cells.
- methods : Proteomics, Sequence Alignment, Sequence Analysis, Protein.
- physiology : Prokaryotic Cells.
- Genome, Phylogeny.
Abstract
We present a whole-proteome phylogeny of prokaryotes constructed by comparing feature frequency profiles (FFPs) of whole proteomes. Features are l-mers of amino acids, and each organism is represented by a profile of frequencies of all features. The selection of feature length is critical in the FFP method, and we have developed a procedure for identifying the optimal feature lengths for inferring the phylogeny of prokaryotes, strictly speaking, a proteome phylogeny. Our FFP trees are constructed with whole proteomes of 884 prokaryotes, 16 unicellular eukaryotes, and 2 random sequences. To highlight the branching order of major groups, we present a simplified proteome FFP tree of monophyletic class or phylum with branch support. In our whole-proteome FFP trees (i) Archaea, Bacteria, Eukaryota, and a random sequence outgroup are clearly separated; (ii) Archaea and Bacteria form a sister group when rooted with random sequences; (iii) Planctomycetes, which possesses an intracellular membrane compartment, is placed at the basal position of the Bacteria domain; (iv) almost all groups are monophyletic in prokaryotes at most taxonomic levels, but many differences in the branching order of major groups are observed between our proteome FFP tree and trees built with other methods; and (v) previously "unclassified" genomes may be assigned to the most likely taxa. We describe notable similarities and differences between our FFP trees and those based on other methods in grouping and phylogeny of prokaryotes.
DOI: 10.1073/pnas.0913033107
PubMed: 20018669
Links toward previous steps (curation, corpus...)
- to stream PubMed, to step Corpus: Pour aller vers cette notice dans l'étape Curation :001F80
- to stream PubMed, to step Curation: Pour aller vers cette notice dans l'étape Curation :001F80
- to stream PubMed, to step Checkpoint: Pour aller vers cette notice dans l'étape Curation :001E12
- to stream Ncbi, to step Merge: Pour aller vers cette notice dans l'étape Curation :000726
- to stream Ncbi, to step Curation: Pour aller vers cette notice dans l'étape Curation :000726
- to stream Ncbi, to step Checkpoint: Pour aller vers cette notice dans l'étape Curation :000726
- to stream Main, to step Merge: Pour aller vers cette notice dans l'étape Curation :002626
Links to Exploration step
pubmed:20018669Le document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en">Whole-proteome phylogeny of prokaryotes by feature frequency profiles: An alignment-free method with optimal feature resolution.</title>
<author><name sortKey="Jun, Se Ran" sort="Jun, Se Ran" uniqKey="Jun S" first="Se-Ran" last="Jun">Se-Ran Jun</name>
<affiliation wicri:level="2"><nlm:affiliation>Department of Chemistry, University of California, Berkeley, CA 94720, USA.</nlm:affiliation>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Chemistry, University of California, Berkeley, CA 94720</wicri:regionArea>
<placeName><region type="state">Californie</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Sims, Gregory E" sort="Sims, Gregory E" uniqKey="Sims G" first="Gregory E" last="Sims">Gregory E. Sims</name>
</author>
<author><name sortKey="Wu, Guohong A" sort="Wu, Guohong A" uniqKey="Wu G" first="Guohong A" last="Wu">Guohong A. Wu</name>
</author>
<author><name sortKey="Kim, Sung Hou" sort="Kim, Sung Hou" uniqKey="Kim S" first="Sung-Hou" last="Kim">Sung-Hou Kim</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">PubMed</idno>
<date when="2010">2010</date>
<idno type="RBID">pubmed:20018669</idno>
<idno type="pmid">20018669</idno>
<idno type="doi">10.1073/pnas.0913033107</idno>
<idno type="wicri:Area/PubMed/Corpus">001F80</idno>
<idno type="wicri:explorRef" wicri:stream="PubMed" wicri:step="Corpus" wicri:corpus="PubMed">001F80</idno>
<idno type="wicri:Area/PubMed/Curation">001F80</idno>
<idno type="wicri:explorRef" wicri:stream="PubMed" wicri:step="Curation">001F80</idno>
<idno type="wicri:Area/PubMed/Checkpoint">001E12</idno>
<idno type="wicri:explorRef" wicri:stream="Checkpoint" wicri:step="PubMed">001E12</idno>
<idno type="wicri:Area/Ncbi/Merge">000726</idno>
<idno type="wicri:Area/Ncbi/Curation">000726</idno>
<idno type="wicri:Area/Ncbi/Checkpoint">000726</idno>
<idno type="wicri:Area/Main/Merge">002626</idno>
<idno type="wicri:Area/Main/Curation">002601</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en">Whole-proteome phylogeny of prokaryotes by feature frequency profiles: An alignment-free method with optimal feature resolution.</title>
<author><name sortKey="Jun, Se Ran" sort="Jun, Se Ran" uniqKey="Jun S" first="Se-Ran" last="Jun">Se-Ran Jun</name>
<affiliation wicri:level="2"><nlm:affiliation>Department of Chemistry, University of California, Berkeley, CA 94720, USA.</nlm:affiliation>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Chemistry, University of California, Berkeley, CA 94720</wicri:regionArea>
<placeName><region type="state">Californie</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Sims, Gregory E" sort="Sims, Gregory E" uniqKey="Sims G" first="Gregory E" last="Sims">Gregory E. Sims</name>
</author>
<author><name sortKey="Wu, Guohong A" sort="Wu, Guohong A" uniqKey="Wu G" first="Guohong A" last="Wu">Guohong A. Wu</name>
</author>
<author><name sortKey="Kim, Sung Hou" sort="Kim, Sung Hou" uniqKey="Kim S" first="Sung-Hou" last="Kim">Sung-Hou Kim</name>
</author>
</analytic>
<series><title level="j">Proceedings of the National Academy of Sciences of the United States of America</title>
<idno type="eISSN">1091-6490</idno>
<imprint><date when="2010" type="published">2010</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Genome</term>
<term>Phylogeny</term>
<term>Prokaryotic Cells (classification)</term>
<term>Prokaryotic Cells (physiology)</term>
<term>Proteome (genetics)</term>
<term>Proteomics (methods)</term>
<term>Sequence Alignment (methods)</term>
<term>Sequence Analysis, Protein (methods)</term>
</keywords>
<keywords scheme="KwdFr" xml:lang="fr"><term>Alignement de séquences ()</term>
<term>Analyse de séquence de protéine ()</term>
<term>Cellules procaryotes ()</term>
<term>Cellules procaryotes (physiologie)</term>
<term>Génome</term>
<term>Phylogénie</term>
<term>Protéome (génétique)</term>
<term>Protéomique ()</term>
</keywords>
<keywords scheme="MESH" type="chemical" qualifier="genetics" xml:lang="en"><term>Proteome</term>
</keywords>
<keywords scheme="MESH" qualifier="classification" xml:lang="en"><term>Prokaryotic Cells</term>
</keywords>
<keywords scheme="MESH" qualifier="génétique" xml:lang="fr"><term>Protéome</term>
</keywords>
<keywords scheme="MESH" qualifier="methods" xml:lang="en"><term>Proteomics</term>
<term>Sequence Alignment</term>
<term>Sequence Analysis, Protein</term>
</keywords>
<keywords scheme="MESH" qualifier="physiologie" xml:lang="fr"><term>Cellules procaryotes</term>
</keywords>
<keywords scheme="MESH" qualifier="physiology" xml:lang="en"><term>Prokaryotic Cells</term>
</keywords>
<keywords scheme="MESH" xml:lang="en"><term>Genome</term>
<term>Phylogeny</term>
</keywords>
<keywords scheme="MESH" xml:lang="fr"><term>Alignement de séquences</term>
<term>Analyse de séquence de protéine</term>
<term>Cellules procaryotes</term>
<term>Génome</term>
<term>Phylogénie</term>
<term>Protéomique</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">We present a whole-proteome phylogeny of prokaryotes constructed by comparing feature frequency profiles (FFPs) of whole proteomes. Features are l-mers of amino acids, and each organism is represented by a profile of frequencies of all features. The selection of feature length is critical in the FFP method, and we have developed a procedure for identifying the optimal feature lengths for inferring the phylogeny of prokaryotes, strictly speaking, a proteome phylogeny. Our FFP trees are constructed with whole proteomes of 884 prokaryotes, 16 unicellular eukaryotes, and 2 random sequences. To highlight the branching order of major groups, we present a simplified proteome FFP tree of monophyletic class or phylum with branch support. In our whole-proteome FFP trees (i) Archaea, Bacteria, Eukaryota, and a random sequence outgroup are clearly separated; (ii) Archaea and Bacteria form a sister group when rooted with random sequences; (iii) Planctomycetes, which possesses an intracellular membrane compartment, is placed at the basal position of the Bacteria domain; (iv) almost all groups are monophyletic in prokaryotes at most taxonomic levels, but many differences in the branching order of major groups are observed between our proteome FFP tree and trees built with other methods; and (v) previously "unclassified" genomes may be assigned to the most likely taxa. We describe notable similarities and differences between our FFP trees and those based on other methods in grouping and phylogeny of prokaryotes.</div>
</front>
</TEI>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Sante/explor/MersV1/Data/Main/Curation
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 002601 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Curation/biblio.hfd -nk 002601 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Sante |area= MersV1 |flux= Main |étape= Curation |type= RBID |clé= pubmed:20018669 |texte= Whole-proteome phylogeny of prokaryotes by feature frequency profiles: An alignment-free method with optimal feature resolution. }}
Pour générer des pages wiki
HfdIndexSelect -h $EXPLOR_AREA/Data/Main/Curation/RBID.i -Sk "pubmed:20018669" \ | HfdSelect -Kh $EXPLOR_AREA/Data/Main/Curation/biblio.hfd \ | NlmPubMed2Wicri -a MersV1
This area was generated with Dilib version V0.6.33. |