Magnus representation of genome sequences.
Identifieur interne : 000457 ( PubMed/Corpus ); précédent : 000456; suivant : 000458Magnus representation of genome sequences.
Auteurs : Chengyuan Wu ; Shiquan Ren ; Jie Wu ; Kelin XiaSource :
- Journal of theoretical biology [ 1095-8541 ] ; 2019.
Abstract
We introduce an alignment-free method, the Magnus Representation, to analyze genome sequences. The Magnus Representation captures higher-order information in genome sequences. We combine our approach with the idea of k-mers to define an effectively computable Mean Magnus Vector. We perform phylogenetic analysis on three datasets: mosquito-borne viruses, filoviruses, and bacterial genomes. Our results on ebolaviruses are consistent with previous phylogenetic analyses, and confirm the modern viewpoint that the 2014 West African Ebola outbreak likely originated from Central Africa. Our analysis also confirms the close relationship between Bundibugyo ebolavirus and Taï Forest ebolavirus. For bacterial genomes, our method is able to classify relatively well at the family and genus level, as well as at higher levels such as phylum level. The bacterial genomes are also separated well into Gram-positive and Gram-negative subgroups.
DOI: 10.1016/j.jtbi.2019.08.004
PubMed: 31398316
Links to Exploration step
pubmed:31398316Le document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en">Magnus representation of genome sequences.</title>
<author><name sortKey="Wu, Chengyuan" sort="Wu, Chengyuan" uniqKey="Wu C" first="Chengyuan" last="Wu">Chengyuan Wu</name>
<affiliation><nlm:affiliation>Department of Mathematics, National University of Singapore, Singapore 119076, Singapore. Electronic address: wuchengyuan@u.nus.edu.</nlm:affiliation>
</affiliation>
</author>
<author><name sortKey="Ren, Shiquan" sort="Ren, Shiquan" uniqKey="Ren S" first="Shiquan" last="Ren">Shiquan Ren</name>
<affiliation><nlm:affiliation>Yau Mathematical Sciences Center, Tsinghua University, Beijing 100084, China. Electronic address: sren@u.nus.edu.</nlm:affiliation>
</affiliation>
</author>
<author><name sortKey="Wu, Jie" sort="Wu, Jie" uniqKey="Wu J" first="Jie" last="Wu">Jie Wu</name>
<affiliation><nlm:affiliation>School of Mathematics and Information Science, Hebei Normal University, Hebei 050024, China; Department of Mathematics, National University of Singapore, Singapore 119076, Singapore. Electronic address: wujie@hebtu.edu.cn.</nlm:affiliation>
</affiliation>
</author>
<author><name sortKey="Xia, Kelin" sort="Xia, Kelin" uniqKey="Xia K" first="Kelin" last="Xia">Kelin Xia</name>
<affiliation><nlm:affiliation>Division of Mathematical Sciences, School of Physical and Mathematical Sciences, Nanyang Technological University, Singapore 637371, Singapore; School of Biological Sciences, Nanyang Technological University, Singapore 637371, Singapore. Electronic address: xiakelin@ntu.edu.sg.</nlm:affiliation>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">PubMed</idno>
<date when="2019">2019</date>
<idno type="RBID">pubmed:31398316</idno>
<idno type="pmid">31398316</idno>
<idno type="doi">10.1016/j.jtbi.2019.08.004</idno>
<idno type="wicri:Area/PubMed/Corpus">000457</idno>
<idno type="wicri:explorRef" wicri:stream="PubMed" wicri:step="Corpus" wicri:corpus="PubMed">000457</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en">Magnus representation of genome sequences.</title>
<author><name sortKey="Wu, Chengyuan" sort="Wu, Chengyuan" uniqKey="Wu C" first="Chengyuan" last="Wu">Chengyuan Wu</name>
<affiliation><nlm:affiliation>Department of Mathematics, National University of Singapore, Singapore 119076, Singapore. Electronic address: wuchengyuan@u.nus.edu.</nlm:affiliation>
</affiliation>
</author>
<author><name sortKey="Ren, Shiquan" sort="Ren, Shiquan" uniqKey="Ren S" first="Shiquan" last="Ren">Shiquan Ren</name>
<affiliation><nlm:affiliation>Yau Mathematical Sciences Center, Tsinghua University, Beijing 100084, China. Electronic address: sren@u.nus.edu.</nlm:affiliation>
</affiliation>
</author>
<author><name sortKey="Wu, Jie" sort="Wu, Jie" uniqKey="Wu J" first="Jie" last="Wu">Jie Wu</name>
<affiliation><nlm:affiliation>School of Mathematics and Information Science, Hebei Normal University, Hebei 050024, China; Department of Mathematics, National University of Singapore, Singapore 119076, Singapore. Electronic address: wujie@hebtu.edu.cn.</nlm:affiliation>
</affiliation>
</author>
<author><name sortKey="Xia, Kelin" sort="Xia, Kelin" uniqKey="Xia K" first="Kelin" last="Xia">Kelin Xia</name>
<affiliation><nlm:affiliation>Division of Mathematical Sciences, School of Physical and Mathematical Sciences, Nanyang Technological University, Singapore 637371, Singapore; School of Biological Sciences, Nanyang Technological University, Singapore 637371, Singapore. Electronic address: xiakelin@ntu.edu.sg.</nlm:affiliation>
</affiliation>
</author>
</analytic>
<series><title level="j">Journal of theoretical biology</title>
<idno type="eISSN">1095-8541</idno>
<imprint><date when="2019" type="published">2019</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><textClass></textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">We introduce an alignment-free method, the Magnus Representation, to analyze genome sequences. The Magnus Representation captures higher-order information in genome sequences. We combine our approach with the idea of k-mers to define an effectively computable Mean Magnus Vector. We perform phylogenetic analysis on three datasets: mosquito-borne viruses, filoviruses, and bacterial genomes. Our results on ebolaviruses are consistent with previous phylogenetic analyses, and confirm the modern viewpoint that the 2014 West African Ebola outbreak likely originated from Central Africa. Our analysis also confirms the close relationship between Bundibugyo ebolavirus and Taï Forest ebolavirus. For bacterial genomes, our method is able to classify relatively well at the family and genus level, as well as at higher levels such as phylum level. The bacterial genomes are also separated well into Gram-positive and Gram-negative subgroups.</div>
</front>
</TEI>
<pubmed><MedlineCitation Status="In-Data-Review" Owner="NLM"><PMID Version="1">31398316</PMID>
<DateRevised><Year>2019</Year>
<Month>09</Month>
<Day>09</Day>
</DateRevised>
<Article PubModel="Print-Electronic"><Journal><ISSN IssnType="Electronic">1095-8541</ISSN>
<JournalIssue CitedMedium="Internet"><Volume>480</Volume>
<PubDate><Year>2019</Year>
<Month>Nov</Month>
<Day>07</Day>
</PubDate>
</JournalIssue>
<Title>Journal of theoretical biology</Title>
<ISOAbbreviation>J. Theor. Biol.</ISOAbbreviation>
</Journal>
<ArticleTitle>Magnus representation of genome sequences.</ArticleTitle>
<Pagination><MedlinePgn>104-111</MedlinePgn>
</Pagination>
<ELocationID EIdType="pii" ValidYN="Y">S0022-5193(19)30316-9</ELocationID>
<ELocationID EIdType="doi" ValidYN="Y">10.1016/j.jtbi.2019.08.004</ELocationID>
<Abstract><AbstractText>We introduce an alignment-free method, the Magnus Representation, to analyze genome sequences. The Magnus Representation captures higher-order information in genome sequences. We combine our approach with the idea of k-mers to define an effectively computable Mean Magnus Vector. We perform phylogenetic analysis on three datasets: mosquito-borne viruses, filoviruses, and bacterial genomes. Our results on ebolaviruses are consistent with previous phylogenetic analyses, and confirm the modern viewpoint that the 2014 West African Ebola outbreak likely originated from Central Africa. Our analysis also confirms the close relationship between Bundibugyo ebolavirus and Taï Forest ebolavirus. For bacterial genomes, our method is able to classify relatively well at the family and genus level, as well as at higher levels such as phylum level. The bacterial genomes are also separated well into Gram-positive and Gram-negative subgroups.</AbstractText>
<CopyrightInformation>Copyright © 2019 Elsevier Ltd. All rights reserved.</CopyrightInformation>
</Abstract>
<AuthorList CompleteYN="Y"><Author ValidYN="Y"><LastName>Wu</LastName>
<ForeName>Chengyuan</ForeName>
<Initials>C</Initials>
<AffiliationInfo><Affiliation>Department of Mathematics, National University of Singapore, Singapore 119076, Singapore. Electronic address: wuchengyuan@u.nus.edu.</Affiliation>
</AffiliationInfo>
</Author>
<Author ValidYN="Y"><LastName>Ren</LastName>
<ForeName>Shiquan</ForeName>
<Initials>S</Initials>
<AffiliationInfo><Affiliation>Yau Mathematical Sciences Center, Tsinghua University, Beijing 100084, China. Electronic address: sren@u.nus.edu.</Affiliation>
</AffiliationInfo>
</Author>
<Author ValidYN="Y"><LastName>Wu</LastName>
<ForeName>Jie</ForeName>
<Initials>J</Initials>
<AffiliationInfo><Affiliation>School of Mathematics and Information Science, Hebei Normal University, Hebei 050024, China; Department of Mathematics, National University of Singapore, Singapore 119076, Singapore. Electronic address: wujie@hebtu.edu.cn.</Affiliation>
</AffiliationInfo>
</Author>
<Author ValidYN="Y"><LastName>Xia</LastName>
<ForeName>Kelin</ForeName>
<Initials>K</Initials>
<AffiliationInfo><Affiliation>Division of Mathematical Sciences, School of Physical and Mathematical Sciences, Nanyang Technological University, Singapore 637371, Singapore; School of Biological Sciences, Nanyang Technological University, Singapore 637371, Singapore. Electronic address: xiakelin@ntu.edu.sg.</Affiliation>
</AffiliationInfo>
</Author>
</AuthorList>
<Language>eng</Language>
<PublicationTypeList><PublicationType UI="D016428">Journal Article</PublicationType>
</PublicationTypeList>
<ArticleDate DateType="Electronic"><Year>2019</Year>
<Month>08</Month>
<Day>06</Day>
</ArticleDate>
</Article>
<MedlineJournalInfo><Country>England</Country>
<MedlineTA>J Theor Biol</MedlineTA>
<NlmUniqueID>0376342</NlmUniqueID>
<ISSNLinking>0022-5193</ISSNLinking>
</MedlineJournalInfo>
<CitationSubset>IM</CitationSubset>
<KeywordList Owner="NOTNLM"><Keyword MajorTopicYN="N">Alignment-free</Keyword>
<Keyword MajorTopicYN="N">Ebolavirus</Keyword>
<Keyword MajorTopicYN="N">Genome sequence</Keyword>
<Keyword MajorTopicYN="N">Sequence analysis</Keyword>
</KeywordList>
</MedlineCitation>
<PubmedData><History><PubMedPubDate PubStatus="received"><Year>2019</Year>
<Month>03</Month>
<Day>26</Day>
</PubMedPubDate>
<PubMedPubDate PubStatus="revised"><Year>2019</Year>
<Month>07</Month>
<Day>30</Day>
</PubMedPubDate>
<PubMedPubDate PubStatus="accepted"><Year>2019</Year>
<Month>08</Month>
<Day>05</Day>
</PubMedPubDate>
<PubMedPubDate PubStatus="pubmed"><Year>2019</Year>
<Month>8</Month>
<Day>10</Day>
<Hour>6</Hour>
<Minute>0</Minute>
</PubMedPubDate>
<PubMedPubDate PubStatus="medline"><Year>2019</Year>
<Month>8</Month>
<Day>10</Day>
<Hour>6</Hour>
<Minute>0</Minute>
</PubMedPubDate>
<PubMedPubDate PubStatus="entrez"><Year>2019</Year>
<Month>8</Month>
<Day>10</Day>
<Hour>6</Hour>
<Minute>0</Minute>
</PubMedPubDate>
</History>
<PublicationStatus>ppublish</PublicationStatus>
<ArticleIdList><ArticleId IdType="pubmed">31398316</ArticleId>
<ArticleId IdType="pii">S0022-5193(19)30316-9</ArticleId>
<ArticleId IdType="doi">10.1016/j.jtbi.2019.08.004</ArticleId>
</ArticleIdList>
</PubmedData>
</pubmed>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Sante/explor/MersV1/Data/PubMed/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000457 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/PubMed/Corpus/biblio.hfd -nk 000457 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Sante |area= MersV1 |flux= PubMed |étape= Corpus |type= RBID |clé= pubmed:31398316 |texte= Magnus representation of genome sequences. }}
Pour générer des pages wiki
HfdIndexSelect -h $EXPLOR_AREA/Data/PubMed/Corpus/RBID.i -Sk "pubmed:31398316" \ | HfdSelect -Kh $EXPLOR_AREA/Data/PubMed/Corpus/biblio.hfd \ | NlmPubMed2Wicri -a MersV1
This area was generated with Dilib version V0.6.33. |