Alignment using genetic programming with causal trees for identification of protein functions.
Identifieur interne : 002108 ( PubMed/Curation ); précédent : 002107; suivant : 002109Alignment using genetic programming with causal trees for identification of protein functions.
Auteurs : Chun-Min Hung [République populaire de Chine] ; Yueh-Min Huang [République populaire de Chine] ; Ming-Shi Chang [République populaire de Chine]Source :
- Nonlinear analysis, theory, methods & applications [ 0362-546X ] ; 2006.
Abstract
A hybrid evolutionary model is used to propose a hierarchical homology of protein sequences to identify protein functions systematically. The proposed model offers considerable potentials, considering the inconsistency of existing methods for predicting novel proteins. Because some novel proteins might align without meaningful conserved domains, maximizing the score of sequence alignment is not the best criterion for predicting protein functions. This work presents a decision model that can minimize the cost of making a decision for predicting protein functions using the hierarchical homologies. Particularly, the model has three characteristics: (i) it is a hybrid evolutionary model with multiple fitness functions that uses genetic programming to predict protein functions on a distantly related protein family, (ii) it incorporates modified robust point matching to accurately compare all feature points using the moment invariant and thin-plate spline theorems, and (iii) the hierarchical homologies holding up a novel protein sequence in the form of a causal tree can effectively demonstrate the relationship between proteins. This work describes the comparisons of nucleocapsid proteins from the putative polyprotein SARS virus and other coronaviruses in other hosts using the model.
DOI: 10.1016/j.na.2005.09.048
PubMed: 32288048
Links toward previous steps (curation, corpus...)
- to stream PubMed, to step Corpus: Pour aller vers cette notice dans l'étape Curation :002108
Links to Exploration step
pubmed:32288048Le document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en">Alignment using genetic programming with causal trees for identification of protein functions.</title>
<author><name sortKey="Hung, Chun Min" sort="Hung, Chun Min" uniqKey="Hung C" first="Chun-Min" last="Hung">Chun-Min Hung</name>
<affiliation wicri:level="1"><nlm:affiliation>Department of Engineering Science, National Cheng Kung University, No.1, Ta-Hsueh Road, Tainan 701, Taiwan, ROC.</nlm:affiliation>
<country xml:lang="fr" wicri:curation="lc">République populaire de Chine</country>
<wicri:regionArea>Department of Engineering Science, National Cheng Kung University, No.1, Ta-Hsueh Road, Tainan 701, Taiwan</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Huang, Yueh Min" sort="Huang, Yueh Min" uniqKey="Huang Y" first="Yueh-Min" last="Huang">Yueh-Min Huang</name>
<affiliation wicri:level="1"><nlm:affiliation>Department of Engineering Science, National Cheng Kung University, No.1, Ta-Hsueh Road, Tainan 701, Taiwan, ROC.</nlm:affiliation>
<country xml:lang="fr" wicri:curation="lc">République populaire de Chine</country>
<wicri:regionArea>Department of Engineering Science, National Cheng Kung University, No.1, Ta-Hsueh Road, Tainan 701, Taiwan</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Chang, Ming Shi" sort="Chang, Ming Shi" uniqKey="Chang M" first="Ming-Shi" last="Chang">Ming-Shi Chang</name>
<affiliation wicri:level="1"><nlm:affiliation>Department of Biochemistry, National Cheng Kung University, No.1, Ta-Hsueh Road, Tainan 701, Taiwan, ROC.</nlm:affiliation>
<country xml:lang="fr" wicri:curation="lc">République populaire de Chine</country>
<wicri:regionArea>Department of Biochemistry, National Cheng Kung University, No.1, Ta-Hsueh Road, Tainan 701, Taiwan</wicri:regionArea>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">PubMed</idno>
<date when="2006">2006</date>
<idno type="RBID">pubmed:32288048</idno>
<idno type="pmid">32288048</idno>
<idno type="doi">10.1016/j.na.2005.09.048</idno>
<idno type="wicri:Area/PubMed/Corpus">002108</idno>
<idno type="wicri:explorRef" wicri:stream="PubMed" wicri:step="Corpus" wicri:corpus="PubMed">002108</idno>
<idno type="wicri:Area/PubMed/Curation">002108</idno>
<idno type="wicri:explorRef" wicri:stream="PubMed" wicri:step="Curation">002108</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en">Alignment using genetic programming with causal trees for identification of protein functions.</title>
<author><name sortKey="Hung, Chun Min" sort="Hung, Chun Min" uniqKey="Hung C" first="Chun-Min" last="Hung">Chun-Min Hung</name>
<affiliation wicri:level="1"><nlm:affiliation>Department of Engineering Science, National Cheng Kung University, No.1, Ta-Hsueh Road, Tainan 701, Taiwan, ROC.</nlm:affiliation>
<country xml:lang="fr" wicri:curation="lc">République populaire de Chine</country>
<wicri:regionArea>Department of Engineering Science, National Cheng Kung University, No.1, Ta-Hsueh Road, Tainan 701, Taiwan</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Huang, Yueh Min" sort="Huang, Yueh Min" uniqKey="Huang Y" first="Yueh-Min" last="Huang">Yueh-Min Huang</name>
<affiliation wicri:level="1"><nlm:affiliation>Department of Engineering Science, National Cheng Kung University, No.1, Ta-Hsueh Road, Tainan 701, Taiwan, ROC.</nlm:affiliation>
<country xml:lang="fr" wicri:curation="lc">République populaire de Chine</country>
<wicri:regionArea>Department of Engineering Science, National Cheng Kung University, No.1, Ta-Hsueh Road, Tainan 701, Taiwan</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Chang, Ming Shi" sort="Chang, Ming Shi" uniqKey="Chang M" first="Ming-Shi" last="Chang">Ming-Shi Chang</name>
<affiliation wicri:level="1"><nlm:affiliation>Department of Biochemistry, National Cheng Kung University, No.1, Ta-Hsueh Road, Tainan 701, Taiwan, ROC.</nlm:affiliation>
<country xml:lang="fr" wicri:curation="lc">République populaire de Chine</country>
<wicri:regionArea>Department of Biochemistry, National Cheng Kung University, No.1, Ta-Hsueh Road, Tainan 701, Taiwan</wicri:regionArea>
</affiliation>
</author>
</analytic>
<series><title level="j">Nonlinear analysis, theory, methods & applications</title>
<idno type="ISSN">0362-546X</idno>
<imprint><date when="2006" type="published">2006</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><textClass></textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">A hybrid evolutionary model is used to propose a hierarchical homology of protein sequences to identify protein functions systematically. The proposed model offers considerable potentials, considering the inconsistency of existing methods for predicting novel proteins. Because some novel proteins might align without meaningful conserved domains, maximizing the score of sequence alignment is not the best criterion for predicting protein functions. This work presents a decision model that can minimize the cost of making a decision for predicting protein functions using the hierarchical homologies. Particularly, the model has three characteristics: (i) it is a hybrid evolutionary model with multiple fitness functions that uses genetic programming to predict protein functions on a distantly related protein family, (ii) it incorporates modified robust point matching to accurately compare all feature points using the moment invariant and thin-plate spline theorems, and (iii) the hierarchical homologies holding up a novel protein sequence in the form of a causal tree can effectively demonstrate the relationship between proteins. This work describes the comparisons of nucleocapsid proteins from the putative polyprotein SARS virus and other coronaviruses in other hosts using the model.</div>
</front>
</TEI>
<pubmed><MedlineCitation Status="PubMed-not-MEDLINE" Owner="NLM"><PMID Version="1">32288048</PMID>
<DateRevised><Year>2020</Year>
<Month>04</Month>
<Day>18</Day>
</DateRevised>
<Article PubModel="Print-Electronic"><Journal><ISSN IssnType="Print">0362-546X</ISSN>
<JournalIssue CitedMedium="Print"><Volume>65</Volume>
<Issue>5</Issue>
<PubDate><Year>2006</Year>
<Month>Sep</Month>
<Day>01</Day>
</PubDate>
</JournalIssue>
<Title>Nonlinear analysis, theory, methods & applications</Title>
<ISOAbbreviation>Nonlinear Anal Theory Methods Appl</ISOAbbreviation>
</Journal>
<ArticleTitle>Alignment using genetic programming with causal trees for identification of protein functions.</ArticleTitle>
<Pagination><MedlinePgn>1070-1093</MedlinePgn>
</Pagination>
<ELocationID EIdType="doi" ValidYN="Y">10.1016/j.na.2005.09.048</ELocationID>
<Abstract><AbstractText>A hybrid evolutionary model is used to propose a hierarchical homology of protein sequences to identify protein functions systematically. The proposed model offers considerable potentials, considering the inconsistency of existing methods for predicting novel proteins. Because some novel proteins might align without meaningful conserved domains, maximizing the score of sequence alignment is not the best criterion for predicting protein functions. This work presents a decision model that can minimize the cost of making a decision for predicting protein functions using the hierarchical homologies. Particularly, the model has three characteristics: (i) it is a hybrid evolutionary model with multiple fitness functions that uses genetic programming to predict protein functions on a distantly related protein family, (ii) it incorporates modified robust point matching to accurately compare all feature points using the moment invariant and thin-plate spline theorems, and (iii) the hierarchical homologies holding up a novel protein sequence in the form of a causal tree can effectively demonstrate the relationship between proteins. This work describes the comparisons of nucleocapsid proteins from the putative polyprotein SARS virus and other coronaviruses in other hosts using the model.</AbstractText>
<CopyrightInformation>Copyright © 2005 Elsevier Ltd. All rights reserved.</CopyrightInformation>
</Abstract>
<AuthorList CompleteYN="Y"><Author ValidYN="Y"><LastName>Hung</LastName>
<ForeName>Chun-Min</ForeName>
<Initials>CM</Initials>
<AffiliationInfo><Affiliation>Department of Engineering Science, National Cheng Kung University, No.1, Ta-Hsueh Road, Tainan 701, Taiwan, ROC.</Affiliation>
</AffiliationInfo>
</Author>
<Author ValidYN="Y"><LastName>Huang</LastName>
<ForeName>Yueh-Min</ForeName>
<Initials>YM</Initials>
<AffiliationInfo><Affiliation>Department of Engineering Science, National Cheng Kung University, No.1, Ta-Hsueh Road, Tainan 701, Taiwan, ROC.</Affiliation>
</AffiliationInfo>
</Author>
<Author ValidYN="Y"><LastName>Chang</LastName>
<ForeName>Ming-Shi</ForeName>
<Initials>MS</Initials>
<AffiliationInfo><Affiliation>Department of Biochemistry, National Cheng Kung University, No.1, Ta-Hsueh Road, Tainan 701, Taiwan, ROC.</Affiliation>
</AffiliationInfo>
</Author>
</AuthorList>
<Language>eng</Language>
<PublicationTypeList><PublicationType UI="D016428">Journal Article</PublicationType>
</PublicationTypeList>
<ArticleDate DateType="Electronic"><Year>2005</Year>
<Month>11</Month>
<Day>28</Day>
</ArticleDate>
</Article>
<MedlineJournalInfo><Country>Netherlands</Country>
<MedlineTA>Nonlinear Anal Theory Methods Appl</MedlineTA>
<NlmUniqueID>101491193</NlmUniqueID>
<ISSNLinking>0362-546X</ISSNLinking>
</MedlineJournalInfo>
<KeywordList Owner="NOTNLM"><Keyword MajorTopicYN="N">Bioinformatics protein databases</Keyword>
<Keyword MajorTopicYN="N">Evolutionary computing and genetic algorithms</Keyword>
<Keyword MajorTopicYN="N">Invariants</Keyword>
<Keyword MajorTopicYN="N">Moments</Keyword>
<Keyword MajorTopicYN="N">Splines</Keyword>
</KeywordList>
</MedlineCitation>
<PubmedData><History><PubMedPubDate PubStatus="entrez"><Year>2020</Year>
<Month>4</Month>
<Day>15</Day>
<Hour>6</Hour>
<Minute>0</Minute>
</PubMedPubDate>
<PubMedPubDate PubStatus="pubmed"><Year>2006</Year>
<Month>9</Month>
<Day>1</Day>
<Hour>0</Hour>
<Minute>0</Minute>
</PubMedPubDate>
<PubMedPubDate PubStatus="medline"><Year>2006</Year>
<Month>9</Month>
<Day>1</Day>
<Hour>0</Hour>
<Minute>1</Minute>
</PubMedPubDate>
</History>
<PublicationStatus>ppublish</PublicationStatus>
<ArticleIdList><ArticleId IdType="pubmed">32288048</ArticleId>
<ArticleId IdType="doi">10.1016/j.na.2005.09.048</ArticleId>
<ArticleId IdType="pii">S0362-546X(05)00902-8</ArticleId>
<ArticleId IdType="pmc">PMC7117053</ArticleId>
</ArticleIdList>
<pmc-dir>pmcsd</pmc-dir>
<ReferenceList><Reference><Citation>Physiol Genomics. 2003 Apr 16;13(2):107-17</Citation>
<ArticleIdList><ArticleId IdType="pubmed">12595578</ArticleId>
</ArticleIdList>
</Reference>
</ReferenceList>
<ReferenceList><Reference><Citation>Bioinformatics. 2003 Aug 12;19(12):1531-9</Citation>
<ArticleIdList><ArticleId IdType="pubmed">12912834</ArticleId>
</ArticleIdList>
</Reference>
</ReferenceList>
<ReferenceList><Reference><Citation>Am J Respir Crit Care Med. 2003 Apr 1;167(7):948-52</Citation>
<ArticleIdList><ArticleId IdType="pubmed">12663335</ArticleId>
</ArticleIdList>
</Reference>
</ReferenceList>
<ReferenceList><Reference><Citation>Biosystems. 2003 Nov;72(1-2):159-76</Citation>
<ArticleIdList><ArticleId IdType="pubmed">14642665</ArticleId>
</ArticleIdList>
</Reference>
</ReferenceList>
<ReferenceList><Reference><Citation>Rev Neurosci. 2003;14(1-2):181-93</Citation>
<ArticleIdList><ArticleId IdType="pubmed">12929925</ArticleId>
</ArticleIdList>
</Reference>
</ReferenceList>
<ReferenceList><Reference><Citation>Bioinformatics. 2003 Apr 12;19(6):784-5</Citation>
<ArticleIdList><ArticleId IdType="pubmed">12691996</ArticleId>
</ArticleIdList>
</Reference>
</ReferenceList>
<ReferenceList><Reference><Citation>J Mol Biol. 1994 Feb 4;235(5):1501-31</Citation>
<ArticleIdList><ArticleId IdType="pubmed">8107089</ArticleId>
</ArticleIdList>
</Reference>
</ReferenceList>
<ReferenceList><Reference><Citation>IEEE Trans Image Process. 1993;2(3):311-26</Citation>
<ArticleIdList><ArticleId IdType="pubmed">18296220</ArticleId>
</ArticleIdList>
</Reference>
</ReferenceList>
<ReferenceList><Reference><Citation>J Mol Biol. 2003 Feb 7;326(1):317-36</Citation>
<ArticleIdList><ArticleId IdType="pubmed">12547212</ArticleId>
</ArticleIdList>
</Reference>
</ReferenceList>
<ReferenceList><Reference><Citation>Bioinformatics. 2000 Feb;16(2):152-8</Citation>
<ArticleIdList><ArticleId IdType="pubmed">10842737</ArticleId>
</ArticleIdList>
</Reference>
</ReferenceList>
<ReferenceList><Reference><Citation>Adv Enzymol Relat Areas Mol Biol. 1978;47:45-148</Citation>
<ArticleIdList><ArticleId IdType="pubmed">364941</ArticleId>
</ArticleIdList>
</Reference>
</ReferenceList>
<ReferenceList><Reference><Citation>J Biomed Inform. 2002 Oct-Dec;35(5-6):352-9</Citation>
<ArticleIdList><ArticleId IdType="pubmed">12968784</ArticleId>
</ArticleIdList>
</Reference>
</ReferenceList>
<ReferenceList><Reference><Citation>Nucleic Acids Res. 1994 Nov 11;22(22):4768-78</Citation>
<ArticleIdList><ArticleId IdType="pubmed">7984429</ArticleId>
</ArticleIdList>
</Reference>
</ReferenceList>
<ReferenceList><Reference><Citation>J Mol Biol. 2002 Feb 1;315(5):1257-75</Citation>
<ArticleIdList><ArticleId IdType="pubmed">11827492</ArticleId>
</ArticleIdList>
</Reference>
</ReferenceList>
<ReferenceList><Reference><Citation>IEEE Eng Med Biol Mag. 2000 Jul-Aug;19(4):45-55</Citation>
<ArticleIdList><ArticleId IdType="pubmed">10916732</ArticleId>
</ArticleIdList>
</Reference>
</ReferenceList>
<ReferenceList><Reference><Citation>Nucleic Acids Res. 1997 Sep 1;25(17):3389-402</Citation>
<ArticleIdList><ArticleId IdType="pubmed">9254694</ArticleId>
</ArticleIdList>
</Reference>
</ReferenceList>
<ReferenceList><Reference><Citation>J Mol Biol. 1981 Mar 25;147(1):195-7</Citation>
<ArticleIdList><ArticleId IdType="pubmed">7265238</ArticleId>
</ArticleIdList>
</Reference>
</ReferenceList>
<ReferenceList><Reference><Citation>IEEE Eng Med Biol Mag. 2000 Jul-Aug;19(4):38-44</Citation>
<ArticleIdList><ArticleId IdType="pubmed">10916731</ArticleId>
</ArticleIdList>
</Reference>
</ReferenceList>
<ReferenceList><Reference><Citation>Trends Biotechnol. 1995 Dec;13(12):516-21</Citation>
<ArticleIdList><ArticleId IdType="pubmed">8595137</ArticleId>
</ArticleIdList>
</Reference>
</ReferenceList>
<ReferenceList><Reference><Citation>Nucleic Acids Res. 1994 Nov 11;22(22):4673-80</Citation>
<ArticleIdList><ArticleId IdType="pubmed">7984417</ArticleId>
</ArticleIdList>
</Reference>
</ReferenceList>
<ReferenceList><Reference><Citation>J Mol Biol. 1990 Oct 5;215(3):403-10</Citation>
<ArticleIdList><ArticleId IdType="pubmed">2231712</ArticleId>
</ArticleIdList>
</Reference>
</ReferenceList>
<ReferenceList><Reference><Citation>Protein Sci. 2000 Feb;9(2):232-41</Citation>
<ArticleIdList><ArticleId IdType="pubmed">10716175</ArticleId>
</ArticleIdList>
</Reference>
</ReferenceList>
</PubmedData>
</pubmed>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Sante/explor/SrasV1/Data/PubMed/Curation
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 002108 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/PubMed/Curation/biblio.hfd -nk 002108 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Sante |area= SrasV1 |flux= PubMed |étape= Curation |type= RBID |clé= pubmed:32288048 |texte= Alignment using genetic programming with causal trees for identification of protein functions. }}
Pour générer des pages wiki
HfdIndexSelect -h $EXPLOR_AREA/Data/PubMed/Curation/RBID.i -Sk "pubmed:32288048" \ | HfdSelect -Kh $EXPLOR_AREA/Data/PubMed/Curation/biblio.hfd \ | NlmPubMed2Wicri -a SrasV1
This area was generated with Dilib version V0.6.33. |