Serveur d'exploration sur les pandémies grippales

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Identifying mutation positions in all segments of influenza genome enables better differentiation between pandemic and seasonal strains.

Identifieur interne : 000332 ( Main/Exploration ); précédent : 000331; suivant : 000333

Identifying mutation positions in all segments of influenza genome enables better differentiation between pandemic and seasonal strains.

Auteurs : Fatemeh Kargarfard [Iran] ; Ashkan Sami [Iran] ; Farhid Hemmatzadeh [Australie] ; Esmaeil Ebrahimie [Australie]

Source :

RBID : pubmed:30769139

Descripteurs français

English descriptors

Abstract

Influenza has a negative sense, single-stranded, and segmented RNA. In the context of pandemic influenza research, most studies have focused on variations in the surface proteins (Hemagglutinin and Neuraminidase). However, new findings suggest that all internal and external proteins of influenza viruses can contribute in pandemic emergence, pathogenicity and increasing host range. The occurrence of the 2009 influenza pandemic and the availability of many external and internal segments of pandemic and non-pandemic sequences offer a unique opportunity to evaluate the performance of machine learning models in discrimination of pandemic from seasonal sequences using mutation positions in all segments. In this study, we hypothesized that identifying mutation positions in all segments (proteins) encoded by the influenza genome would enable pandemic and seasonal strains to be more reliably distinguished. In a large scale study, we applied a range of data mining techniques to all segments of influenza for rule discovery and discrimination of pandemic from seasonal strains. CBA (classification based on association rule mining), Ripper and Decision tree algorithms were utilized to extract association rules among mutations. CBA outperformed the other models. Our approach could discriminate pandemic sequences from seasonal ones with more than 95% accuracy for PA and NP, 99.33% accuracy for NA and 100% accuracy, precision, specificity and sensitivity (recall) for M1, M2, PB1, NS1, and NS2. The values of precision, specificity, and sensitivity were more than 90% for other segments except PB2. If sequences of all segments of one strain were available, the accuracy of discrimination of pandemic strains was 100%. General rules extracted by rule base classification approaches, such as M1-V147I, NP-N334H, NS1-V112I, and PB1-L364I, were able to detect pandemic sequences with high accuracy. We observed that mutations on internal proteins of influenza can contribute in distinguishing the pandemic viruses, similar to the external ones.

DOI: 10.1016/j.gene.2019.01.014
PubMed: 30769139


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Identifying mutation positions in all segments of influenza genome enables better differentiation between pandemic and seasonal strains.</title>
<author>
<name sortKey="Kargarfard, Fatemeh" sort="Kargarfard, Fatemeh" uniqKey="Kargarfard F" first="Fatemeh" last="Kargarfard">Fatemeh Kargarfard</name>
<affiliation wicri:level="1">
<nlm:affiliation>Faculty of Engineering and IT, University of Technology Sydney, New South Wales, Australia; Department of Computer Science and Engineering, School of Electrical Engineering and Computer, Shiraz University, Shiraz, Iran.</nlm:affiliation>
<country xml:lang="fr">Iran</country>
<wicri:regionArea>Faculty of Engineering and IT, University of Technology Sydney, New South Wales, Australia; Department of Computer Science and Engineering, School of Electrical Engineering and Computer, Shiraz University, Shiraz</wicri:regionArea>
<wicri:noRegion>Shiraz</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Sami, Ashkan" sort="Sami, Ashkan" uniqKey="Sami A" first="Ashkan" last="Sami">Ashkan Sami</name>
<affiliation wicri:level="1">
<nlm:affiliation>Department of Computer Science and Engineering, School of Electrical Engineering and Computer, Shiraz University, Shiraz, Iran.</nlm:affiliation>
<country xml:lang="fr">Iran</country>
<wicri:regionArea>Department of Computer Science and Engineering, School of Electrical Engineering and Computer, Shiraz University, Shiraz</wicri:regionArea>
<wicri:noRegion>Shiraz</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Hemmatzadeh, Farhid" sort="Hemmatzadeh, Farhid" uniqKey="Hemmatzadeh F" first="Farhid" last="Hemmatzadeh">Farhid Hemmatzadeh</name>
<affiliation wicri:level="1">
<nlm:affiliation>School of Animal and Veterinary Sciences, The University of Adelaide, Adelaide, Australia.</nlm:affiliation>
<country xml:lang="fr">Australie</country>
<wicri:regionArea>School of Animal and Veterinary Sciences, The University of Adelaide, Adelaide</wicri:regionArea>
<wicri:noRegion>Adelaide</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Ebrahimie, Esmaeil" sort="Ebrahimie, Esmaeil" uniqKey="Ebrahimie E" first="Esmaeil" last="Ebrahimie">Esmaeil Ebrahimie</name>
<affiliation wicri:level="1">
<nlm:affiliation>School of Animal and Veterinary Sciences, The University of Adelaide, Adelaide, Australia; Genomics Research Platform, La Trobe University, Melbourne, Victoria 3086, Australia; School of Information Technology and Mathematical Sciences, Division of Information Technology Engineering & Environment, University of South Australia, Adelaide, Australia; School of Biological Sciences, Faculty of Science and Engineering, Flinders University, Adelaide, Australia. Electronic address: esmaeil.ebrahimie@adelaide.edu.au.</nlm:affiliation>
<country xml:lang="fr">Australie</country>
<wicri:regionArea>School of Animal and Veterinary Sciences, The University of Adelaide, Adelaide, Australia; Genomics Research Platform, La Trobe University, Melbourne, Victoria 3086, Australia; School of Information Technology and Mathematical Sciences, Division of Information Technology Engineering & Environment, University of South Australia, Adelaide, Australia; School of Biological Sciences, Faculty of Science and Engineering, Flinders University, Adelaide</wicri:regionArea>
<wicri:noRegion>Adelaide</wicri:noRegion>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PubMed</idno>
<date when="2019">2019</date>
<idno type="RBID">pubmed:30769139</idno>
<idno type="pmid">30769139</idno>
<idno type="doi">10.1016/j.gene.2019.01.014</idno>
<idno type="wicri:Area/PubMed/Corpus">000169</idno>
<idno type="wicri:explorRef" wicri:stream="PubMed" wicri:step="Corpus" wicri:corpus="PubMed">000169</idno>
<idno type="wicri:Area/PubMed/Curation">000169</idno>
<idno type="wicri:explorRef" wicri:stream="PubMed" wicri:step="Curation">000169</idno>
<idno type="wicri:Area/PubMed/Checkpoint">000136</idno>
<idno type="wicri:explorRef" wicri:stream="Checkpoint" wicri:step="PubMed">000136</idno>
<idno type="wicri:Area/Ncbi/Merge">001F25</idno>
<idno type="wicri:Area/Ncbi/Curation">001F25</idno>
<idno type="wicri:Area/Ncbi/Checkpoint">001F25</idno>
<idno type="wicri:Area/Main/Merge">000331</idno>
<idno type="wicri:Area/Main/Curation">000332</idno>
<idno type="wicri:Area/Main/Exploration">000332</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">Identifying mutation positions in all segments of influenza genome enables better differentiation between pandemic and seasonal strains.</title>
<author>
<name sortKey="Kargarfard, Fatemeh" sort="Kargarfard, Fatemeh" uniqKey="Kargarfard F" first="Fatemeh" last="Kargarfard">Fatemeh Kargarfard</name>
<affiliation wicri:level="1">
<nlm:affiliation>Faculty of Engineering and IT, University of Technology Sydney, New South Wales, Australia; Department of Computer Science and Engineering, School of Electrical Engineering and Computer, Shiraz University, Shiraz, Iran.</nlm:affiliation>
<country xml:lang="fr">Iran</country>
<wicri:regionArea>Faculty of Engineering and IT, University of Technology Sydney, New South Wales, Australia; Department of Computer Science and Engineering, School of Electrical Engineering and Computer, Shiraz University, Shiraz</wicri:regionArea>
<wicri:noRegion>Shiraz</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Sami, Ashkan" sort="Sami, Ashkan" uniqKey="Sami A" first="Ashkan" last="Sami">Ashkan Sami</name>
<affiliation wicri:level="1">
<nlm:affiliation>Department of Computer Science and Engineering, School of Electrical Engineering and Computer, Shiraz University, Shiraz, Iran.</nlm:affiliation>
<country xml:lang="fr">Iran</country>
<wicri:regionArea>Department of Computer Science and Engineering, School of Electrical Engineering and Computer, Shiraz University, Shiraz</wicri:regionArea>
<wicri:noRegion>Shiraz</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Hemmatzadeh, Farhid" sort="Hemmatzadeh, Farhid" uniqKey="Hemmatzadeh F" first="Farhid" last="Hemmatzadeh">Farhid Hemmatzadeh</name>
<affiliation wicri:level="1">
<nlm:affiliation>School of Animal and Veterinary Sciences, The University of Adelaide, Adelaide, Australia.</nlm:affiliation>
<country xml:lang="fr">Australie</country>
<wicri:regionArea>School of Animal and Veterinary Sciences, The University of Adelaide, Adelaide</wicri:regionArea>
<wicri:noRegion>Adelaide</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Ebrahimie, Esmaeil" sort="Ebrahimie, Esmaeil" uniqKey="Ebrahimie E" first="Esmaeil" last="Ebrahimie">Esmaeil Ebrahimie</name>
<affiliation wicri:level="1">
<nlm:affiliation>School of Animal and Veterinary Sciences, The University of Adelaide, Adelaide, Australia; Genomics Research Platform, La Trobe University, Melbourne, Victoria 3086, Australia; School of Information Technology and Mathematical Sciences, Division of Information Technology Engineering & Environment, University of South Australia, Adelaide, Australia; School of Biological Sciences, Faculty of Science and Engineering, Flinders University, Adelaide, Australia. Electronic address: esmaeil.ebrahimie@adelaide.edu.au.</nlm:affiliation>
<country xml:lang="fr">Australie</country>
<wicri:regionArea>School of Animal and Veterinary Sciences, The University of Adelaide, Adelaide, Australia; Genomics Research Platform, La Trobe University, Melbourne, Victoria 3086, Australia; School of Information Technology and Mathematical Sciences, Division of Information Technology Engineering & Environment, University of South Australia, Adelaide, Australia; School of Biological Sciences, Faculty of Science and Engineering, Flinders University, Adelaide</wicri:regionArea>
<wicri:noRegion>Adelaide</wicri:noRegion>
</affiliation>
</author>
</analytic>
<series>
<title level="j">Gene</title>
<idno type="eISSN">1879-0038</idno>
<imprint>
<date when="2019" type="published">2019</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Algorithms</term>
<term>Computational Biology (methods)</term>
<term>Host Specificity</term>
<term>Humans</term>
<term>Influenza A Virus, H1N1 Subtype (genetics)</term>
<term>Influenza, Human (epidemiology)</term>
<term>Influenza, Human (genetics)</term>
<term>Mutation</term>
<term>Pandemics</term>
<term>Seasons</term>
<term>Sequence Analysis, DNA (methods)</term>
<term>Supervised Machine Learning</term>
<term>Viral Proteins</term>
</keywords>
<keywords scheme="KwdFr" xml:lang="fr">
<term>Algorithmes</term>
<term>Analyse de séquence d'ADN ()</term>
<term>Apprentissage machine supervisé</term>
<term>Biologie informatique ()</term>
<term>Grippe humaine (génétique)</term>
<term>Grippe humaine (épidémiologie)</term>
<term>Humains</term>
<term>Mutation</term>
<term>Pandémies</term>
<term>Protéines virales</term>
<term>Saisons</term>
<term>Sous-type H1N1 du virus de la grippe A (génétique)</term>
<term>Spécificité d'hôte</term>
</keywords>
<keywords scheme="MESH" type="chemical" xml:lang="en">
<term>Viral Proteins</term>
</keywords>
<keywords scheme="MESH" qualifier="epidemiology" xml:lang="en">
<term>Influenza, Human</term>
</keywords>
<keywords scheme="MESH" qualifier="genetics" xml:lang="en">
<term>Influenza A Virus, H1N1 Subtype</term>
<term>Influenza, Human</term>
</keywords>
<keywords scheme="MESH" qualifier="génétique" xml:lang="fr">
<term>Grippe humaine</term>
<term>Sous-type H1N1 du virus de la grippe A</term>
</keywords>
<keywords scheme="MESH" qualifier="methods" xml:lang="en">
<term>Computational Biology</term>
<term>Sequence Analysis, DNA</term>
</keywords>
<keywords scheme="MESH" qualifier="épidémiologie" xml:lang="fr">
<term>Grippe humaine</term>
</keywords>
<keywords scheme="MESH" xml:lang="en">
<term>Algorithms</term>
<term>Host Specificity</term>
<term>Humans</term>
<term>Mutation</term>
<term>Pandemics</term>
<term>Seasons</term>
<term>Supervised Machine Learning</term>
</keywords>
<keywords scheme="MESH" xml:lang="fr">
<term>Algorithmes</term>
<term>Analyse de séquence d'ADN</term>
<term>Apprentissage machine supervisé</term>
<term>Biologie informatique</term>
<term>Humains</term>
<term>Mutation</term>
<term>Pandémies</term>
<term>Protéines virales</term>
<term>Saisons</term>
<term>Spécificité d'hôte</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Influenza has a negative sense, single-stranded, and segmented RNA. In the context of pandemic influenza research, most studies have focused on variations in the surface proteins (Hemagglutinin and Neuraminidase). However, new findings suggest that all internal and external proteins of influenza viruses can contribute in pandemic emergence, pathogenicity and increasing host range. The occurrence of the 2009 influenza pandemic and the availability of many external and internal segments of pandemic and non-pandemic sequences offer a unique opportunity to evaluate the performance of machine learning models in discrimination of pandemic from seasonal sequences using mutation positions in all segments. In this study, we hypothesized that identifying mutation positions in all segments (proteins) encoded by the influenza genome would enable pandemic and seasonal strains to be more reliably distinguished. In a large scale study, we applied a range of data mining techniques to all segments of influenza for rule discovery and discrimination of pandemic from seasonal strains. CBA (classification based on association rule mining), Ripper and Decision tree algorithms were utilized to extract association rules among mutations. CBA outperformed the other models. Our approach could discriminate pandemic sequences from seasonal ones with more than 95% accuracy for PA and NP, 99.33% accuracy for NA and 100% accuracy, precision, specificity and sensitivity (recall) for M1, M2, PB1, NS1, and NS2. The values of precision, specificity, and sensitivity were more than 90% for other segments except PB2. If sequences of all segments of one strain were available, the accuracy of discrimination of pandemic strains was 100%. General rules extracted by rule base classification approaches, such as M1-V147I, NP-N334H, NS1-V112I, and PB1-L364I, were able to detect pandemic sequences with high accuracy. We observed that mutations on internal proteins of influenza can contribute in distinguishing the pandemic viruses, similar to the external ones.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Australie</li>
<li>Iran</li>
</country>
</list>
<tree>
<country name="Iran">
<noRegion>
<name sortKey="Kargarfard, Fatemeh" sort="Kargarfard, Fatemeh" uniqKey="Kargarfard F" first="Fatemeh" last="Kargarfard">Fatemeh Kargarfard</name>
</noRegion>
<name sortKey="Sami, Ashkan" sort="Sami, Ashkan" uniqKey="Sami A" first="Ashkan" last="Sami">Ashkan Sami</name>
</country>
<country name="Australie">
<noRegion>
<name sortKey="Hemmatzadeh, Farhid" sort="Hemmatzadeh, Farhid" uniqKey="Hemmatzadeh F" first="Farhid" last="Hemmatzadeh">Farhid Hemmatzadeh</name>
</noRegion>
<name sortKey="Ebrahimie, Esmaeil" sort="Ebrahimie, Esmaeil" uniqKey="Ebrahimie E" first="Esmaeil" last="Ebrahimie">Esmaeil Ebrahimie</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Sante/explor/PandemieGrippaleV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000332 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000332 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Sante
   |area=    PandemieGrippaleV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     pubmed:30769139
   |texte=   Identifying mutation positions in all segments of influenza genome enables better differentiation between pandemic and seasonal strains.
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Main/Exploration/RBID.i   -Sk "pubmed:30769139" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd   \
       | NlmPubMed2Wicri -a PandemieGrippaleV1 

Wicri

This area was generated with Dilib version V0.6.34.
Data generation: Wed Jun 10 11:04:28 2020. Site generation: Sun Mar 28 09:10:28 2021