Serveur d'exploration Cyberinfrastructure

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Detection of non-coding RNAs on the basis of predicted secondary structure formation free energy change

Identifieur interne : 001037 ( Main/Merge ); précédent : 001036; suivant : 001038

Detection of non-coding RNAs on the basis of predicted secondary structure formation free energy change

Auteurs : Andrew V. Uzilov [États-Unis] ; Joshua M. Keegan [États-Unis] ; David H. Mathews [États-Unis]

Source :

RBID : PMC:1570369

Abstract

Background

Non-coding RNAs (ncRNAs) have a multitude of roles in the cell, many of which remain to be discovered. However, it is difficult to detect novel ncRNAs in biochemical screens. To advance biological knowledge, computational methods that can accurately detect ncRNAs in sequenced genomes are therefore desirable. The increasing number of genomic sequences provides a rich dataset for computational comparative sequence analysis and detection of novel ncRNAs.

Results

Here, Dynalign, a program for predicting secondary structures common to two RNA sequences on the basis of minimizing folding free energy change, is utilized as a computational ncRNA detection tool. The Dynalign-computed optimal total free energy change, which scores the structural alignment and the free energy change of folding into a common structure for two RNA sequences, is shown to be an effective measure for distinguishing ncRNA from randomized sequences. To make the classification as a ncRNA, the total free energy change of an input sequence pair can either be compared with the total free energy changes of a set of control sequence pairs, or be used in combination with sequence length and nucleotide frequencies as input to a classification support vector machine. The latter method is much faster, but slightly less sensitive at a given specificity. Additionally, the classification support vector machine method is shown to be sensitive and specific on genomic ncRNA screens of two different Escherichia coli and Salmonella typhi genome alignments, in which many ncRNAs are known. The Dynalign computational experiments are also compared with two other ncRNA detection programs, RNAz and QRNA.

Conclusion

The Dynalign-based support vector machine method is more sensitive for known ncRNAs in the test genomic screens than RNAz and QRNA. Additionally, both Dynalign-based methods are more sensitive than RNAz and QRNA at low sequence pair identities. Dynalign can be used as a comparable or more accurate tool than RNAz or QRNA in genomic screens, especially for low-identity regions. Dynalign provides a method for discovering ncRNAs in sequenced genomes that other methods may not identify. Significant improvements in Dynalign runtime have also been achieved.


Url:
DOI: 10.1186/1471-2105-7-173
PubMed: 16566836
PubMed Central: 1570369

Links toward previous steps (curation, corpus...)


Links to Exploration step

PMC:1570369

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Detection of non-coding RNAs on the basis of predicted secondary structure formation free energy change</title>
<author>
<name sortKey="Uzilov, Andrew V" sort="Uzilov, Andrew V" uniqKey="Uzilov A" first="Andrew V" last="Uzilov">Andrew V. Uzilov</name>
<affiliation wicri:level="1">
<nlm:aff id="I1">Department of Biochemistry & Biophysics, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Biochemistry & Biophysics, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642</wicri:regionArea>
<wicri:noRegion>New York 14642</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<nlm:aff id="I2">Department of Biostatistics & Computational Biology, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Biostatistics & Computational Biology, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642</wicri:regionArea>
<wicri:noRegion>New York 14642</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<nlm:aff id="I3">Center for Pediatric Biomedical Research, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Center for Pediatric Biomedical Research, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642</wicri:regionArea>
<wicri:noRegion>New York 14642</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Keegan, Joshua M" sort="Keegan, Joshua M" uniqKey="Keegan J" first="Joshua M" last="Keegan">Joshua M. Keegan</name>
<affiliation wicri:level="1">
<nlm:aff id="I1">Department of Biochemistry & Biophysics, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Biochemistry & Biophysics, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642</wicri:regionArea>
<wicri:noRegion>New York 14642</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<nlm:aff id="I2">Department of Biostatistics & Computational Biology, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Biostatistics & Computational Biology, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642</wicri:regionArea>
<wicri:noRegion>New York 14642</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<nlm:aff id="I3">Center for Pediatric Biomedical Research, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Center for Pediatric Biomedical Research, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642</wicri:regionArea>
<wicri:noRegion>New York 14642</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Mathews, David H" sort="Mathews, David H" uniqKey="Mathews D" first="David H" last="Mathews">David H. Mathews</name>
<affiliation wicri:level="1">
<nlm:aff id="I1">Department of Biochemistry & Biophysics, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Biochemistry & Biophysics, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642</wicri:regionArea>
<wicri:noRegion>New York 14642</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<nlm:aff id="I2">Department of Biostatistics & Computational Biology, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Biostatistics & Computational Biology, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642</wicri:regionArea>
<wicri:noRegion>New York 14642</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<nlm:aff id="I3">Center for Pediatric Biomedical Research, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Center for Pediatric Biomedical Research, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642</wicri:regionArea>
<wicri:noRegion>New York 14642</wicri:noRegion>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PMC</idno>
<idno type="pmid">16566836</idno>
<idno type="pmc">1570369</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1570369</idno>
<idno type="RBID">PMC:1570369</idno>
<idno type="doi">10.1186/1471-2105-7-173</idno>
<date when="2006">2006</date>
<idno type="wicri:Area/Pmc/Corpus">000212</idno>
<idno type="wicri:Area/Pmc/Curation">000212</idno>
<idno type="wicri:Area/Pmc/Checkpoint">000716</idno>
<idno type="wicri:Area/Ncbi/Merge">000009</idno>
<idno type="wicri:Area/Ncbi/Curation">000009</idno>
<idno type="wicri:Area/Ncbi/Checkpoint">000009</idno>
<idno type="wicri:Area/Main/Merge">001037</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a" type="main">Detection of non-coding RNAs on the basis of predicted secondary structure formation free energy change</title>
<author>
<name sortKey="Uzilov, Andrew V" sort="Uzilov, Andrew V" uniqKey="Uzilov A" first="Andrew V" last="Uzilov">Andrew V. Uzilov</name>
<affiliation wicri:level="1">
<nlm:aff id="I1">Department of Biochemistry & Biophysics, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Biochemistry & Biophysics, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642</wicri:regionArea>
<wicri:noRegion>New York 14642</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<nlm:aff id="I2">Department of Biostatistics & Computational Biology, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Biostatistics & Computational Biology, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642</wicri:regionArea>
<wicri:noRegion>New York 14642</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<nlm:aff id="I3">Center for Pediatric Biomedical Research, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Center for Pediatric Biomedical Research, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642</wicri:regionArea>
<wicri:noRegion>New York 14642</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Keegan, Joshua M" sort="Keegan, Joshua M" uniqKey="Keegan J" first="Joshua M" last="Keegan">Joshua M. Keegan</name>
<affiliation wicri:level="1">
<nlm:aff id="I1">Department of Biochemistry & Biophysics, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Biochemistry & Biophysics, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642</wicri:regionArea>
<wicri:noRegion>New York 14642</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<nlm:aff id="I2">Department of Biostatistics & Computational Biology, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Biostatistics & Computational Biology, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642</wicri:regionArea>
<wicri:noRegion>New York 14642</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<nlm:aff id="I3">Center for Pediatric Biomedical Research, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Center for Pediatric Biomedical Research, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642</wicri:regionArea>
<wicri:noRegion>New York 14642</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Mathews, David H" sort="Mathews, David H" uniqKey="Mathews D" first="David H" last="Mathews">David H. Mathews</name>
<affiliation wicri:level="1">
<nlm:aff id="I1">Department of Biochemistry & Biophysics, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Biochemistry & Biophysics, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642</wicri:regionArea>
<wicri:noRegion>New York 14642</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<nlm:aff id="I2">Department of Biostatistics & Computational Biology, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Biostatistics & Computational Biology, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642</wicri:regionArea>
<wicri:noRegion>New York 14642</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<nlm:aff id="I3">Center for Pediatric Biomedical Research, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Center for Pediatric Biomedical Research, University of Rochester Medical Center, 601 Elmwood Avenue, Box 712, Rochester, New York 14642</wicri:regionArea>
<wicri:noRegion>New York 14642</wicri:noRegion>
</affiliation>
</author>
</analytic>
<series>
<title level="j">BMC Bioinformatics</title>
<idno type="eISSN">1471-2105</idno>
<imprint>
<date when="2006">2006</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass></textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">
<sec>
<title>Background</title>
<p>Non-coding RNAs (ncRNAs) have a multitude of roles in the cell, many of which remain to be discovered. However, it is difficult to detect novel ncRNAs in biochemical screens. To advance biological knowledge, computational methods that can accurately detect ncRNAs in sequenced genomes are therefore desirable. The increasing number of genomic sequences provides a rich dataset for computational comparative sequence analysis and detection of novel ncRNAs.</p>
</sec>
<sec>
<title>Results</title>
<p>Here, Dynalign, a program for predicting secondary structures common to two RNA sequences on the basis of minimizing folding free energy change, is utilized as a computational ncRNA detection tool. The Dynalign-computed optimal total free energy change, which scores the structural alignment and the free energy change of folding into a common structure for two RNA sequences, is shown to be an effective measure for distinguishing ncRNA from randomized sequences. To make the classification as a ncRNA, the total free energy change of an input sequence pair can either be compared with the total free energy changes of a set of control sequence pairs, or be used in combination with sequence length and nucleotide frequencies as input to a classification support vector machine. The latter method is much faster, but slightly less sensitive at a given specificity. Additionally, the classification support vector machine method is shown to be sensitive and specific on genomic ncRNA screens of two different
<italic>Escherichia coli </italic>
and
<italic>Salmonella typhi </italic>
genome alignments, in which many ncRNAs are known. The Dynalign computational experiments are also compared with two other ncRNA detection programs, RNAz and QRNA.</p>
</sec>
<sec>
<title>Conclusion</title>
<p>The Dynalign-based support vector machine method is more sensitive for known ncRNAs in the test genomic screens than RNAz and QRNA. Additionally, both Dynalign-based methods are more sensitive than RNAz and QRNA at low sequence pair identities. Dynalign can be used as a comparable or more accurate tool than RNAz or QRNA in genomic screens, especially for low-identity regions. Dynalign provides a method for discovering ncRNAs in sequenced genomes that other methods may not identify. Significant improvements in Dynalign runtime have also been achieved.</p>
</sec>
</div>
</front>
<back>
<div1 type="bibliography">
<listBibl>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
</listBibl>
</div1>
</back>
</TEI>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/CyberinfraV1/Data/Main/Merge
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001037 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Merge/biblio.hfd -nk 001037 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    CyberinfraV1
   |flux=    Main
   |étape=   Merge
   |type=    RBID
   |clé=     PMC:1570369
   |texte=   Detection of non-coding RNAs on the basis of predicted secondary structure formation free energy change
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Main/Merge/RBID.i   -Sk "pubmed:16566836" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Main/Merge/biblio.hfd   \
       | NlmPubMed2Wicri -a CyberinfraV1 

Wicri

This area was generated with Dilib version V0.6.25.
Data generation: Thu Oct 27 09:30:58 2016. Site generation: Sun Mar 10 23:08:40 2024