TreeRipper web application: towards a fully automated optical tree recognition software
Identifieur interne : 000362 ( Main/Merge ); précédent : 000361; suivant : 000363TreeRipper web application: towards a fully automated optical tree recognition software
Auteurs : Joseph Hughes [Royaume-Uni]Source :
- BMC Bioinformatics [ 1471-2105 ] ; 2011.
English descriptors
- KwdEn :
- MESH :
Abstract
Relationships between species, genes and genomes have been printed as trees for over a century. Whilst this may have been the best format for exchanging and sharing phylogenetic hypotheses during the 20th century, the worldwide web now provides faster and automated ways of transferring and sharing phylogenetic knowledge. However, novel software is needed to defrost these published phylogenies for the 21st century.
TreeRipper is a simple website for the fully-automated recognition of multifurcating phylogenetic trees (
Despite the diversity of ways phylogenies have been illustrated making the design of a fully automated tree recognition software difficult, TreeRipper is a step towards automating the digitization of past phylogenies. We also provide a dataset of 100 tree images and associated tree files for training and/or benchmarking future software. TreeRipper is an open source project licensed under the GNU General Public Licence v3.
Url:
DOI: 10.1186/1471-2105-12-178
PubMed: 21599881
PubMed Central: 3111373
Links toward previous steps (curation, corpus...)
- to stream Pmc, to step Corpus: 000084
- to stream Pmc, to step Curation: 000084
- to stream Pmc, to step Checkpoint: 000117
- to stream PubMed, to step Corpus: 000036
- to stream PubMed, to step Curation: 000036
- to stream PubMed, to step Checkpoint: 000036
- to stream Ncbi, to step Merge: 000101
- to stream Ncbi, to step Curation: 000101
- to stream Ncbi, to step Checkpoint: 000101
Links to Exploration step
PMC:3111373Le document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en">TreeRipper web application: towards a fully automated optical tree recognition software</title>
<author><name sortKey="Hughes, Joseph" sort="Hughes, Joseph" uniqKey="Hughes J" first="Joseph" last="Hughes">Joseph Hughes</name>
<affiliation wicri:level="4"><nlm:aff id="I1">IBAHCM, College of Medical, Veterinary and Life Sciences, University of Glasgow, Graham Kerr Building, University Avenue, Glasgow, G12 8QQ, UK</nlm:aff>
<country xml:lang="fr">Royaume-Uni</country>
<wicri:regionArea>IBAHCM, College of Medical, Veterinary and Life Sciences, University of Glasgow, Graham Kerr Building, University Avenue, Glasgow, G12 8QQ</wicri:regionArea>
<orgName type="university">Université de Glasgow</orgName>
<placeName><settlement type="city">Glasgow</settlement>
<region type="country">Écosse</region>
</placeName>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">PMC</idno>
<idno type="pmid">21599881</idno>
<idno type="pmc">3111373</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3111373</idno>
<idno type="RBID">PMC:3111373</idno>
<idno type="doi">10.1186/1471-2105-12-178</idno>
<date when="2011">2011</date>
<idno type="wicri:Area/Pmc/Corpus">000084</idno>
<idno type="wicri:Area/Pmc/Curation">000084</idno>
<idno type="wicri:Area/Pmc/Checkpoint">000117</idno>
<idno type="wicri:source">PubMed</idno>
<idno type="wicri:Area/PubMed/Corpus">000036</idno>
<idno type="wicri:Area/PubMed/Curation">000036</idno>
<idno type="wicri:Area/PubMed/Checkpoint">000036</idno>
<idno type="wicri:Area/Ncbi/Merge">000101</idno>
<idno type="wicri:Area/Ncbi/Curation">000101</idno>
<idno type="wicri:Area/Ncbi/Checkpoint">000101</idno>
<idno type="wicri:Area/Main/Merge">000362</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en" level="a" type="main">TreeRipper web application: towards a fully automated optical tree recognition software</title>
<author><name sortKey="Hughes, Joseph" sort="Hughes, Joseph" uniqKey="Hughes J" first="Joseph" last="Hughes">Joseph Hughes</name>
<affiliation wicri:level="4"><nlm:aff id="I1">IBAHCM, College of Medical, Veterinary and Life Sciences, University of Glasgow, Graham Kerr Building, University Avenue, Glasgow, G12 8QQ, UK</nlm:aff>
<country xml:lang="fr">Royaume-Uni</country>
<wicri:regionArea>IBAHCM, College of Medical, Veterinary and Life Sciences, University of Glasgow, Graham Kerr Building, University Avenue, Glasgow, G12 8QQ</wicri:regionArea>
<orgName type="university">Université de Glasgow</orgName>
<placeName><settlement type="city">Glasgow</settlement>
<region type="country">Écosse</region>
</placeName>
</affiliation>
</author>
</analytic>
<series><title level="j">BMC Bioinformatics</title>
<idno type="eISSN">1471-2105</idno>
<imprint><date when="2011">2011</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Animals</term>
<term>Biological Evolution</term>
<term>Classification</term>
<term>Genome</term>
<term>Humans</term>
<term>Internet</term>
<term>Phylogeny</term>
<term>Software</term>
</keywords>
<keywords scheme="MESH" xml:lang="en"><term>Animals</term>
<term>Biological Evolution</term>
<term>Classification</term>
<term>Genome</term>
<term>Humans</term>
<term>Internet</term>
<term>Phylogeny</term>
<term>Software</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en"><sec><title>Background</title>
<p>Relationships between species, genes and genomes have been printed as trees for over a century. Whilst this may have been the best format for exchanging and sharing phylogenetic hypotheses during the 20<sup>th </sup>
century, the worldwide web now provides faster and automated ways of transferring and sharing phylogenetic knowledge. However, novel software is needed to defrost these published phylogenies for the 21<sup>st </sup>
century.</p>
</sec>
<sec><title>Results</title>
<p>TreeRipper is a simple website for the fully-automated recognition of multifurcating phylogenetic trees (<ext-link ext-link-type="uri" xlink:href="http://linnaeus.zoology.gla.ac.uk/~jhughes/treeripper/">http://linnaeus.zoology.gla.ac.uk/~jhughes/treeripper/</ext-link>
). The program accepts a range of input image formats (PNG, JPG/JPEG or GIF). The underlying command line c++ program follows a number of cleaning steps to detect lines, remove node labels, patch-up broken lines and corners and detect line edges. The edge contour is then determined to detect the branch length, tip label positions and the topology of the tree. Optical Character Recognition (OCR) is used to convert the tip labels into text with the freely available tesseract-ocr software. 32% of images meeting the prerequisites for TreeRipper were successfully recognised, the largest tree had 115 leaves.</p>
</sec>
<sec><title>Conclusions</title>
<p>Despite the diversity of ways phylogenies have been illustrated making the design of a fully automated tree recognition software difficult, TreeRipper is a step towards automating the digitization of past phylogenies. We also provide a dataset of 100 tree images and associated tree files for training and/or benchmarking future software. TreeRipper is an open source project licensed under the GNU General Public Licence v3.</p>
</sec>
</div>
</front>
<back><div1 type="bibliography"><listBibl><biblStruct><analytic><author><name sortKey="Darwin, Cr" uniqKey="Darwin C">CR Darwin</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Smith, Sa" uniqKey="Smith S">SA Smith</name>
</author>
<author><name sortKey="Beaulieu, Jm" uniqKey="Beaulieu J">JM Beaulieu</name>
</author>
<author><name sortKey="Donoghue, Mj" uniqKey="Donoghue M">MJ Donoghue</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Mcmahon, Mm" uniqKey="Mcmahon M">MM McMahon</name>
</author>
<author><name sortKey="Sanderson, Mj" uniqKey="Sanderson M">MJ Sanderson</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct><analytic><author><name sortKey="Page, Rdm" uniqKey="Page R">RDM Page</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct><analytic><author><name sortKey="Laubach, T" uniqKey="Laubach T">T Laubach</name>
</author>
<author><name sortKey="Von Haeseler, A" uniqKey="Von Haeseler A">A von Haeseler</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct><analytic><author><name sortKey="Smith, R" uniqKey="Smith R">R Smith</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Han, Mv" uniqKey="Han M">MV Han</name>
</author>
<author><name sortKey="Zmasek, Cm" uniqKey="Zmasek C">CM Zmasek</name>
</author>
</analytic>
</biblStruct>
</listBibl>
</div1>
</back>
</TEI>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Merge
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000362 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Merge/biblio.hfd -nk 000362 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= Main |étape= Merge |type= RBID |clé= PMC:3111373 |texte= TreeRipper web application: towards a fully automated optical tree recognition software }}
Pour générer des pages wiki
HfdIndexSelect -h $EXPLOR_AREA/Data/Main/Merge/RBID.i -Sk "pubmed:21599881" \ | HfdSelect -Kh $EXPLOR_AREA/Data/Main/Merge/biblio.hfd \ | NlmPubMed2Wicri -a OcrV1
This area was generated with Dilib version V0.6.32. |