Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Ground Truth for Layout Analysis Performance Evaluation

Identifieur interne : 002490 ( Istex/Corpus ); précédent : 002489; suivant : 002491

Ground Truth for Layout Analysis Performance Evaluation

Auteurs : A. Antonacopoulos ; D. Karatzas ; D. Bridson

Source :

RBID : ISTEX:3833F85C0434283A3271B3B19B95DC29B2FDF83F

Abstract

Abstract: Over the past two decades a significant number of layout analysis (page segmentation and region classification) approaches have been proposed in the literature. Each approach has been devised for and/or evaluated using (usually small) application-specific datasets. While the need for objective performance evaluation of layout analysis algorithms is evident, there does not exist a suitable dataset with ground truth that reflects the realities of everyday documents (widely varying layouts, complex entities, colour, noise etc.). The most significant impediment is the creation of accurate and flexible (in representation) ground truth, a task that is costly and must be carefully designed. This paper discusses the issues related to the design, representation and creation of ground truth in the context of a realistic dataset developed by the authors. The effectiveness of the ground truth discussed in this paper has been successfully shown in its use for two international page segmentation competitions (ICDAR2003 and ICDAR2005).

Url:
DOI: 10.1007/11669487_27

Links to Exploration step

ISTEX:3833F85C0434283A3271B3B19B95DC29B2FDF83F

Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Ground Truth for Layout Analysis Performance Evaluation</title>
<author>
<name sortKey="Antonacopoulos, A" sort="Antonacopoulos, A" uniqKey="Antonacopoulos A" first="A." last="Antonacopoulos">A. Antonacopoulos</name>
<affiliation>
<mods:affiliation>Pattern Recognition and Image Analysis (PRImA) Research Lab, School of Computing, Science and Engineering, University of Salford, M5 4WT, Manchester, United Kingdom</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Karatzas, D" sort="Karatzas, D" uniqKey="Karatzas D" first="D." last="Karatzas">D. Karatzas</name>
<affiliation>
<mods:affiliation>School of Electronics and Computer Science, University of Southampton, SO16 1BJ, Southampton, United Kingdom</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Bridson, D" sort="Bridson, D" uniqKey="Bridson D" first="D." last="Bridson">D. Bridson</name>
<affiliation>
<mods:affiliation>Pattern Recognition and Image Analysis (PRImA) Research Lab, School of Computing, Science and Engineering, University of Salford, M5 4WT, Manchester, United Kingdom</mods:affiliation>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:3833F85C0434283A3271B3B19B95DC29B2FDF83F</idno>
<date when="2006" year="2006">2006</date>
<idno type="doi">10.1007/11669487_27</idno>
<idno type="url">https://api.istex.fr/document/3833F85C0434283A3271B3B19B95DC29B2FDF83F/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">002490</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Ground Truth for Layout Analysis Performance Evaluation</title>
<author>
<name sortKey="Antonacopoulos, A" sort="Antonacopoulos, A" uniqKey="Antonacopoulos A" first="A." last="Antonacopoulos">A. Antonacopoulos</name>
<affiliation>
<mods:affiliation>Pattern Recognition and Image Analysis (PRImA) Research Lab, School of Computing, Science and Engineering, University of Salford, M5 4WT, Manchester, United Kingdom</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Karatzas, D" sort="Karatzas, D" uniqKey="Karatzas D" first="D." last="Karatzas">D. Karatzas</name>
<affiliation>
<mods:affiliation>School of Electronics and Computer Science, University of Southampton, SO16 1BJ, Southampton, United Kingdom</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Bridson, D" sort="Bridson, D" uniqKey="Bridson D" first="D." last="Bridson">D. Bridson</name>
<affiliation>
<mods:affiliation>Pattern Recognition and Image Analysis (PRImA) Research Lab, School of Computing, Science and Engineering, University of Salford, M5 4WT, Manchester, United Kingdom</mods:affiliation>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<imprint>
<date>2006</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">3833F85C0434283A3271B3B19B95DC29B2FDF83F</idno>
<idno type="DOI">10.1007/11669487_27</idno>
<idno type="ChapterID">27</idno>
<idno type="ChapterID">Chap27</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: Over the past two decades a significant number of layout analysis (page segmentation and region classification) approaches have been proposed in the literature. Each approach has been devised for and/or evaluated using (usually small) application-specific datasets. While the need for objective performance evaluation of layout analysis algorithms is evident, there does not exist a suitable dataset with ground truth that reflects the realities of everyday documents (widely varying layouts, complex entities, colour, noise etc.). The most significant impediment is the creation of accurate and flexible (in representation) ground truth, a task that is costly and must be carefully designed. This paper discusses the issues related to the design, representation and creation of ground truth in the context of a realistic dataset developed by the authors. The effectiveness of the ground truth discussed in this paper has been successfully shown in its use for two international page segmentation competitions (ICDAR2003 and ICDAR2005).</div>
</front>
</TEI>
<istex>
<corpusName>springer</corpusName>
<author>
<json:item>
<name>A. Antonacopoulos</name>
<affiliations>
<json:string>Pattern Recognition and Image Analysis (PRImA) Research Lab, School of Computing, Science and Engineering, University of Salford, M5 4WT, Manchester, United Kingdom</json:string>
</affiliations>
</json:item>
<json:item>
<name>D. Karatzas</name>
<affiliations>
<json:string>School of Electronics and Computer Science, University of Southampton, SO16 1BJ, Southampton, United Kingdom</json:string>
</affiliations>
</json:item>
<json:item>
<name>D. Bridson</name>
<affiliations>
<json:string>Pattern Recognition and Image Analysis (PRImA) Research Lab, School of Computing, Science and Engineering, University of Salford, M5 4WT, Manchester, United Kingdom</json:string>
</affiliations>
</json:item>
</author>
<language>
<json:string>eng</json:string>
</language>
<abstract>Abstract: Over the past two decades a significant number of layout analysis (page segmentation and region classification) approaches have been proposed in the literature. Each approach has been devised for and/or evaluated using (usually small) application-specific datasets. While the need for objective performance evaluation of layout analysis algorithms is evident, there does not exist a suitable dataset with ground truth that reflects the realities of everyday documents (widely varying layouts, complex entities, colour, noise etc.). The most significant impediment is the creation of accurate and flexible (in representation) ground truth, a task that is costly and must be carefully designed. This paper discusses the issues related to the design, representation and creation of ground truth in the context of a realistic dataset developed by the authors. The effectiveness of the ground truth discussed in this paper has been successfully shown in its use for two international page segmentation competitions (ICDAR2003 and ICDAR2005).</abstract>
<qualityIndicators>
<score>5.595</score>
<pdfVersion>1.3</pdfVersion>
<pdfPageSize>430 x 660 pts</pdfPageSize>
<refBibsNative>false</refBibsNative>
<keywordCount>0</keywordCount>
<abstractCharCount>1046</abstractCharCount>
<pdfWordCount>3771</pdfWordCount>
<pdfCharCount>23659</pdfCharCount>
<pdfPageCount>10</pdfPageCount>
<abstractWordCount>152</abstractWordCount>
</qualityIndicators>
<title>Ground Truth for Layout Analysis Performance Evaluation</title>
<genre.original>
<json:string>OriginalPaper</json:string>
</genre.original>
<chapterId>
<json:string>27</json:string>
<json:string>Chap27</json:string>
</chapterId>
<genre>
<json:string>conference [eBooks]</json:string>
</genre>
<serie>
<editor>
<json:item>
<name>David Hutchison</name>
<affiliations>
<json:string>Lancaster University, UK</json:string>
</affiliations>
</json:item>
<json:item>
<name>Takeo Kanade</name>
<affiliations>
<json:string>Carnegie Mellon University, Pittsburgh, PA, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Josef Kittler</name>
<affiliations>
<json:string>University of Surrey, Guildford, UK</json:string>
</affiliations>
</json:item>
<json:item>
<name>Jon M. Kleinberg</name>
<affiliations>
<json:string>Cornell University, Ithaca, NY, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Friedemann Mattern</name>
<affiliations>
<json:string>ETH Zurich, Switzerland</json:string>
</affiliations>
</json:item>
<json:item>
<name>John C. Mitchell</name>
<affiliations>
<json:string>Stanford University, CA, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Moni Naor</name>
<affiliations>
<json:string>Weizmann Institute of Science, Rehovot, Israel</json:string>
</affiliations>
</json:item>
<json:item>
<name>Oscar Nierstrasz</name>
<affiliations>
<json:string>University of Bern, Switzerland</json:string>
</affiliations>
</json:item>
<json:item>
<name>C. Pandu Rangan</name>
<affiliations>
<json:string>Indian Institute of Technology, Madras, India</json:string>
</affiliations>
</json:item>
<json:item>
<name>Bernhard Steffen</name>
<affiliations>
<json:string>University of Dortmund, Germany</json:string>
</affiliations>
</json:item>
<json:item>
<name>Madhu Sudan</name>
<affiliations>
<json:string>Massachusetts Institute of Technology, MA, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Demetri Terzopoulos</name>
<affiliations>
<json:string>New York University, NY, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Dough Tygar</name>
<affiliations>
<json:string>University of California, Berkeley, CA, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Moshe Y. Vardi</name>
<affiliations>
<json:string>Rice University, Houston, TX, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Gerhard Weikum</name>
<affiliations>
<json:string>Max-Planck Institute of Computer Science, Saarbruecken, Germany</json:string>
</affiliations>
</json:item>
</editor>
<issn>
<json:string>0302-9743</json:string>
</issn>
<language>
<json:string>unknown</json:string>
</language>
<eissn>
<json:string>1611-3349</json:string>
</eissn>
<title>Lecture Notes in Computer Science</title>
<copyrightDate>2006</copyrightDate>
</serie>
<host>
<editor>
<json:item>
<name>Horst Bunke</name>
<affiliations>
<json:string>Institute of Computer Science and Applied Mathematics, University of Bern, Neubrückstrasse 10, CH-3012, Bern, Switzerland</json:string>
<json:string>E-mail: bunke@iam.unibe.ch</json:string>
</affiliations>
</json:item>
<json:item>
<name>A. Lawrence Spitz</name>
<affiliations>
<json:string>DocRec Ltd, 34 Strathaven Place, 7001, Atawhai, Nelson, New Zealand</json:string>
<json:string>E-mail: spitz@docrec.com</json:string>
</affiliations>
</json:item>
</editor>
<subject>
<json:item>
<value>Computer Science</value>
</json:item>
<json:item>
<value>Computer Science</value>
</json:item>
<json:item>
<value>Pattern Recognition</value>
</json:item>
<json:item>
<value>Information Storage and Retrieval</value>
</json:item>
<json:item>
<value>Image Processing and Computer Vision</value>
</json:item>
<json:item>
<value>Simulation and Modeling</value>
</json:item>
<json:item>
<value>Computer Appl. in Administrative Data Processing</value>
</json:item>
</subject>
<isbn>
<json:string>978-3-540-32140-8</json:string>
</isbn>
<language>
<json:string>unknown</json:string>
</language>
<eissn>
<json:string>1611-3349</json:string>
</eissn>
<title>Document Analysis Systems VII</title>
<genre.original>
<json:string>Proceedings</json:string>
</genre.original>
<bookId>
<json:string>978-3-540-32157-6</json:string>
</bookId>
<volume>3872</volume>
<pages>
<last>311</last>
<first>302</first>
</pages>
<issn>
<json:string>0302-9743</json:string>
</issn>
<genre>
<json:string>Book Series</json:string>
</genre>
<eisbn>
<json:string>978-3-540-32157-6</json:string>
</eisbn>
<copyrightDate>2006</copyrightDate>
<doi>
<json:string>10.1007/11669487</json:string>
</doi>
</host>
<publicationDate>2006</publicationDate>
<copyrightDate>2006</copyrightDate>
<doi>
<json:string>10.1007/11669487_27</json:string>
</doi>
<id>3833F85C0434283A3271B3B19B95DC29B2FDF83F</id>
<fulltext>
<json:item>
<original>true</original>
<mimetype>application/pdf</mimetype>
<extension>pdf</extension>
<uri>https://api.istex.fr/document/3833F85C0434283A3271B3B19B95DC29B2FDF83F/fulltext/pdf</uri>
</json:item>
<json:item>
<original>false</original>
<mimetype>application/zip</mimetype>
<extension>zip</extension>
<uri>https://api.istex.fr/document/3833F85C0434283A3271B3B19B95DC29B2FDF83F/fulltext/zip</uri>
</json:item>
<istex:fulltextTEI uri="https://api.istex.fr/document/3833F85C0434283A3271B3B19B95DC29B2FDF83F/fulltext/tei">
<teiHeader>
<fileDesc>
<titleStmt>
<title level="a" type="main" xml:lang="en">Ground Truth for Layout Analysis Performance Evaluation</title>
<respStmt xml:id="ISTEX-API" resp="Références bibliographiques récupérées via GROBID" name="ISTEX-API (INIST-CNRS)"></respStmt>
</titleStmt>
<publicationStmt>
<authority>ISTEX</authority>
<publisher>Springer Berlin Heidelberg</publisher>
<pubPlace>Berlin, Heidelberg</pubPlace>
<availability>
<p>SPRINGER</p>
</availability>
<date>2006</date>
</publicationStmt>
<sourceDesc>
<biblStruct type="inbook">
<analytic>
<title level="a" type="main" xml:lang="en">Ground Truth for Layout Analysis Performance Evaluation</title>
<author>
<persName>
<forename type="first">A.</forename>
<surname>Antonacopoulos</surname>
</persName>
<affiliation>Pattern Recognition and Image Analysis (PRImA) Research Lab, School of Computing, Science and Engineering, University of Salford, M5 4WT, Manchester, United Kingdom</affiliation>
</author>
<author>
<persName>
<forename type="first">D.</forename>
<surname>Karatzas</surname>
</persName>
<affiliation>School of Electronics and Computer Science, University of Southampton, SO16 1BJ, Southampton, United Kingdom</affiliation>
</author>
<author>
<persName>
<forename type="first">D.</forename>
<surname>Bridson</surname>
</persName>
<affiliation>Pattern Recognition and Image Analysis (PRImA) Research Lab, School of Computing, Science and Engineering, University of Salford, M5 4WT, Manchester, United Kingdom</affiliation>
</author>
</analytic>
<monogr>
<title level="m">Document Analysis Systems VII</title>
<title level="m" type="sub">7th International Workshop, DAS 2006, Nelson, New Zealand, February 13-15, 2006. Proceedings</title>
<idno type="pISBN">978-3-540-32140-8</idno>
<idno type="eISBN">978-3-540-32157-6</idno>
<idno type="pISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="DOI">10.1007/11669487</idno>
<idno type="BookID">978-3-540-32157-6</idno>
<idno type="BookTitleID">133456</idno>
<idno type="BookSequenceNumber">3872</idno>
<idno type="BookVolumeNumber">3872</idno>
<idno type="BookChapterCount">55</idno>
<editor>
<persName>
<forename type="first">Horst</forename>
<surname>Bunke</surname>
</persName>
<email>bunke@iam.unibe.ch</email>
<affiliation>Institute of Computer Science and Applied Mathematics, University of Bern, Neubrückstrasse 10, CH-3012, Bern, Switzerland</affiliation>
</editor>
<editor>
<persName>
<forename type="first">A.</forename>
<forename type="first">Lawrence</forename>
<surname>Spitz</surname>
</persName>
<email>spitz@docrec.com</email>
<affiliation>DocRec Ltd, 34 Strathaven Place, 7001, Atawhai, Nelson, New Zealand</affiliation>
</editor>
<imprint>
<publisher>Springer Berlin Heidelberg</publisher>
<pubPlace>Berlin, Heidelberg</pubPlace>
<date type="published" when="2006"></date>
<biblScope unit="volume">3872</biblScope>
<biblScope unit="page" from="302">302</biblScope>
<biblScope unit="page" to="311">311</biblScope>
</imprint>
</monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<editor>
<persName>
<forename type="first">David</forename>
<surname>Hutchison</surname>
</persName>
<affiliation>Lancaster University, UK</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Takeo</forename>
<surname>Kanade</surname>
</persName>
<affiliation>Carnegie Mellon University, Pittsburgh, PA, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Josef</forename>
<surname>Kittler</surname>
</persName>
<affiliation>University of Surrey, Guildford, UK</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Jon</forename>
<forename type="first">M.</forename>
<surname>Kleinberg</surname>
</persName>
<affiliation>Cornell University, Ithaca, NY, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Friedemann</forename>
<surname>Mattern</surname>
</persName>
<affiliation>ETH Zurich, Switzerland</affiliation>
</editor>
<editor>
<persName>
<forename type="first">John</forename>
<forename type="first">C.</forename>
<surname>Mitchell</surname>
</persName>
<affiliation>Stanford University, CA, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Moni</forename>
<surname>Naor</surname>
</persName>
<affiliation>Weizmann Institute of Science, Rehovot, Israel</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Oscar</forename>
<surname>Nierstrasz</surname>
</persName>
<affiliation>University of Bern, Switzerland</affiliation>
</editor>
<editor>
<persName>
<forename type="first">C.</forename>
<surname>Pandu Rangan</surname>
</persName>
<affiliation>Indian Institute of Technology, Madras, India</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Bernhard</forename>
<surname>Steffen</surname>
</persName>
<affiliation>University of Dortmund, Germany</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Madhu</forename>
<surname>Sudan</surname>
</persName>
<affiliation>Massachusetts Institute of Technology, MA, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Demetri</forename>
<surname>Terzopoulos</surname>
</persName>
<affiliation>New York University, NY, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Dough</forename>
<surname>Tygar</surname>
</persName>
<affiliation>University of California, Berkeley, CA, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Moshe</forename>
<forename type="first">Y.</forename>
<surname>Vardi</surname>
</persName>
<affiliation>Rice University, Houston, TX, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Gerhard</forename>
<surname>Weikum</surname>
</persName>
<affiliation>Max-Planck Institute of Computer Science, Saarbruecken, Germany</affiliation>
</editor>
<biblScope>
<date>2006</date>
</biblScope>
<idno type="pISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="seriesId">558</idno>
</series>
<idno type="istex">3833F85C0434283A3271B3B19B95DC29B2FDF83F</idno>
<idno type="DOI">10.1007/11669487_27</idno>
<idno type="ChapterID">27</idno>
<idno type="ChapterID">Chap27</idno>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<creation>
<date>2006</date>
</creation>
<langUsage>
<language ident="en">en</language>
</langUsage>
<abstract xml:lang="en">
<p>Abstract: Over the past two decades a significant number of layout analysis (page segmentation and region classification) approaches have been proposed in the literature. Each approach has been devised for and/or evaluated using (usually small) application-specific datasets. While the need for objective performance evaluation of layout analysis algorithms is evident, there does not exist a suitable dataset with ground truth that reflects the realities of everyday documents (widely varying layouts, complex entities, colour, noise etc.). The most significant impediment is the creation of accurate and flexible (in representation) ground truth, a task that is costly and must be carefully designed. This paper discusses the issues related to the design, representation and creation of ground truth in the context of a realistic dataset developed by the authors. The effectiveness of the ground truth discussed in this paper has been successfully shown in its use for two international page segmentation competitions (ICDAR2003 and ICDAR2005).</p>
</abstract>
<textClass>
<keywords scheme="Book Subject Collection">
<list>
<label>SUCO11645</label>
<item>
<term>Computer Science</term>
</item>
</list>
</keywords>
</textClass>
<textClass>
<keywords scheme="Book Subject Group">
<list>
<label>I</label>
<label>I2203X</label>
<label>I18032</label>
<label>I22021</label>
<label>I21025</label>
<label>I2301X</label>
<item>
<term>Computer Science</term>
</item>
<item>
<term>Pattern Recognition</term>
</item>
<item>
<term>Information Storage and Retrieval</term>
</item>
<item>
<term>Image Processing and Computer Vision</term>
</item>
<item>
<term>Simulation and Modeling</term>
</item>
<item>
<term>Computer Appl. in Administrative Data Processing</term>
</item>
</list>
</keywords>
</textClass>
</profileDesc>
<revisionDesc>
<change when="2006">Published</change>
<change xml:id="refBibs-istex" who="#ISTEX-API" when="2016-3-20">References added</change>
</revisionDesc>
</teiHeader>
</istex:fulltextTEI>
<json:item>
<original>false</original>
<mimetype>text/plain</mimetype>
<extension>txt</extension>
<uri>https://api.istex.fr/document/3833F85C0434283A3271B3B19B95DC29B2FDF83F/fulltext/txt</uri>
</json:item>
</fulltext>
<metadata>
<istex:metadataXml wicri:clean="Springer, Publisher found" wicri:toSee="no header">
<istex:xmlDeclaration>version="1.0" encoding="UTF-8"</istex:xmlDeclaration>
<istex:docType PUBLIC="-//Springer-Verlag//DTD A++ V2.4//EN" URI="http://devel.springer.de/A++/V2.4/DTD/A++V2.4.dtd" name="istex:docType"></istex:docType>
<istex:document>
<Publisher>
<PublisherInfo>
<PublisherName>Springer Berlin Heidelberg</PublisherName>
<PublisherLocation>Berlin, Heidelberg</PublisherLocation>
</PublisherInfo>
<Series>
<SeriesInfo SeriesType="Series" TocLevels="0">
<SeriesID>558</SeriesID>
<SeriesPrintISSN>0302-9743</SeriesPrintISSN>
<SeriesElectronicISSN>1611-3349</SeriesElectronicISSN>
<SeriesTitle Language="En">Lecture Notes in Computer Science</SeriesTitle>
</SeriesInfo>
<SeriesHeader>
<EditorGroup>
<Editor AffiliationIDS="Aff1">
<EditorName DisplayOrder="Western">
<GivenName>David</GivenName>
<FamilyName>Hutchison</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff2">
<EditorName DisplayOrder="Western">
<GivenName>Takeo</GivenName>
<FamilyName>Kanade</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff3">
<EditorName DisplayOrder="Western">
<GivenName>Josef</GivenName>
<FamilyName>Kittler</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff4">
<EditorName DisplayOrder="Western">
<GivenName>Jon</GivenName>
<GivenName>M.</GivenName>
<FamilyName>Kleinberg</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff5">
<EditorName DisplayOrder="Western">
<GivenName>Friedemann</GivenName>
<FamilyName>Mattern</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff6">
<EditorName DisplayOrder="Western">
<GivenName>John</GivenName>
<GivenName>C.</GivenName>
<FamilyName>Mitchell</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff7">
<EditorName DisplayOrder="Western">
<GivenName>Moni</GivenName>
<FamilyName>Naor</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff8">
<EditorName DisplayOrder="Western">
<GivenName>Oscar</GivenName>
<FamilyName>Nierstrasz</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff9">
<EditorName DisplayOrder="Western">
<GivenName>C.</GivenName>
<FamilyName>Pandu Rangan</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff10">
<EditorName DisplayOrder="Western">
<GivenName>Bernhard</GivenName>
<FamilyName>Steffen</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff11">
<EditorName DisplayOrder="Western">
<GivenName>Madhu</GivenName>
<FamilyName>Sudan</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff12">
<EditorName DisplayOrder="Western">
<GivenName>Demetri</GivenName>
<FamilyName>Terzopoulos</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff13">
<EditorName DisplayOrder="Western">
<GivenName>Dough</GivenName>
<FamilyName>Tygar</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff14">
<EditorName DisplayOrder="Western">
<GivenName>Moshe</GivenName>
<GivenName>Y.</GivenName>
<FamilyName>Vardi</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff15">
<EditorName DisplayOrder="Western">
<GivenName>Gerhard</GivenName>
<FamilyName>Weikum</FamilyName>
</EditorName>
</Editor>
<Affiliation ID="Aff1">
<OrgName>Lancaster University</OrgName>
<OrgAddress>
<Country>UK</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff2">
<OrgName>Carnegie Mellon University</OrgName>
<OrgAddress>
<City>Pittsburgh</City>
<State>PA</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff3">
<OrgName>University of Surrey</OrgName>
<OrgAddress>
<City>Guildford</City>
<Country>UK</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff4">
<OrgName>Cornell University</OrgName>
<OrgAddress>
<City>Ithaca</City>
<State>NY</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff5">
<OrgName>ETH Zurich</OrgName>
<OrgAddress>
<Country>Switzerland</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff6">
<OrgName>Stanford University</OrgName>
<OrgAddress>
<City>CA</City>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff7">
<OrgName>Weizmann Institute of Science</OrgName>
<OrgAddress>
<City>Rehovot</City>
<Country>Israel</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff8">
<OrgName>University of Bern</OrgName>
<OrgAddress>
<Country>Switzerland</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff9">
<OrgName>Indian Institute of Technology</OrgName>
<OrgAddress>
<City>Madras</City>
<Country>India</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff10">
<OrgName>University of Dortmund</OrgName>
<OrgAddress>
<Country>Germany</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff11">
<OrgName>Massachusetts Institute of Technology</OrgName>
<OrgAddress>
<City>MA</City>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff12">
<OrgName>New York University</OrgName>
<OrgAddress>
<City>NY</City>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff13">
<OrgName>University of California</OrgName>
<OrgAddress>
<City>Berkeley</City>
<State>CA</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff14">
<OrgName>Rice University</OrgName>
<OrgAddress>
<City>Houston</City>
<State>TX</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff15">
<OrgName>Max-Planck Institute of Computer Science</OrgName>
<OrgAddress>
<City>Saarbruecken</City>
<Country>Germany</Country>
</OrgAddress>
</Affiliation>
</EditorGroup>
</SeriesHeader>
<Book Language="En">
<BookInfo BookProductType="Proceedings" ContainsESM="No" Language="En" MediaType="eBook" NumberingDepth="2" NumberingStyle="ContentOnly" OutputMedium="All" TocLevels="0">
<BookID>978-3-540-32157-6</BookID>
<BookTitle>Document Analysis Systems VII</BookTitle>
<BookSubTitle>7th International Workshop, DAS 2006, Nelson, New Zealand, February 13-15, 2006. Proceedings</BookSubTitle>
<BookVolumeNumber>3872</BookVolumeNumber>
<BookSequenceNumber>3872</BookSequenceNumber>
<BookDOI>10.1007/11669487</BookDOI>
<BookTitleID>133456</BookTitleID>
<BookPrintISBN>978-3-540-32140-8</BookPrintISBN>
<BookElectronicISBN>978-3-540-32157-6</BookElectronicISBN>
<BookChapterCount>55</BookChapterCount>
<BookCopyright>
<CopyrightHolderName>Springer-Verlag Berlin Heidelberg</CopyrightHolderName>
<CopyrightYear>2006</CopyrightYear>
</BookCopyright>
<BookSubjectGroup>
<BookSubject Code="I" Type="Primary">Computer Science</BookSubject>
<BookSubject Code="I2203X" Priority="1" Type="Secondary">Pattern Recognition</BookSubject>
<BookSubject Code="I18032" Priority="2" Type="Secondary">Information Storage and Retrieval</BookSubject>
<BookSubject Code="I22021" Priority="3" Type="Secondary">Image Processing and Computer Vision</BookSubject>
<BookSubject Code="I21025" Priority="4" Type="Secondary">Simulation and Modeling</BookSubject>
<BookSubject Code="I2301X" Priority="5" Type="Secondary">Computer Appl. in Administrative Data Processing</BookSubject>
<SubjectCollection Code="SUCO11645">Computer Science</SubjectCollection>
</BookSubjectGroup>
<BookContext>
<SeriesID>558</SeriesID>
</BookContext>
</BookInfo>
<BookHeader>
<EditorGroup>
<Editor AffiliationIDS="Aff16">
<EditorName DisplayOrder="Western">
<GivenName>Horst</GivenName>
<FamilyName>Bunke</FamilyName>
</EditorName>
<Contact>
<Email>bunke@iam.unibe.ch</Email>
</Contact>
</Editor>
<Editor AffiliationIDS="Aff17">
<EditorName DisplayOrder="Western">
<GivenName>A.</GivenName>
<GivenName>Lawrence</GivenName>
<FamilyName>Spitz</FamilyName>
</EditorName>
<Contact>
<Email>spitz@docrec.com</Email>
</Contact>
</Editor>
<Affiliation ID="Aff16">
<OrgDivision>Institute of Computer Science and Applied Mathematics</OrgDivision>
<OrgName>University of Bern</OrgName>
<OrgAddress>
<Street>Neubrückstrasse 10</Street>
<Postcode>CH-3012</Postcode>
<City>Bern</City>
<Country>Switzerland</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff17">
<OrgName>DocRec Ltd</OrgName>
<OrgAddress>
<Street>34 Strathaven Place</Street>
<Postcode>7001</Postcode>
<City>Atawhai, Nelson</City>
<Country>New Zealand</Country>
</OrgAddress>
</Affiliation>
</EditorGroup>
</BookHeader>
<Part ID="Part8">
<PartInfo TocLevels="0">
<PartID>8</PartID>
<PartSequenceNumber>8</PartSequenceNumber>
<PartTitle>Session 9: Systems and Performance Evaluation</PartTitle>
<PartChapterCount>5</PartChapterCount>
<PartContext>
<SeriesID>558</SeriesID>
<BookTitle>Document Analysis Systems VII</BookTitle>
</PartContext>
</PartInfo>
<Chapter ID="Chap27" Language="En">
<ChapterInfo ChapterType="OriginalPaper" ContainsESM="No" NumberingDepth="2" NumberingStyle="ContentOnly" TocLevels="0">
<ChapterID>27</ChapterID>
<ChapterDOI>10.1007/11669487_27</ChapterDOI>
<ChapterSequenceNumber>27</ChapterSequenceNumber>
<ChapterTitle Language="En">Ground Truth for Layout Analysis Performance Evaluation</ChapterTitle>
<ChapterFirstPage>302</ChapterFirstPage>
<ChapterLastPage>311</ChapterLastPage>
<ChapterCopyright>
<CopyrightHolderName>Springer-Verlag Berlin Heidelberg</CopyrightHolderName>
<CopyrightYear>2006</CopyrightYear>
</ChapterCopyright>
<ChapterGrants Type="Regular">
<MetadataGrant Grant="OpenAccess"></MetadataGrant>
<AbstractGrant Grant="OpenAccess"></AbstractGrant>
<BodyPDFGrant Grant="Restricted"></BodyPDFGrant>
<BodyHTMLGrant Grant="Restricted"></BodyHTMLGrant>
<BibliographyGrant Grant="Restricted"></BibliographyGrant>
<ESMGrant Grant="Restricted"></ESMGrant>
</ChapterGrants>
<ChapterContext>
<SeriesID>558</SeriesID>
<PartID>8</PartID>
<BookID>978-3-540-32157-6</BookID>
<BookTitle>Document Analysis Systems VII</BookTitle>
</ChapterContext>
</ChapterInfo>
<ChapterHeader>
<AuthorGroup>
<Author AffiliationIDS="Aff18">
<AuthorName DisplayOrder="Western">
<GivenName>A.</GivenName>
<FamilyName>Antonacopoulos</FamilyName>
</AuthorName>
<Contact>
<URL>http://www.primaresearch.org</URL>
</Contact>
</Author>
<Author AffiliationIDS="Aff19">
<AuthorName DisplayOrder="Western">
<GivenName>D.</GivenName>
<FamilyName>Karatzas</FamilyName>
</AuthorName>
<Contact>
<URL>http://www.ecs.soton.ac.uk/~dk3</URL>
</Contact>
</Author>
<Author AffiliationIDS="Aff18">
<AuthorName DisplayOrder="Western">
<GivenName>D.</GivenName>
<FamilyName>Bridson</FamilyName>
</AuthorName>
<Contact>
<URL>http://www.primaresearch.org</URL>
</Contact>
</Author>
<Affiliation ID="Aff18">
<OrgDivision>Pattern Recognition and Image Analysis (PRImA) Research Lab, School of Computing, Science and Engineering</OrgDivision>
<OrgName>University of Salford</OrgName>
<OrgAddress>
<City>Manchester</City>
<Postcode>M5 4WT</Postcode>
<Country>United Kingdom</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff19">
<OrgDivision>School of Electronics and Computer Science</OrgDivision>
<OrgName>University of Southampton</OrgName>
<OrgAddress>
<City>Southampton</City>
<Postcode>SO16 1BJ</Postcode>
<Country>United Kingdom</Country>
</OrgAddress>
</Affiliation>
</AuthorGroup>
<Abstract ID="Abs1" Language="En">
<Heading>Abstract</Heading>
<Para>Over the past two decades a significant number of layout analysis (page segmentation and region classification) approaches have been proposed in the literature. Each approach has been devised for and/or evaluated using (usually small) application-specific datasets. While the need for objective performance evaluation of layout analysis algorithms is evident, there does not exist a suitable dataset with ground truth that reflects the realities of everyday documents (widely varying layouts, complex entities, colour, noise etc.). The most significant impediment is the creation of accurate and flexible (in representation) ground truth, a task that is costly and must be carefully designed. This paper discusses the issues related to the design, representation and creation of ground truth in the context of a realistic dataset developed by the authors. The effectiveness of the ground truth discussed in this paper has been successfully shown in its use for two international page segmentation competitions (ICDAR2003 and ICDAR2005).</Para>
</Abstract>
<ArticleNote Type="Misc">
<SimplePara>This work was supported by GCHQ (UK Government Communications Headquarters) and the EPSRC (UK Engineering and Physical Sciences Research Council).</SimplePara>
</ArticleNote>
</ChapterHeader>
<NoBody></NoBody>
</Chapter>
</Part>
</Book>
</Series>
</Publisher>
</istex:document>
</istex:metadataXml>
<mods version="3.6">
<titleInfo lang="en">
<title>Ground Truth for Layout Analysis Performance Evaluation</title>
</titleInfo>
<titleInfo type="alternative" contentType="CDATA" lang="en">
<title>Ground Truth for Layout Analysis Performance Evaluation</title>
</titleInfo>
<name type="personal">
<namePart type="given">A.</namePart>
<namePart type="family">Antonacopoulos</namePart>
<affiliation>Pattern Recognition and Image Analysis (PRImA) Research Lab, School of Computing, Science and Engineering, University of Salford, M5 4WT, Manchester, United Kingdom</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">D.</namePart>
<namePart type="family">Karatzas</namePart>
<affiliation>School of Electronics and Computer Science, University of Southampton, SO16 1BJ, Southampton, United Kingdom</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">D.</namePart>
<namePart type="family">Bridson</namePart>
<affiliation>Pattern Recognition and Image Analysis (PRImA) Research Lab, School of Computing, Science and Engineering, University of Salford, M5 4WT, Manchester, United Kingdom</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<typeOfResource>text</typeOfResource>
<genre type="conference [eBooks]" displayLabel="OriginalPaper"></genre>
<originInfo>
<publisher>Springer Berlin Heidelberg</publisher>
<place>
<placeTerm type="text">Berlin, Heidelberg</placeTerm>
</place>
<dateIssued encoding="w3cdtf">2006</dateIssued>
<copyrightDate encoding="w3cdtf">2006</copyrightDate>
</originInfo>
<language>
<languageTerm type="code" authority="rfc3066">en</languageTerm>
<languageTerm type="code" authority="iso639-2b">eng</languageTerm>
</language>
<physicalDescription>
<internetMediaType>text/html</internetMediaType>
</physicalDescription>
<abstract lang="en">Abstract: Over the past two decades a significant number of layout analysis (page segmentation and region classification) approaches have been proposed in the literature. Each approach has been devised for and/or evaluated using (usually small) application-specific datasets. While the need for objective performance evaluation of layout analysis algorithms is evident, there does not exist a suitable dataset with ground truth that reflects the realities of everyday documents (widely varying layouts, complex entities, colour, noise etc.). The most significant impediment is the creation of accurate and flexible (in representation) ground truth, a task that is costly and must be carefully designed. This paper discusses the issues related to the design, representation and creation of ground truth in the context of a realistic dataset developed by the authors. The effectiveness of the ground truth discussed in this paper has been successfully shown in its use for two international page segmentation competitions (ICDAR2003 and ICDAR2005).</abstract>
<relatedItem type="host">
<titleInfo>
<title>Document Analysis Systems VII</title>
<subTitle>7th International Workshop, DAS 2006, Nelson, New Zealand, February 13-15, 2006. Proceedings</subTitle>
</titleInfo>
<name type="personal">
<namePart type="given">Horst</namePart>
<namePart type="family">Bunke</namePart>
<affiliation>Institute of Computer Science and Applied Mathematics, University of Bern, Neubrückstrasse 10, CH-3012, Bern, Switzerland</affiliation>
<affiliation>E-mail: bunke@iam.unibe.ch</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">A.</namePart>
<namePart type="given">Lawrence</namePart>
<namePart type="family">Spitz</namePart>
<affiliation>DocRec Ltd, 34 Strathaven Place, 7001, Atawhai, Nelson, New Zealand</affiliation>
<affiliation>E-mail: spitz@docrec.com</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<genre type="Book Series" displayLabel="Proceedings"></genre>
<originInfo>
<copyrightDate encoding="w3cdtf">2006</copyrightDate>
<issuance>monographic</issuance>
</originInfo>
<subject>
<genre>Book Subject Collection</genre>
<topic authority="SpringerSubjectCodes" authorityURI="SUCO11645">Computer Science</topic>
</subject>
<subject>
<genre>Book Subject Group</genre>
<topic authority="SpringerSubjectCodes" authorityURI="I">Computer Science</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I2203X">Pattern Recognition</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I18032">Information Storage and Retrieval</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I22021">Image Processing and Computer Vision</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I21025">Simulation and Modeling</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I2301X">Computer Appl. in Administrative Data Processing</topic>
</subject>
<identifier type="DOI">10.1007/11669487</identifier>
<identifier type="ISBN">978-3-540-32140-8</identifier>
<identifier type="eISBN">978-3-540-32157-6</identifier>
<identifier type="ISSN">0302-9743</identifier>
<identifier type="eISSN">1611-3349</identifier>
<identifier type="BookTitleID">133456</identifier>
<identifier type="BookID">978-3-540-32157-6</identifier>
<identifier type="BookChapterCount">55</identifier>
<identifier type="BookVolumeNumber">3872</identifier>
<identifier type="BookSequenceNumber">3872</identifier>
<identifier type="PartChapterCount">5</identifier>
<part>
<date>2006</date>
<detail type="part">
<title>Session 9: Systems and Performance Evaluation</title>
</detail>
<detail type="volume">
<number>3872</number>
<caption>vol.</caption>
</detail>
<extent unit="pages">
<start>302</start>
<end>311</end>
</extent>
</part>
<recordInfo>
<recordOrigin>Springer-Verlag Berlin Heidelberg, 2006</recordOrigin>
</recordInfo>
</relatedItem>
<relatedItem type="series">
<titleInfo>
<title>Lecture Notes in Computer Science</title>
</titleInfo>
<name type="personal">
<namePart type="given">David</namePart>
<namePart type="family">Hutchison</namePart>
<affiliation>Lancaster University, UK</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Takeo</namePart>
<namePart type="family">Kanade</namePart>
<affiliation>Carnegie Mellon University, Pittsburgh, PA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Josef</namePart>
<namePart type="family">Kittler</namePart>
<affiliation>University of Surrey, Guildford, UK</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Jon</namePart>
<namePart type="given">M.</namePart>
<namePart type="family">Kleinberg</namePart>
<affiliation>Cornell University, Ithaca, NY, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Friedemann</namePart>
<namePart type="family">Mattern</namePart>
<affiliation>ETH Zurich, Switzerland</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">John</namePart>
<namePart type="given">C.</namePart>
<namePart type="family">Mitchell</namePart>
<affiliation>Stanford University, CA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Moni</namePart>
<namePart type="family">Naor</namePart>
<affiliation>Weizmann Institute of Science, Rehovot, Israel</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Oscar</namePart>
<namePart type="family">Nierstrasz</namePart>
<affiliation>University of Bern, Switzerland</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">C.</namePart>
<namePart type="family">Pandu Rangan</namePart>
<affiliation>Indian Institute of Technology, Madras, India</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Bernhard</namePart>
<namePart type="family">Steffen</namePart>
<affiliation>University of Dortmund, Germany</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Madhu</namePart>
<namePart type="family">Sudan</namePart>
<affiliation>Massachusetts Institute of Technology, MA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Demetri</namePart>
<namePart type="family">Terzopoulos</namePart>
<affiliation>New York University, NY, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Dough</namePart>
<namePart type="family">Tygar</namePart>
<affiliation>University of California, Berkeley, CA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Moshe</namePart>
<namePart type="given">Y.</namePart>
<namePart type="family">Vardi</namePart>
<affiliation>Rice University, Houston, TX, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Gerhard</namePart>
<namePart type="family">Weikum</namePart>
<affiliation>Max-Planck Institute of Computer Science, Saarbruecken, Germany</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<originInfo>
<copyrightDate encoding="w3cdtf">2006</copyrightDate>
<issuance>serial</issuance>
</originInfo>
<identifier type="ISSN">0302-9743</identifier>
<identifier type="eISSN">1611-3349</identifier>
<identifier type="SeriesID">558</identifier>
<recordInfo>
<recordOrigin>Springer-Verlag Berlin Heidelberg, 2006</recordOrigin>
</recordInfo>
</relatedItem>
<identifier type="istex">3833F85C0434283A3271B3B19B95DC29B2FDF83F</identifier>
<identifier type="DOI">10.1007/11669487_27</identifier>
<identifier type="ChapterID">27</identifier>
<identifier type="ChapterID">Chap27</identifier>
<accessCondition type="use and reproduction" contentType="copyright">Springer-Verlag Berlin Heidelberg, 2006</accessCondition>
<recordInfo>
<recordContentSource>SPRINGER</recordContentSource>
<recordOrigin>Springer-Verlag Berlin Heidelberg, 2006</recordOrigin>
</recordInfo>
</mods>
</metadata>
<enrichments>
<istex:refBibTEI uri="https://api.istex.fr/document/3833F85C0434283A3271B3B19B95DC29B2FDF83F/enrichments/refBib">
<teiHeader></teiHeader>
<text>
<front></front>
<body></body>
<back>
<listBibl>
<biblStruct xml:id="b0">
<analytic>
<title level="a" type="main">English Document Database Design and Implementation Methodology</title>
<author>
<persName>
<forename type="first">I</forename>
<forename type="middle">T</forename>
<surname>Philips</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">S</forename>
<surname>Chen</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">J</forename>
<surname>Ha</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">R</forename>
<forename type="middle">M</forename>
<surname>Haralick</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proceeding of the 2nd Annual Symposium on Document Analysis and Retrieval</title>
<meeting>eeding of the 2nd Annual Symposium on Document Analysis and Retrieval
<address>
<addrLine>UNLV, USA</addrLine>
</address>
</meeting>
<imprint>
<date type="published" when="1993"></date>
<biblScope unit="page" from="65" to="104"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b1">
<analytic>
<title level="a" type="main">Methodology for Flexible and Efficient Analysis of the Performance of Page Segmentation Algorithms</title>
<author>
<persName>
<forename type="first">A</forename>
<surname>Antonacopoulos</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">B</forename>
<surname>Brough</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proceedings of the 5th International Conference on Document Analysis and Recognition (ICDAR'99)</title>
<meeting>the 5th International Conference on Document Analysis and Recognition (ICDAR'99)
<address>
<addrLine>Bangalore, India</addrLine>
</address>
</meeting>
<imprint>
<publisher>IEEE-CS Press</publisher>
<date type="published" when="1999"></date>
<biblScope unit="page" from="451" to="454"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b2">
<analytic>
<title level="a" type="main">Page Segmentation Using the Description of the Background</title>
<author>
<persName>
<forename type="first">A</forename>
<surname>Antonacopoulos</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">Computer Vision and Image Understanding</title>
<imprint>
<biblScope unit="volume">70</biblScope>
<biblScope unit="issue">3</biblScope>
<biblScope unit="page" from="350" to="369"></biblScope>
<date type="published" when="1998"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b3">
<analytic>
<title level="a" type="main">Representation and Classification of Complex- Shaped Printed Regions Using White Tiles</title>
<author>
<persName>
<forename type="first">A</forename>
<surname>Antonacopoulos</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">R</forename>
<forename type="middle">T</forename>
<surname>Ritchings</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proceedings of the 3 rd International Conference on Document Analysis and Recognition (ICDAR'95)</title>
<meeting>the 3 rd International Conference on Document Analysis and Recognition (ICDAR'95)
<address>
<addrLine>Montreal, Canada</addrLine>
</address>
</meeting>
<imprint>
<publisher>IEEE-CS Press</publisher>
<date type="published" when="1995"></date>
<biblScope unit="page" from="1132" to="1135"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b4">
<analytic>
<title level="a" type="main">A Ground-Truthing Tool for Layout Analysis Performance Evaluation</title>
<author>
<persName>
<forename type="first">A</forename>
<surname>Antonacopoulos</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">H</forename>
<surname>Meng</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Document Analysis Springer Lecture Notes in Computer Science</title>
<editor>Systems V, D. Lopresti, J. Hu and R. Kashi</editor>
<imprint>
<date type="published" when="2002"></date>
<biblScope unit="page" from="236" to="244"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b5">
<analytic>
<title level="a" type="main">A Ground-Truthing Engine for Proofsetting, Publishing, Re- Purposing and Quality Assurance</title>
<author>
<persName>
<forename type="first">S</forename>
<forename type="middle">J</forename>
<surname>Simske</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">M</forename>
<surname>Sturgill</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proceedings of the 2003 ACM Symposium on Document Engineering (DocEng'03)</title>
<meeting>the 2003 ACM Symposium on Document Engineering (DocEng'03)
<address>
<addrLine>Grenoble, France</addrLine>
</address>
</meeting>
<imprint>
<publisher>ACM Press</publisher>
<date type="published" when="2003"></date>
<biblScope unit="page" from="150" to="152"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b6">
<analytic>
<title level="a" type="main">ICDAR2003 Page Segmentation Competition</title>
<author>
<persName>
<forename type="first">A</forename>
<surname>Antonacopoulos</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">B</forename>
<surname>Gatos</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">D</forename>
<surname>Karatzas</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proceedings of the 7 th International Conference on Document Analysis and Recognition (ICDAR2003)</title>
<meeting>the 7 th International Conference on Document Analysis and Recognition (ICDAR2003)
<address>
<addrLine>Edinburgh, UK</addrLine>
</address>
</meeting>
<imprint>
<publisher>IEEE-CS Press</publisher>
<date type="published" when="2003-08"></date>
<biblScope unit="page" from="688" to="692"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b7">
<analytic>
<title level="a" type="main">ICDAR2005 Page Segmentation Competition</title>
<author>
<persName>
<forename type="first">A</forename>
<surname>Antonacopoulos</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">B</forename>
<surname>Gatos</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">D</forename>
<surname>Bridson</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proceedings of the 8 th International Conference on Document Analysis and Recognition (ICDAR2005)</title>
<meeting>the 8 th International Conference on Document Analysis and Recognition (ICDAR2005)
<address>
<addrLine>Seoul, South Korea</addrLine>
</address>
</meeting>
<imprint>
<publisher>IEEE-CS Press</publisher>
<date type="published" when="2005-08"></date>
<biblScope unit="page" from="75" to="79"></biblScope>
</imprint>
</monogr>
</biblStruct>
</listBibl>
</back>
</text>
</istex:refBibTEI>
</enrichments>
</istex>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Istex/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 002490 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Istex/Corpus/biblio.hfd -nk 002490 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Istex
   |étape=   Corpus
   |type=    RBID
   |clé=     ISTEX:3833F85C0434283A3271B3B19B95DC29B2FDF83F
   |texte=   Ground Truth for Layout Analysis Performance Evaluation
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024