Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

A Complete Approach to the Conversion of Typewritten Historical Documents for Digital Archives

Identifieur interne : 000182 ( Istex/Corpus ); précédent : 000181; suivant : 000183

A Complete Approach to the Conversion of Typewritten Historical Documents for Digital Archives

Auteurs : Apostolos Antonacopoulos ; Dimosthenis Karatzas

Source :

RBID : ISTEX:2B98E21C53FF822F406CF03A235F4E9A4B172523

Abstract

Abstract: This paper presents a complete system that historians/archivists can use to digitize whole collections of documents relating to personal information. The system integrates tools and processes that facilitate scanning, image indexing, document (physical and logical) structure definition, document image analysis, recognition, proofreading/correction and semantic tagging. The system is described in the context of different types of typewritten documents relating to prisoners in World-War II concentration camps and is the result of a multinational collaboration under the MEMORIAL project funded (€1.5M) by the European Union (www.memorial-project.info). Results on a representative selection of documents show a significant improvement not only in terms of OCR accuracy but also in terms of overall time/cost involved in converting these documents for digital archives.

Url:
DOI: 10.1007/978-3-540-28640-0_9

Links to Exploration step

ISTEX:2B98E21C53FF822F406CF03A235F4E9A4B172523

Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">A Complete Approach to the Conversion of Typewritten Historical Documents for Digital Archives</title>
<author>
<name sortKey="Antonacopoulos, Apostolos" sort="Antonacopoulos, Apostolos" uniqKey="Antonacopoulos A" first="Apostolos" last="Antonacopoulos">Apostolos Antonacopoulos</name>
<affiliation>
<mods:affiliation>Pattern Recognition and Image Analysis (PRImA) group, Department of Computer Science, University of Liverpool, L69 3BX, Liverpool, UK</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Karatzas, Dimosthenis" sort="Karatzas, Dimosthenis" uniqKey="Karatzas D" first="Dimosthenis" last="Karatzas">Dimosthenis Karatzas</name>
<affiliation>
<mods:affiliation>Pattern Recognition and Image Analysis (PRImA) group, Department of Computer Science, University of Liverpool, L69 3BX, Liverpool, UK</mods:affiliation>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:2B98E21C53FF822F406CF03A235F4E9A4B172523</idno>
<date when="2004" year="2004">2004</date>
<idno type="doi">10.1007/978-3-540-28640-0_9</idno>
<idno type="url">https://api.istex.fr/document/2B98E21C53FF822F406CF03A235F4E9A4B172523/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000182</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">A Complete Approach to the Conversion of Typewritten Historical Documents for Digital Archives</title>
<author>
<name sortKey="Antonacopoulos, Apostolos" sort="Antonacopoulos, Apostolos" uniqKey="Antonacopoulos A" first="Apostolos" last="Antonacopoulos">Apostolos Antonacopoulos</name>
<affiliation>
<mods:affiliation>Pattern Recognition and Image Analysis (PRImA) group, Department of Computer Science, University of Liverpool, L69 3BX, Liverpool, UK</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Karatzas, Dimosthenis" sort="Karatzas, Dimosthenis" uniqKey="Karatzas D" first="Dimosthenis" last="Karatzas">Dimosthenis Karatzas</name>
<affiliation>
<mods:affiliation>Pattern Recognition and Image Analysis (PRImA) group, Department of Computer Science, University of Liverpool, L69 3BX, Liverpool, UK</mods:affiliation>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<imprint>
<date>2004</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">2B98E21C53FF822F406CF03A235F4E9A4B172523</idno>
<idno type="DOI">10.1007/978-3-540-28640-0_9</idno>
<idno type="ChapterID">9</idno>
<idno type="ChapterID">Chap9</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: This paper presents a complete system that historians/archivists can use to digitize whole collections of documents relating to personal information. The system integrates tools and processes that facilitate scanning, image indexing, document (physical and logical) structure definition, document image analysis, recognition, proofreading/correction and semantic tagging. The system is described in the context of different types of typewritten documents relating to prisoners in World-War II concentration camps and is the result of a multinational collaboration under the MEMORIAL project funded (€1.5M) by the European Union (www.memorial-project.info). Results on a representative selection of documents show a significant improvement not only in terms of OCR accuracy but also in terms of overall time/cost involved in converting these documents for digital archives.</div>
</front>
</TEI>
<istex>
<corpusName>springer</corpusName>
<author>
<json:item>
<name>Apostolos Antonacopoulos</name>
<affiliations>
<json:string>Pattern Recognition and Image Analysis (PRImA) group, Department of Computer Science, University of Liverpool, L69 3BX, Liverpool, UK</json:string>
</affiliations>
</json:item>
<json:item>
<name>Dimosthenis Karatzas</name>
<affiliations>
<json:string>Pattern Recognition and Image Analysis (PRImA) group, Department of Computer Science, University of Liverpool, L69 3BX, Liverpool, UK</json:string>
</affiliations>
</json:item>
</author>
<language>
<json:string>eng</json:string>
</language>
<abstract>Abstract: This paper presents a complete system that historians/archivists can use to digitize whole collections of documents relating to personal information. The system integrates tools and processes that facilitate scanning, image indexing, document (physical and logical) structure definition, document image analysis, recognition, proofreading/correction and semantic tagging. The system is described in the context of different types of typewritten documents relating to prisoners in World-War II concentration camps and is the result of a multinational collaboration under the MEMORIAL project funded (€1.5M) by the European Union (www.memorial-project.info). Results on a representative selection of documents show a significant improvement not only in terms of OCR accuracy but also in terms of overall time/cost involved in converting these documents for digital archives.</abstract>
<qualityIndicators>
<score>6.259</score>
<pdfVersion>1.3</pdfVersion>
<pdfPageSize>430 x 660 pts</pdfPageSize>
<refBibsNative>false</refBibsNative>
<keywordCount>0</keywordCount>
<abstractCharCount>882</abstractCharCount>
<pdfWordCount>4831</pdfWordCount>
<pdfCharCount>29035</pdfCharCount>
<pdfPageCount>12</pdfPageCount>
<abstractWordCount>119</abstractWordCount>
</qualityIndicators>
<title>A Complete Approach to the Conversion of Typewritten Historical Documents for Digital Archives</title>
<genre.original>
<json:string>OriginalPaper</json:string>
</genre.original>
<chapterId>
<json:string>9</json:string>
<json:string>Chap9</json:string>
</chapterId>
<genre>
<json:string>conference [eBooks]</json:string>
</genre>
<serie>
<editor>
<json:item>
<name>David Hutchison</name>
<affiliations>
<json:string>Lancaster University, UK</json:string>
</affiliations>
</json:item>
<json:item>
<name>Takeo Kanade</name>
<affiliations>
<json:string>Carnegie Mellon University, Pittsburgh, PA, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Josef Kittler</name>
<affiliations>
<json:string>University of Surrey, Guildford, UK</json:string>
</affiliations>
</json:item>
<json:item>
<name>Jon M. Kleinberg</name>
<affiliations>
<json:string>Cornell University, Ithaca, NY, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Friedemann Mattern</name>
<affiliations>
<json:string>ETH Zurich, Switzerland</json:string>
</affiliations>
</json:item>
<json:item>
<name>John C. Mitchell</name>
<affiliations>
<json:string>Stanford University, CA, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Moni Naor</name>
<affiliations>
<json:string>Weizmann Institute of Science, Rehovot, Israel</json:string>
</affiliations>
</json:item>
<json:item>
<name>Oscar Nierstrasz</name>
<affiliations>
<json:string>University of Bern, Switzerland</json:string>
</affiliations>
</json:item>
<json:item>
<name>C. Pandu Rangan</name>
<affiliations>
<json:string>Indian Institute of Technology, Madras, India</json:string>
</affiliations>
</json:item>
<json:item>
<name>Bernhard Steffen</name>
<affiliations>
<json:string>University of Dortmund, Germany</json:string>
</affiliations>
</json:item>
<json:item>
<name>Madhu Sudan</name>
<affiliations>
<json:string>Massachusetts Institute of Technology, MA, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Demetri Terzopoulos</name>
<affiliations>
<json:string>New York University, NY, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Dough Tygar</name>
<affiliations>
<json:string>University of California, Berkeley, CA, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Moshe Y. Vardi</name>
<affiliations>
<json:string>Rice University, Houston, TX, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Gerhard Weikum</name>
<affiliations>
<json:string>Max-Planck Institute of Computer Science, Saarbruecken, Germany</json:string>
</affiliations>
</json:item>
</editor>
<issn>
<json:string>0302-9743</json:string>
</issn>
<language>
<json:string>unknown</json:string>
</language>
<eissn>
<json:string>1611-3349</json:string>
</eissn>
<title>Lecture Notes in Computer Science</title>
<copyrightDate>2004</copyrightDate>
</serie>
<host>
<editor>
<json:item>
<name>Simone Marinai</name>
<affiliations>
<json:string>Dipartimento di Sistemi e Informatica, Università di Firenze, Via di Santa Marta 3, 50139, Firenze, Italy</json:string>
<json:string>E-mail: marinai@dsi.unifi.it</json:string>
</affiliations>
</json:item>
<json:item>
<name>Andreas R. Dengel</name>
<affiliations>
<json:string>Knowledge Management Department, German Research Center for Artificial Intelligence (DFKI) GmbH, Kaiserslautern, Germany</json:string>
<json:string>E-mail: Andreas.Dengel@dfki.de</json:string>
</affiliations>
</json:item>
</editor>
<subject>
<json:item>
<value>Computer Science</value>
</json:item>
<json:item>
<value>Computer Science</value>
</json:item>
<json:item>
<value>Pattern Recognition</value>
</json:item>
<json:item>
<value>Information Storage and Retrieval</value>
</json:item>
<json:item>
<value>Image Processing and Computer Vision</value>
</json:item>
<json:item>
<value>Simulation and Modeling</value>
</json:item>
<json:item>
<value>Computer Appl. in Administrative Data Processing</value>
</json:item>
</subject>
<isbn>
<json:string>978-3-540-23060-1</json:string>
</isbn>
<language>
<json:string>unknown</json:string>
</language>
<eissn>
<json:string>1611-3349</json:string>
</eissn>
<title>Document Analysis Systems VI</title>
<genre.original>
<json:string>Proceedings</json:string>
</genre.original>
<bookId>
<json:string>978-3-540-28640-0</json:string>
</bookId>
<volume>3163</volume>
<pages>
<last>101</last>
<first>90</first>
</pages>
<issn>
<json:string>0302-9743</json:string>
</issn>
<genre>
<json:string>Book Series</json:string>
</genre>
<eisbn>
<json:string>978-3-540-28640-0</json:string>
</eisbn>
<copyrightDate>2004</copyrightDate>
<doi>
<json:string>10.1007/b100557</json:string>
</doi>
</host>
<publicationDate>2004</publicationDate>
<copyrightDate>2004</copyrightDate>
<doi>
<json:string>10.1007/978-3-540-28640-0_9</json:string>
</doi>
<id>2B98E21C53FF822F406CF03A235F4E9A4B172523</id>
<fulltext>
<json:item>
<original>true</original>
<mimetype>application/pdf</mimetype>
<extension>pdf</extension>
<uri>https://api.istex.fr/document/2B98E21C53FF822F406CF03A235F4E9A4B172523/fulltext/pdf</uri>
</json:item>
<json:item>
<original>false</original>
<mimetype>application/zip</mimetype>
<extension>zip</extension>
<uri>https://api.istex.fr/document/2B98E21C53FF822F406CF03A235F4E9A4B172523/fulltext/zip</uri>
</json:item>
<istex:fulltextTEI uri="https://api.istex.fr/document/2B98E21C53FF822F406CF03A235F4E9A4B172523/fulltext/tei">
<teiHeader>
<fileDesc>
<titleStmt>
<title level="a" type="main" xml:lang="en">A Complete Approach to the Conversion of Typewritten Historical Documents for Digital Archives</title>
<respStmt xml:id="ISTEX-API" resp="Références bibliographiques récupérées via GROBID" name="ISTEX-API (INIST-CNRS)"></respStmt>
</titleStmt>
<publicationStmt>
<authority>ISTEX</authority>
<publisher>Springer Berlin Heidelberg</publisher>
<pubPlace>Berlin, Heidelberg</pubPlace>
<availability>
<p>SPRINGER</p>
</availability>
<date>2004</date>
</publicationStmt>
<sourceDesc>
<biblStruct type="inbook">
<analytic>
<title level="a" type="main" xml:lang="en">A Complete Approach to the Conversion of Typewritten Historical Documents for Digital Archives</title>
<author>
<persName>
<forename type="first">Apostolos</forename>
<surname>Antonacopoulos</surname>
</persName>
<affiliation>Pattern Recognition and Image Analysis (PRImA) group, Department of Computer Science, University of Liverpool, L69 3BX, Liverpool, UK</affiliation>
</author>
<author>
<persName>
<forename type="first">Dimosthenis</forename>
<surname>Karatzas</surname>
</persName>
<affiliation>Pattern Recognition and Image Analysis (PRImA) group, Department of Computer Science, University of Liverpool, L69 3BX, Liverpool, UK</affiliation>
</author>
</analytic>
<monogr>
<title level="m">Document Analysis Systems VI</title>
<title level="m" type="sub">6th International Workshop, DAS 2004, Florence, Italy, September 8 - 10, 2004. Proceedings</title>
<idno type="pISBN">978-3-540-23060-1</idno>
<idno type="eISBN">978-3-540-28640-0</idno>
<idno type="pISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="DOI">10.1007/b100557</idno>
<idno type="BookID">978-3-540-28640-0</idno>
<idno type="BookTitleID">112492</idno>
<idno type="BookSequenceNumber">3163</idno>
<idno type="BookVolumeNumber">3163</idno>
<idno type="BookChapterCount">53</idno>
<editor>
<persName>
<forename type="first">Simone</forename>
<surname>Marinai</surname>
</persName>
<email>marinai@dsi.unifi.it</email>
<affiliation>Dipartimento di Sistemi e Informatica, Università di Firenze, Via di Santa Marta 3, 50139, Firenze, Italy</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Andreas</forename>
<forename type="first">R.</forename>
<surname>Dengel</surname>
</persName>
<email>Andreas.Dengel@dfki.de</email>
<affiliation>Knowledge Management Department, German Research Center for Artificial Intelligence (DFKI) GmbH, Kaiserslautern, Germany</affiliation>
</editor>
<imprint>
<publisher>Springer Berlin Heidelberg</publisher>
<pubPlace>Berlin, Heidelberg</pubPlace>
<date type="published" when="2004"></date>
<biblScope unit="volume">3163</biblScope>
<biblScope unit="page" from="90">90</biblScope>
<biblScope unit="page" to="101">101</biblScope>
</imprint>
</monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<editor>
<persName>
<forename type="first">David</forename>
<surname>Hutchison</surname>
</persName>
<affiliation>Lancaster University, UK</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Takeo</forename>
<surname>Kanade</surname>
</persName>
<affiliation>Carnegie Mellon University, Pittsburgh, PA, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Josef</forename>
<surname>Kittler</surname>
</persName>
<affiliation>University of Surrey, Guildford, UK</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Jon</forename>
<forename type="first">M.</forename>
<surname>Kleinberg</surname>
</persName>
<affiliation>Cornell University, Ithaca, NY, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Friedemann</forename>
<surname>Mattern</surname>
</persName>
<affiliation>ETH Zurich, Switzerland</affiliation>
</editor>
<editor>
<persName>
<forename type="first">John</forename>
<forename type="first">C.</forename>
<surname>Mitchell</surname>
</persName>
<affiliation>Stanford University, CA, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Moni</forename>
<surname>Naor</surname>
</persName>
<affiliation>Weizmann Institute of Science, Rehovot, Israel</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Oscar</forename>
<surname>Nierstrasz</surname>
</persName>
<affiliation>University of Bern, Switzerland</affiliation>
</editor>
<editor>
<persName>
<forename type="first">C.</forename>
<surname>Pandu Rangan</surname>
</persName>
<affiliation>Indian Institute of Technology, Madras, India</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Bernhard</forename>
<surname>Steffen</surname>
</persName>
<affiliation>University of Dortmund, Germany</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Madhu</forename>
<surname>Sudan</surname>
</persName>
<affiliation>Massachusetts Institute of Technology, MA, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Demetri</forename>
<surname>Terzopoulos</surname>
</persName>
<affiliation>New York University, NY, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Dough</forename>
<surname>Tygar</surname>
</persName>
<affiliation>University of California, Berkeley, CA, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Moshe</forename>
<forename type="first">Y.</forename>
<surname>Vardi</surname>
</persName>
<affiliation>Rice University, Houston, TX, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Gerhard</forename>
<surname>Weikum</surname>
</persName>
<affiliation>Max-Planck Institute of Computer Science, Saarbruecken, Germany</affiliation>
</editor>
<biblScope>
<date>2004</date>
</biblScope>
<idno type="pISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="seriesId">558</idno>
</series>
<idno type="istex">2B98E21C53FF822F406CF03A235F4E9A4B172523</idno>
<idno type="DOI">10.1007/978-3-540-28640-0_9</idno>
<idno type="ChapterID">9</idno>
<idno type="ChapterID">Chap9</idno>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<creation>
<date>2004</date>
</creation>
<langUsage>
<language ident="en">en</language>
</langUsage>
<abstract xml:lang="en">
<p>Abstract: This paper presents a complete system that historians/archivists can use to digitize whole collections of documents relating to personal information. The system integrates tools and processes that facilitate scanning, image indexing, document (physical and logical) structure definition, document image analysis, recognition, proofreading/correction and semantic tagging. The system is described in the context of different types of typewritten documents relating to prisoners in World-War II concentration camps and is the result of a multinational collaboration under the MEMORIAL project funded (€1.5M) by the European Union (www.memorial-project.info). Results on a representative selection of documents show a significant improvement not only in terms of OCR accuracy but also in terms of overall time/cost involved in converting these documents for digital archives.</p>
</abstract>
<textClass>
<keywords scheme="Book Subject Collection">
<list>
<label>SUCO11645</label>
<item>
<term>Computer Science</term>
</item>
</list>
</keywords>
</textClass>
<textClass>
<keywords scheme="Book Subject Group">
<list>
<label>I</label>
<label>I2203X</label>
<label>I18032</label>
<label>I22021</label>
<label>I21025</label>
<label>I2301X</label>
<item>
<term>Computer Science</term>
</item>
<item>
<term>Pattern Recognition</term>
</item>
<item>
<term>Information Storage and Retrieval</term>
</item>
<item>
<term>Image Processing and Computer Vision</term>
</item>
<item>
<term>Simulation and Modeling</term>
</item>
<item>
<term>Computer Appl. in Administrative Data Processing</term>
</item>
</list>
</keywords>
</textClass>
</profileDesc>
<revisionDesc>
<change when="2004">Published</change>
<change xml:id="refBibs-istex" who="#ISTEX-API" when="2016-3-19">References added</change>
</revisionDesc>
</teiHeader>
</istex:fulltextTEI>
<json:item>
<original>false</original>
<mimetype>text/plain</mimetype>
<extension>txt</extension>
<uri>https://api.istex.fr/document/2B98E21C53FF822F406CF03A235F4E9A4B172523/fulltext/txt</uri>
</json:item>
</fulltext>
<metadata>
<istex:metadataXml wicri:clean="Springer, Publisher found" wicri:toSee="no header">
<istex:xmlDeclaration>version="1.0" encoding="UTF-8"</istex:xmlDeclaration>
<istex:docType PUBLIC="-//Springer-Verlag//DTD A++ V2.4//EN" URI="http://devel.springer.de/A++/V2.4/DTD/A++V2.4.dtd" name="istex:docType"></istex:docType>
<istex:document>
<Publisher>
<PublisherInfo>
<PublisherName>Springer Berlin Heidelberg</PublisherName>
<PublisherLocation>Berlin, Heidelberg</PublisherLocation>
</PublisherInfo>
<Series>
<SeriesInfo SeriesType="Series" TocLevels="0">
<SeriesID>558</SeriesID>
<SeriesPrintISSN>0302-9743</SeriesPrintISSN>
<SeriesElectronicISSN>1611-3349</SeriesElectronicISSN>
<SeriesTitle Language="En">Lecture Notes in Computer Science</SeriesTitle>
</SeriesInfo>
<SeriesHeader>
<EditorGroup>
<Editor AffiliationIDS="Aff1">
<EditorName DisplayOrder="Western">
<GivenName>David</GivenName>
<FamilyName>Hutchison</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff2">
<EditorName DisplayOrder="Western">
<GivenName>Takeo</GivenName>
<FamilyName>Kanade</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff3">
<EditorName DisplayOrder="Western">
<GivenName>Josef</GivenName>
<FamilyName>Kittler</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff4">
<EditorName DisplayOrder="Western">
<GivenName>Jon</GivenName>
<GivenName>M.</GivenName>
<FamilyName>Kleinberg</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff5">
<EditorName DisplayOrder="Western">
<GivenName>Friedemann</GivenName>
<FamilyName>Mattern</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff6">
<EditorName DisplayOrder="Western">
<GivenName>John</GivenName>
<GivenName>C.</GivenName>
<FamilyName>Mitchell</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff7">
<EditorName DisplayOrder="Western">
<GivenName>Moni</GivenName>
<FamilyName>Naor</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff8">
<EditorName DisplayOrder="Western">
<GivenName>Oscar</GivenName>
<FamilyName>Nierstrasz</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff9">
<EditorName DisplayOrder="Western">
<GivenName>C.</GivenName>
<FamilyName>Pandu Rangan</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff10">
<EditorName DisplayOrder="Western">
<GivenName>Bernhard</GivenName>
<FamilyName>Steffen</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff11">
<EditorName DisplayOrder="Western">
<GivenName>Madhu</GivenName>
<FamilyName>Sudan</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff12">
<EditorName DisplayOrder="Western">
<GivenName>Demetri</GivenName>
<FamilyName>Terzopoulos</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff13">
<EditorName DisplayOrder="Western">
<GivenName>Dough</GivenName>
<FamilyName>Tygar</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff14">
<EditorName DisplayOrder="Western">
<GivenName>Moshe</GivenName>
<GivenName>Y.</GivenName>
<FamilyName>Vardi</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff15">
<EditorName DisplayOrder="Western">
<GivenName>Gerhard</GivenName>
<FamilyName>Weikum</FamilyName>
</EditorName>
</Editor>
<Affiliation ID="Aff1">
<OrgName>Lancaster University</OrgName>
<OrgAddress>
<Country>UK</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff2">
<OrgName>Carnegie Mellon University</OrgName>
<OrgAddress>
<City>Pittsburgh</City>
<State>PA</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff3">
<OrgName>University of Surrey</OrgName>
<OrgAddress>
<City>Guildford</City>
<Country>UK</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff4">
<OrgName>Cornell University</OrgName>
<OrgAddress>
<City>Ithaca</City>
<State>NY</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff5">
<OrgName>ETH Zurich</OrgName>
<OrgAddress>
<Country>Switzerland</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff6">
<OrgName>Stanford University</OrgName>
<OrgAddress>
<City>CA</City>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff7">
<OrgName>Weizmann Institute of Science</OrgName>
<OrgAddress>
<City>Rehovot</City>
<Country>Israel</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff8">
<OrgName>University of Bern</OrgName>
<OrgAddress>
<Country>Switzerland</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff9">
<OrgName>Indian Institute of Technology</OrgName>
<OrgAddress>
<City>Madras</City>
<Country>India</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff10">
<OrgName>University of Dortmund</OrgName>
<OrgAddress>
<Country>Germany</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff11">
<OrgName>Massachusetts Institute of Technology</OrgName>
<OrgAddress>
<City>MA</City>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff12">
<OrgName>New York University</OrgName>
<OrgAddress>
<City>NY</City>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff13">
<OrgName>University of California</OrgName>
<OrgAddress>
<City>Berkeley</City>
<State>CA</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff14">
<OrgName>Rice University</OrgName>
<OrgAddress>
<City>Houston</City>
<State>TX</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff15">
<OrgName>Max-Planck Institute of Computer Science</OrgName>
<OrgAddress>
<City>Saarbruecken</City>
<Country>Germany</Country>
</OrgAddress>
</Affiliation>
</EditorGroup>
</SeriesHeader>
<Book Language="En">
<BookInfo BookProductType="Proceedings" ContainsESM="No" Language="En" MediaType="eBook" NumberingDepth="2" NumberingStyle="ContentOnly" OutputMedium="All" TocLevels="0">
<BookID>978-3-540-28640-0</BookID>
<BookTitle>Document Analysis Systems VI</BookTitle>
<BookSubTitle>6th International Workshop, DAS 2004, Florence, Italy, September 8 - 10, 2004. Proceedings</BookSubTitle>
<BookVolumeNumber>3163</BookVolumeNumber>
<BookSequenceNumber>3163</BookSequenceNumber>
<BookDOI>10.1007/b100557</BookDOI>
<BookTitleID>112492</BookTitleID>
<BookPrintISBN>978-3-540-23060-1</BookPrintISBN>
<BookElectronicISBN>978-3-540-28640-0</BookElectronicISBN>
<BookChapterCount>53</BookChapterCount>
<BookCopyright>
<CopyrightHolderName>Springer-Verlag Berlin Heidelberg</CopyrightHolderName>
<CopyrightYear>2004</CopyrightYear>
</BookCopyright>
<BookSubjectGroup>
<BookSubject Code="I" Type="Primary">Computer Science</BookSubject>
<BookSubject Code="I2203X" Priority="1" Type="Secondary">Pattern Recognition</BookSubject>
<BookSubject Code="I18032" Priority="2" Type="Secondary">Information Storage and Retrieval</BookSubject>
<BookSubject Code="I22021" Priority="3" Type="Secondary">Image Processing and Computer Vision</BookSubject>
<BookSubject Code="I21025" Priority="4" Type="Secondary">Simulation and Modeling</BookSubject>
<BookSubject Code="I2301X" Priority="5" Type="Secondary">Computer Appl. in Administrative Data Processing</BookSubject>
<SubjectCollection Code="SUCO11645">Computer Science</SubjectCollection>
</BookSubjectGroup>
<BookContext>
<SeriesID>558</SeriesID>
</BookContext>
</BookInfo>
<BookHeader>
<EditorGroup>
<Editor AffiliationIDS="Aff16">
<EditorName DisplayOrder="Western">
<GivenName>Simone</GivenName>
<FamilyName>Marinai</FamilyName>
</EditorName>
<Contact>
<Email>marinai@dsi.unifi.it</Email>
</Contact>
</Editor>
<Editor AffiliationIDS="Aff17">
<EditorName DisplayOrder="Western">
<GivenName>Andreas</GivenName>
<GivenName>R.</GivenName>
<FamilyName>Dengel</FamilyName>
</EditorName>
<Contact>
<Email>Andreas.Dengel@dfki.de</Email>
</Contact>
</Editor>
<Affiliation ID="Aff16">
<OrgDivision>Dipartimento di Sistemi e Informatica</OrgDivision>
<OrgName>Università di Firenze</OrgName>
<OrgAddress>
<Street>Via di Santa Marta 3</Street>
<Postcode>50139</Postcode>
<City>Firenze</City>
<Country>Italy</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff17">
<OrgName>Knowledge Management Department, German Research Center for Artificial Intelligence (DFKI) GmbH</OrgName>
<OrgAddress>
<City>Kaiserslautern</City>
<Country>Germany</Country>
</OrgAddress>
</Affiliation>
</EditorGroup>
</BookHeader>
<Part ID="Part2">
<PartInfo TocLevels="0">
<PartID>2</PartID>
<PartSequenceNumber>2</PartSequenceNumber>
<PartTitle>Historical Documents</PartTitle>
<PartChapterCount>8</PartChapterCount>
<PartContext>
<SeriesID>558</SeriesID>
<BookTitle>Document Analysis Systems VI</BookTitle>
</PartContext>
</PartInfo>
<Chapter ID="Chap9" Language="En">
<ChapterInfo ChapterType="OriginalPaper" ContainsESM="No" NumberingDepth="2" NumberingStyle="ContentOnly" TocLevels="0">
<ChapterID>9</ChapterID>
<ChapterDOI>10.1007/978-3-540-28640-0_9</ChapterDOI>
<ChapterSequenceNumber>9</ChapterSequenceNumber>
<ChapterTitle Language="En">A Complete Approach to the Conversion of Typewritten Historical Documents for Digital Archives</ChapterTitle>
<ChapterFirstPage>90</ChapterFirstPage>
<ChapterLastPage>101</ChapterLastPage>
<ChapterCopyright>
<CopyrightHolderName>Springer-Verlag Berlin Heidelberg</CopyrightHolderName>
<CopyrightYear>2004</CopyrightYear>
</ChapterCopyright>
<ChapterGrants Type="Regular">
<MetadataGrant Grant="OpenAccess"></MetadataGrant>
<AbstractGrant Grant="OpenAccess"></AbstractGrant>
<BodyPDFGrant Grant="Restricted"></BodyPDFGrant>
<BodyHTMLGrant Grant="Restricted"></BodyHTMLGrant>
<BibliographyGrant Grant="Restricted"></BibliographyGrant>
<ESMGrant Grant="Restricted"></ESMGrant>
</ChapterGrants>
<ChapterContext>
<SeriesID>558</SeriesID>
<PartID>2</PartID>
<BookID>978-3-540-28640-0</BookID>
<BookTitle>Document Analysis Systems VI</BookTitle>
</ChapterContext>
</ChapterInfo>
<ChapterHeader>
<AuthorGroup>
<Author AffiliationIDS="Aff18">
<AuthorName DisplayOrder="Western">
<GivenName>Apostolos</GivenName>
<FamilyName>Antonacopoulos</FamilyName>
</AuthorName>
<Contact>
<URL>http://www.csc.liv.ac.uk/~prima</URL>
</Contact>
</Author>
<Author AffiliationIDS="Aff18">
<AuthorName DisplayOrder="Western">
<GivenName>Dimosthenis</GivenName>
<FamilyName>Karatzas</FamilyName>
</AuthorName>
<Contact>
<URL>http://www.csc.liv.ac.uk/~prima</URL>
</Contact>
</Author>
<Affiliation ID="Aff18">
<OrgDivision>Pattern Recognition and Image Analysis (PRImA) group, Department of Computer Science</OrgDivision>
<OrgName>University of Liverpool</OrgName>
<OrgAddress>
<City>Liverpool</City>
<Postcode>L69 3BX</Postcode>
<Country>UK</Country>
</OrgAddress>
</Affiliation>
</AuthorGroup>
<Abstract ID="Abs1" Language="En">
<Heading>Abstract</Heading>
<Para>This paper presents a complete system that historians/archivists can use to digitize whole collections of documents relating to personal information. The system integrates tools and processes that facilitate scanning, image indexing, document (physical and logical) structure definition, document image analysis, recognition, proofreading/correction and semantic tagging. The system is described in the context of different types of typewritten documents relating to prisoners in World-War II concentration camps and is the result of a multinational collaboration under the MEMORIAL project funded (€1.5M) by the European Union (www.memorial-project.info). Results on a representative selection of documents show a significant improvement not only in terms of OCR accuracy but also in terms of overall time/cost involved in converting these documents for digital archives.</Para>
</Abstract>
<ArticleNote Type="Misc">
<SimplePara>This work is supported by the European Union grant IST-2001-33441.</SimplePara>
</ArticleNote>
</ChapterHeader>
<NoBody></NoBody>
</Chapter>
</Part>
</Book>
</Series>
</Publisher>
</istex:document>
</istex:metadataXml>
<mods version="3.6">
<titleInfo lang="en">
<title>A Complete Approach to the Conversion of Typewritten Historical Documents for Digital Archives</title>
</titleInfo>
<titleInfo type="alternative" contentType="CDATA" lang="en">
<title>A Complete Approach to the Conversion of Typewritten Historical Documents for Digital Archives</title>
</titleInfo>
<name type="personal">
<namePart type="given">Apostolos</namePart>
<namePart type="family">Antonacopoulos</namePart>
<affiliation>Pattern Recognition and Image Analysis (PRImA) group, Department of Computer Science, University of Liverpool, L69 3BX, Liverpool, UK</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Dimosthenis</namePart>
<namePart type="family">Karatzas</namePart>
<affiliation>Pattern Recognition and Image Analysis (PRImA) group, Department of Computer Science, University of Liverpool, L69 3BX, Liverpool, UK</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<typeOfResource>text</typeOfResource>
<genre type="conference [eBooks]" displayLabel="OriginalPaper"></genre>
<originInfo>
<publisher>Springer Berlin Heidelberg</publisher>
<place>
<placeTerm type="text">Berlin, Heidelberg</placeTerm>
</place>
<dateIssued encoding="w3cdtf">2004</dateIssued>
<copyrightDate encoding="w3cdtf">2004</copyrightDate>
</originInfo>
<language>
<languageTerm type="code" authority="rfc3066">en</languageTerm>
<languageTerm type="code" authority="iso639-2b">eng</languageTerm>
</language>
<physicalDescription>
<internetMediaType>text/html</internetMediaType>
</physicalDescription>
<abstract lang="en">Abstract: This paper presents a complete system that historians/archivists can use to digitize whole collections of documents relating to personal information. The system integrates tools and processes that facilitate scanning, image indexing, document (physical and logical) structure definition, document image analysis, recognition, proofreading/correction and semantic tagging. The system is described in the context of different types of typewritten documents relating to prisoners in World-War II concentration camps and is the result of a multinational collaboration under the MEMORIAL project funded (€1.5M) by the European Union (www.memorial-project.info). Results on a representative selection of documents show a significant improvement not only in terms of OCR accuracy but also in terms of overall time/cost involved in converting these documents for digital archives.</abstract>
<relatedItem type="host">
<titleInfo>
<title>Document Analysis Systems VI</title>
<subTitle>6th International Workshop, DAS 2004, Florence, Italy, September 8 - 10, 2004. Proceedings</subTitle>
</titleInfo>
<name type="personal">
<namePart type="given">Simone</namePart>
<namePart type="family">Marinai</namePart>
<affiliation>Dipartimento di Sistemi e Informatica, Università di Firenze, Via di Santa Marta 3, 50139, Firenze, Italy</affiliation>
<affiliation>E-mail: marinai@dsi.unifi.it</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Andreas</namePart>
<namePart type="given">R.</namePart>
<namePart type="family">Dengel</namePart>
<affiliation>Knowledge Management Department, German Research Center for Artificial Intelligence (DFKI) GmbH, Kaiserslautern, Germany</affiliation>
<affiliation>E-mail: Andreas.Dengel@dfki.de</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<genre type="Book Series" displayLabel="Proceedings"></genre>
<originInfo>
<copyrightDate encoding="w3cdtf">2004</copyrightDate>
<issuance>monographic</issuance>
</originInfo>
<subject>
<genre>Book Subject Collection</genre>
<topic authority="SpringerSubjectCodes" authorityURI="SUCO11645">Computer Science</topic>
</subject>
<subject>
<genre>Book Subject Group</genre>
<topic authority="SpringerSubjectCodes" authorityURI="I">Computer Science</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I2203X">Pattern Recognition</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I18032">Information Storage and Retrieval</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I22021">Image Processing and Computer Vision</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I21025">Simulation and Modeling</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I2301X">Computer Appl. in Administrative Data Processing</topic>
</subject>
<identifier type="DOI">10.1007/b100557</identifier>
<identifier type="ISBN">978-3-540-23060-1</identifier>
<identifier type="eISBN">978-3-540-28640-0</identifier>
<identifier type="ISSN">0302-9743</identifier>
<identifier type="eISSN">1611-3349</identifier>
<identifier type="BookTitleID">112492</identifier>
<identifier type="BookID">978-3-540-28640-0</identifier>
<identifier type="BookChapterCount">53</identifier>
<identifier type="BookVolumeNumber">3163</identifier>
<identifier type="BookSequenceNumber">3163</identifier>
<identifier type="PartChapterCount">8</identifier>
<part>
<date>2004</date>
<detail type="part">
<title>Historical Documents</title>
</detail>
<detail type="volume">
<number>3163</number>
<caption>vol.</caption>
</detail>
<extent unit="pages">
<start>90</start>
<end>101</end>
</extent>
</part>
<recordInfo>
<recordOrigin>Springer-Verlag Berlin Heidelberg, 2004</recordOrigin>
</recordInfo>
</relatedItem>
<relatedItem type="series">
<titleInfo>
<title>Lecture Notes in Computer Science</title>
</titleInfo>
<name type="personal">
<namePart type="given">David</namePart>
<namePart type="family">Hutchison</namePart>
<affiliation>Lancaster University, UK</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Takeo</namePart>
<namePart type="family">Kanade</namePart>
<affiliation>Carnegie Mellon University, Pittsburgh, PA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Josef</namePart>
<namePart type="family">Kittler</namePart>
<affiliation>University of Surrey, Guildford, UK</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Jon</namePart>
<namePart type="given">M.</namePart>
<namePart type="family">Kleinberg</namePart>
<affiliation>Cornell University, Ithaca, NY, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Friedemann</namePart>
<namePart type="family">Mattern</namePart>
<affiliation>ETH Zurich, Switzerland</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">John</namePart>
<namePart type="given">C.</namePart>
<namePart type="family">Mitchell</namePart>
<affiliation>Stanford University, CA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Moni</namePart>
<namePart type="family">Naor</namePart>
<affiliation>Weizmann Institute of Science, Rehovot, Israel</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Oscar</namePart>
<namePart type="family">Nierstrasz</namePart>
<affiliation>University of Bern, Switzerland</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">C.</namePart>
<namePart type="family">Pandu Rangan</namePart>
<affiliation>Indian Institute of Technology, Madras, India</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Bernhard</namePart>
<namePart type="family">Steffen</namePart>
<affiliation>University of Dortmund, Germany</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Madhu</namePart>
<namePart type="family">Sudan</namePart>
<affiliation>Massachusetts Institute of Technology, MA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Demetri</namePart>
<namePart type="family">Terzopoulos</namePart>
<affiliation>New York University, NY, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Dough</namePart>
<namePart type="family">Tygar</namePart>
<affiliation>University of California, Berkeley, CA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Moshe</namePart>
<namePart type="given">Y.</namePart>
<namePart type="family">Vardi</namePart>
<affiliation>Rice University, Houston, TX, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Gerhard</namePart>
<namePart type="family">Weikum</namePart>
<affiliation>Max-Planck Institute of Computer Science, Saarbruecken, Germany</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<originInfo>
<copyrightDate encoding="w3cdtf">2004</copyrightDate>
<issuance>serial</issuance>
</originInfo>
<identifier type="ISSN">0302-9743</identifier>
<identifier type="eISSN">1611-3349</identifier>
<identifier type="SeriesID">558</identifier>
<recordInfo>
<recordOrigin>Springer-Verlag Berlin Heidelberg, 2004</recordOrigin>
</recordInfo>
</relatedItem>
<identifier type="istex">2B98E21C53FF822F406CF03A235F4E9A4B172523</identifier>
<identifier type="DOI">10.1007/978-3-540-28640-0_9</identifier>
<identifier type="ChapterID">9</identifier>
<identifier type="ChapterID">Chap9</identifier>
<accessCondition type="use and reproduction" contentType="copyright">Springer-Verlag Berlin Heidelberg, 2004</accessCondition>
<recordInfo>
<recordContentSource>SPRINGER</recordContentSource>
<recordOrigin>Springer-Verlag Berlin Heidelberg, 2004</recordOrigin>
</recordInfo>
</mods>
</metadata>
<enrichments>
<istex:refBibTEI uri="https://api.istex.fr/document/2B98E21C53FF822F406CF03A235F4E9A4B172523/enrichments/refBib">
<teiHeader></teiHeader>
<text>
<front></front>
<body></body>
<back>
<listBibl>
<biblStruct xml:id="b0">
<analytic>
<title level="a" type="main">Digital Mountain: From Granite Archive to Global Access</title>
<author>
<persName>
<forename type="first">W</forename>
<surname>Barrett</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">L</forename>
<surname>Hutchinson</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">D</forename>
<surname>Quass</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">H</forename>
<surname>Nielson</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">D</forename>
<surname>Kennard</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proceedings of the International Workshop on Document Image Analysis for Libraries (DIAL2004)</title>
<meeting>the International Workshop on Document Image Analysis for Libraries (DIAL2004)
<address>
<addrLine>Palo Alto, USA</addrLine>
</address>
</meeting>
<imprint>
<date type="published" when="2004"></date>
<biblScope unit="page" from="104" to="121"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b1">
<analytic>
<title level="a" type="main">Computerising Natural History Card Archives</title>
<author>
<persName>
<forename type="first">A</forename>
<surname>Downton</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">S</forename>
<surname>Lucas</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">G</forename>
<surname>Patoulas</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">G</forename>
<surname>Beccaloni</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">M</forename>
<surname>Scoble</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">G</forename>
<surname>Robinson</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proceedings of the 7 th International Conference on Document Analysis and Recognition (ICDAR2003)</title>
<meeting>the 7 th International Conference on Document Analysis and Recognition (ICDAR2003)
<address>
<addrLine>Edinburgh, UK</addrLine>
</address>
</meeting>
<imprint>
<date type="published" when="2003"></date>
<biblScope unit="page" from="354" to="358"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b2">
<monogr>
<title level="m" type="main">IsyReADeT project, IST-1999-57462, www.isyreadet.net</title>
<imprint></imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b3">
<analytic>
<title level="a" type="main">A Geberal System for the Retrieval of Document Images from Digital Libraries MEMORIAL Consortium.: Specification of a Personal Record Paper Document Layout</title>
<author>
<persName>
<forename type="first">S</forename>
<surname>Marinai</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">E</forename>
<surname>Marino</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">F</forename>
<surname>Cesarini</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">G</forename>
<surname>Soda</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proceedings of the International Workshop on Document Image Analysis for Libraries (DIAL2004)</title>
<meeting>the International Workshop on Document Image Analysis for Libraries (DIAL2004)
<address>
<addrLine>Palo Alto, USA</addrLine>
</address>
</meeting>
<imprint>
<date type="published" when="2002"></date>
<biblScope unit="page" from="150" to="173"></biblScope>
</imprint>
</monogr>
<note>Report. D2</note>
</biblStruct>
<biblStruct xml:id="b4">
<monogr>
<title level="m" type="main">An Introduction To Digital Image Processing</title>
<author>
<persName>
<forename type="first">W</forename>
<surname>Niblack</surname>
</persName>
</author>
<imprint>
<date type="published" when="1986"></date>
<publisher>Prentice-Hall</publisher>
<pubPlace>London</pubPlace>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b5">
<analytic>
<title level="a" type="main">A threshold selection method from gray-level histograms</title>
<author>
<persName>
<forename type="first">N</forename>
<surname>Otsu</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">IEEE Transactions on Systems, Man and Cybernetics</title>
<imprint>
<biblScope unit="volume">9</biblScope>
<biblScope unit="page" from="62" to="66"></biblScope>
<date type="published" when="1979"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b6">
<analytic>
<title></title>
<author>
<persName>
<forename type="first">M</forename>
<surname>Sonka</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">V</forename>
<surname>Hlavac</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">R</forename>
<surname>Boyle</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Image Processing, Analysis and Machine Vision</title>
<imprint>
<publisher>PWS Publishing</publisher>
<date type="published" when="1999"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b7">
<analytic>
<title level="a" type="main">Threshold Evaluation Techniques</title>
<author>
<persName>
<forename type="first">J</forename>
<forename type="middle">S</forename>
<surname>Weszka</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">A</forename>
<surname>Rosenfeld</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">IEEE Transactions on Systems, Man and Cybernetics</title>
<imprint>
<biblScope unit="volume">8</biblScope>
<biblScope unit="page" from="622" to="629"></biblScope>
<date type="published" when="1978"></date>
</imprint>
</monogr>
</biblStruct>
</listBibl>
</back>
</text>
</istex:refBibTEI>
</enrichments>
</istex>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Istex/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000182 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Istex/Corpus/biblio.hfd -nk 000182 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Istex
   |étape=   Corpus
   |type=    RBID
   |clé=     ISTEX:2B98E21C53FF822F406CF03A235F4E9A4B172523
   |texte=   A Complete Approach to the Conversion of Typewritten Historical Documents for Digital Archives
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024