Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Goal-oriented rectification of camera-based document images.

Identifieur interne : 000040 ( PubMed/Curation ); précédent : 000039; suivant : 000041

Goal-oriented rectification of camera-based document images.

Auteurs : Nikolaos Stamatopoulos [Grèce] ; Basilis Gatos ; Ioannis Pratikakis ; Stavros J. Perantonis

Source :

RBID : pubmed:20876019

English descriptors

Abstract

Document digitization with either flatbed scanners or camera-based systems results in document images which often suffer from warping and perspective distortions that deteriorate the performance of current OCR approaches. In this paper, we present a goal-oriented rectification methodology to compensate for undesirable document image distortions aiming to improve the OCR result. Our approach relies upon a coarse-to-fine strategy. First, a coarse rectification is accomplished with the aid of a computationally low cost transformation which addresses the projection of a curved surface to a 2-D rectangular area. The projection of the curved surface on the plane is guided only by the textual content's appearance in the document image while incorporating a transformation which does not depend on specific model primitives or camera setup parameters. Second, pose normalization is applied on the word level aiming to restore all the local distortions of the document image. Experimental results on various document images with a variety of distortions demonstrate the robustness and effectiveness of the proposed rectification methodology using a consistent evaluation methodology that encounters OCR accuracy and a newly introduced measure using a semi-automatic procedure.

DOI: 10.1109/TIP.2010.2080280
PubMed: 20876019

Links toward previous steps (curation, corpus...)


Links to Exploration step

pubmed:20876019

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Goal-oriented rectification of camera-based document images.</title>
<author>
<name sortKey="Stamatopoulos, Nikolaos" sort="Stamatopoulos, Nikolaos" uniqKey="Stamatopoulos N" first="Nikolaos" last="Stamatopoulos">Nikolaos Stamatopoulos</name>
<affiliation wicri:level="1">
<nlm:affiliation>Department of Informatics and Telecommunications, National and Kapodistrian University of Athens, Greece. nstam@iit.demokritos.gr</nlm:affiliation>
<country xml:lang="fr">Grèce</country>
<wicri:regionArea>Department of Informatics and Telecommunications, National and Kapodistrian University of Athens</wicri:regionArea>
</affiliation>
</author>
<author>
<name sortKey="Gatos, Basilis" sort="Gatos, Basilis" uniqKey="Gatos B" first="Basilis" last="Gatos">Basilis Gatos</name>
</author>
<author>
<name sortKey="Pratikakis, Ioannis" sort="Pratikakis, Ioannis" uniqKey="Pratikakis I" first="Ioannis" last="Pratikakis">Ioannis Pratikakis</name>
</author>
<author>
<name sortKey="Perantonis, Stavros J" sort="Perantonis, Stavros J" uniqKey="Perantonis S" first="Stavros J" last="Perantonis">Stavros J. Perantonis</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PubMed</idno>
<date when="2011">2011</date>
<idno type="doi">10.1109/TIP.2010.2080280</idno>
<idno type="RBID">pubmed:20876019</idno>
<idno type="pmid">20876019</idno>
<idno type="wicri:Area/PubMed/Corpus">000040</idno>
<idno type="wicri:Area/PubMed/Curation">000040</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">Goal-oriented rectification of camera-based document images.</title>
<author>
<name sortKey="Stamatopoulos, Nikolaos" sort="Stamatopoulos, Nikolaos" uniqKey="Stamatopoulos N" first="Nikolaos" last="Stamatopoulos">Nikolaos Stamatopoulos</name>
<affiliation wicri:level="1">
<nlm:affiliation>Department of Informatics and Telecommunications, National and Kapodistrian University of Athens, Greece. nstam@iit.demokritos.gr</nlm:affiliation>
<country xml:lang="fr">Grèce</country>
<wicri:regionArea>Department of Informatics and Telecommunications, National and Kapodistrian University of Athens</wicri:regionArea>
</affiliation>
</author>
<author>
<name sortKey="Gatos, Basilis" sort="Gatos, Basilis" uniqKey="Gatos B" first="Basilis" last="Gatos">Basilis Gatos</name>
</author>
<author>
<name sortKey="Pratikakis, Ioannis" sort="Pratikakis, Ioannis" uniqKey="Pratikakis I" first="Ioannis" last="Pratikakis">Ioannis Pratikakis</name>
</author>
<author>
<name sortKey="Perantonis, Stavros J" sort="Perantonis, Stavros J" uniqKey="Perantonis S" first="Stavros J" last="Perantonis">Stavros J. Perantonis</name>
</author>
</analytic>
<series>
<title level="j">IEEE transactions on image processing : a publication of the IEEE Signal Processing Society</title>
<idno type="eISSN">1941-0042</idno>
<imprint>
<date when="2011" type="published">2011</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Algorithms</term>
<term>Artificial Intelligence</term>
<term>Automatic Data Processing (methods)</term>
<term>Documentation (methods)</term>
<term>Image Enhancement (methods)</term>
<term>Image Interpretation, Computer-Assisted (methods)</term>
<term>Pattern Recognition, Automated (methods)</term>
<term>Photography (methods)</term>
<term>Reproducibility of Results</term>
<term>Sensitivity and Specificity</term>
</keywords>
<keywords scheme="MESH" qualifier="methods" xml:lang="en">
<term>Automatic Data Processing</term>
<term>Documentation</term>
<term>Image Enhancement</term>
<term>Image Interpretation, Computer-Assisted</term>
<term>Pattern Recognition, Automated</term>
<term>Photography</term>
</keywords>
<keywords scheme="MESH" xml:lang="en">
<term>Algorithms</term>
<term>Artificial Intelligence</term>
<term>Reproducibility of Results</term>
<term>Sensitivity and Specificity</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Document digitization with either flatbed scanners or camera-based systems results in document images which often suffer from warping and perspective distortions that deteriorate the performance of current OCR approaches. In this paper, we present a goal-oriented rectification methodology to compensate for undesirable document image distortions aiming to improve the OCR result. Our approach relies upon a coarse-to-fine strategy. First, a coarse rectification is accomplished with the aid of a computationally low cost transformation which addresses the projection of a curved surface to a 2-D rectangular area. The projection of the curved surface on the plane is guided only by the textual content's appearance in the document image while incorporating a transformation which does not depend on specific model primitives or camera setup parameters. Second, pose normalization is applied on the word level aiming to restore all the local distortions of the document image. Experimental results on various document images with a variety of distortions demonstrate the robustness and effectiveness of the proposed rectification methodology using a consistent evaluation methodology that encounters OCR accuracy and a newly introduced measure using a semi-automatic procedure.</div>
</front>
</TEI>
<pubmed>
<MedlineCitation Owner="NLM" Status="MEDLINE">
<PMID Version="1">20876019</PMID>
<DateCreated>
<Year>2011</Year>
<Month>03</Month>
<Day>22</Day>
</DateCreated>
<DateCompleted>
<Year>2011</Year>
<Month>08</Month>
<Day>16</Day>
</DateCompleted>
<Article PubModel="Print-Electronic">
<Journal>
<ISSN IssnType="Electronic">1941-0042</ISSN>
<JournalIssue CitedMedium="Internet">
<Volume>20</Volume>
<Issue>4</Issue>
<PubDate>
<Year>2011</Year>
<Month>Apr</Month>
</PubDate>
</JournalIssue>
<Title>IEEE transactions on image processing : a publication of the IEEE Signal Processing Society</Title>
<ISOAbbreviation>IEEE Trans Image Process</ISOAbbreviation>
</Journal>
<ArticleTitle>Goal-oriented rectification of camera-based document images.</ArticleTitle>
<Pagination>
<MedlinePgn>910-20</MedlinePgn>
</Pagination>
<ELocationID EIdType="doi" ValidYN="Y">10.1109/TIP.2010.2080280</ELocationID>
<Abstract>
<AbstractText>Document digitization with either flatbed scanners or camera-based systems results in document images which often suffer from warping and perspective distortions that deteriorate the performance of current OCR approaches. In this paper, we present a goal-oriented rectification methodology to compensate for undesirable document image distortions aiming to improve the OCR result. Our approach relies upon a coarse-to-fine strategy. First, a coarse rectification is accomplished with the aid of a computationally low cost transformation which addresses the projection of a curved surface to a 2-D rectangular area. The projection of the curved surface on the plane is guided only by the textual content's appearance in the document image while incorporating a transformation which does not depend on specific model primitives or camera setup parameters. Second, pose normalization is applied on the word level aiming to restore all the local distortions of the document image. Experimental results on various document images with a variety of distortions demonstrate the robustness and effectiveness of the proposed rectification methodology using a consistent evaluation methodology that encounters OCR accuracy and a newly introduced measure using a semi-automatic procedure.</AbstractText>
</Abstract>
<AuthorList CompleteYN="Y">
<Author ValidYN="Y">
<LastName>Stamatopoulos</LastName>
<ForeName>Nikolaos</ForeName>
<Initials>N</Initials>
<AffiliationInfo>
<Affiliation>Department of Informatics and Telecommunications, National and Kapodistrian University of Athens, Greece. nstam@iit.demokritos.gr</Affiliation>
</AffiliationInfo>
</Author>
<Author ValidYN="Y">
<LastName>Gatos</LastName>
<ForeName>Basilis</ForeName>
<Initials>B</Initials>
</Author>
<Author ValidYN="Y">
<LastName>Pratikakis</LastName>
<ForeName>Ioannis</ForeName>
<Initials>I</Initials>
</Author>
<Author ValidYN="Y">
<LastName>Perantonis</LastName>
<ForeName>Stavros J</ForeName>
<Initials>SJ</Initials>
</Author>
</AuthorList>
<Language>eng</Language>
<PublicationTypeList>
<PublicationType UI="D016428">Journal Article</PublicationType>
<PublicationType UI="D013485">Research Support, Non-U.S. Gov't</PublicationType>
</PublicationTypeList>
<ArticleDate DateType="Electronic">
<Year>2010</Year>
<Month>09</Month>
<Day>27</Day>
</ArticleDate>
</Article>
<MedlineJournalInfo>
<Country>United States</Country>
<MedlineTA>IEEE Trans Image Process</MedlineTA>
<NlmUniqueID>9886191</NlmUniqueID>
<ISSNLinking>1057-7149</ISSNLinking>
</MedlineJournalInfo>
<CitationSubset>IM</CitationSubset>
<MeshHeadingList>
<MeshHeading>
<DescriptorName MajorTopicYN="N" UI="D000465">Algorithms</DescriptorName>
</MeshHeading>
<MeshHeading>
<DescriptorName MajorTopicYN="Y" UI="D001185">Artificial Intelligence</DescriptorName>
</MeshHeading>
<MeshHeading>
<DescriptorName MajorTopicYN="N" UI="D001330">Automatic Data Processing</DescriptorName>
<QualifierName MajorTopicYN="Y" UI="Q000379">methods</QualifierName>
</MeshHeading>
<MeshHeading>
<DescriptorName MajorTopicYN="N" UI="D004282">Documentation</DescriptorName>
<QualifierName MajorTopicYN="Y" UI="Q000379">methods</QualifierName>
</MeshHeading>
<MeshHeading>
<DescriptorName MajorTopicYN="N" UI="D007089">Image Enhancement</DescriptorName>
<QualifierName MajorTopicYN="Y" UI="Q000379">methods</QualifierName>
</MeshHeading>
<MeshHeading>
<DescriptorName MajorTopicYN="N" UI="D007090">Image Interpretation, Computer-Assisted</DescriptorName>
<QualifierName MajorTopicYN="Y" UI="Q000379">methods</QualifierName>
</MeshHeading>
<MeshHeading>
<DescriptorName MajorTopicYN="N" UI="D010363">Pattern Recognition, Automated</DescriptorName>
<QualifierName MajorTopicYN="Y" UI="Q000379">methods</QualifierName>
</MeshHeading>
<MeshHeading>
<DescriptorName MajorTopicYN="N" UI="D010781">Photography</DescriptorName>
<QualifierName MajorTopicYN="Y" UI="Q000379">methods</QualifierName>
</MeshHeading>
<MeshHeading>
<DescriptorName MajorTopicYN="N" UI="D015203">Reproducibility of Results</DescriptorName>
</MeshHeading>
<MeshHeading>
<DescriptorName MajorTopicYN="N" UI="D012680">Sensitivity and Specificity</DescriptorName>
</MeshHeading>
</MeshHeadingList>
</MedlineCitation>
<PubmedData>
<History>
<PubMedPubDate PubStatus="aheadofprint">
<Year>2010</Year>
<Month>9</Month>
<Day>27</Day>
</PubMedPubDate>
<PubMedPubDate PubStatus="entrez">
<Year>2010</Year>
<Month>9</Month>
<Day>30</Day>
<Hour>6</Hour>
<Minute>0</Minute>
</PubMedPubDate>
<PubMedPubDate PubStatus="pubmed">
<Year>2010</Year>
<Month>9</Month>
<Day>30</Day>
<Hour>6</Hour>
<Minute>0</Minute>
</PubMedPubDate>
<PubMedPubDate PubStatus="medline">
<Year>2011</Year>
<Month>8</Month>
<Day>17</Day>
<Hour>6</Hour>
<Minute>0</Minute>
</PubMedPubDate>
</History>
<PublicationStatus>ppublish</PublicationStatus>
<ArticleIdList>
<ArticleId IdType="doi">10.1109/TIP.2010.2080280</ArticleId>
<ArticleId IdType="pubmed">20876019</ArticleId>
</ArticleIdList>
</PubmedData>
</pubmed>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/PubMed/Curation
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000040 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/PubMed/Curation/biblio.hfd -nk 000040 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    PubMed
   |étape=   Curation
   |type=    RBID
   |clé=     pubmed:20876019
   |texte=   Goal-oriented rectification of camera-based document images.
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/PubMed/Curation/RBID.i   -Sk "pubmed:20876019" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/PubMed/Curation/biblio.hfd   \
       | NlmPubMed2Wicri -a OcrV1 

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024