An improved physically-based method for geometric restoration of distorted document images.
Identifieur interne : 000B35 ( Main/Merge ); précédent : 000B34; suivant : 000B36An improved physically-based method for geometric restoration of distorted document images.
Auteurs : Li Zhang [Singapour] ; Yu Zhang ; Chew TanSource :
- IEEE transactions on pattern analysis and machine intelligence [ 0162-8828 ] ; 2008.
English descriptors
- KwdEn :
- Algorithms, Artifacts, Artificial Intelligence, Automatic Data Processing (methods), Documentation (methods), Image Enhancement (methods), Image Interpretation, Computer-Assisted (methods), Imaging, Three-Dimensional (methods), Pattern Recognition, Automated (methods), Reproducibility of Results, Sensitivity and Specificity.
- MESH :
Abstract
In document digitization through camera-based systems, simple imaging setups often produce geometric distortions in the resultant 2D images because of the non-planar geometric shapes of certain documents such as thick bound books, rolled, folded or crumpled materials, etc. Previous works have demonstrated that arbitrary warped documents can be successfully restored by flattening a 3D scan of the document. These approaches use physically-based or relaxation-based techniques in their flattening process. While this has been demonstrated to be effective in rectifying the image content and improving OCR, these previous approaches have several limitations in terms of speed and stability. In this paper, we propose a distance-based penalty metric to replace the mass-spring model and introduce additional bending resistance and drag forces to improve the efficiency of the existing approaches. The use of Verlet integration and special plane collision handling schemes also help to achieve better stability without sacrificing efficiency. Experiments on various document images captured from books, brochures and historical documents with arbitrary warpings have demonstrated large improvements over the existing approaches in terms of stability and efficiency.
DOI: 10.1109/TPAMI.2007.70831
PubMed: 18276976
Links toward previous steps (curation, corpus...)
- to stream PubMed, to step Corpus: 000051
- to stream PubMed, to step Curation: 000051
- to stream PubMed, to step Checkpoint: 000051
- to stream Ncbi, to step Merge: 000049
- to stream Ncbi, to step Curation: 000049
- to stream Ncbi, to step Checkpoint: 000049
Links to Exploration step
pubmed:18276976Le document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en">An improved physically-based method for geometric restoration of distorted document images.</title>
<author><name sortKey="Zhang, Li" sort="Zhang, Li" uniqKey="Zhang L" first="Li" last="Zhang">Li Zhang</name>
<affiliation wicri:level="4"><nlm:affiliation>School of Computing, National University of Singapore, 3 Science Drive #2, Singapore. zhangli@comp.nus.edu.sg</nlm:affiliation>
<country xml:lang="fr">Singapour</country>
<wicri:regionArea>School of Computing, National University of Singapore, 3 Science Drive #2</wicri:regionArea>
<orgName type="university">Université nationale de Singapour</orgName>
</affiliation>
</author>
<author><name sortKey="Zhang, Yu" sort="Zhang, Yu" uniqKey="Zhang Y" first="Yu" last="Zhang">Yu Zhang</name>
</author>
<author><name sortKey="Tan, Chew" sort="Tan, Chew" uniqKey="Tan C" first="Chew" last="Tan">Chew Tan</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">PubMed</idno>
<date when="2008">2008</date>
<idno type="doi">10.1109/TPAMI.2007.70831</idno>
<idno type="RBID">pubmed:18276976</idno>
<idno type="pmid">18276976</idno>
<idno type="wicri:Area/PubMed/Corpus">000051</idno>
<idno type="wicri:Area/PubMed/Curation">000051</idno>
<idno type="wicri:Area/PubMed/Checkpoint">000051</idno>
<idno type="wicri:Area/Ncbi/Merge">000049</idno>
<idno type="wicri:Area/Ncbi/Curation">000049</idno>
<idno type="wicri:Area/Ncbi/Checkpoint">000049</idno>
<idno type="wicri:doubleKey">0162-8828:2008:Zhang L:an:improved:physically</idno>
<idno type="wicri:Area/Main/Merge">000B35</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en">An improved physically-based method for geometric restoration of distorted document images.</title>
<author><name sortKey="Zhang, Li" sort="Zhang, Li" uniqKey="Zhang L" first="Li" last="Zhang">Li Zhang</name>
<affiliation wicri:level="4"><nlm:affiliation>School of Computing, National University of Singapore, 3 Science Drive #2, Singapore. zhangli@comp.nus.edu.sg</nlm:affiliation>
<country xml:lang="fr">Singapour</country>
<wicri:regionArea>School of Computing, National University of Singapore, 3 Science Drive #2</wicri:regionArea>
<orgName type="university">Université nationale de Singapour</orgName>
</affiliation>
</author>
<author><name sortKey="Zhang, Yu" sort="Zhang, Yu" uniqKey="Zhang Y" first="Yu" last="Zhang">Yu Zhang</name>
</author>
<author><name sortKey="Tan, Chew" sort="Tan, Chew" uniqKey="Tan C" first="Chew" last="Tan">Chew Tan</name>
</author>
</analytic>
<series><title level="j">IEEE transactions on pattern analysis and machine intelligence</title>
<idno type="ISSN">0162-8828</idno>
<imprint><date when="2008" type="published">2008</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Algorithms</term>
<term>Artifacts</term>
<term>Artificial Intelligence</term>
<term>Automatic Data Processing (methods)</term>
<term>Documentation (methods)</term>
<term>Image Enhancement (methods)</term>
<term>Image Interpretation, Computer-Assisted (methods)</term>
<term>Imaging, Three-Dimensional (methods)</term>
<term>Pattern Recognition, Automated (methods)</term>
<term>Reproducibility of Results</term>
<term>Sensitivity and Specificity</term>
</keywords>
<keywords scheme="MESH" qualifier="methods" xml:lang="en"><term>Automatic Data Processing</term>
<term>Documentation</term>
<term>Image Enhancement</term>
<term>Image Interpretation, Computer-Assisted</term>
<term>Imaging, Three-Dimensional</term>
<term>Pattern Recognition, Automated</term>
</keywords>
<keywords scheme="MESH" xml:lang="en"><term>Algorithms</term>
<term>Artifacts</term>
<term>Artificial Intelligence</term>
<term>Reproducibility of Results</term>
<term>Sensitivity and Specificity</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">In document digitization through camera-based systems, simple imaging setups often produce geometric distortions in the resultant 2D images because of the non-planar geometric shapes of certain documents such as thick bound books, rolled, folded or crumpled materials, etc. Previous works have demonstrated that arbitrary warped documents can be successfully restored by flattening a 3D scan of the document. These approaches use physically-based or relaxation-based techniques in their flattening process. While this has been demonstrated to be effective in rectifying the image content and improving OCR, these previous approaches have several limitations in terms of speed and stability. In this paper, we propose a distance-based penalty metric to replace the mass-spring model and introduce additional bending resistance and drag forces to improve the efficiency of the existing approaches. The use of Verlet integration and special plane collision handling schemes also help to achieve better stability without sacrificing efficiency. Experiments on various document images captured from books, brochures and historical documents with arbitrary warpings have demonstrated large improvements over the existing approaches in terms of stability and efficiency.</div>
</front>
</TEI>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Merge
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000B35 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Merge/biblio.hfd -nk 000B35 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= Main |étape= Merge |type= RBID |clé= pubmed:18276976 |texte= An improved physically-based method for geometric restoration of distorted document images. }}
Pour générer des pages wiki
HfdIndexSelect -h $EXPLOR_AREA/Data/Main/Merge/RBID.i -Sk "pubmed:18276976" \ | HfdSelect -Kh $EXPLOR_AREA/Data/Main/Merge/biblio.hfd \ | NlmPubMed2Wicri -a OcrV1
This area was generated with Dilib version V0.6.32. |