Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Correcting Bound Document Images Based on Automatic and Robust Curved Text Lines Estimation

Identifieur interne : 000F90 ( Main/Merge ); précédent : 000F89; suivant : 000F91

Correcting Bound Document Images Based on Automatic and Robust Curved Text Lines Estimation

Auteurs : Yichao Ma [République populaire de Chine] ; Chunheng Wang [République populaire de Chine] ; Ruwei Dai [République populaire de Chine]

Source :

RBID : ISTEX:12EA9A149B9CEC59E4428289F1F833DAFAEDF7B1

Abstract

Abstract: Geometric distortion often occurs when taking images of bound documents. This phenomenon greatly impairs recognition accuracy. In this paper, a new one-image based method is proposed to correct geometric distortion in bound document images. According to this method, the document image is binarized first. Next, curved text-line features are extracted. Thirdly, locally optimized text curves are detected using a graph model. Finally, the technique of texture warping is applied to correct the image. Experimental results show that images restored by our proposed method can achieve good perception and recognition results.

Url:
DOI: 10.1007/11940098_21

Links toward previous steps (curation, corpus...)


Links to Exploration step

ISTEX:12EA9A149B9CEC59E4428289F1F833DAFAEDF7B1

Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct:series">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Correcting Bound Document Images Based on Automatic and Robust Curved Text Lines Estimation</title>
<author>
<name sortKey="Ma, Yichao" sort="Ma, Yichao" uniqKey="Ma Y" first="Yichao" last="Ma">Yichao Ma</name>
</author>
<author>
<name sortKey="Wang, Chunheng" sort="Wang, Chunheng" uniqKey="Wang C" first="Chunheng" last="Wang">Chunheng Wang</name>
</author>
<author>
<name sortKey="Dai, Ruwei" sort="Dai, Ruwei" uniqKey="Dai R" first="Ruwei" last="Dai">Ruwei Dai</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:12EA9A149B9CEC59E4428289F1F833DAFAEDF7B1</idno>
<date when="2006" year="2006">2006</date>
<idno type="doi">10.1007/11940098_21</idno>
<idno type="url">https://api.istex.fr/document/12EA9A149B9CEC59E4428289F1F833DAFAEDF7B1/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000A94</idno>
<idno type="wicri:Area/Istex/Curation">000A81</idno>
<idno type="wicri:Area/Istex/Checkpoint">000946</idno>
<idno type="wicri:doubleKey">0302-9743:2006:Ma Y:correcting:bound:document</idno>
<idno type="wicri:Area/Main/Merge">000F90</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Correcting Bound Document Images Based on Automatic and Robust Curved Text Lines Estimation</title>
<author>
<name sortKey="Ma, Yichao" sort="Ma, Yichao" uniqKey="Ma Y" first="Yichao" last="Ma">Yichao Ma</name>
<affiliation wicri:level="3">
<country xml:lang="fr" wicri:curation="lc">République populaire de Chine</country>
<wicri:regionArea>Laboratory of Complex System and Intelligent Science, Institute of Automation, Chinese Academy of Science, Zhongguancun East Rd, No.95, Haidian Dist, 100080, Beijing</wicri:regionArea>
<placeName>
<settlement type="city">Pékin</settlement>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Wang, Chunheng" sort="Wang, Chunheng" uniqKey="Wang C" first="Chunheng" last="Wang">Chunheng Wang</name>
<affiliation wicri:level="3">
<country xml:lang="fr" wicri:curation="lc">République populaire de Chine</country>
<wicri:regionArea>Laboratory of Complex System and Intelligent Science, Institute of Automation, Chinese Academy of Science, Zhongguancun East Rd, No.95, Haidian Dist, 100080, Beijing</wicri:regionArea>
<placeName>
<settlement type="city">Pékin</settlement>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Dai, Ruwei" sort="Dai, Ruwei" uniqKey="Dai R" first="Ruwei" last="Dai">Ruwei Dai</name>
<affiliation wicri:level="3">
<country xml:lang="fr" wicri:curation="lc">République populaire de Chine</country>
<wicri:regionArea>Laboratory of Complex System and Intelligent Science, Institute of Automation, Chinese Academy of Science, Zhongguancun East Rd, No.95, Haidian Dist, 100080, Beijing</wicri:regionArea>
<placeName>
<settlement type="city">Pékin</settlement>
</placeName>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<imprint>
<date>2006</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">12EA9A149B9CEC59E4428289F1F833DAFAEDF7B1</idno>
<idno type="DOI">10.1007/11940098_21</idno>
<idno type="ChapterID">21</idno>
<idno type="ChapterID">Chap21</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: Geometric distortion often occurs when taking images of bound documents. This phenomenon greatly impairs recognition accuracy. In this paper, a new one-image based method is proposed to correct geometric distortion in bound document images. According to this method, the document image is binarized first. Next, curved text-line features are extracted. Thirdly, locally optimized text curves are detected using a graph model. Finally, the technique of texture warping is applied to correct the image. Experimental results show that images restored by our proposed method can achieve good perception and recognition results.</div>
</front>
</TEI>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Merge
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000F90 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Merge/biblio.hfd -nk 000F90 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Merge
   |type=    RBID
   |clé=     ISTEX:12EA9A149B9CEC59E4428289F1F833DAFAEDF7B1
   |texte=   Correcting Bound Document Images Based on Automatic and Robust Curved Text Lines Estimation
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024