Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Foreground Text Extraction in Color Document Images for Enhanced Readability

Identifieur interne : 000493 ( Istex/Checkpoint ); précédent : 000492; suivant : 000494

Foreground Text Extraction in Color Document Images for Enhanced Readability

Auteurs : S. Nirmala [Inde] ; P. Nagabhushan [Inde]

Source :

RBID : ISTEX:CE1EB4781754E663A4A979A2CDE1213BC6B2DBF9

Abstract

Abstract: Quite often it is observed that text information in documents is printed on colorful complex background. Smooth reading of text content in such documents is difficult due to background patterns and mix up of foreground text color with background color. Further the character recognition rate when such documents are OCRed, is low. In this paper we are presenting a novel approach for extraction of text information in complex color document images. The proposed approach is a three stage process. In the first stage the edge map is obtained utilizing the Canny edge operator. The edge map is split into blocks of uniform size and image blocks are classified as text or non-text. In each text block the possible text regions are identified and enclosed in tight bounding boxes using x-y cut on edge pixels. Further the text regions that are immediate adjacent to each other in vertical direction in which the character(s) are split horizontally are merged so as to enclose the character(s) fully in one text region. In the second stage certain amount of false text regions are eliminated based on a property of printed text. In the last stage the foreground text in each text region is extracted by unsupervised thresholding using the data of refined text regions. We conducted exhaustive experiments on documents having variety of background complexities with printed foreground text in any color, font and tilt. The experimental evaluations show that on an average 98.03% of text is identified. The processed document images showed better performance when OCRed compared with the corresponding unprocessed source document images.

Url:
DOI: 10.1007/978-3-642-11164-8_63


Affiliations:


Links toward previous steps (curation, corpus...)


Links to Exploration step

ISTEX:CE1EB4781754E663A4A979A2CDE1213BC6B2DBF9

Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Foreground Text Extraction in Color Document Images for Enhanced Readability</title>
<author>
<name sortKey="Nirmala, S" sort="Nirmala, S" uniqKey="Nirmala S" first="S." last="Nirmala">S. Nirmala</name>
</author>
<author>
<name sortKey="Nagabhushan, P" sort="Nagabhushan, P" uniqKey="Nagabhushan P" first="P." last="Nagabhushan">P. Nagabhushan</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:CE1EB4781754E663A4A979A2CDE1213BC6B2DBF9</idno>
<date when="2009" year="2009">2009</date>
<idno type="doi">10.1007/978-3-642-11164-8_63</idno>
<idno type="url">https://api.istex.fr/document/CE1EB4781754E663A4A979A2CDE1213BC6B2DBF9/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">001189</idno>
<idno type="wicri:Area/Istex/Curation">001118</idno>
<idno type="wicri:Area/Istex/Checkpoint">000493</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Foreground Text Extraction in Color Document Images for Enhanced Readability</title>
<author>
<name sortKey="Nirmala, S" sort="Nirmala, S" uniqKey="Nirmala S" first="S." last="Nirmala">S. Nirmala</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Inde</country>
<wicri:regionArea>Dept of Studies in Computer Science, University of Mysore, 570 006, Mysore</wicri:regionArea>
<wicri:noRegion>Mysore</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Inde</country>
</affiliation>
</author>
<author>
<name sortKey="Nagabhushan, P" sort="Nagabhushan, P" uniqKey="Nagabhushan P" first="P." last="Nagabhushan">P. Nagabhushan</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Inde</country>
<wicri:regionArea>Dept of Studies in Computer Science, University of Mysore, 570 006, Mysore</wicri:regionArea>
<wicri:noRegion>Mysore</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Inde</country>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<imprint>
<date>2009</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">CE1EB4781754E663A4A979A2CDE1213BC6B2DBF9</idno>
<idno type="DOI">10.1007/978-3-642-11164-8_63</idno>
<idno type="ChapterID">63</idno>
<idno type="ChapterID">Chap63</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: Quite often it is observed that text information in documents is printed on colorful complex background. Smooth reading of text content in such documents is difficult due to background patterns and mix up of foreground text color with background color. Further the character recognition rate when such documents are OCRed, is low. In this paper we are presenting a novel approach for extraction of text information in complex color document images. The proposed approach is a three stage process. In the first stage the edge map is obtained utilizing the Canny edge operator. The edge map is split into blocks of uniform size and image blocks are classified as text or non-text. In each text block the possible text regions are identified and enclosed in tight bounding boxes using x-y cut on edge pixels. Further the text regions that are immediate adjacent to each other in vertical direction in which the character(s) are split horizontally are merged so as to enclose the character(s) fully in one text region. In the second stage certain amount of false text regions are eliminated based on a property of printed text. In the last stage the foreground text in each text region is extracted by unsupervised thresholding using the data of refined text regions. We conducted exhaustive experiments on documents having variety of background complexities with printed foreground text in any color, font and tilt. The experimental evaluations show that on an average 98.03% of text is identified. The processed document images showed better performance when OCRed compared with the corresponding unprocessed source document images.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Inde</li>
</country>
</list>
<tree>
<country name="Inde">
<noRegion>
<name sortKey="Nirmala, S" sort="Nirmala, S" uniqKey="Nirmala S" first="S." last="Nirmala">S. Nirmala</name>
</noRegion>
<name sortKey="Nagabhushan, P" sort="Nagabhushan, P" uniqKey="Nagabhushan P" first="P." last="Nagabhushan">P. Nagabhushan</name>
<name sortKey="Nagabhushan, P" sort="Nagabhushan, P" uniqKey="Nagabhushan P" first="P." last="Nagabhushan">P. Nagabhushan</name>
<name sortKey="Nirmala, S" sort="Nirmala, S" uniqKey="Nirmala S" first="S." last="Nirmala">S. Nirmala</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Istex/Checkpoint
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000493 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Istex/Checkpoint/biblio.hfd -nk 000493 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Istex
   |étape=   Checkpoint
   |type=    RBID
   |clé=     ISTEX:CE1EB4781754E663A4A979A2CDE1213BC6B2DBF9
   |texte=   Foreground Text Extraction in Color Document Images for Enhanced Readability
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024