Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Unsupervised Text Segmentation Using Color and Wavelet Features

Identifieur interne : 001518 ( Main/Merge ); précédent : 001517; suivant : 001519

Unsupervised Text Segmentation Using Color and Wavelet Features

Auteurs : Julinda Gllavata [Allemagne] ; Ralph Ewerth [Allemagne] ; Teuta Stefi [Allemagne] ; Bernd Freisleben [Allemagne]

Source :

RBID : ISTEX:47D1959E86954EDEC94F7C98BB54A6DDC84D687F

Abstract

Abstract: Since the number of digital multimedia libraries is growing rapidly, the need to efficiently index, browse and retrieve this information is also increased. In this context, text appearing in images represents an important entity for indexing and retrieval purposes. Often, text is superimposed over complex image background and its recognition by a commercial optical character recognition (OCR) engine is difficult. Thus, there is the need for a text segmentation process, including background removal and binarization, in order to achieve a satisfactory recognition rate by OCR. In this paper, an unsupervised learning method for text segmentation in images with complex backgrounds is presented. First, the color of the text and background is determined based on a color quantizer. Then, the pixel color and the standard deviation of the wavelet transformed image are used to distinguish between text and non-text pixels. To classify pixels into text and background, a slightly modified k-means algorithm is applied which is used to produce a binarized text image. The segmentation result is fed into a commercial OCR software to investigate the segmentation quality. The performance of our approach is demonstrated by presenting experimental results for a set of video frames.

Url:
DOI: 10.1007/978-3-540-27814-6_28

Links toward previous steps (curation, corpus...)


Links to Exploration step

ISTEX:47D1959E86954EDEC94F7C98BB54A6DDC84D687F

Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Unsupervised Text Segmentation Using Color and Wavelet Features</title>
<author>
<name sortKey="Gllavata, Julinda" sort="Gllavata, Julinda" uniqKey="Gllavata J" first="Julinda" last="Gllavata">Julinda Gllavata</name>
</author>
<author>
<name sortKey="Ewerth, Ralph" sort="Ewerth, Ralph" uniqKey="Ewerth R" first="Ralph" last="Ewerth">Ralph Ewerth</name>
</author>
<author>
<name sortKey="Stefi, Teuta" sort="Stefi, Teuta" uniqKey="Stefi T" first="Teuta" last="Stefi">Teuta Stefi</name>
</author>
<author>
<name sortKey="Freisleben, Bernd" sort="Freisleben, Bernd" uniqKey="Freisleben B" first="Bernd" last="Freisleben">Bernd Freisleben</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:47D1959E86954EDEC94F7C98BB54A6DDC84D687F</idno>
<date when="2004" year="2004">2004</date>
<idno type="doi">10.1007/978-3-540-27814-6_28</idno>
<idno type="url">https://api.istex.fr/document/47D1959E86954EDEC94F7C98BB54A6DDC84D687F/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000056</idno>
<idno type="wicri:Area/Istex/Curation">000055</idno>
<idno type="wicri:Area/Istex/Checkpoint">000D02</idno>
<idno type="wicri:doubleKey">0302-9743:2004:Gllavata J:unsupervised:text:segmentation</idno>
<idno type="wicri:Area/Main/Merge">001518</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Unsupervised Text Segmentation Using Color and Wavelet Features</title>
<author>
<name sortKey="Gllavata, Julinda" sort="Gllavata, Julinda" uniqKey="Gllavata J" first="Julinda" last="Gllavata">Julinda Gllavata</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Allemagne</country>
<wicri:regionArea>1SFB/FK 615, University of Siegen, D-57068, Siegen</wicri:regionArea>
<wicri:noRegion>57068, Siegen</wicri:noRegion>
<wicri:noRegion>Siegen</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Allemagne</country>
</affiliation>
</author>
<author>
<name sortKey="Ewerth, Ralph" sort="Ewerth, Ralph" uniqKey="Ewerth R" first="Ralph" last="Ewerth">Ralph Ewerth</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Allemagne</country>
<wicri:regionArea>1SFB/FK 615, University of Siegen, D-57068, Siegen</wicri:regionArea>
<wicri:noRegion>57068, Siegen</wicri:noRegion>
<wicri:noRegion>Siegen</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Allemagne</country>
</affiliation>
</author>
<author>
<name sortKey="Stefi, Teuta" sort="Stefi, Teuta" uniqKey="Stefi T" first="Teuta" last="Stefi">Teuta Stefi</name>
<affiliation wicri:level="3">
<country xml:lang="fr">Allemagne</country>
<wicri:regionArea>Dept. of Math. and Computer Science, University of Marburg, D-35032, Marburg</wicri:regionArea>
<placeName>
<region type="land" nuts="1">Hesse (Land)</region>
<region type="district" nuts="2">District de Giessen</region>
<settlement type="city">Marbourg</settlement>
</placeName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Allemagne</country>
</affiliation>
</author>
<author>
<name sortKey="Freisleben, Bernd" sort="Freisleben, Bernd" uniqKey="Freisleben B" first="Bernd" last="Freisleben">Bernd Freisleben</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Allemagne</country>
<wicri:regionArea>1SFB/FK 615, University of Siegen, D-57068, Siegen</wicri:regionArea>
<wicri:noRegion>57068, Siegen</wicri:noRegion>
<wicri:noRegion>Siegen</wicri:noRegion>
</affiliation>
<affiliation wicri:level="3">
<country xml:lang="fr">Allemagne</country>
<wicri:regionArea>Dept. of Math. and Computer Science, University of Marburg, D-35032, Marburg</wicri:regionArea>
<placeName>
<region type="land" nuts="1">Hesse (Land)</region>
<region type="district" nuts="2">District de Giessen</region>
<settlement type="city">Marbourg</settlement>
</placeName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Allemagne</country>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<imprint>
<date>2004</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">47D1959E86954EDEC94F7C98BB54A6DDC84D687F</idno>
<idno type="DOI">10.1007/978-3-540-27814-6_28</idno>
<idno type="ChapterID">28</idno>
<idno type="ChapterID">Chap28</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: Since the number of digital multimedia libraries is growing rapidly, the need to efficiently index, browse and retrieve this information is also increased. In this context, text appearing in images represents an important entity for indexing and retrieval purposes. Often, text is superimposed over complex image background and its recognition by a commercial optical character recognition (OCR) engine is difficult. Thus, there is the need for a text segmentation process, including background removal and binarization, in order to achieve a satisfactory recognition rate by OCR. In this paper, an unsupervised learning method for text segmentation in images with complex backgrounds is presented. First, the color of the text and background is determined based on a color quantizer. Then, the pixel color and the standard deviation of the wavelet transformed image are used to distinguish between text and non-text pixels. To classify pixels into text and background, a slightly modified k-means algorithm is applied which is used to produce a binarized text image. The segmentation result is fed into a commercial OCR software to investigate the segmentation quality. The performance of our approach is demonstrated by presenting experimental results for a set of video frames.</div>
</front>
</TEI>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Merge
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001518 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Merge/biblio.hfd -nk 001518 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Merge
   |type=    RBID
   |clé=     ISTEX:47D1959E86954EDEC94F7C98BB54A6DDC84D687F
   |texte=   Unsupervised Text Segmentation Using Color and Wavelet Features
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024