Unsupervised Text Segmentation Using Color and Wavelet Features
Identifieur interne : 001518 ( Main/Merge ); précédent : 001517; suivant : 001519Unsupervised Text Segmentation Using Color and Wavelet Features
Auteurs : Julinda Gllavata [Allemagne] ; Ralph Ewerth [Allemagne] ; Teuta Stefi [Allemagne] ; Bernd Freisleben [Allemagne]Source :
- Lecture Notes in Computer Science [ 0302-9743 ] ; 2004.
Abstract
Abstract: Since the number of digital multimedia libraries is growing rapidly, the need to efficiently index, browse and retrieve this information is also increased. In this context, text appearing in images represents an important entity for indexing and retrieval purposes. Often, text is superimposed over complex image background and its recognition by a commercial optical character recognition (OCR) engine is difficult. Thus, there is the need for a text segmentation process, including background removal and binarization, in order to achieve a satisfactory recognition rate by OCR. In this paper, an unsupervised learning method for text segmentation in images with complex backgrounds is presented. First, the color of the text and background is determined based on a color quantizer. Then, the pixel color and the standard deviation of the wavelet transformed image are used to distinguish between text and non-text pixels. To classify pixels into text and background, a slightly modified k-means algorithm is applied which is used to produce a binarized text image. The segmentation result is fed into a commercial OCR software to investigate the segmentation quality. The performance of our approach is demonstrated by presenting experimental results for a set of video frames.
Url:
DOI: 10.1007/978-3-540-27814-6_28
Links toward previous steps (curation, corpus...)
- to stream Istex, to step Corpus: 000056
- to stream Istex, to step Curation: 000055
- to stream Istex, to step Checkpoint: 000D02
Links to Exploration step
ISTEX:47D1959E86954EDEC94F7C98BB54A6DDC84D687FLe document en format XML
<record><TEI wicri:istexFullTextTei="biblStruct"><teiHeader><fileDesc><titleStmt><title xml:lang="en">Unsupervised Text Segmentation Using Color and Wavelet Features</title>
<author><name sortKey="Gllavata, Julinda" sort="Gllavata, Julinda" uniqKey="Gllavata J" first="Julinda" last="Gllavata">Julinda Gllavata</name>
</author>
<author><name sortKey="Ewerth, Ralph" sort="Ewerth, Ralph" uniqKey="Ewerth R" first="Ralph" last="Ewerth">Ralph Ewerth</name>
</author>
<author><name sortKey="Stefi, Teuta" sort="Stefi, Teuta" uniqKey="Stefi T" first="Teuta" last="Stefi">Teuta Stefi</name>
</author>
<author><name sortKey="Freisleben, Bernd" sort="Freisleben, Bernd" uniqKey="Freisleben B" first="Bernd" last="Freisleben">Bernd Freisleben</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:47D1959E86954EDEC94F7C98BB54A6DDC84D687F</idno>
<date when="2004" year="2004">2004</date>
<idno type="doi">10.1007/978-3-540-27814-6_28</idno>
<idno type="url">https://api.istex.fr/document/47D1959E86954EDEC94F7C98BB54A6DDC84D687F/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000056</idno>
<idno type="wicri:Area/Istex/Curation">000055</idno>
<idno type="wicri:Area/Istex/Checkpoint">000D02</idno>
<idno type="wicri:doubleKey">0302-9743:2004:Gllavata J:unsupervised:text:segmentation</idno>
<idno type="wicri:Area/Main/Merge">001518</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title level="a" type="main" xml:lang="en">Unsupervised Text Segmentation Using Color and Wavelet Features</title>
<author><name sortKey="Gllavata, Julinda" sort="Gllavata, Julinda" uniqKey="Gllavata J" first="Julinda" last="Gllavata">Julinda Gllavata</name>
<affiliation wicri:level="1"><country xml:lang="fr">Allemagne</country>
<wicri:regionArea>1SFB/FK 615, University of Siegen, D-57068, Siegen</wicri:regionArea>
<wicri:noRegion>57068, Siegen</wicri:noRegion>
<wicri:noRegion>Siegen</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">Allemagne</country>
</affiliation>
</author>
<author><name sortKey="Ewerth, Ralph" sort="Ewerth, Ralph" uniqKey="Ewerth R" first="Ralph" last="Ewerth">Ralph Ewerth</name>
<affiliation wicri:level="1"><country xml:lang="fr">Allemagne</country>
<wicri:regionArea>1SFB/FK 615, University of Siegen, D-57068, Siegen</wicri:regionArea>
<wicri:noRegion>57068, Siegen</wicri:noRegion>
<wicri:noRegion>Siegen</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">Allemagne</country>
</affiliation>
</author>
<author><name sortKey="Stefi, Teuta" sort="Stefi, Teuta" uniqKey="Stefi T" first="Teuta" last="Stefi">Teuta Stefi</name>
<affiliation wicri:level="3"><country xml:lang="fr">Allemagne</country>
<wicri:regionArea>Dept. of Math. and Computer Science, University of Marburg, D-35032, Marburg</wicri:regionArea>
<placeName><region type="land" nuts="1">Hesse (Land)</region>
<region type="district" nuts="2">District de Giessen</region>
<settlement type="city">Marbourg</settlement>
</placeName>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">Allemagne</country>
</affiliation>
</author>
<author><name sortKey="Freisleben, Bernd" sort="Freisleben, Bernd" uniqKey="Freisleben B" first="Bernd" last="Freisleben">Bernd Freisleben</name>
<affiliation wicri:level="1"><country xml:lang="fr">Allemagne</country>
<wicri:regionArea>1SFB/FK 615, University of Siegen, D-57068, Siegen</wicri:regionArea>
<wicri:noRegion>57068, Siegen</wicri:noRegion>
<wicri:noRegion>Siegen</wicri:noRegion>
</affiliation>
<affiliation wicri:level="3"><country xml:lang="fr">Allemagne</country>
<wicri:regionArea>Dept. of Math. and Computer Science, University of Marburg, D-35032, Marburg</wicri:regionArea>
<placeName><region type="land" nuts="1">Hesse (Land)</region>
<region type="district" nuts="2">District de Giessen</region>
<settlement type="city">Marbourg</settlement>
</placeName>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">Allemagne</country>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series><title level="s">Lecture Notes in Computer Science</title>
<imprint><date>2004</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">47D1959E86954EDEC94F7C98BB54A6DDC84D687F</idno>
<idno type="DOI">10.1007/978-3-540-27814-6_28</idno>
<idno type="ChapterID">28</idno>
<idno type="ChapterID">Chap28</idno>
</biblStruct>
</sourceDesc>
<seriesStmt><idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass></textClass>
<langUsage><language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Abstract: Since the number of digital multimedia libraries is growing rapidly, the need to efficiently index, browse and retrieve this information is also increased. In this context, text appearing in images represents an important entity for indexing and retrieval purposes. Often, text is superimposed over complex image background and its recognition by a commercial optical character recognition (OCR) engine is difficult. Thus, there is the need for a text segmentation process, including background removal and binarization, in order to achieve a satisfactory recognition rate by OCR. In this paper, an unsupervised learning method for text segmentation in images with complex backgrounds is presented. First, the color of the text and background is determined based on a color quantizer. Then, the pixel color and the standard deviation of the wavelet transformed image are used to distinguish between text and non-text pixels. To classify pixels into text and background, a slightly modified k-means algorithm is applied which is used to produce a binarized text image. The segmentation result is fed into a commercial OCR software to investigate the segmentation quality. The performance of our approach is demonstrated by presenting experimental results for a set of video frames.</div>
</front>
</TEI>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Merge
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001518 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Merge/biblio.hfd -nk 001518 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= Main |étape= Merge |type= RBID |clé= ISTEX:47D1959E86954EDEC94F7C98BB54A6DDC84D687F |texte= Unsupervised Text Segmentation Using Color and Wavelet Features }}
This area was generated with Dilib version V0.6.32. |