Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Text Detection in Natural Scene Images by Stroke Gabor Words

Identifieur interne : 000140 ( Pmc/Curation ); précédent : 000139; suivant : 000141

Text Detection in Natural Scene Images by Stroke Gabor Words

Auteurs : Chucai Yi ; Yingli Tian

Source :

RBID : PMC:4180654

Abstract

In this paper, we propose a novel algorithm, based on stroke components and descriptive Gabor filters, to detect text regions in natural scene images. Text characters and strings are constructed by stroke components as basic units. Gabor filters are used to describe and analyze the stroke components in text characters or strings. We define a suitability measurement to analyze the confidence of Gabor filters in describing stroke component and the suitability of Gabor filters on an image window. From the training set, we compute a set of Gabor filters that can describe principle stroke components of text by their parameters. Then a K -means algorithm is applied to cluster the descriptive Gabor filters. The clustering centers are defined as Stroke Gabor Words (SGWs) to provide a universal description of stroke components. By suitability evaluation on positive and negative training samples respectively, each SGW generates a pair of characteristic distributions of suitability measurements. On a testing natural scene image, heuristic layout analysis is applied first to extract candidate image windows. Then we compute the principle SGWs for each image window to describe its principle stroke components. Characteristic distributions generated by principle SGWs are used to classify text or nontext windows. Experimental results on benchmark datasets demonstrate that our algorithm can handle complex backgrounds and variant text patterns (font, color, scale, etc.).


Url:
DOI: 10.1109/ICDAR.2011.44
PubMed: 25285042
PubMed Central: 4180654

Links toward previous steps (curation, corpus...)


Links to Exploration step

PMC:4180654

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Text Detection in Natural Scene Images by Stroke Gabor Words</title>
<author>
<name sortKey="Yi, Chucai" sort="Yi, Chucai" uniqKey="Yi C" first="Chucai" last="Yi">Chucai Yi</name>
</author>
<author>
<name sortKey="Tian, Yingli" sort="Tian, Yingli" uniqKey="Tian Y" first="Yingli" last="Tian">Yingli Tian</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PMC</idno>
<idno type="pmid">25285042</idno>
<idno type="pmc">4180654</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4180654</idno>
<idno type="RBID">PMC:4180654</idno>
<idno type="doi">10.1109/ICDAR.2011.44</idno>
<date when="2011">2011</date>
<idno type="wicri:Area/Pmc/Corpus">000140</idno>
<idno type="wicri:Area/Pmc/Curation">000140</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a" type="main">Text Detection in Natural Scene Images by Stroke Gabor Words</title>
<author>
<name sortKey="Yi, Chucai" sort="Yi, Chucai" uniqKey="Yi C" first="Chucai" last="Yi">Chucai Yi</name>
</author>
<author>
<name sortKey="Tian, Yingli" sort="Tian, Yingli" uniqKey="Tian Y" first="Yingli" last="Tian">Yingli Tian</name>
</author>
</analytic>
<series>
<title level="j">Proceedings of the ... International Conference on Document Analysis and Recognition / sponsored by the IAPR TC-11 and TC-10, in cooperation with the IEEE Computer Society and IGS. International Conference on Document Analysis and Recog...</title>
<idno type="ISSN">1520-5363</idno>
<imprint>
<date when="2011">2011</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass></textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">
<p id="P1">In this paper, we propose a novel algorithm, based on stroke components and descriptive Gabor filters, to detect text regions in natural scene images. Text characters and strings are constructed by stroke components as basic units. Gabor filters are used to describe and analyze the stroke components in text characters or strings. We define a suitability measurement to analyze the confidence of Gabor filters in describing stroke component and the suitability of Gabor filters on an image window. From the training set, we compute a set of Gabor filters that can describe principle stroke components of text by their parameters. Then a
<italic>K</italic>
-means algorithm is applied to cluster the descriptive Gabor filters. The clustering centers are defined as Stroke Gabor Words (SGWs) to provide a universal description of stroke components. By suitability evaluation on positive and negative training samples respectively, each SGW generates a pair of characteristic distributions of suitability measurements. On a testing natural scene image, heuristic layout analysis is applied first to extract candidate image windows. Then we compute the principle SGWs for each image window to describe its principle stroke components. Characteristic distributions generated by principle SGWs are used to classify text or nontext windows. Experimental results on benchmark datasets demonstrate that our algorithm can handle complex backgrounds and variant text patterns (font, color, scale, etc.).</p>
</div>
</front>
</TEI>
<pmc article-type="research-article">
<pmc-comment>The publisher of this article does not allow downloading of the full text in XML form.</pmc-comment>
<pmc-dir>properties manuscript</pmc-dir>
<front>
<journal-meta>
<journal-id journal-id-type="nlm-journal-id">101582795</journal-id>
<journal-id journal-id-type="pubmed-jr-id">40118</journal-id>
<journal-id journal-id-type="nlm-ta">Proc Int Conf Doc Anal Recognit</journal-id>
<journal-id journal-id-type="iso-abbrev">Proc Int Conf Doc Anal Recognit</journal-id>
<journal-title-group>
<journal-title>Proceedings of the ... International Conference on Document Analysis and Recognition / sponsored by the IAPR TC-11 and TC-10, in cooperation with the IEEE Computer Society and IGS. International Conference on Document Analysis and Recog...</journal-title>
</journal-title-group>
<issn pub-type="ppub">1520-5363</issn>
</journal-meta>
<article-meta>
<article-id pub-id-type="pmid">25285042</article-id>
<article-id pub-id-type="pmc">4180654</article-id>
<article-id pub-id-type="doi">10.1109/ICDAR.2011.44</article-id>
<article-id pub-id-type="manuscript">NIHMS369675</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Article</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Text Detection in Natural Scene Images by Stroke Gabor Words</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Yi</surname>
<given-names>Chucai</given-names>
</name>
<aff id="A1">Dept. of Computer Science, The Graduate Center, City Univ. of New York, New York, U.S.A.,
<email>CYi@gc.cuny.edu</email>
</aff>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Tian</surname>
<given-names>Yingli</given-names>
</name>
<aff id="A2">Dept. of Electrical Engineering, The City College and Graduate Center, City Univ. of New York, New York, U.S.A.,
<email>ytian@ccny.cuny.edu</email>
</aff>
</contrib>
</contrib-group>
<pub-date pub-type="nihms-submitted">
<day>12</day>
<month>2</month>
<year>2013</year>
</pub-date>
<pub-date pub-type="ppub">
<year>2011</year>
</pub-date>
<pub-date pub-type="pmc-release">
<day>01</day>
<month>10</month>
<year>2014</year>
</pub-date>
<volume>2011</volume>
<fpage>177</fpage>
<lpage>181</lpage>
<pmc-comment>elocation-id from pubmed: 10.1109/ICDAR.2011.44</pmc-comment>
<abstract>
<p id="P1">In this paper, we propose a novel algorithm, based on stroke components and descriptive Gabor filters, to detect text regions in natural scene images. Text characters and strings are constructed by stroke components as basic units. Gabor filters are used to describe and analyze the stroke components in text characters or strings. We define a suitability measurement to analyze the confidence of Gabor filters in describing stroke component and the suitability of Gabor filters on an image window. From the training set, we compute a set of Gabor filters that can describe principle stroke components of text by their parameters. Then a
<italic>K</italic>
-means algorithm is applied to cluster the descriptive Gabor filters. The clustering centers are defined as Stroke Gabor Words (SGWs) to provide a universal description of stroke components. By suitability evaluation on positive and negative training samples respectively, each SGW generates a pair of characteristic distributions of suitability measurements. On a testing natural scene image, heuristic layout analysis is applied first to extract candidate image windows. Then we compute the principle SGWs for each image window to describe its principle stroke components. Characteristic distributions generated by principle SGWs are used to classify text or nontext windows. Experimental results on benchmark datasets demonstrate that our algorithm can handle complex backgrounds and variant text patterns (font, color, scale, etc.).</p>
</abstract>
<kwd-group>
<kwd>Gabor Filter</kwd>
<kwd>Stroke Component</kwd>
<kwd>Suitability Measurement</kwd>
<kwd>Stroke Gabor Words</kwd>
<kwd>SGW Characteristic Distributions</kwd>
</kwd-group>
</article-meta>
</front>
</pmc>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Pmc/Curation
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000140 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Pmc/Curation/biblio.hfd -nk 000140 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Pmc
   |étape=   Curation
   |type=    RBID
   |clé=     PMC:4180654
   |texte=   Text Detection in Natural Scene Images by Stroke Gabor Words
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Pmc/Curation/RBID.i   -Sk "pubmed:25285042" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Pmc/Curation/biblio.hfd   \
       | NlmPubMed2Wicri -a OcrV1 

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024