OcrV1, PascalFrancis, Corpus, bibRecord, 000608

Automated system for text detection in individual video images

Identifieur interne : 000608 ( PascalFrancis/Corpus ); précédent : 000607; suivant : 000609

Automated system for text detection in individual video images

Auteurs : Y. Du ; C. I. Chang ; P. D. Thouin

Source :

Journal of Electronic Imaging [ 1017-9909 ] ; 2003.

RBID : Pascal:03-0353866

Descripteurs français

Pascal (Inist)
- Théorie, Analyse image, Traitement texte, Extraction caractéristique, Couleur, Segmentation image, Reconnaissance optique caractère, Recherche information, Reconnaissance automatique cible.

English descriptors

KwdEn :
- Automatic target recognition, Color, Color video images, Feature extraction, Image analysis, Image segmentation, Information retrieval, Multistage pulse code modulation, Optical character recognition, Text detection, Text processing, Theory.

Abstract

Text detection in video images is a challenging research problem because of the poor spatial resolution and complex background, which may contain a variety of colors. An automated system for text detection in video images is presented. It makes use of four modules to implement a series of processes to extract text regions from video images. The first module, called the multistage pulse code modulation (MPCM) module, is used to locate potential text regions in color video images. It converts a video image to a coded image, with each pixel encoded by a priority code ranging from 7 down to 0 in accordance with its priority, and further produces a binary thresholded image, which segments potential text regions from the background. The second module, called the text region detection module, applies a sequence of spatial filters to remove noisy regions and eliminate regions that are unlikely to contain text. The third module, called the text box finding module, merges text regions and produces boxes that are likely to contain text. Finally, the fourth module, called the optical character recognition (OCR) module, eliminates the text boxes that produce no OCR output. An extensive set of experiments is conducted and demonstrates that the proposed system is effective in detecting text in a wide variety of video images.

Notice en format standard (ISO 2709)

Pour connaître la documentation sur le format Inist Standard.

A01	`01`	`1`		`@0 1017-9909`
A02	`01`			`@0 JEIME5`
A03		`1`		`@0 J Electron Imaging`
A05				`@2 12`
A06				`@2 3`
A08	`01`	`1`	`ENG`	`@1 Automated system for text detection in individual video images`
A11	`01`	`1`		`@1 DU (Y.)`
A11	`02`	`1`		`@1 CHANG (C. I.)`
A11	`03`	`1`		`@1 THOUIN (P. D.)`
A14	`01`			`@1 Univ. of Maryland Baltimore Country Remote Sensing Signal Image Proc Lab Dept. of Comp. Sci. and Elec. Eng. @2 Baltimore, MD 21250 @3 USA @Z 1 aut.`
A20				`@1 410-422`
A21				`@1 2003`
A23	`01`			`@0 ENG`
A43	`01`			`@1 INIST @2 XXXX`
A44				`@0 A100`
A45				`@0 17 Refs.`
A47	`01`	`1`		`@0 03-0353866`
A60				`@1 P`
A61				`@0 A`
A64	`01`	`1`		`@0 Journal of Electronic Imaging`
A66	`01`			`@0 INT`
C01	`01`		`ENG`	@0 Text detection in video images is a challenging research problem because of the poor spatial resolution and complex background, which may contain a variety of colors. An automated system for text detection in video images is presented. It makes use of four modules to implement a series of processes to extract text regions from video images. The first module, called the multistage pulse code modulation (MPCM) module, is used to locate potential text regions in color video images. It converts a video image to a coded image, with each pixel encoded by a priority code ranging from 7 down to 0 in accordance with its priority, and further produces a binary thresholded image, which segments potential text regions from the background. The second module, called the text region detection module, applies a sequence of spatial filters to remove noisy regions and eliminate regions that are unlikely to contain text. The third module, called the text box finding module, merges text regions and produces boxes that are likely to contain text. Finally, the fourth module, called the optical character recognition (OCR) module, eliminates the text boxes that produce no OCR output. An extensive set of experiments is conducted and demonstrates that the proposed system is effective in detecting text in a wide variety of video images.
C02	`01`	`3`		`@0 001B40B`
C02	`02`	`X`		`@0 001D02B12`
C02	`03`	`X`		`@0 001A01F03`
C02	`04`	`X`		`@0 205`
C03	`01`	`1`	`ENG`	`@0 Text detection @4 INC`
C03	`02`	`1`	`ENG`	`@0 Color video images @4 INC`
C03	`03`	`1`	`ENG`	`@0 Multistage pulse code modulation @4 INC`
C03	`04`	`1`	`FRE`	`@0 Théorie`
C03	`04`	`1`	`ENG`	`@0 Theory`
C03	`05`	`1`	`FRE`	`@0 Analyse image`
C03	`05`	`1`	`ENG`	`@0 Image analysis`
C03	`06`	`1`	`FRE`	`@0 Traitement texte`
C03	`06`	`1`	`ENG`	`@0 Text processing`
C03	`07`	`1`	`FRE`	`@0 Extraction caractéristique`
C03	`07`	`1`	`ENG`	`@0 Feature extraction`
C03	`08`	`1`	`FRE`	`@0 Couleur`
C03	`08`	`1`	`ENG`	`@0 Color`
C03	`09`	`1`	`FRE`	`@0 Segmentation image`
C03	`09`	`1`	`ENG`	`@0 Image segmentation`
C03	`10`	`1`	`FRE`	`@0 Reconnaissance optique caractère`
C03	`10`	`1`	`ENG`	`@0 Optical character recognition`
C03	`11`	`1`	`FRE`	`@0 Recherche information`
C03	`11`	`1`	`ENG`	`@0 Information retrieval`
C03	`12`	`1`	`FRE`	`@0 Reconnaissance automatique cible @3 P`
C03	`12`	`1`	`ENG`	`@0 Automatic target recognition @3 P`
N21				`@1 251`

Format Inist (serveur)

NO :	PASCAL 03-0353866 EI
ET :	Automated system for text detection in individual video images
AU :	DU (Y.); CHANG (C. I.); THOUIN (P. D.)
AF :	Univ. of Maryland Baltimore Country Remote Sensing Signal Image Proc Lab Dept. of Comp. Sci. and Elec. Eng./Baltimore, MD 21250/Etats-Unis (1 aut.)
DT :	Publication en série; Niveau analytique
SO :	Journal of Electronic Imaging; ISSN 1017-9909; Coden JEIME5; International; Da. 2003; Vol. 12; No. 3; Pp. 410-422; Bibl. 17 Refs.
LA :	Anglais
EA :	Text detection in video images is a challenging research problem because of the poor spatial resolution and complex background, which may contain a variety of colors. An automated system for text detection in video images is presented. It makes use of four modules to implement a series of processes to extract text regions from video images. The first module, called the multistage pulse code modulation (MPCM) module, is used to locate potential text regions in color video images. It converts a video image to a coded image, with each pixel encoded by a priority code ranging from 7 down to 0 in accordance with its priority, and further produces a binary thresholded image, which segments potential text regions from the background. The second module, called the text region detection module, applies a sequence of spatial filters to remove noisy regions and eliminate regions that are unlikely to contain text. The third module, called the text box finding module, merges text regions and produces boxes that are likely to contain text. Finally, the fourth module, called the optical character recognition (OCR) module, eliminates the text boxes that produce no OCR output. An extensive set of experiments is conducted and demonstrates that the proposed system is effective in detecting text in a wide variety of video images.
CC :	001B40B; 001D02B12; 001A01F03; 205
FD :	Théorie; Analyse image; Traitement texte; Extraction caractéristique; Couleur; Segmentation image; Reconnaissance optique caractère; Recherche information; Reconnaissance automatique cible
ED :	Text detection; Color video images; Multistage pulse code modulation; Theory; Image analysis; Text processing; Feature extraction; Color; Image segmentation; Optical character recognition; Information retrieval; Automatic target recognition
LO :	INIST-XXXX
ID :	03-0353866

Links to Exploration step

Pascal:03-0353866

Le document en format XML

<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en" level="a">Automated system for text detection in individual video images</title>
<author><name sortKey="Du, Y" sort="Du, Y" uniqKey="Du Y" first="Y." last="Du">Y. Du</name>
<affiliation><inist:fA14 i1="01"><s1>Univ. of Maryland Baltimore Country Remote Sensing Signal Image Proc  Lab Dept. of Comp. Sci. and Elec. Eng.</s1>
<s2>Baltimore, MD 21250</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author><name sortKey="Chang, C I" sort="Chang, C I" uniqKey="Chang C" first="C. I." last="Chang">C. I. Chang</name>
</author>
<author><name sortKey="Thouin, P D" sort="Thouin, P D" uniqKey="Thouin P" first="P. D." last="Thouin">P. D. Thouin</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">INIST</idno>
<idno type="inist">03-0353866</idno>
<date when="2003">2003</date>
<idno type="stanalyst">PASCAL 03-0353866 EI</idno>
<idno type="RBID">Pascal:03-0353866</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000608</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en" level="a">Automated system for text detection in individual video images</title>
<author><name sortKey="Du, Y" sort="Du, Y" uniqKey="Du Y" first="Y." last="Du">Y. Du</name>
<affiliation><inist:fA14 i1="01"><s1>Univ. of Maryland Baltimore Country Remote Sensing Signal Image Proc  Lab Dept. of Comp. Sci. and Elec. Eng.</s1>
<s2>Baltimore, MD 21250</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author><name sortKey="Chang, C I" sort="Chang, C I" uniqKey="Chang C" first="C. I." last="Chang">C. I. Chang</name>
</author>
<author><name sortKey="Thouin, P D" sort="Thouin, P D" uniqKey="Thouin P" first="P. D." last="Thouin">P. D. Thouin</name>
</author>
</analytic>
<series><title level="j" type="main">Journal of Electronic Imaging</title>
<title level="j" type="abbreviated">J Electron Imaging</title>
<idno type="ISSN">1017-9909</idno>
<imprint><date when="2003">2003</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt><title level="j" type="main">Journal of Electronic Imaging</title>
<title level="j" type="abbreviated">J Electron Imaging</title>
<idno type="ISSN">1017-9909</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Automatic target recognition</term>
<term>Color</term>
<term>Color video images</term>
<term>Feature extraction</term>
<term>Image analysis</term>
<term>Image segmentation</term>
<term>Information retrieval</term>
<term>Multistage pulse code modulation</term>
<term>Optical character recognition</term>
<term>Text detection</term>
<term>Text processing</term>
<term>Theory</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr"><term>Théorie</term>
<term>Analyse image</term>
<term>Traitement texte</term>
<term>Extraction caractéristique</term>
<term>Couleur</term>
<term>Segmentation image</term>
<term>Reconnaissance optique caractère</term>
<term>Recherche information</term>
<term>Reconnaissance automatique cible</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Text detection in video images is a challenging research problem  because of the poor spatial resolution and complex background, which may contain a variety of colors. An automated system for text detection in video images is presented. It makes use of four modules to implement a series of processes to extract text regions from video images. The first module, called the multistage pulse code modulation (MPCM) module, is used to locate potential text regions in color video images. It converts a video image to a coded image, with each pixel encoded by a priority code ranging from 7 down to 0 in accordance with its priority, and further produces a binary thresholded image, which segments potential text regions from the background. The second module, called the text region detection module, applies a sequence of spatial filters to remove noisy regions and eliminate regions that are unlikely to contain text. The third module, called the text box finding module, merges text regions and produces boxes that are likely to contain text. Finally, the fourth module, called the optical character recognition (OCR) module, eliminates the text boxes that produce no OCR output. An extensive set of experiments is conducted and demonstrates that the proposed system is effective in detecting text in a wide variety of video images.</div>
</front>
</TEI>
<inist><standard h6="B"><pA><fA01 i1="01" i2="1"><s0>1017-9909</s0>
</fA01>
<fA02 i1="01"><s0>JEIME5</s0>
</fA02>
<fA03 i2="1"><s0>J Electron Imaging</s0>
</fA03>
<fA05><s2>12</s2>
</fA05>
<fA06><s2>3</s2>
</fA06>
<fA08 i1="01" i2="1" l="ENG"><s1>Automated system for text detection in individual video images</s1>
</fA08>
<fA11 i1="01" i2="1"><s1>DU (Y.)</s1>
</fA11>
<fA11 i1="02" i2="1"><s1>CHANG (C. I.)</s1>
</fA11>
<fA11 i1="03" i2="1"><s1>THOUIN (P. D.)</s1>
</fA11>
<fA14 i1="01"><s1>Univ. of Maryland Baltimore Country Remote Sensing Signal Image Proc  Lab Dept. of Comp. Sci. and Elec. Eng.</s1>
<s2>Baltimore, MD 21250</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
</fA14>
<fA20><s1>410-422</s1>
</fA20>
<fA21><s1>2003</s1>
</fA21>
<fA23 i1="01"><s0>ENG</s0>
</fA23>
<fA43 i1="01"><s1>INIST</s1>
<s2>XXXX</s2>
</fA43>
<fA44><s0>A100</s0>
</fA44>
<fA45><s0>17 Refs.</s0>
</fA45>
<fA47 i1="01" i2="1"><s0>03-0353866</s0>
</fA47>
<fA60><s1>P</s1>
</fA60>
<fA61><s0>A</s0>
</fA61>
<fA64 i1="01" i2="1"><s0>Journal of Electronic Imaging</s0>
</fA64>
<fA66 i1="01"><s0>INT</s0>
</fA66>
<fC01 i1="01" l="ENG"><s0>Text detection in video images is a challenging research problem  because of the poor spatial resolution and complex background, which may contain a variety of colors. An automated system for text detection in video images is presented. It makes use of four modules to implement a series of processes to extract text regions from video images. The first module, called the multistage pulse code modulation (MPCM) module, is used to locate potential text regions in color video images. It converts a video image to a coded image, with each pixel encoded by a priority code ranging from 7 down to 0 in accordance with its priority, and further produces a binary thresholded image, which segments potential text regions from the background. The second module, called the text region detection module, applies a sequence of spatial filters to remove noisy regions and eliminate regions that are unlikely to contain text. The third module, called the text box finding module, merges text regions and produces boxes that are likely to contain text. Finally, the fourth module, called the optical character recognition (OCR) module, eliminates the text boxes that produce no OCR output. An extensive set of experiments is conducted and demonstrates that the proposed system is effective in detecting text in a wide variety of video images.</s0>
</fC01>
<fC02 i1="01" i2="3"><s0>001B40B</s0>
</fC02>
<fC02 i1="02" i2="X"><s0>001D02B12</s0>
</fC02>
<fC02 i1="03" i2="X"><s0>001A01F03</s0>
</fC02>
<fC02 i1="04" i2="X"><s0>205</s0>
</fC02>
<fC03 i1="01" i2="1" l="ENG"><s0>Text detection</s0>
<s4>INC</s4>
</fC03>
<fC03 i1="02" i2="1" l="ENG"><s0>Color video images</s0>
<s4>INC</s4>
</fC03>
<fC03 i1="03" i2="1" l="ENG"><s0>Multistage pulse code modulation</s0>
<s4>INC</s4>
</fC03>
<fC03 i1="04" i2="1" l="FRE"><s0>Théorie</s0>
</fC03>
<fC03 i1="04" i2="1" l="ENG"><s0>Theory</s0>
</fC03>
<fC03 i1="05" i2="1" l="FRE"><s0>Analyse image</s0>
</fC03>
<fC03 i1="05" i2="1" l="ENG"><s0>Image analysis</s0>
</fC03>
<fC03 i1="06" i2="1" l="FRE"><s0>Traitement texte</s0>
</fC03>
<fC03 i1="06" i2="1" l="ENG"><s0>Text processing</s0>
</fC03>
<fC03 i1="07" i2="1" l="FRE"><s0>Extraction caractéristique</s0>
</fC03>
<fC03 i1="07" i2="1" l="ENG"><s0>Feature extraction</s0>
</fC03>
<fC03 i1="08" i2="1" l="FRE"><s0>Couleur</s0>
</fC03>
<fC03 i1="08" i2="1" l="ENG"><s0>Color</s0>
</fC03>
<fC03 i1="09" i2="1" l="FRE"><s0>Segmentation image</s0>
</fC03>
<fC03 i1="09" i2="1" l="ENG"><s0>Image segmentation</s0>
</fC03>
<fC03 i1="10" i2="1" l="FRE"><s0>Reconnaissance optique caractère</s0>
</fC03>
<fC03 i1="10" i2="1" l="ENG"><s0>Optical character recognition</s0>
</fC03>
<fC03 i1="11" i2="1" l="FRE"><s0>Recherche information</s0>
</fC03>
<fC03 i1="11" i2="1" l="ENG"><s0>Information retrieval</s0>
</fC03>
<fC03 i1="12" i2="1" l="FRE"><s0>Reconnaissance automatique cible</s0>
<s3>P</s3>
</fC03>
<fC03 i1="12" i2="1" l="ENG"><s0>Automatic target recognition</s0>
<s3>P</s3>
</fC03>
<fN21><s1>251</s1>
</fN21>
</pA>
</standard>
<server><NO>PASCAL 03-0353866 EI</NO>
<ET>Automated system for text detection in individual video images</ET>
<AU>DU (Y.); CHANG (C. I.); THOUIN (P. D.)</AU>
<AF>Univ. of Maryland Baltimore Country Remote Sensing Signal Image Proc Lab Dept. of Comp. Sci. and Elec. Eng./Baltimore, MD 21250/Etats-Unis (1 aut.)</AF>
<DT>Publication en série; Niveau analytique</DT>
<SO>Journal of Electronic Imaging; ISSN 1017-9909; Coden JEIME5; International; Da. 2003; Vol. 12; No. 3; Pp. 410-422; Bibl. 17 Refs.</SO>
<LA>Anglais</LA>
<EA>Text detection in video images is a challenging research problem because of the poor spatial resolution and complex background, which may contain a variety of colors. An automated system for text detection in video images is presented. It makes use of four modules to implement a series of processes to extract text regions from video images. The first module, called the multistage pulse code modulation (MPCM) module, is used to locate potential text regions in color video images. It converts a video image to a coded image, with each pixel encoded by a priority code ranging from 7 down to 0 in accordance with its priority, and further produces a binary thresholded image, which segments potential text regions from the background. The second module, called the text region detection module, applies a sequence of spatial filters to remove noisy regions and eliminate regions that are unlikely to contain text. The third module, called the text box finding module, merges text regions and produces boxes that are likely to contain text. Finally, the fourth module, called the optical character recognition (OCR) module, eliminates the text boxes that produce no OCR output. An extensive set of experiments is conducted and demonstrates that the proposed system is effective in detecting text in a wide variety of video images.</EA>
<CC>001B40B; 001D02B12; 001A01F03; 205</CC>
<FD>Théorie; Analyse image; Traitement texte; Extraction caractéristique; Couleur; Segmentation image; Reconnaissance optique caractère; Recherche information; Reconnaissance automatique cible</FD>
<ED>Text detection; Color video images; Multistage pulse code modulation; Theory; Image analysis; Text processing; Feature extraction; Color; Image segmentation; Optical character recognition; Information retrieval; Automatic target recognition</ED>
<LO>INIST-XXXX</LO>
<ID>03-0353866</ID>
</server>
</inist>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/PascalFrancis/Corpus

HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000608 | SxmlIndent | more

HfdSelect -h $EXPLOR_AREA/Data/PascalFrancis/Corpus/biblio.hfd -nk 000608 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    PascalFrancis
   |étape=   Corpus
   |type=    RBID
   |clé=     Pascal:03-0353866
   |texte=   Automated system for text detection in individual video images
}}

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024

	Serveur d'exploration sur l'OCR
	Attention, ce site est en cours de développement ! Attention, site généré par des moyens informatiques à partir de corpus bruts. Les informations ne sont donc pas validées.

Serveur d'exploration sur l'OCR

Automated system for text detection in individual video images

Automated system for text detection in individual video images

Source :

Descripteurs français

English descriptors

Abstract

Notice en format standard (ISO 2709)

Format Inist (serveur)

Links to Exploration step

Le document en format XML

Pour manipuler ce document sous Unix (Dilib)

Pour mettre un lien sur cette page dans le réseau Wicri