Automated system for text detection in individual video images
Identifieur interne : 000608 ( PascalFrancis/Corpus ); précédent : 000607; suivant : 000609Automated system for text detection in individual video images
Auteurs : Y. Du ; C. I. Chang ; P. D. ThouinSource :
- Journal of Electronic Imaging [ 1017-9909 ] ; 2003.
Descripteurs français
- Pascal (Inist)
English descriptors
- KwdEn :
Abstract
Text detection in video images is a challenging research problem because of the poor spatial resolution and complex background, which may contain a variety of colors. An automated system for text detection in video images is presented. It makes use of four modules to implement a series of processes to extract text regions from video images. The first module, called the multistage pulse code modulation (MPCM) module, is used to locate potential text regions in color video images. It converts a video image to a coded image, with each pixel encoded by a priority code ranging from 7 down to 0 in accordance with its priority, and further produces a binary thresholded image, which segments potential text regions from the background. The second module, called the text region detection module, applies a sequence of spatial filters to remove noisy regions and eliminate regions that are unlikely to contain text. The third module, called the text box finding module, merges text regions and produces boxes that are likely to contain text. Finally, the fourth module, called the optical character recognition (OCR) module, eliminates the text boxes that produce no OCR output. An extensive set of experiments is conducted and demonstrates that the proposed system is effective in detecting text in a wide variety of video images.
Notice en format standard (ISO 2709)
Pour connaître la documentation sur le format Inist Standard.
pA |
|
---|
Format Inist (serveur)
NO : | PASCAL 03-0353866 EI |
---|---|
ET : | Automated system for text detection in individual video images |
AU : | DU (Y.); CHANG (C. I.); THOUIN (P. D.) |
AF : | Univ. of Maryland Baltimore Country Remote Sensing Signal Image Proc Lab Dept. of Comp. Sci. and Elec. Eng./Baltimore, MD 21250/Etats-Unis (1 aut.) |
DT : | Publication en série; Niveau analytique |
SO : | Journal of Electronic Imaging; ISSN 1017-9909; Coden JEIME5; International; Da. 2003; Vol. 12; No. 3; Pp. 410-422; Bibl. 17 Refs. |
LA : | Anglais |
EA : | Text detection in video images is a challenging research problem because of the poor spatial resolution and complex background, which may contain a variety of colors. An automated system for text detection in video images is presented. It makes use of four modules to implement a series of processes to extract text regions from video images. The first module, called the multistage pulse code modulation (MPCM) module, is used to locate potential text regions in color video images. It converts a video image to a coded image, with each pixel encoded by a priority code ranging from 7 down to 0 in accordance with its priority, and further produces a binary thresholded image, which segments potential text regions from the background. The second module, called the text region detection module, applies a sequence of spatial filters to remove noisy regions and eliminate regions that are unlikely to contain text. The third module, called the text box finding module, merges text regions and produces boxes that are likely to contain text. Finally, the fourth module, called the optical character recognition (OCR) module, eliminates the text boxes that produce no OCR output. An extensive set of experiments is conducted and demonstrates that the proposed system is effective in detecting text in a wide variety of video images. |
CC : | 001B40B; 001D02B12; 001A01F03; 205 |
FD : | Théorie; Analyse image; Traitement texte; Extraction caractéristique; Couleur; Segmentation image; Reconnaissance optique caractère; Recherche information; Reconnaissance automatique cible |
ED : | Text detection; Color video images; Multistage pulse code modulation; Theory; Image analysis; Text processing; Feature extraction; Color; Image segmentation; Optical character recognition; Information retrieval; Automatic target recognition |
LO : | INIST-XXXX |
ID : | 03-0353866 |
Links to Exploration step
Pascal:03-0353866Le document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en" level="a">Automated system for text detection in individual video images</title>
<author><name sortKey="Du, Y" sort="Du, Y" uniqKey="Du Y" first="Y." last="Du">Y. Du</name>
<affiliation><inist:fA14 i1="01"><s1>Univ. of Maryland Baltimore Country Remote Sensing Signal Image Proc Lab Dept. of Comp. Sci. and Elec. Eng.</s1>
<s2>Baltimore, MD 21250</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author><name sortKey="Chang, C I" sort="Chang, C I" uniqKey="Chang C" first="C. I." last="Chang">C. I. Chang</name>
</author>
<author><name sortKey="Thouin, P D" sort="Thouin, P D" uniqKey="Thouin P" first="P. D." last="Thouin">P. D. Thouin</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">INIST</idno>
<idno type="inist">03-0353866</idno>
<date when="2003">2003</date>
<idno type="stanalyst">PASCAL 03-0353866 EI</idno>
<idno type="RBID">Pascal:03-0353866</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000608</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en" level="a">Automated system for text detection in individual video images</title>
<author><name sortKey="Du, Y" sort="Du, Y" uniqKey="Du Y" first="Y." last="Du">Y. Du</name>
<affiliation><inist:fA14 i1="01"><s1>Univ. of Maryland Baltimore Country Remote Sensing Signal Image Proc Lab Dept. of Comp. Sci. and Elec. Eng.</s1>
<s2>Baltimore, MD 21250</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author><name sortKey="Chang, C I" sort="Chang, C I" uniqKey="Chang C" first="C. I." last="Chang">C. I. Chang</name>
</author>
<author><name sortKey="Thouin, P D" sort="Thouin, P D" uniqKey="Thouin P" first="P. D." last="Thouin">P. D. Thouin</name>
</author>
</analytic>
<series><title level="j" type="main">Journal of Electronic Imaging</title>
<title level="j" type="abbreviated">J Electron Imaging</title>
<idno type="ISSN">1017-9909</idno>
<imprint><date when="2003">2003</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt><title level="j" type="main">Journal of Electronic Imaging</title>
<title level="j" type="abbreviated">J Electron Imaging</title>
<idno type="ISSN">1017-9909</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Automatic target recognition</term>
<term>Color</term>
<term>Color video images</term>
<term>Feature extraction</term>
<term>Image analysis</term>
<term>Image segmentation</term>
<term>Information retrieval</term>
<term>Multistage pulse code modulation</term>
<term>Optical character recognition</term>
<term>Text detection</term>
<term>Text processing</term>
<term>Theory</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr"><term>Théorie</term>
<term>Analyse image</term>
<term>Traitement texte</term>
<term>Extraction caractéristique</term>
<term>Couleur</term>
<term>Segmentation image</term>
<term>Reconnaissance optique caractère</term>
<term>Recherche information</term>
<term>Reconnaissance automatique cible</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Text detection in video images is a challenging research problem because of the poor spatial resolution and complex background, which may contain a variety of colors. An automated system for text detection in video images is presented. It makes use of four modules to implement a series of processes to extract text regions from video images. The first module, called the multistage pulse code modulation (MPCM) module, is used to locate potential text regions in color video images. It converts a video image to a coded image, with each pixel encoded by a priority code ranging from 7 down to 0 in accordance with its priority, and further produces a binary thresholded image, which segments potential text regions from the background. The second module, called the text region detection module, applies a sequence of spatial filters to remove noisy regions and eliminate regions that are unlikely to contain text. The third module, called the text box finding module, merges text regions and produces boxes that are likely to contain text. Finally, the fourth module, called the optical character recognition (OCR) module, eliminates the text boxes that produce no OCR output. An extensive set of experiments is conducted and demonstrates that the proposed system is effective in detecting text in a wide variety of video images.</div>
</front>
</TEI>
<inist><standard h6="B"><pA><fA01 i1="01" i2="1"><s0>1017-9909</s0>
</fA01>
<fA02 i1="01"><s0>JEIME5</s0>
</fA02>
<fA03 i2="1"><s0>J Electron Imaging</s0>
</fA03>
<fA05><s2>12</s2>
</fA05>
<fA06><s2>3</s2>
</fA06>
<fA08 i1="01" i2="1" l="ENG"><s1>Automated system for text detection in individual video images</s1>
</fA08>
<fA11 i1="01" i2="1"><s1>DU (Y.)</s1>
</fA11>
<fA11 i1="02" i2="1"><s1>CHANG (C. I.)</s1>
</fA11>
<fA11 i1="03" i2="1"><s1>THOUIN (P. D.)</s1>
</fA11>
<fA14 i1="01"><s1>Univ. of Maryland Baltimore Country Remote Sensing Signal Image Proc Lab Dept. of Comp. Sci. and Elec. Eng.</s1>
<s2>Baltimore, MD 21250</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
</fA14>
<fA20><s1>410-422</s1>
</fA20>
<fA21><s1>2003</s1>
</fA21>
<fA23 i1="01"><s0>ENG</s0>
</fA23>
<fA43 i1="01"><s1>INIST</s1>
<s2>XXXX</s2>
</fA43>
<fA44><s0>A100</s0>
</fA44>
<fA45><s0>17 Refs.</s0>
</fA45>
<fA47 i1="01" i2="1"><s0>03-0353866</s0>
</fA47>
<fA60><s1>P</s1>
</fA60>
<fA61><s0>A</s0>
</fA61>
<fA64 i1="01" i2="1"><s0>Journal of Electronic Imaging</s0>
</fA64>
<fA66 i1="01"><s0>INT</s0>
</fA66>
<fC01 i1="01" l="ENG"><s0>Text detection in video images is a challenging research problem because of the poor spatial resolution and complex background, which may contain a variety of colors. An automated system for text detection in video images is presented. It makes use of four modules to implement a series of processes to extract text regions from video images. The first module, called the multistage pulse code modulation (MPCM) module, is used to locate potential text regions in color video images. It converts a video image to a coded image, with each pixel encoded by a priority code ranging from 7 down to 0 in accordance with its priority, and further produces a binary thresholded image, which segments potential text regions from the background. The second module, called the text region detection module, applies a sequence of spatial filters to remove noisy regions and eliminate regions that are unlikely to contain text. The third module, called the text box finding module, merges text regions and produces boxes that are likely to contain text. Finally, the fourth module, called the optical character recognition (OCR) module, eliminates the text boxes that produce no OCR output. An extensive set of experiments is conducted and demonstrates that the proposed system is effective in detecting text in a wide variety of video images.</s0>
</fC01>
<fC02 i1="01" i2="3"><s0>001B40B</s0>
</fC02>
<fC02 i1="02" i2="X"><s0>001D02B12</s0>
</fC02>
<fC02 i1="03" i2="X"><s0>001A01F03</s0>
</fC02>
<fC02 i1="04" i2="X"><s0>205</s0>
</fC02>
<fC03 i1="01" i2="1" l="ENG"><s0>Text detection</s0>
<s4>INC</s4>
</fC03>
<fC03 i1="02" i2="1" l="ENG"><s0>Color video images</s0>
<s4>INC</s4>
</fC03>
<fC03 i1="03" i2="1" l="ENG"><s0>Multistage pulse code modulation</s0>
<s4>INC</s4>
</fC03>
<fC03 i1="04" i2="1" l="FRE"><s0>Théorie</s0>
</fC03>
<fC03 i1="04" i2="1" l="ENG"><s0>Theory</s0>
</fC03>
<fC03 i1="05" i2="1" l="FRE"><s0>Analyse image</s0>
</fC03>
<fC03 i1="05" i2="1" l="ENG"><s0>Image analysis</s0>
</fC03>
<fC03 i1="06" i2="1" l="FRE"><s0>Traitement texte</s0>
</fC03>
<fC03 i1="06" i2="1" l="ENG"><s0>Text processing</s0>
</fC03>
<fC03 i1="07" i2="1" l="FRE"><s0>Extraction caractéristique</s0>
</fC03>
<fC03 i1="07" i2="1" l="ENG"><s0>Feature extraction</s0>
</fC03>
<fC03 i1="08" i2="1" l="FRE"><s0>Couleur</s0>
</fC03>
<fC03 i1="08" i2="1" l="ENG"><s0>Color</s0>
</fC03>
<fC03 i1="09" i2="1" l="FRE"><s0>Segmentation image</s0>
</fC03>
<fC03 i1="09" i2="1" l="ENG"><s0>Image segmentation</s0>
</fC03>
<fC03 i1="10" i2="1" l="FRE"><s0>Reconnaissance optique caractère</s0>
</fC03>
<fC03 i1="10" i2="1" l="ENG"><s0>Optical character recognition</s0>
</fC03>
<fC03 i1="11" i2="1" l="FRE"><s0>Recherche information</s0>
</fC03>
<fC03 i1="11" i2="1" l="ENG"><s0>Information retrieval</s0>
</fC03>
<fC03 i1="12" i2="1" l="FRE"><s0>Reconnaissance automatique cible</s0>
<s3>P</s3>
</fC03>
<fC03 i1="12" i2="1" l="ENG"><s0>Automatic target recognition</s0>
<s3>P</s3>
</fC03>
<fN21><s1>251</s1>
</fN21>
</pA>
</standard>
<server><NO>PASCAL 03-0353866 EI</NO>
<ET>Automated system for text detection in individual video images</ET>
<AU>DU (Y.); CHANG (C. I.); THOUIN (P. D.)</AU>
<AF>Univ. of Maryland Baltimore Country Remote Sensing Signal Image Proc Lab Dept. of Comp. Sci. and Elec. Eng./Baltimore, MD 21250/Etats-Unis (1 aut.)</AF>
<DT>Publication en série; Niveau analytique</DT>
<SO>Journal of Electronic Imaging; ISSN 1017-9909; Coden JEIME5; International; Da. 2003; Vol. 12; No. 3; Pp. 410-422; Bibl. 17 Refs.</SO>
<LA>Anglais</LA>
<EA>Text detection in video images is a challenging research problem because of the poor spatial resolution and complex background, which may contain a variety of colors. An automated system for text detection in video images is presented. It makes use of four modules to implement a series of processes to extract text regions from video images. The first module, called the multistage pulse code modulation (MPCM) module, is used to locate potential text regions in color video images. It converts a video image to a coded image, with each pixel encoded by a priority code ranging from 7 down to 0 in accordance with its priority, and further produces a binary thresholded image, which segments potential text regions from the background. The second module, called the text region detection module, applies a sequence of spatial filters to remove noisy regions and eliminate regions that are unlikely to contain text. The third module, called the text box finding module, merges text regions and produces boxes that are likely to contain text. Finally, the fourth module, called the optical character recognition (OCR) module, eliminates the text boxes that produce no OCR output. An extensive set of experiments is conducted and demonstrates that the proposed system is effective in detecting text in a wide variety of video images.</EA>
<CC>001B40B; 001D02B12; 001A01F03; 205</CC>
<FD>Théorie; Analyse image; Traitement texte; Extraction caractéristique; Couleur; Segmentation image; Reconnaissance optique caractère; Recherche information; Reconnaissance automatique cible</FD>
<ED>Text detection; Color video images; Multistage pulse code modulation; Theory; Image analysis; Text processing; Feature extraction; Color; Image segmentation; Optical character recognition; Information retrieval; Automatic target recognition</ED>
<LO>INIST-XXXX</LO>
<ID>03-0353866</ID>
</server>
</inist>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/PascalFrancis/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000608 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/PascalFrancis/Corpus/biblio.hfd -nk 000608 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= PascalFrancis |étape= Corpus |type= RBID |clé= Pascal:03-0353866 |texte= Automated system for text detection in individual video images }}
This area was generated with Dilib version V0.6.32. |