Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Automated system for text detection in individual video images

Identifieur interne : 000608 ( PascalFrancis/Corpus ); précédent : 000607; suivant : 000609

Automated system for text detection in individual video images

Auteurs : Y. Du ; C. I. Chang ; P. D. Thouin

Source :

RBID : Pascal:03-0353866

Descripteurs français

English descriptors

Abstract

Text detection in video images is a challenging research problem because of the poor spatial resolution and complex background, which may contain a variety of colors. An automated system for text detection in video images is presented. It makes use of four modules to implement a series of processes to extract text regions from video images. The first module, called the multistage pulse code modulation (MPCM) module, is used to locate potential text regions in color video images. It converts a video image to a coded image, with each pixel encoded by a priority code ranging from 7 down to 0 in accordance with its priority, and further produces a binary thresholded image, which segments potential text regions from the background. The second module, called the text region detection module, applies a sequence of spatial filters to remove noisy regions and eliminate regions that are unlikely to contain text. The third module, called the text box finding module, merges text regions and produces boxes that are likely to contain text. Finally, the fourth module, called the optical character recognition (OCR) module, eliminates the text boxes that produce no OCR output. An extensive set of experiments is conducted and demonstrates that the proposed system is effective in detecting text in a wide variety of video images.

Notice en format standard (ISO 2709)

Pour connaître la documentation sur le format Inist Standard.

pA  
A01 01  1    @0 1017-9909
A02 01      @0 JEIME5
A03   1    @0 J Electron Imaging
A05       @2 12
A06       @2 3
A08 01  1  ENG  @1 Automated system for text detection in individual video images
A11 01  1    @1 DU (Y.)
A11 02  1    @1 CHANG (C. I.)
A11 03  1    @1 THOUIN (P. D.)
A14 01      @1 Univ. of Maryland Baltimore Country Remote Sensing Signal Image Proc Lab Dept. of Comp. Sci. and Elec. Eng. @2 Baltimore, MD 21250 @3 USA @Z 1 aut.
A20       @1 410-422
A21       @1 2003
A23 01      @0 ENG
A43 01      @1 INIST @2 XXXX
A44       @0 A100
A45       @0 17 Refs.
A47 01  1    @0 03-0353866
A60       @1 P
A61       @0 A
A64 01  1    @0 Journal of Electronic Imaging
A66 01      @0 INT
C01 01    ENG  @0 Text detection in video images is a challenging research problem because of the poor spatial resolution and complex background, which may contain a variety of colors. An automated system for text detection in video images is presented. It makes use of four modules to implement a series of processes to extract text regions from video images. The first module, called the multistage pulse code modulation (MPCM) module, is used to locate potential text regions in color video images. It converts a video image to a coded image, with each pixel encoded by a priority code ranging from 7 down to 0 in accordance with its priority, and further produces a binary thresholded image, which segments potential text regions from the background. The second module, called the text region detection module, applies a sequence of spatial filters to remove noisy regions and eliminate regions that are unlikely to contain text. The third module, called the text box finding module, merges text regions and produces boxes that are likely to contain text. Finally, the fourth module, called the optical character recognition (OCR) module, eliminates the text boxes that produce no OCR output. An extensive set of experiments is conducted and demonstrates that the proposed system is effective in detecting text in a wide variety of video images.
C02 01  3    @0 001B40B
C02 02  X    @0 001D02B12
C02 03  X    @0 001A01F03
C02 04  X    @0 205
C03 01  1  ENG  @0 Text detection @4 INC
C03 02  1  ENG  @0 Color video images @4 INC
C03 03  1  ENG  @0 Multistage pulse code modulation @4 INC
C03 04  1  FRE  @0 Théorie
C03 04  1  ENG  @0 Theory
C03 05  1  FRE  @0 Analyse image
C03 05  1  ENG  @0 Image analysis
C03 06  1  FRE  @0 Traitement texte
C03 06  1  ENG  @0 Text processing
C03 07  1  FRE  @0 Extraction caractéristique
C03 07  1  ENG  @0 Feature extraction
C03 08  1  FRE  @0 Couleur
C03 08  1  ENG  @0 Color
C03 09  1  FRE  @0 Segmentation image
C03 09  1  ENG  @0 Image segmentation
C03 10  1  FRE  @0 Reconnaissance optique caractère
C03 10  1  ENG  @0 Optical character recognition
C03 11  1  FRE  @0 Recherche information
C03 11  1  ENG  @0 Information retrieval
C03 12  1  FRE  @0 Reconnaissance automatique cible @3 P
C03 12  1  ENG  @0 Automatic target recognition @3 P
N21       @1 251

Format Inist (serveur)

NO : PASCAL 03-0353866 EI
ET : Automated system for text detection in individual video images
AU : DU (Y.); CHANG (C. I.); THOUIN (P. D.)
AF : Univ. of Maryland Baltimore Country Remote Sensing Signal Image Proc Lab Dept. of Comp. Sci. and Elec. Eng./Baltimore, MD 21250/Etats-Unis (1 aut.)
DT : Publication en série; Niveau analytique
SO : Journal of Electronic Imaging; ISSN 1017-9909; Coden JEIME5; International; Da. 2003; Vol. 12; No. 3; Pp. 410-422; Bibl. 17 Refs.
LA : Anglais
EA : Text detection in video images is a challenging research problem because of the poor spatial resolution and complex background, which may contain a variety of colors. An automated system for text detection in video images is presented. It makes use of four modules to implement a series of processes to extract text regions from video images. The first module, called the multistage pulse code modulation (MPCM) module, is used to locate potential text regions in color video images. It converts a video image to a coded image, with each pixel encoded by a priority code ranging from 7 down to 0 in accordance with its priority, and further produces a binary thresholded image, which segments potential text regions from the background. The second module, called the text region detection module, applies a sequence of spatial filters to remove noisy regions and eliminate regions that are unlikely to contain text. The third module, called the text box finding module, merges text regions and produces boxes that are likely to contain text. Finally, the fourth module, called the optical character recognition (OCR) module, eliminates the text boxes that produce no OCR output. An extensive set of experiments is conducted and demonstrates that the proposed system is effective in detecting text in a wide variety of video images.
CC : 001B40B; 001D02B12; 001A01F03; 205
FD : Théorie; Analyse image; Traitement texte; Extraction caractéristique; Couleur; Segmentation image; Reconnaissance optique caractère; Recherche information; Reconnaissance automatique cible
ED : Text detection; Color video images; Multistage pulse code modulation; Theory; Image analysis; Text processing; Feature extraction; Color; Image segmentation; Optical character recognition; Information retrieval; Automatic target recognition
LO : INIST-XXXX
ID : 03-0353866

Links to Exploration step

Pascal:03-0353866

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" level="a">Automated system for text detection in individual video images</title>
<author>
<name sortKey="Du, Y" sort="Du, Y" uniqKey="Du Y" first="Y." last="Du">Y. Du</name>
<affiliation>
<inist:fA14 i1="01">
<s1>Univ. of Maryland Baltimore Country Remote Sensing Signal Image Proc Lab Dept. of Comp. Sci. and Elec. Eng.</s1>
<s2>Baltimore, MD 21250</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author>
<name sortKey="Chang, C I" sort="Chang, C I" uniqKey="Chang C" first="C. I." last="Chang">C. I. Chang</name>
</author>
<author>
<name sortKey="Thouin, P D" sort="Thouin, P D" uniqKey="Thouin P" first="P. D." last="Thouin">P. D. Thouin</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">03-0353866</idno>
<date when="2003">2003</date>
<idno type="stanalyst">PASCAL 03-0353866 EI</idno>
<idno type="RBID">Pascal:03-0353866</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000608</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a">Automated system for text detection in individual video images</title>
<author>
<name sortKey="Du, Y" sort="Du, Y" uniqKey="Du Y" first="Y." last="Du">Y. Du</name>
<affiliation>
<inist:fA14 i1="01">
<s1>Univ. of Maryland Baltimore Country Remote Sensing Signal Image Proc Lab Dept. of Comp. Sci. and Elec. Eng.</s1>
<s2>Baltimore, MD 21250</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author>
<name sortKey="Chang, C I" sort="Chang, C I" uniqKey="Chang C" first="C. I." last="Chang">C. I. Chang</name>
</author>
<author>
<name sortKey="Thouin, P D" sort="Thouin, P D" uniqKey="Thouin P" first="P. D." last="Thouin">P. D. Thouin</name>
</author>
</analytic>
<series>
<title level="j" type="main">Journal of Electronic Imaging</title>
<title level="j" type="abbreviated">J Electron Imaging</title>
<idno type="ISSN">1017-9909</idno>
<imprint>
<date when="2003">2003</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<title level="j" type="main">Journal of Electronic Imaging</title>
<title level="j" type="abbreviated">J Electron Imaging</title>
<idno type="ISSN">1017-9909</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Automatic target recognition</term>
<term>Color</term>
<term>Color video images</term>
<term>Feature extraction</term>
<term>Image analysis</term>
<term>Image segmentation</term>
<term>Information retrieval</term>
<term>Multistage pulse code modulation</term>
<term>Optical character recognition</term>
<term>Text detection</term>
<term>Text processing</term>
<term>Theory</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Théorie</term>
<term>Analyse image</term>
<term>Traitement texte</term>
<term>Extraction caractéristique</term>
<term>Couleur</term>
<term>Segmentation image</term>
<term>Reconnaissance optique caractère</term>
<term>Recherche information</term>
<term>Reconnaissance automatique cible</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Text detection in video images is a challenging research problem because of the poor spatial resolution and complex background, which may contain a variety of colors. An automated system for text detection in video images is presented. It makes use of four modules to implement a series of processes to extract text regions from video images. The first module, called the multistage pulse code modulation (MPCM) module, is used to locate potential text regions in color video images. It converts a video image to a coded image, with each pixel encoded by a priority code ranging from 7 down to 0 in accordance with its priority, and further produces a binary thresholded image, which segments potential text regions from the background. The second module, called the text region detection module, applies a sequence of spatial filters to remove noisy regions and eliminate regions that are unlikely to contain text. The third module, called the text box finding module, merges text regions and produces boxes that are likely to contain text. Finally, the fourth module, called the optical character recognition (OCR) module, eliminates the text boxes that produce no OCR output. An extensive set of experiments is conducted and demonstrates that the proposed system is effective in detecting text in a wide variety of video images.</div>
</front>
</TEI>
<inist>
<standard h6="B">
<pA>
<fA01 i1="01" i2="1">
<s0>1017-9909</s0>
</fA01>
<fA02 i1="01">
<s0>JEIME5</s0>
</fA02>
<fA03 i2="1">
<s0>J Electron Imaging</s0>
</fA03>
<fA05>
<s2>12</s2>
</fA05>
<fA06>
<s2>3</s2>
</fA06>
<fA08 i1="01" i2="1" l="ENG">
<s1>Automated system for text detection in individual video images</s1>
</fA08>
<fA11 i1="01" i2="1">
<s1>DU (Y.)</s1>
</fA11>
<fA11 i1="02" i2="1">
<s1>CHANG (C. I.)</s1>
</fA11>
<fA11 i1="03" i2="1">
<s1>THOUIN (P. D.)</s1>
</fA11>
<fA14 i1="01">
<s1>Univ. of Maryland Baltimore Country Remote Sensing Signal Image Proc Lab Dept. of Comp. Sci. and Elec. Eng.</s1>
<s2>Baltimore, MD 21250</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
</fA14>
<fA20>
<s1>410-422</s1>
</fA20>
<fA21>
<s1>2003</s1>
</fA21>
<fA23 i1="01">
<s0>ENG</s0>
</fA23>
<fA43 i1="01">
<s1>INIST</s1>
<s2>XXXX</s2>
</fA43>
<fA44>
<s0>A100</s0>
</fA44>
<fA45>
<s0>17 Refs.</s0>
</fA45>
<fA47 i1="01" i2="1">
<s0>03-0353866</s0>
</fA47>
<fA60>
<s1>P</s1>
</fA60>
<fA61>
<s0>A</s0>
</fA61>
<fA64 i1="01" i2="1">
<s0>Journal of Electronic Imaging</s0>
</fA64>
<fA66 i1="01">
<s0>INT</s0>
</fA66>
<fC01 i1="01" l="ENG">
<s0>Text detection in video images is a challenging research problem because of the poor spatial resolution and complex background, which may contain a variety of colors. An automated system for text detection in video images is presented. It makes use of four modules to implement a series of processes to extract text regions from video images. The first module, called the multistage pulse code modulation (MPCM) module, is used to locate potential text regions in color video images. It converts a video image to a coded image, with each pixel encoded by a priority code ranging from 7 down to 0 in accordance with its priority, and further produces a binary thresholded image, which segments potential text regions from the background. The second module, called the text region detection module, applies a sequence of spatial filters to remove noisy regions and eliminate regions that are unlikely to contain text. The third module, called the text box finding module, merges text regions and produces boxes that are likely to contain text. Finally, the fourth module, called the optical character recognition (OCR) module, eliminates the text boxes that produce no OCR output. An extensive set of experiments is conducted and demonstrates that the proposed system is effective in detecting text in a wide variety of video images.</s0>
</fC01>
<fC02 i1="01" i2="3">
<s0>001B40B</s0>
</fC02>
<fC02 i1="02" i2="X">
<s0>001D02B12</s0>
</fC02>
<fC02 i1="03" i2="X">
<s0>001A01F03</s0>
</fC02>
<fC02 i1="04" i2="X">
<s0>205</s0>
</fC02>
<fC03 i1="01" i2="1" l="ENG">
<s0>Text detection</s0>
<s4>INC</s4>
</fC03>
<fC03 i1="02" i2="1" l="ENG">
<s0>Color video images</s0>
<s4>INC</s4>
</fC03>
<fC03 i1="03" i2="1" l="ENG">
<s0>Multistage pulse code modulation</s0>
<s4>INC</s4>
</fC03>
<fC03 i1="04" i2="1" l="FRE">
<s0>Théorie</s0>
</fC03>
<fC03 i1="04" i2="1" l="ENG">
<s0>Theory</s0>
</fC03>
<fC03 i1="05" i2="1" l="FRE">
<s0>Analyse image</s0>
</fC03>
<fC03 i1="05" i2="1" l="ENG">
<s0>Image analysis</s0>
</fC03>
<fC03 i1="06" i2="1" l="FRE">
<s0>Traitement texte</s0>
</fC03>
<fC03 i1="06" i2="1" l="ENG">
<s0>Text processing</s0>
</fC03>
<fC03 i1="07" i2="1" l="FRE">
<s0>Extraction caractéristique</s0>
</fC03>
<fC03 i1="07" i2="1" l="ENG">
<s0>Feature extraction</s0>
</fC03>
<fC03 i1="08" i2="1" l="FRE">
<s0>Couleur</s0>
</fC03>
<fC03 i1="08" i2="1" l="ENG">
<s0>Color</s0>
</fC03>
<fC03 i1="09" i2="1" l="FRE">
<s0>Segmentation image</s0>
</fC03>
<fC03 i1="09" i2="1" l="ENG">
<s0>Image segmentation</s0>
</fC03>
<fC03 i1="10" i2="1" l="FRE">
<s0>Reconnaissance optique caractère</s0>
</fC03>
<fC03 i1="10" i2="1" l="ENG">
<s0>Optical character recognition</s0>
</fC03>
<fC03 i1="11" i2="1" l="FRE">
<s0>Recherche information</s0>
</fC03>
<fC03 i1="11" i2="1" l="ENG">
<s0>Information retrieval</s0>
</fC03>
<fC03 i1="12" i2="1" l="FRE">
<s0>Reconnaissance automatique cible</s0>
<s3>P</s3>
</fC03>
<fC03 i1="12" i2="1" l="ENG">
<s0>Automatic target recognition</s0>
<s3>P</s3>
</fC03>
<fN21>
<s1>251</s1>
</fN21>
</pA>
</standard>
<server>
<NO>PASCAL 03-0353866 EI</NO>
<ET>Automated system for text detection in individual video images</ET>
<AU>DU (Y.); CHANG (C. I.); THOUIN (P. D.)</AU>
<AF>Univ. of Maryland Baltimore Country Remote Sensing Signal Image Proc Lab Dept. of Comp. Sci. and Elec. Eng./Baltimore, MD 21250/Etats-Unis (1 aut.)</AF>
<DT>Publication en série; Niveau analytique</DT>
<SO>Journal of Electronic Imaging; ISSN 1017-9909; Coden JEIME5; International; Da. 2003; Vol. 12; No. 3; Pp. 410-422; Bibl. 17 Refs.</SO>
<LA>Anglais</LA>
<EA>Text detection in video images is a challenging research problem because of the poor spatial resolution and complex background, which may contain a variety of colors. An automated system for text detection in video images is presented. It makes use of four modules to implement a series of processes to extract text regions from video images. The first module, called the multistage pulse code modulation (MPCM) module, is used to locate potential text regions in color video images. It converts a video image to a coded image, with each pixel encoded by a priority code ranging from 7 down to 0 in accordance with its priority, and further produces a binary thresholded image, which segments potential text regions from the background. The second module, called the text region detection module, applies a sequence of spatial filters to remove noisy regions and eliminate regions that are unlikely to contain text. The third module, called the text box finding module, merges text regions and produces boxes that are likely to contain text. Finally, the fourth module, called the optical character recognition (OCR) module, eliminates the text boxes that produce no OCR output. An extensive set of experiments is conducted and demonstrates that the proposed system is effective in detecting text in a wide variety of video images.</EA>
<CC>001B40B; 001D02B12; 001A01F03; 205</CC>
<FD>Théorie; Analyse image; Traitement texte; Extraction caractéristique; Couleur; Segmentation image; Reconnaissance optique caractère; Recherche information; Reconnaissance automatique cible</FD>
<ED>Text detection; Color video images; Multistage pulse code modulation; Theory; Image analysis; Text processing; Feature extraction; Color; Image segmentation; Optical character recognition; Information retrieval; Automatic target recognition</ED>
<LO>INIST-XXXX</LO>
<ID>03-0353866</ID>
</server>
</inist>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/PascalFrancis/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000608 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/PascalFrancis/Corpus/biblio.hfd -nk 000608 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    PascalFrancis
   |étape=   Corpus
   |type=    RBID
   |clé=     Pascal:03-0353866
   |texte=   Automated system for text detection in individual video images
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024