Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Text detection and recognition for video indexing

Identifieur interne : 000392 ( PascalFrancis/Curation ); précédent : 000391; suivant : 000393

Text detection and recognition for video indexing

Auteurs : ARISH ASIF QAZI [Allemagne] ; Imran A. Siddiqi [Pakistan] ; M. Sarmad Hussain [Pakistan]

Source :

RBID : Pascal:06-0213101

Descripteurs français

English descriptors

Abstract

Efficient indexing and retrieval of digital video has been an area of active research in the recent years. Many of the existing retrieval systems, however, work without semantic knowledge. We propose a knowledge-based approach for video indexing, based on the fact that videos are rich in semantic contents. The most powerful index for semantic retrieval of video images is the Text appearing in them. In this paper we present a system for localization of horizontally aligned artificial text in video images. The detected text is binarized and is passed to a standard OCR package. The system has been thoroughly tested and the experimental results are found to be quite encouraging.
pA  
A08 01  1  ENG  @1 Text detection and recognition for video indexing
A09 01  1  ENG  @1 Signals and electronic systems : proceedings : Poznan, Poland, 13-15 September 2004
A11 01  1    @1 ARISH ASIF QAZI
A11 02  1    @1 SIDDIQI (Imran A.)
A11 03  1    @1 SARMAD HUSSAIN (M.)
A14 01      @1 Room No 617 SpannsKamp26 @2 22527 Hamburg @3 DEU @Z 1 aut.
A14 02      @1 InterActive Communications Private Limited* 9 - Street 29, F-7/1 @2 Islamabad @3 PAK @Z 2 aut. @Z 3 aut.
A20       @1 529-532
A21       @1 2005
A23 01      @0 ENG
A25 01      @1 PTETiS @2 Poznan
A26 01      @0 83-906074-7-6
A30 01  1  ENG  @1 International conference on signals and electronic systems @3 Poznan POL @4 2004-09-13
A43 01      @1 INIST @2 Y 38366 @5 354000138746651320
A44       @0 0000 @1 © 2006 INIST-CNRS. All rights reserved.
A45       @0 12 ref.
A47 01  1    @0 06-0213101
A60       @1 C
A61       @0 A
A66 01      @0 POL
C01 01    ENG  @0 Efficient indexing and retrieval of digital video has been an area of active research in the recent years. Many of the existing retrieval systems, however, work without semantic knowledge. We propose a knowledge-based approach for video indexing, based on the fact that videos are rich in semantic contents. The most powerful index for semantic retrieval of video images is the Text appearing in them. In this paper we present a system for localization of horizontally aligned artificial text in video images. The detected text is binarized and is passed to a standard OCR package. The system has been thoroughly tested and the experimental results are found to be quite encouraging.
C02 01  X    @0 001D04A05A
C02 02  X    @0 001D04A03
C02 03  X    @0 001D04A05C
C02 04  X    @0 001D03F06A
C03 01  X  FRE  @0 Reconnaissance caractère @5 01
C03 01  X  ENG  @0 Character recognition @5 01
C03 01  X  SPA  @0 Reconocimiento carácter @5 01
C03 02  X  FRE  @0 Indexation @5 02
C03 02  X  ENG  @0 Indexing @5 02
C03 02  X  SPA  @0 Indización @5 02
C03 03  X  FRE  @0 Recherche information @5 03
C03 03  X  ENG  @0 Information retrieval @5 03
C03 03  X  SPA  @0 Búsqueda información @5 03
C03 04  X  FRE  @0 Signal numérique @5 04
C03 04  X  ENG  @0 Digital signal @5 04
C03 04  X  SPA  @0 Señal numérica @5 04
C03 05  X  FRE  @0 Signal vidéo @5 05
C03 05  X  ENG  @0 Video signal @5 05
C03 05  X  SPA  @0 Señal video @5 05
C03 06  X  FRE  @0 Analyse sémantique @5 06
C03 06  X  ENG  @0 Semantic analysis @5 06
C03 06  X  SPA  @0 Análisis semántico @5 06
C03 07  X  FRE  @0 Base connaissance @5 07
C03 07  X  ENG  @0 Knowledge base @5 07
C03 07  X  SPA  @0 Base conocimiento @5 07
C03 08  3  FRE  @0 Recherche image @5 08
C03 08  3  ENG  @0 Image retrieval @5 08
C03 09  X  FRE  @0 Localisation @5 09
C03 09  X  ENG  @0 Localization @5 09
C03 09  X  SPA  @0 Localización @5 09
C03 10  X  FRE  @0 Reconnaissance optique caractère @5 10
C03 10  X  ENG  @0 Optical character recognition @5 10
C03 10  X  SPA  @0 Reconocimento óptico de caracteres @5 10
C03 11  X  FRE  @0 Packaging électronique @5 11
C03 11  X  ENG  @0 Electronic packaging @5 11
C03 11  X  SPA  @0 Packaging electrónico @5 11
C03 12  X  FRE  @0 Reconnaissance forme @5 31
C03 12  X  ENG  @0 Pattern recognition @5 31
C03 12  X  SPA  @0 Reconocimiento patrón @5 31
C03 13  3  FRE  @0 Extraction caractéristique @5 32
C03 13  3  ENG  @0 Feature extraction @5 32
C03 14  X  FRE  @0 Traitement signal @5 33
C03 14  X  ENG  @0 Signal processing @5 33
C03 14  X  SPA  @0 Procesamiento señal @5 33
N21       @1 135
N44 01      @1 OTO
N82       @1 OTO

Links toward previous steps (curation, corpus...)


Links to Exploration step

Pascal:06-0213101

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" level="a">Text detection and recognition for video indexing</title>
<author>
<name sortKey="Arish Asif Qazi" sort="Arish Asif Qazi" uniqKey="Arish Asif Qazi" last="Arish Asif Qazi">ARISH ASIF QAZI</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Room No 617 SpannsKamp26</s1>
<s2>22527 Hamburg</s2>
<s3>DEU</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>Allemagne</country>
</affiliation>
</author>
<author>
<name sortKey="Siddiqi, Imran A" sort="Siddiqi, Imran A" uniqKey="Siddiqi I" first="Imran A." last="Siddiqi">Imran A. Siddiqi</name>
<affiliation wicri:level="1">
<inist:fA14 i1="02">
<s1>InterActive Communications Private Limited* 9 - Street 29, F-7/1</s1>
<s2>Islamabad</s2>
<s3>PAK</s3>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Pakistan</country>
</affiliation>
</author>
<author>
<name sortKey="Sarmad Hussain, M" sort="Sarmad Hussain, M" uniqKey="Sarmad Hussain M" first="M." last="Sarmad Hussain">M. Sarmad Hussain</name>
<affiliation wicri:level="1">
<inist:fA14 i1="02">
<s1>InterActive Communications Private Limited* 9 - Street 29, F-7/1</s1>
<s2>Islamabad</s2>
<s3>PAK</s3>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Pakistan</country>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">06-0213101</idno>
<date when="2005">2005</date>
<idno type="stanalyst">PASCAL 06-0213101 INIST</idno>
<idno type="RBID">Pascal:06-0213101</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000394</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000392</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a">Text detection and recognition for video indexing</title>
<author>
<name sortKey="Arish Asif Qazi" sort="Arish Asif Qazi" uniqKey="Arish Asif Qazi" last="Arish Asif Qazi">ARISH ASIF QAZI</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Room No 617 SpannsKamp26</s1>
<s2>22527 Hamburg</s2>
<s3>DEU</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>Allemagne</country>
</affiliation>
</author>
<author>
<name sortKey="Siddiqi, Imran A" sort="Siddiqi, Imran A" uniqKey="Siddiqi I" first="Imran A." last="Siddiqi">Imran A. Siddiqi</name>
<affiliation wicri:level="1">
<inist:fA14 i1="02">
<s1>InterActive Communications Private Limited* 9 - Street 29, F-7/1</s1>
<s2>Islamabad</s2>
<s3>PAK</s3>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Pakistan</country>
</affiliation>
</author>
<author>
<name sortKey="Sarmad Hussain, M" sort="Sarmad Hussain, M" uniqKey="Sarmad Hussain M" first="M." last="Sarmad Hussain">M. Sarmad Hussain</name>
<affiliation wicri:level="1">
<inist:fA14 i1="02">
<s1>InterActive Communications Private Limited* 9 - Street 29, F-7/1</s1>
<s2>Islamabad</s2>
<s3>PAK</s3>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Pakistan</country>
</affiliation>
</author>
</analytic>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Character recognition</term>
<term>Digital signal</term>
<term>Electronic packaging</term>
<term>Feature extraction</term>
<term>Image retrieval</term>
<term>Indexing</term>
<term>Information retrieval</term>
<term>Knowledge base</term>
<term>Localization</term>
<term>Optical character recognition</term>
<term>Pattern recognition</term>
<term>Semantic analysis</term>
<term>Signal processing</term>
<term>Video signal</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Reconnaissance caractère</term>
<term>Indexation</term>
<term>Recherche information</term>
<term>Signal numérique</term>
<term>Signal vidéo</term>
<term>Analyse sémantique</term>
<term>Base connaissance</term>
<term>Recherche image</term>
<term>Localisation</term>
<term>Reconnaissance optique caractère</term>
<term>Packaging électronique</term>
<term>Reconnaissance forme</term>
<term>Extraction caractéristique</term>
<term>Traitement signal</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Efficient indexing and retrieval of digital video has been an area of active research in the recent years. Many of the existing retrieval systems, however, work without semantic knowledge. We propose a knowledge-based approach for video indexing, based on the fact that videos are rich in semantic contents. The most powerful index for semantic retrieval of video images is the Text appearing in them. In this paper we present a system for localization of horizontally aligned artificial text in video images. The detected text is binarized and is passed to a standard OCR package. The system has been thoroughly tested and the experimental results are found to be quite encouraging.</div>
</front>
</TEI>
<inist>
<standard h6="B">
<pA>
<fA08 i1="01" i2="1" l="ENG">
<s1>Text detection and recognition for video indexing</s1>
</fA08>
<fA09 i1="01" i2="1" l="ENG">
<s1>Signals and electronic systems : proceedings : Poznan, Poland, 13-15 September 2004</s1>
</fA09>
<fA11 i1="01" i2="1">
<s1>ARISH ASIF QAZI</s1>
</fA11>
<fA11 i1="02" i2="1">
<s1>SIDDIQI (Imran A.)</s1>
</fA11>
<fA11 i1="03" i2="1">
<s1>SARMAD HUSSAIN (M.)</s1>
</fA11>
<fA14 i1="01">
<s1>Room No 617 SpannsKamp26</s1>
<s2>22527 Hamburg</s2>
<s3>DEU</s3>
<sZ>1 aut.</sZ>
</fA14>
<fA14 i1="02">
<s1>InterActive Communications Private Limited* 9 - Street 29, F-7/1</s1>
<s2>Islamabad</s2>
<s3>PAK</s3>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</fA14>
<fA20>
<s1>529-532</s1>
</fA20>
<fA21>
<s1>2005</s1>
</fA21>
<fA23 i1="01">
<s0>ENG</s0>
</fA23>
<fA25 i1="01">
<s1>PTETiS</s1>
<s2>Poznan</s2>
</fA25>
<fA26 i1="01">
<s0>83-906074-7-6</s0>
</fA26>
<fA30 i1="01" i2="1" l="ENG">
<s1>International conference on signals and electronic systems</s1>
<s3>Poznan POL</s3>
<s4>2004-09-13</s4>
</fA30>
<fA43 i1="01">
<s1>INIST</s1>
<s2>Y 38366</s2>
<s5>354000138746651320</s5>
</fA43>
<fA44>
<s0>0000</s0>
<s1>© 2006 INIST-CNRS. All rights reserved.</s1>
</fA44>
<fA45>
<s0>12 ref.</s0>
</fA45>
<fA47 i1="01" i2="1">
<s0>06-0213101</s0>
</fA47>
<fA60>
<s1>C</s1>
</fA60>
<fA61>
<s0>A</s0>
</fA61>
<fA66 i1="01">
<s0>POL</s0>
</fA66>
<fC01 i1="01" l="ENG">
<s0>Efficient indexing and retrieval of digital video has been an area of active research in the recent years. Many of the existing retrieval systems, however, work without semantic knowledge. We propose a knowledge-based approach for video indexing, based on the fact that videos are rich in semantic contents. The most powerful index for semantic retrieval of video images is the Text appearing in them. In this paper we present a system for localization of horizontally aligned artificial text in video images. The detected text is binarized and is passed to a standard OCR package. The system has been thoroughly tested and the experimental results are found to be quite encouraging.</s0>
</fC01>
<fC02 i1="01" i2="X">
<s0>001D04A05A</s0>
</fC02>
<fC02 i1="02" i2="X">
<s0>001D04A03</s0>
</fC02>
<fC02 i1="03" i2="X">
<s0>001D04A05C</s0>
</fC02>
<fC02 i1="04" i2="X">
<s0>001D03F06A</s0>
</fC02>
<fC03 i1="01" i2="X" l="FRE">
<s0>Reconnaissance caractère</s0>
<s5>01</s5>
</fC03>
<fC03 i1="01" i2="X" l="ENG">
<s0>Character recognition</s0>
<s5>01</s5>
</fC03>
<fC03 i1="01" i2="X" l="SPA">
<s0>Reconocimiento carácter</s0>
<s5>01</s5>
</fC03>
<fC03 i1="02" i2="X" l="FRE">
<s0>Indexation</s0>
<s5>02</s5>
</fC03>
<fC03 i1="02" i2="X" l="ENG">
<s0>Indexing</s0>
<s5>02</s5>
</fC03>
<fC03 i1="02" i2="X" l="SPA">
<s0>Indización</s0>
<s5>02</s5>
</fC03>
<fC03 i1="03" i2="X" l="FRE">
<s0>Recherche information</s0>
<s5>03</s5>
</fC03>
<fC03 i1="03" i2="X" l="ENG">
<s0>Information retrieval</s0>
<s5>03</s5>
</fC03>
<fC03 i1="03" i2="X" l="SPA">
<s0>Búsqueda información</s0>
<s5>03</s5>
</fC03>
<fC03 i1="04" i2="X" l="FRE">
<s0>Signal numérique</s0>
<s5>04</s5>
</fC03>
<fC03 i1="04" i2="X" l="ENG">
<s0>Digital signal</s0>
<s5>04</s5>
</fC03>
<fC03 i1="04" i2="X" l="SPA">
<s0>Señal numérica</s0>
<s5>04</s5>
</fC03>
<fC03 i1="05" i2="X" l="FRE">
<s0>Signal vidéo</s0>
<s5>05</s5>
</fC03>
<fC03 i1="05" i2="X" l="ENG">
<s0>Video signal</s0>
<s5>05</s5>
</fC03>
<fC03 i1="05" i2="X" l="SPA">
<s0>Señal video</s0>
<s5>05</s5>
</fC03>
<fC03 i1="06" i2="X" l="FRE">
<s0>Analyse sémantique</s0>
<s5>06</s5>
</fC03>
<fC03 i1="06" i2="X" l="ENG">
<s0>Semantic analysis</s0>
<s5>06</s5>
</fC03>
<fC03 i1="06" i2="X" l="SPA">
<s0>Análisis semántico</s0>
<s5>06</s5>
</fC03>
<fC03 i1="07" i2="X" l="FRE">
<s0>Base connaissance</s0>
<s5>07</s5>
</fC03>
<fC03 i1="07" i2="X" l="ENG">
<s0>Knowledge base</s0>
<s5>07</s5>
</fC03>
<fC03 i1="07" i2="X" l="SPA">
<s0>Base conocimiento</s0>
<s5>07</s5>
</fC03>
<fC03 i1="08" i2="3" l="FRE">
<s0>Recherche image</s0>
<s5>08</s5>
</fC03>
<fC03 i1="08" i2="3" l="ENG">
<s0>Image retrieval</s0>
<s5>08</s5>
</fC03>
<fC03 i1="09" i2="X" l="FRE">
<s0>Localisation</s0>
<s5>09</s5>
</fC03>
<fC03 i1="09" i2="X" l="ENG">
<s0>Localization</s0>
<s5>09</s5>
</fC03>
<fC03 i1="09" i2="X" l="SPA">
<s0>Localización</s0>
<s5>09</s5>
</fC03>
<fC03 i1="10" i2="X" l="FRE">
<s0>Reconnaissance optique caractère</s0>
<s5>10</s5>
</fC03>
<fC03 i1="10" i2="X" l="ENG">
<s0>Optical character recognition</s0>
<s5>10</s5>
</fC03>
<fC03 i1="10" i2="X" l="SPA">
<s0>Reconocimento óptico de caracteres</s0>
<s5>10</s5>
</fC03>
<fC03 i1="11" i2="X" l="FRE">
<s0>Packaging électronique</s0>
<s5>11</s5>
</fC03>
<fC03 i1="11" i2="X" l="ENG">
<s0>Electronic packaging</s0>
<s5>11</s5>
</fC03>
<fC03 i1="11" i2="X" l="SPA">
<s0>Packaging electrónico</s0>
<s5>11</s5>
</fC03>
<fC03 i1="12" i2="X" l="FRE">
<s0>Reconnaissance forme</s0>
<s5>31</s5>
</fC03>
<fC03 i1="12" i2="X" l="ENG">
<s0>Pattern recognition</s0>
<s5>31</s5>
</fC03>
<fC03 i1="12" i2="X" l="SPA">
<s0>Reconocimiento patrón</s0>
<s5>31</s5>
</fC03>
<fC03 i1="13" i2="3" l="FRE">
<s0>Extraction caractéristique</s0>
<s5>32</s5>
</fC03>
<fC03 i1="13" i2="3" l="ENG">
<s0>Feature extraction</s0>
<s5>32</s5>
</fC03>
<fC03 i1="14" i2="X" l="FRE">
<s0>Traitement signal</s0>
<s5>33</s5>
</fC03>
<fC03 i1="14" i2="X" l="ENG">
<s0>Signal processing</s0>
<s5>33</s5>
</fC03>
<fC03 i1="14" i2="X" l="SPA">
<s0>Procesamiento señal</s0>
<s5>33</s5>
</fC03>
<fN21>
<s1>135</s1>
</fN21>
<fN44 i1="01">
<s1>OTO</s1>
</fN44>
<fN82>
<s1>OTO</s1>
</fN82>
</pA>
</standard>
</inist>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/PascalFrancis/Curation
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000392 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/PascalFrancis/Curation/biblio.hfd -nk 000392 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    PascalFrancis
   |étape=   Curation
   |type=    RBID
   |clé=     Pascal:06-0213101
   |texte=   Text detection and recognition for video indexing
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024