Text detection and recognition for video indexing
Identifieur interne : 000394 ( PascalFrancis/Corpus ); précédent : 000393; suivant : 000395Text detection and recognition for video indexing
Auteurs : ARISH ASIF QAZI ; Imran A. Siddiqi ; M. Sarmad HussainSource :
Descripteurs français
- Pascal (Inist)
English descriptors
- KwdEn :
Abstract
Efficient indexing and retrieval of digital video has been an area of active research in the recent years. Many of the existing retrieval systems, however, work without semantic knowledge. We propose a knowledge-based approach for video indexing, based on the fact that videos are rich in semantic contents. The most powerful index for semantic retrieval of video images is the Text appearing in them. In this paper we present a system for localization of horizontally aligned artificial text in video images. The detected text is binarized and is passed to a standard OCR package. The system has been thoroughly tested and the experimental results are found to be quite encouraging.
Notice en format standard (ISO 2709)
Pour connaître la documentation sur le format Inist Standard.
pA |
|
---|
Format Inist (serveur)
NO : | PASCAL 06-0213101 INIST |
---|---|
ET : | Text detection and recognition for video indexing |
AU : | ARISH ASIF QAZI; SIDDIQI (Imran A.); SARMAD HUSSAIN (M.) |
AF : | Room No 617 SpannsKamp26/22527 Hamburg/Allemagne (1 aut.); InterActive Communications Private Limited* 9 - Street 29, F-7/1/Islamabad/Pakistan (2 aut., 3 aut.) |
DT : | Congrès; Niveau analytique |
SO : | International conference on signals and electronic systems/2004-09-13/Poznan POL; Pologne; Poznan: PTETiS; Da. 2005; Pp. 529-532; ISBN 83-906074-7-6 |
LA : | Anglais |
EA : | Efficient indexing and retrieval of digital video has been an area of active research in the recent years. Many of the existing retrieval systems, however, work without semantic knowledge. We propose a knowledge-based approach for video indexing, based on the fact that videos are rich in semantic contents. The most powerful index for semantic retrieval of video images is the Text appearing in them. In this paper we present a system for localization of horizontally aligned artificial text in video images. The detected text is binarized and is passed to a standard OCR package. The system has been thoroughly tested and the experimental results are found to be quite encouraging. |
CC : | 001D04A05A; 001D04A03; 001D04A05C; 001D03F06A |
FD : | Reconnaissance caractère; Indexation; Recherche information; Signal numérique; Signal vidéo; Analyse sémantique; Base connaissance; Recherche image; Localisation; Reconnaissance optique caractère; Packaging électronique; Reconnaissance forme; Extraction caractéristique; Traitement signal |
ED : | Character recognition; Indexing; Information retrieval; Digital signal; Video signal; Semantic analysis; Knowledge base; Image retrieval; Localization; Optical character recognition; Electronic packaging; Pattern recognition; Feature extraction; Signal processing |
SD : | Reconocimiento carácter; Indización; Búsqueda información; Señal numérica; Señal video; Análisis semántico; Base conocimiento; Localización; Reconocimento óptico de caracteres; Packaging electrónico; Reconocimiento patrón; Procesamiento señal |
LO : | INIST-Y 38366.354000138746651320 |
ID : | 06-0213101 |
Links to Exploration step
Pascal:06-0213101Le document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en" level="a">Text detection and recognition for video indexing</title>
<author><name sortKey="Arish Asif Qazi" sort="Arish Asif Qazi" uniqKey="Arish Asif Qazi" last="Arish Asif Qazi">ARISH ASIF QAZI</name>
<affiliation><inist:fA14 i1="01"><s1>Room No 617 SpannsKamp26</s1>
<s2>22527 Hamburg</s2>
<s3>DEU</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author><name sortKey="Siddiqi, Imran A" sort="Siddiqi, Imran A" uniqKey="Siddiqi I" first="Imran A." last="Siddiqi">Imran A. Siddiqi</name>
<affiliation><inist:fA14 i1="02"><s1>InterActive Communications Private Limited* 9 - Street 29, F-7/1</s1>
<s2>Islamabad</s2>
<s3>PAK</s3>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author><name sortKey="Sarmad Hussain, M" sort="Sarmad Hussain, M" uniqKey="Sarmad Hussain M" first="M." last="Sarmad Hussain">M. Sarmad Hussain</name>
<affiliation><inist:fA14 i1="02"><s1>InterActive Communications Private Limited* 9 - Street 29, F-7/1</s1>
<s2>Islamabad</s2>
<s3>PAK</s3>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">INIST</idno>
<idno type="inist">06-0213101</idno>
<date when="2005">2005</date>
<idno type="stanalyst">PASCAL 06-0213101 INIST</idno>
<idno type="RBID">Pascal:06-0213101</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000394</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en" level="a">Text detection and recognition for video indexing</title>
<author><name sortKey="Arish Asif Qazi" sort="Arish Asif Qazi" uniqKey="Arish Asif Qazi" last="Arish Asif Qazi">ARISH ASIF QAZI</name>
<affiliation><inist:fA14 i1="01"><s1>Room No 617 SpannsKamp26</s1>
<s2>22527 Hamburg</s2>
<s3>DEU</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author><name sortKey="Siddiqi, Imran A" sort="Siddiqi, Imran A" uniqKey="Siddiqi I" first="Imran A." last="Siddiqi">Imran A. Siddiqi</name>
<affiliation><inist:fA14 i1="02"><s1>InterActive Communications Private Limited* 9 - Street 29, F-7/1</s1>
<s2>Islamabad</s2>
<s3>PAK</s3>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author><name sortKey="Sarmad Hussain, M" sort="Sarmad Hussain, M" uniqKey="Sarmad Hussain M" first="M." last="Sarmad Hussain">M. Sarmad Hussain</name>
<affiliation><inist:fA14 i1="02"><s1>InterActive Communications Private Limited* 9 - Street 29, F-7/1</s1>
<s2>Islamabad</s2>
<s3>PAK</s3>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
</analytic>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Character recognition</term>
<term>Digital signal</term>
<term>Electronic packaging</term>
<term>Feature extraction</term>
<term>Image retrieval</term>
<term>Indexing</term>
<term>Information retrieval</term>
<term>Knowledge base</term>
<term>Localization</term>
<term>Optical character recognition</term>
<term>Pattern recognition</term>
<term>Semantic analysis</term>
<term>Signal processing</term>
<term>Video signal</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr"><term>Reconnaissance caractère</term>
<term>Indexation</term>
<term>Recherche information</term>
<term>Signal numérique</term>
<term>Signal vidéo</term>
<term>Analyse sémantique</term>
<term>Base connaissance</term>
<term>Recherche image</term>
<term>Localisation</term>
<term>Reconnaissance optique caractère</term>
<term>Packaging électronique</term>
<term>Reconnaissance forme</term>
<term>Extraction caractéristique</term>
<term>Traitement signal</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Efficient indexing and retrieval of digital video has been an area of active research in the recent years. Many of the existing retrieval systems, however, work without semantic knowledge. We propose a knowledge-based approach for video indexing, based on the fact that videos are rich in semantic contents. The most powerful index for semantic retrieval of video images is the Text appearing in them. In this paper we present a system for localization of horizontally aligned artificial text in video images. The detected text is binarized and is passed to a standard OCR package. The system has been thoroughly tested and the experimental results are found to be quite encouraging.</div>
</front>
</TEI>
<inist><standard h6="B"><pA><fA08 i1="01" i2="1" l="ENG"><s1>Text detection and recognition for video indexing</s1>
</fA08>
<fA09 i1="01" i2="1" l="ENG"><s1>Signals and electronic systems : proceedings : Poznan, Poland, 13-15 September 2004</s1>
</fA09>
<fA11 i1="01" i2="1"><s1>ARISH ASIF QAZI</s1>
</fA11>
<fA11 i1="02" i2="1"><s1>SIDDIQI (Imran A.)</s1>
</fA11>
<fA11 i1="03" i2="1"><s1>SARMAD HUSSAIN (M.)</s1>
</fA11>
<fA14 i1="01"><s1>Room No 617 SpannsKamp26</s1>
<s2>22527 Hamburg</s2>
<s3>DEU</s3>
<sZ>1 aut.</sZ>
</fA14>
<fA14 i1="02"><s1>InterActive Communications Private Limited* 9 - Street 29, F-7/1</s1>
<s2>Islamabad</s2>
<s3>PAK</s3>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</fA14>
<fA20><s1>529-532</s1>
</fA20>
<fA21><s1>2005</s1>
</fA21>
<fA23 i1="01"><s0>ENG</s0>
</fA23>
<fA25 i1="01"><s1>PTETiS</s1>
<s2>Poznan</s2>
</fA25>
<fA26 i1="01"><s0>83-906074-7-6</s0>
</fA26>
<fA30 i1="01" i2="1" l="ENG"><s1>International conference on signals and electronic systems</s1>
<s3>Poznan POL</s3>
<s4>2004-09-13</s4>
</fA30>
<fA43 i1="01"><s1>INIST</s1>
<s2>Y 38366</s2>
<s5>354000138746651320</s5>
</fA43>
<fA44><s0>0000</s0>
<s1>© 2006 INIST-CNRS. All rights reserved.</s1>
</fA44>
<fA45><s0>12 ref.</s0>
</fA45>
<fA47 i1="01" i2="1"><s0>06-0213101</s0>
</fA47>
<fA60><s1>C</s1>
</fA60>
<fA61><s0>A</s0>
</fA61>
<fA66 i1="01"><s0>POL</s0>
</fA66>
<fC01 i1="01" l="ENG"><s0>Efficient indexing and retrieval of digital video has been an area of active research in the recent years. Many of the existing retrieval systems, however, work without semantic knowledge. We propose a knowledge-based approach for video indexing, based on the fact that videos are rich in semantic contents. The most powerful index for semantic retrieval of video images is the Text appearing in them. In this paper we present a system for localization of horizontally aligned artificial text in video images. The detected text is binarized and is passed to a standard OCR package. The system has been thoroughly tested and the experimental results are found to be quite encouraging.</s0>
</fC01>
<fC02 i1="01" i2="X"><s0>001D04A05A</s0>
</fC02>
<fC02 i1="02" i2="X"><s0>001D04A03</s0>
</fC02>
<fC02 i1="03" i2="X"><s0>001D04A05C</s0>
</fC02>
<fC02 i1="04" i2="X"><s0>001D03F06A</s0>
</fC02>
<fC03 i1="01" i2="X" l="FRE"><s0>Reconnaissance caractère</s0>
<s5>01</s5>
</fC03>
<fC03 i1="01" i2="X" l="ENG"><s0>Character recognition</s0>
<s5>01</s5>
</fC03>
<fC03 i1="01" i2="X" l="SPA"><s0>Reconocimiento carácter</s0>
<s5>01</s5>
</fC03>
<fC03 i1="02" i2="X" l="FRE"><s0>Indexation</s0>
<s5>02</s5>
</fC03>
<fC03 i1="02" i2="X" l="ENG"><s0>Indexing</s0>
<s5>02</s5>
</fC03>
<fC03 i1="02" i2="X" l="SPA"><s0>Indización</s0>
<s5>02</s5>
</fC03>
<fC03 i1="03" i2="X" l="FRE"><s0>Recherche information</s0>
<s5>03</s5>
</fC03>
<fC03 i1="03" i2="X" l="ENG"><s0>Information retrieval</s0>
<s5>03</s5>
</fC03>
<fC03 i1="03" i2="X" l="SPA"><s0>Búsqueda información</s0>
<s5>03</s5>
</fC03>
<fC03 i1="04" i2="X" l="FRE"><s0>Signal numérique</s0>
<s5>04</s5>
</fC03>
<fC03 i1="04" i2="X" l="ENG"><s0>Digital signal</s0>
<s5>04</s5>
</fC03>
<fC03 i1="04" i2="X" l="SPA"><s0>Señal numérica</s0>
<s5>04</s5>
</fC03>
<fC03 i1="05" i2="X" l="FRE"><s0>Signal vidéo</s0>
<s5>05</s5>
</fC03>
<fC03 i1="05" i2="X" l="ENG"><s0>Video signal</s0>
<s5>05</s5>
</fC03>
<fC03 i1="05" i2="X" l="SPA"><s0>Señal video</s0>
<s5>05</s5>
</fC03>
<fC03 i1="06" i2="X" l="FRE"><s0>Analyse sémantique</s0>
<s5>06</s5>
</fC03>
<fC03 i1="06" i2="X" l="ENG"><s0>Semantic analysis</s0>
<s5>06</s5>
</fC03>
<fC03 i1="06" i2="X" l="SPA"><s0>Análisis semántico</s0>
<s5>06</s5>
</fC03>
<fC03 i1="07" i2="X" l="FRE"><s0>Base connaissance</s0>
<s5>07</s5>
</fC03>
<fC03 i1="07" i2="X" l="ENG"><s0>Knowledge base</s0>
<s5>07</s5>
</fC03>
<fC03 i1="07" i2="X" l="SPA"><s0>Base conocimiento</s0>
<s5>07</s5>
</fC03>
<fC03 i1="08" i2="3" l="FRE"><s0>Recherche image</s0>
<s5>08</s5>
</fC03>
<fC03 i1="08" i2="3" l="ENG"><s0>Image retrieval</s0>
<s5>08</s5>
</fC03>
<fC03 i1="09" i2="X" l="FRE"><s0>Localisation</s0>
<s5>09</s5>
</fC03>
<fC03 i1="09" i2="X" l="ENG"><s0>Localization</s0>
<s5>09</s5>
</fC03>
<fC03 i1="09" i2="X" l="SPA"><s0>Localización</s0>
<s5>09</s5>
</fC03>
<fC03 i1="10" i2="X" l="FRE"><s0>Reconnaissance optique caractère</s0>
<s5>10</s5>
</fC03>
<fC03 i1="10" i2="X" l="ENG"><s0>Optical character recognition</s0>
<s5>10</s5>
</fC03>
<fC03 i1="10" i2="X" l="SPA"><s0>Reconocimento óptico de caracteres</s0>
<s5>10</s5>
</fC03>
<fC03 i1="11" i2="X" l="FRE"><s0>Packaging électronique</s0>
<s5>11</s5>
</fC03>
<fC03 i1="11" i2="X" l="ENG"><s0>Electronic packaging</s0>
<s5>11</s5>
</fC03>
<fC03 i1="11" i2="X" l="SPA"><s0>Packaging electrónico</s0>
<s5>11</s5>
</fC03>
<fC03 i1="12" i2="X" l="FRE"><s0>Reconnaissance forme</s0>
<s5>31</s5>
</fC03>
<fC03 i1="12" i2="X" l="ENG"><s0>Pattern recognition</s0>
<s5>31</s5>
</fC03>
<fC03 i1="12" i2="X" l="SPA"><s0>Reconocimiento patrón</s0>
<s5>31</s5>
</fC03>
<fC03 i1="13" i2="3" l="FRE"><s0>Extraction caractéristique</s0>
<s5>32</s5>
</fC03>
<fC03 i1="13" i2="3" l="ENG"><s0>Feature extraction</s0>
<s5>32</s5>
</fC03>
<fC03 i1="14" i2="X" l="FRE"><s0>Traitement signal</s0>
<s5>33</s5>
</fC03>
<fC03 i1="14" i2="X" l="ENG"><s0>Signal processing</s0>
<s5>33</s5>
</fC03>
<fC03 i1="14" i2="X" l="SPA"><s0>Procesamiento señal</s0>
<s5>33</s5>
</fC03>
<fN21><s1>135</s1>
</fN21>
<fN44 i1="01"><s1>OTO</s1>
</fN44>
<fN82><s1>OTO</s1>
</fN82>
</pA>
</standard>
<server><NO>PASCAL 06-0213101 INIST</NO>
<ET>Text detection and recognition for video indexing</ET>
<AU>ARISH ASIF QAZI; SIDDIQI (Imran A.); SARMAD HUSSAIN (M.)</AU>
<AF>Room No 617 SpannsKamp26/22527 Hamburg/Allemagne (1 aut.); InterActive Communications Private Limited* 9 - Street 29, F-7/1/Islamabad/Pakistan (2 aut., 3 aut.)</AF>
<DT>Congrès; Niveau analytique</DT>
<SO>International conference on signals and electronic systems/2004-09-13/Poznan POL; Pologne; Poznan: PTETiS; Da. 2005; Pp. 529-532; ISBN 83-906074-7-6</SO>
<LA>Anglais</LA>
<EA>Efficient indexing and retrieval of digital video has been an area of active research in the recent years. Many of the existing retrieval systems, however, work without semantic knowledge. We propose a knowledge-based approach for video indexing, based on the fact that videos are rich in semantic contents. The most powerful index for semantic retrieval of video images is the Text appearing in them. In this paper we present a system for localization of horizontally aligned artificial text in video images. The detected text is binarized and is passed to a standard OCR package. The system has been thoroughly tested and the experimental results are found to be quite encouraging.</EA>
<CC>001D04A05A; 001D04A03; 001D04A05C; 001D03F06A</CC>
<FD>Reconnaissance caractère; Indexation; Recherche information; Signal numérique; Signal vidéo; Analyse sémantique; Base connaissance; Recherche image; Localisation; Reconnaissance optique caractère; Packaging électronique; Reconnaissance forme; Extraction caractéristique; Traitement signal</FD>
<ED>Character recognition; Indexing; Information retrieval; Digital signal; Video signal; Semantic analysis; Knowledge base; Image retrieval; Localization; Optical character recognition; Electronic packaging; Pattern recognition; Feature extraction; Signal processing</ED>
<SD>Reconocimiento carácter; Indización; Búsqueda información; Señal numérica; Señal video; Análisis semántico; Base conocimiento; Localización; Reconocimento óptico de caracteres; Packaging electrónico; Reconocimiento patrón; Procesamiento señal</SD>
<LO>INIST-Y 38366.354000138746651320</LO>
<ID>06-0213101</ID>
</server>
</inist>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/PascalFrancis/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000394 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/PascalFrancis/Corpus/biblio.hfd -nk 000394 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= PascalFrancis |étape= Corpus |type= RBID |clé= Pascal:06-0213101 |texte= Text detection and recognition for video indexing }}
This area was generated with Dilib version V0.6.32. |