Text detection and recognition for video indexing
Identifieur interne : 000392 ( PascalFrancis/Curation ); précédent : 000391; suivant : 000393Text detection and recognition for video indexing
Auteurs : ARISH ASIF QAZI [Allemagne] ; Imran A. Siddiqi [Pakistan] ; M. Sarmad Hussain [Pakistan]Source :
Descripteurs français
- Pascal (Inist)
English descriptors
- KwdEn :
Abstract
Efficient indexing and retrieval of digital video has been an area of active research in the recent years. Many of the existing retrieval systems, however, work without semantic knowledge. We propose a knowledge-based approach for video indexing, based on the fact that videos are rich in semantic contents. The most powerful index for semantic retrieval of video images is the Text appearing in them. In this paper we present a system for localization of horizontally aligned artificial text in video images. The detected text is binarized and is passed to a standard OCR package. The system has been thoroughly tested and the experimental results are found to be quite encouraging.
pA |
|
---|
Links toward previous steps (curation, corpus...)
- to stream PascalFrancis, to step Corpus: Pour aller vers cette notice dans l'étape Curation :000394
Links to Exploration step
Pascal:06-0213101Le document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en" level="a">Text detection and recognition for video indexing</title>
<author><name sortKey="Arish Asif Qazi" sort="Arish Asif Qazi" uniqKey="Arish Asif Qazi" last="Arish Asif Qazi">ARISH ASIF QAZI</name>
<affiliation wicri:level="1"><inist:fA14 i1="01"><s1>Room No 617 SpannsKamp26</s1>
<s2>22527 Hamburg</s2>
<s3>DEU</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>Allemagne</country>
</affiliation>
</author>
<author><name sortKey="Siddiqi, Imran A" sort="Siddiqi, Imran A" uniqKey="Siddiqi I" first="Imran A." last="Siddiqi">Imran A. Siddiqi</name>
<affiliation wicri:level="1"><inist:fA14 i1="02"><s1>InterActive Communications Private Limited* 9 - Street 29, F-7/1</s1>
<s2>Islamabad</s2>
<s3>PAK</s3>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Pakistan</country>
</affiliation>
</author>
<author><name sortKey="Sarmad Hussain, M" sort="Sarmad Hussain, M" uniqKey="Sarmad Hussain M" first="M." last="Sarmad Hussain">M. Sarmad Hussain</name>
<affiliation wicri:level="1"><inist:fA14 i1="02"><s1>InterActive Communications Private Limited* 9 - Street 29, F-7/1</s1>
<s2>Islamabad</s2>
<s3>PAK</s3>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Pakistan</country>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">INIST</idno>
<idno type="inist">06-0213101</idno>
<date when="2005">2005</date>
<idno type="stanalyst">PASCAL 06-0213101 INIST</idno>
<idno type="RBID">Pascal:06-0213101</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000394</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000392</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en" level="a">Text detection and recognition for video indexing</title>
<author><name sortKey="Arish Asif Qazi" sort="Arish Asif Qazi" uniqKey="Arish Asif Qazi" last="Arish Asif Qazi">ARISH ASIF QAZI</name>
<affiliation wicri:level="1"><inist:fA14 i1="01"><s1>Room No 617 SpannsKamp26</s1>
<s2>22527 Hamburg</s2>
<s3>DEU</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>Allemagne</country>
</affiliation>
</author>
<author><name sortKey="Siddiqi, Imran A" sort="Siddiqi, Imran A" uniqKey="Siddiqi I" first="Imran A." last="Siddiqi">Imran A. Siddiqi</name>
<affiliation wicri:level="1"><inist:fA14 i1="02"><s1>InterActive Communications Private Limited* 9 - Street 29, F-7/1</s1>
<s2>Islamabad</s2>
<s3>PAK</s3>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Pakistan</country>
</affiliation>
</author>
<author><name sortKey="Sarmad Hussain, M" sort="Sarmad Hussain, M" uniqKey="Sarmad Hussain M" first="M." last="Sarmad Hussain">M. Sarmad Hussain</name>
<affiliation wicri:level="1"><inist:fA14 i1="02"><s1>InterActive Communications Private Limited* 9 - Street 29, F-7/1</s1>
<s2>Islamabad</s2>
<s3>PAK</s3>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Pakistan</country>
</affiliation>
</author>
</analytic>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Character recognition</term>
<term>Digital signal</term>
<term>Electronic packaging</term>
<term>Feature extraction</term>
<term>Image retrieval</term>
<term>Indexing</term>
<term>Information retrieval</term>
<term>Knowledge base</term>
<term>Localization</term>
<term>Optical character recognition</term>
<term>Pattern recognition</term>
<term>Semantic analysis</term>
<term>Signal processing</term>
<term>Video signal</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr"><term>Reconnaissance caractère</term>
<term>Indexation</term>
<term>Recherche information</term>
<term>Signal numérique</term>
<term>Signal vidéo</term>
<term>Analyse sémantique</term>
<term>Base connaissance</term>
<term>Recherche image</term>
<term>Localisation</term>
<term>Reconnaissance optique caractère</term>
<term>Packaging électronique</term>
<term>Reconnaissance forme</term>
<term>Extraction caractéristique</term>
<term>Traitement signal</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Efficient indexing and retrieval of digital video has been an area of active research in the recent years. Many of the existing retrieval systems, however, work without semantic knowledge. We propose a knowledge-based approach for video indexing, based on the fact that videos are rich in semantic contents. The most powerful index for semantic retrieval of video images is the Text appearing in them. In this paper we present a system for localization of horizontally aligned artificial text in video images. The detected text is binarized and is passed to a standard OCR package. The system has been thoroughly tested and the experimental results are found to be quite encouraging.</div>
</front>
</TEI>
<inist><standard h6="B"><pA><fA08 i1="01" i2="1" l="ENG"><s1>Text detection and recognition for video indexing</s1>
</fA08>
<fA09 i1="01" i2="1" l="ENG"><s1>Signals and electronic systems : proceedings : Poznan, Poland, 13-15 September 2004</s1>
</fA09>
<fA11 i1="01" i2="1"><s1>ARISH ASIF QAZI</s1>
</fA11>
<fA11 i1="02" i2="1"><s1>SIDDIQI (Imran A.)</s1>
</fA11>
<fA11 i1="03" i2="1"><s1>SARMAD HUSSAIN (M.)</s1>
</fA11>
<fA14 i1="01"><s1>Room No 617 SpannsKamp26</s1>
<s2>22527 Hamburg</s2>
<s3>DEU</s3>
<sZ>1 aut.</sZ>
</fA14>
<fA14 i1="02"><s1>InterActive Communications Private Limited* 9 - Street 29, F-7/1</s1>
<s2>Islamabad</s2>
<s3>PAK</s3>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</fA14>
<fA20><s1>529-532</s1>
</fA20>
<fA21><s1>2005</s1>
</fA21>
<fA23 i1="01"><s0>ENG</s0>
</fA23>
<fA25 i1="01"><s1>PTETiS</s1>
<s2>Poznan</s2>
</fA25>
<fA26 i1="01"><s0>83-906074-7-6</s0>
</fA26>
<fA30 i1="01" i2="1" l="ENG"><s1>International conference on signals and electronic systems</s1>
<s3>Poznan POL</s3>
<s4>2004-09-13</s4>
</fA30>
<fA43 i1="01"><s1>INIST</s1>
<s2>Y 38366</s2>
<s5>354000138746651320</s5>
</fA43>
<fA44><s0>0000</s0>
<s1>© 2006 INIST-CNRS. All rights reserved.</s1>
</fA44>
<fA45><s0>12 ref.</s0>
</fA45>
<fA47 i1="01" i2="1"><s0>06-0213101</s0>
</fA47>
<fA60><s1>C</s1>
</fA60>
<fA61><s0>A</s0>
</fA61>
<fA66 i1="01"><s0>POL</s0>
</fA66>
<fC01 i1="01" l="ENG"><s0>Efficient indexing and retrieval of digital video has been an area of active research in the recent years. Many of the existing retrieval systems, however, work without semantic knowledge. We propose a knowledge-based approach for video indexing, based on the fact that videos are rich in semantic contents. The most powerful index for semantic retrieval of video images is the Text appearing in them. In this paper we present a system for localization of horizontally aligned artificial text in video images. The detected text is binarized and is passed to a standard OCR package. The system has been thoroughly tested and the experimental results are found to be quite encouraging.</s0>
</fC01>
<fC02 i1="01" i2="X"><s0>001D04A05A</s0>
</fC02>
<fC02 i1="02" i2="X"><s0>001D04A03</s0>
</fC02>
<fC02 i1="03" i2="X"><s0>001D04A05C</s0>
</fC02>
<fC02 i1="04" i2="X"><s0>001D03F06A</s0>
</fC02>
<fC03 i1="01" i2="X" l="FRE"><s0>Reconnaissance caractère</s0>
<s5>01</s5>
</fC03>
<fC03 i1="01" i2="X" l="ENG"><s0>Character recognition</s0>
<s5>01</s5>
</fC03>
<fC03 i1="01" i2="X" l="SPA"><s0>Reconocimiento carácter</s0>
<s5>01</s5>
</fC03>
<fC03 i1="02" i2="X" l="FRE"><s0>Indexation</s0>
<s5>02</s5>
</fC03>
<fC03 i1="02" i2="X" l="ENG"><s0>Indexing</s0>
<s5>02</s5>
</fC03>
<fC03 i1="02" i2="X" l="SPA"><s0>Indización</s0>
<s5>02</s5>
</fC03>
<fC03 i1="03" i2="X" l="FRE"><s0>Recherche information</s0>
<s5>03</s5>
</fC03>
<fC03 i1="03" i2="X" l="ENG"><s0>Information retrieval</s0>
<s5>03</s5>
</fC03>
<fC03 i1="03" i2="X" l="SPA"><s0>Búsqueda información</s0>
<s5>03</s5>
</fC03>
<fC03 i1="04" i2="X" l="FRE"><s0>Signal numérique</s0>
<s5>04</s5>
</fC03>
<fC03 i1="04" i2="X" l="ENG"><s0>Digital signal</s0>
<s5>04</s5>
</fC03>
<fC03 i1="04" i2="X" l="SPA"><s0>Señal numérica</s0>
<s5>04</s5>
</fC03>
<fC03 i1="05" i2="X" l="FRE"><s0>Signal vidéo</s0>
<s5>05</s5>
</fC03>
<fC03 i1="05" i2="X" l="ENG"><s0>Video signal</s0>
<s5>05</s5>
</fC03>
<fC03 i1="05" i2="X" l="SPA"><s0>Señal video</s0>
<s5>05</s5>
</fC03>
<fC03 i1="06" i2="X" l="FRE"><s0>Analyse sémantique</s0>
<s5>06</s5>
</fC03>
<fC03 i1="06" i2="X" l="ENG"><s0>Semantic analysis</s0>
<s5>06</s5>
</fC03>
<fC03 i1="06" i2="X" l="SPA"><s0>Análisis semántico</s0>
<s5>06</s5>
</fC03>
<fC03 i1="07" i2="X" l="FRE"><s0>Base connaissance</s0>
<s5>07</s5>
</fC03>
<fC03 i1="07" i2="X" l="ENG"><s0>Knowledge base</s0>
<s5>07</s5>
</fC03>
<fC03 i1="07" i2="X" l="SPA"><s0>Base conocimiento</s0>
<s5>07</s5>
</fC03>
<fC03 i1="08" i2="3" l="FRE"><s0>Recherche image</s0>
<s5>08</s5>
</fC03>
<fC03 i1="08" i2="3" l="ENG"><s0>Image retrieval</s0>
<s5>08</s5>
</fC03>
<fC03 i1="09" i2="X" l="FRE"><s0>Localisation</s0>
<s5>09</s5>
</fC03>
<fC03 i1="09" i2="X" l="ENG"><s0>Localization</s0>
<s5>09</s5>
</fC03>
<fC03 i1="09" i2="X" l="SPA"><s0>Localización</s0>
<s5>09</s5>
</fC03>
<fC03 i1="10" i2="X" l="FRE"><s0>Reconnaissance optique caractère</s0>
<s5>10</s5>
</fC03>
<fC03 i1="10" i2="X" l="ENG"><s0>Optical character recognition</s0>
<s5>10</s5>
</fC03>
<fC03 i1="10" i2="X" l="SPA"><s0>Reconocimento óptico de caracteres</s0>
<s5>10</s5>
</fC03>
<fC03 i1="11" i2="X" l="FRE"><s0>Packaging électronique</s0>
<s5>11</s5>
</fC03>
<fC03 i1="11" i2="X" l="ENG"><s0>Electronic packaging</s0>
<s5>11</s5>
</fC03>
<fC03 i1="11" i2="X" l="SPA"><s0>Packaging electrónico</s0>
<s5>11</s5>
</fC03>
<fC03 i1="12" i2="X" l="FRE"><s0>Reconnaissance forme</s0>
<s5>31</s5>
</fC03>
<fC03 i1="12" i2="X" l="ENG"><s0>Pattern recognition</s0>
<s5>31</s5>
</fC03>
<fC03 i1="12" i2="X" l="SPA"><s0>Reconocimiento patrón</s0>
<s5>31</s5>
</fC03>
<fC03 i1="13" i2="3" l="FRE"><s0>Extraction caractéristique</s0>
<s5>32</s5>
</fC03>
<fC03 i1="13" i2="3" l="ENG"><s0>Feature extraction</s0>
<s5>32</s5>
</fC03>
<fC03 i1="14" i2="X" l="FRE"><s0>Traitement signal</s0>
<s5>33</s5>
</fC03>
<fC03 i1="14" i2="X" l="ENG"><s0>Signal processing</s0>
<s5>33</s5>
</fC03>
<fC03 i1="14" i2="X" l="SPA"><s0>Procesamiento señal</s0>
<s5>33</s5>
</fC03>
<fN21><s1>135</s1>
</fN21>
<fN44 i1="01"><s1>OTO</s1>
</fN44>
<fN82><s1>OTO</s1>
</fN82>
</pA>
</standard>
</inist>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/PascalFrancis/Curation
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000392 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/PascalFrancis/Curation/biblio.hfd -nk 000392 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= PascalFrancis |étape= Curation |type= RBID |clé= Pascal:06-0213101 |texte= Text detection and recognition for video indexing }}
This area was generated with Dilib version V0.6.32. |