OcrV1, PascalFrancis, Corpus, bibRecord, 000312

Evaluation framework for video OCR

Identifieur interne : 000312 ( PascalFrancis/Corpus ); précédent : 000311; suivant : 000313

Evaluation framework for video OCR

Auteurs : Padmanabhan Soundararajan ; Matthew Boonstra ; Vasant Manohar ; Valentina Korzhova ; Dmitry Goldgof ; Rangachar Kasturi ; Shubha Prasad ; Harish Raju ; Rachel Bowers ; John Garofolo

Source :

Lecture notes in computer science [ 0302-9743 ] ; 2006.

RBID : Pascal:07-0525709

Descripteurs français

Pascal (Inist)
- Vision ordinateur, Signal vidéo, Reconnaissance caractère, Reconnaissance optique caractère, Texte, Recherche information, Poursuite cible, Traitement image, Pistage, Métrique.

English descriptors

KwdEn :
- Character recognition, Computer vision, Image processing, Information retrieval, Metric, Optical character recognition, Target tracking, Text, Tracking, Video signal.

Abstract

In this work, we present a recently developed evaluation framework for video OCR specifically for English Text but could well be generalized for other languages as well. Earlier works include the development of an evaluation strategy for text detection and tracking in video, this work is a natural extension. We sucessfully port and use the ASR metrics used in the speech community here in the video domain. Further, we also show results on a small pilot corpus which involves 25 clips. Results obtained are promising and we believe that this is a good baseline and will encourage future participation in such evaluations.

Notice en format standard (ISO 2709)

Pour connaître la documentation sur le format Inist Standard.

A01	`01`	`1`		`@0 0302-9743`
A05				`@2 4338`
A08	`01`	`1`	`ENG`	`@1 Evaluation framework for video OCR`
A09	`01`	`1`	`ENG`	`@1 Computer vision, graphics and image processing : 5th Indian conference, ICVGIP 2006, Madurai, India, December 13-16, 2006 : proceedings`
A11	`01`	`1`		`@1 SOUNDARARAJAN (Padmanabhan)`
A11	`02`	`1`		`@1 BOONSTRA (Matthew)`
A11	`03`	`1`		`@1 MANOHAR (Vasant)`
A11	`04`	`1`		`@1 KORZHOVA (Valentina)`
A11	`05`	`1`		`@1 GOLDGOF (Dmitry)`
A11	`06`	`1`		`@1 KASTURI (Rangachar)`
A11	`07`	`1`		`@1 PRASAD (Shubha)`
A11	`08`	`1`		`@1 RAJU (Harish)`
A11	`09`	`1`		`@1 BOWERS (Rachel)`
A11	`10`	`1`		`@1 GAROFOLO (John)`
A12	`01`	`1`		`@1 KALRA (Prem K.) @9 ed.`
A12	`02`	`1`		`@1 PELEG (Shmuel) @9 ed.`
A14	`01`			`@1 Computer Science and Engineering, University of South Florida @2 Tampa, FL @3 USA @Z 1 aut. @Z 2 aut. @Z 3 aut. @Z 4 aut. @Z 5 aut. @Z 6 aut.`
A14	`02`			`@1 VideoMining Corporation @2 State College, PA @3 USA @Z 7 aut. @Z 8 aut.`
A14	`03`			`@1 National Institute of Standards and Technology (NIST), Information Technology Lab -Information Access Division, Speech Group @3 USA @Z 9 aut. @Z 10 aut.`
A20				`@1 829-836`
A21				`@1 2006`
A23	`01`			`@0 ENG`
A26	`01`			`@0 3-540-68301-1`
A43	`01`			`@1 INIST @2 16343 @5 354000153627190740`
A44				`@0 0000 @1 © 2007 INIST-CNRS. All rights reserved.`
A45				`@0 6 ref.`
A47	`01`	`1`		`@0 07-0525709`
A60				`@1 P @2 C`
A61				`@0 A`
A64	`01`	`1`		`@0 Lecture notes in computer science`
A66	`01`			`@0 DEU`
A66	`02`			`@0 USA`
C01	`01`		`ENG`	@0 In this work, we present a recently developed evaluation framework for video OCR specifically for English Text but could well be generalized for other languages as well. Earlier works include the development of an evaluation strategy for text detection and tracking in video, this work is a natural extension. We sucessfully port and use the ASR metrics used in the speech community here in the video domain. Further, we also show results on a small pilot corpus which involves 25 clips. Results obtained are promising and we believe that this is a good baseline and will encourage future participation in such evaluations.
C02	`01`	`X`		`@0 001D02C04`
C02	`02`	`X`		`@0 001D02C03`
C03	`01`	`X`	`FRE`	`@0 Vision ordinateur @5 01`
C03	`01`	`X`	`ENG`	`@0 Computer vision @5 01`
C03	`01`	`X`	`SPA`	`@0 Visión ordenador @5 01`
C03	`02`	`X`	`FRE`	`@0 Signal vidéo @5 06`
C03	`02`	`X`	`ENG`	`@0 Video signal @5 06`
C03	`02`	`X`	`SPA`	`@0 Señal video @5 06`
C03	`03`	`X`	`FRE`	`@0 Reconnaissance caractère @5 07`
C03	`03`	`X`	`ENG`	`@0 Character recognition @5 07`
C03	`03`	`X`	`SPA`	`@0 Reconocimiento carácter @5 07`
C03	`04`	`X`	`FRE`	`@0 Reconnaissance optique caractère @5 08`
C03	`04`	`X`	`ENG`	`@0 Optical character recognition @5 08`
C03	`04`	`X`	`SPA`	`@0 Reconocimento óptico de caracteres @5 08`
C03	`05`	`X`	`FRE`	`@0 Texte @5 09`
C03	`05`	`X`	`ENG`	`@0 Text @5 09`
C03	`05`	`X`	`SPA`	`@0 Texto @5 09`
C03	`06`	`X`	`FRE`	`@0 Recherche information @5 10`
C03	`06`	`X`	`ENG`	`@0 Information retrieval @5 10`
C03	`06`	`X`	`SPA`	`@0 Búsqueda información @5 10`
C03	`07`	`3`	`FRE`	`@0 Poursuite cible @5 11`
C03	`07`	`3`	`ENG`	`@0 Target tracking @5 11`
C03	`08`	`X`	`FRE`	`@0 Traitement image @5 12`
C03	`08`	`X`	`ENG`	`@0 Image processing @5 12`
C03	`08`	`X`	`SPA`	`@0 Procesamiento imagen @5 12`
C03	`09`	`X`	`FRE`	`@0 Pistage @5 13`
C03	`09`	`X`	`ENG`	`@0 Tracking @5 13`
C03	`09`	`X`	`SPA`	`@0 Rastreo @5 13`
C03	`10`	`X`	`FRE`	`@0 Métrique @5 18`
C03	`10`	`X`	`ENG`	`@0 Metric @5 18`
C03	`10`	`X`	`SPA`	`@0 Métrico @5 18`
N21				`@1 344`
N44	`01`			`@1 OTO`
N82				`@1 OTO`

A30	`01`	`1`	`ENG`	`@1 Indian Conference on Computer Vision Graphics and Image Processing @2 5 @3 Madurai IND @4 2006`

Format Inist (serveur)

NO :	PASCAL 07-0525709 INIST
ET :	Evaluation framework for video OCR
AU :	SOUNDARARAJAN (Padmanabhan); BOONSTRA (Matthew); MANOHAR (Vasant); KORZHOVA (Valentina); GOLDGOF (Dmitry); KASTURI (Rangachar); PRASAD (Shubha); RAJU (Harish); BOWERS (Rachel); GAROFOLO (John); KALRA (Prem K.); PELEG (Shmuel)
AF :	Computer Science and Engineering, University of South Florida/Tampa, FL/Etats-Unis (1 aut., 2 aut., 3 aut., 4 aut., 5 aut., 6 aut.); VideoMining Corporation/State College, PA/Etats-Unis (7 aut., 8 aut.); National Institute of Standards and Technology (NIST), Information Technology Lab -Information Access Division, Speech Group/Etats-Unis (9 aut., 10 aut.)
DT :	Publication en série; Congrès; Niveau analytique
SO :	Lecture notes in computer science; ISSN 0302-9743; Allemagne; Da. 2006; Vol. 4338; Pp. 829-836; Bibl. 6 ref.
LA :	Anglais
EA :	In this work, we present a recently developed evaluation framework for video OCR specifically for English Text but could well be generalized for other languages as well. Earlier works include the development of an evaluation strategy for text detection and tracking in video, this work is a natural extension. We sucessfully port and use the ASR metrics used in the speech community here in the video domain. Further, we also show results on a small pilot corpus which involves 25 clips. Results obtained are promising and we believe that this is a good baseline and will encourage future participation in such evaluations.
CC :	001D02C04; 001D02C03
FD :	Vision ordinateur; Signal vidéo; Reconnaissance caractère; Reconnaissance optique caractère; Texte; Recherche information; Poursuite cible; Traitement image; Pistage; Métrique
ED :	Computer vision; Video signal; Character recognition; Optical character recognition; Text; Information retrieval; Target tracking; Image processing; Tracking; Metric
SD :	Visión ordenador; Señal video; Reconocimiento carácter; Reconocimento óptico de caracteres; Texto; Búsqueda información; Procesamiento imagen; Rastreo; Métrico
LO :	INIST-16343.354000153627190740
ID :	07-0525709

Links to Exploration step

Pascal:07-0525709

Le document en format XML

<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en" level="a">Evaluation framework for video OCR</title>
<author><name sortKey="Soundararajan, Padmanabhan" sort="Soundararajan, Padmanabhan" uniqKey="Soundararajan P" first="Padmanabhan" last="Soundararajan">Padmanabhan Soundararajan</name>
<affiliation><inist:fA14 i1="01"><s1>Computer Science and Engineering, University of South Florida</s1>
<s2>Tampa, FL</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author><name sortKey="Boonstra, Matthew" sort="Boonstra, Matthew" uniqKey="Boonstra M" first="Matthew" last="Boonstra">Matthew Boonstra</name>
<affiliation><inist:fA14 i1="01"><s1>Computer Science and Engineering, University of South Florida</s1>
<s2>Tampa, FL</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author><name sortKey="Manohar, Vasant" sort="Manohar, Vasant" uniqKey="Manohar V" first="Vasant" last="Manohar">Vasant Manohar</name>
<affiliation><inist:fA14 i1="01"><s1>Computer Science and Engineering, University of South Florida</s1>
<s2>Tampa, FL</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author><name sortKey="Korzhova, Valentina" sort="Korzhova, Valentina" uniqKey="Korzhova V" first="Valentina" last="Korzhova">Valentina Korzhova</name>
<affiliation><inist:fA14 i1="01"><s1>Computer Science and Engineering, University of South Florida</s1>
<s2>Tampa, FL</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author><name sortKey="Goldgof, Dmitry" sort="Goldgof, Dmitry" uniqKey="Goldgof D" first="Dmitry" last="Goldgof">Dmitry Goldgof</name>
<affiliation><inist:fA14 i1="01"><s1>Computer Science and Engineering, University of South Florida</s1>
<s2>Tampa, FL</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author><name sortKey="Kasturi, Rangachar" sort="Kasturi, Rangachar" uniqKey="Kasturi R" first="Rangachar" last="Kasturi">Rangachar Kasturi</name>
<affiliation><inist:fA14 i1="01"><s1>Computer Science and Engineering, University of South Florida</s1>
<s2>Tampa, FL</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author><name sortKey="Prasad, Shubha" sort="Prasad, Shubha" uniqKey="Prasad S" first="Shubha" last="Prasad">Shubha Prasad</name>
<affiliation><inist:fA14 i1="02"><s1>VideoMining Corporation</s1>
<s2>State College, PA</s2>
<s3>USA</s3>
<sZ>7 aut.</sZ>
<sZ>8 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author><name sortKey="Raju, Harish" sort="Raju, Harish" uniqKey="Raju H" first="Harish" last="Raju">Harish Raju</name>
<affiliation><inist:fA14 i1="02"><s1>VideoMining Corporation</s1>
<s2>State College, PA</s2>
<s3>USA</s3>
<sZ>7 aut.</sZ>
<sZ>8 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author><name sortKey="Bowers, Rachel" sort="Bowers, Rachel" uniqKey="Bowers R" first="Rachel" last="Bowers">Rachel Bowers</name>
<affiliation><inist:fA14 i1="03"><s1>National Institute of Standards and Technology (NIST), Information Technology Lab -Information Access Division, Speech Group</s1>
<s3>USA</s3>
<sZ>9 aut.</sZ>
<sZ>10 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author><name sortKey="Garofolo, John" sort="Garofolo, John" uniqKey="Garofolo J" first="John" last="Garofolo">John Garofolo</name>
<affiliation><inist:fA14 i1="03"><s1>National Institute of Standards and Technology (NIST), Information Technology Lab -Information Access Division, Speech Group</s1>
<s3>USA</s3>
<sZ>9 aut.</sZ>
<sZ>10 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">INIST</idno>
<idno type="inist">07-0525709</idno>
<date when="2006">2006</date>
<idno type="stanalyst">PASCAL 07-0525709 INIST</idno>
<idno type="RBID">Pascal:07-0525709</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000312</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en" level="a">Evaluation framework for video OCR</title>
<author><name sortKey="Soundararajan, Padmanabhan" sort="Soundararajan, Padmanabhan" uniqKey="Soundararajan P" first="Padmanabhan" last="Soundararajan">Padmanabhan Soundararajan</name>
<affiliation><inist:fA14 i1="01"><s1>Computer Science and Engineering, University of South Florida</s1>
<s2>Tampa, FL</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author><name sortKey="Boonstra, Matthew" sort="Boonstra, Matthew" uniqKey="Boonstra M" first="Matthew" last="Boonstra">Matthew Boonstra</name>
<affiliation><inist:fA14 i1="01"><s1>Computer Science and Engineering, University of South Florida</s1>
<s2>Tampa, FL</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author><name sortKey="Manohar, Vasant" sort="Manohar, Vasant" uniqKey="Manohar V" first="Vasant" last="Manohar">Vasant Manohar</name>
<affiliation><inist:fA14 i1="01"><s1>Computer Science and Engineering, University of South Florida</s1>
<s2>Tampa, FL</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author><name sortKey="Korzhova, Valentina" sort="Korzhova, Valentina" uniqKey="Korzhova V" first="Valentina" last="Korzhova">Valentina Korzhova</name>
<affiliation><inist:fA14 i1="01"><s1>Computer Science and Engineering, University of South Florida</s1>
<s2>Tampa, FL</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author><name sortKey="Goldgof, Dmitry" sort="Goldgof, Dmitry" uniqKey="Goldgof D" first="Dmitry" last="Goldgof">Dmitry Goldgof</name>
<affiliation><inist:fA14 i1="01"><s1>Computer Science and Engineering, University of South Florida</s1>
<s2>Tampa, FL</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author><name sortKey="Kasturi, Rangachar" sort="Kasturi, Rangachar" uniqKey="Kasturi R" first="Rangachar" last="Kasturi">Rangachar Kasturi</name>
<affiliation><inist:fA14 i1="01"><s1>Computer Science and Engineering, University of South Florida</s1>
<s2>Tampa, FL</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author><name sortKey="Prasad, Shubha" sort="Prasad, Shubha" uniqKey="Prasad S" first="Shubha" last="Prasad">Shubha Prasad</name>
<affiliation><inist:fA14 i1="02"><s1>VideoMining Corporation</s1>
<s2>State College, PA</s2>
<s3>USA</s3>
<sZ>7 aut.</sZ>
<sZ>8 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author><name sortKey="Raju, Harish" sort="Raju, Harish" uniqKey="Raju H" first="Harish" last="Raju">Harish Raju</name>
<affiliation><inist:fA14 i1="02"><s1>VideoMining Corporation</s1>
<s2>State College, PA</s2>
<s3>USA</s3>
<sZ>7 aut.</sZ>
<sZ>8 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author><name sortKey="Bowers, Rachel" sort="Bowers, Rachel" uniqKey="Bowers R" first="Rachel" last="Bowers">Rachel Bowers</name>
<affiliation><inist:fA14 i1="03"><s1>National Institute of Standards and Technology (NIST), Information Technology Lab -Information Access Division, Speech Group</s1>
<s3>USA</s3>
<sZ>9 aut.</sZ>
<sZ>10 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author><name sortKey="Garofolo, John" sort="Garofolo, John" uniqKey="Garofolo J" first="John" last="Garofolo">John Garofolo</name>
<affiliation><inist:fA14 i1="03"><s1>National Institute of Standards and Technology (NIST), Information Technology Lab -Information Access Division, Speech Group</s1>
<s3>USA</s3>
<sZ>9 aut.</sZ>
<sZ>10 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
</analytic>
<series><title level="j" type="main">Lecture notes in computer science</title>
<idno type="ISSN">0302-9743</idno>
<imprint><date when="2006">2006</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt><title level="j" type="main">Lecture notes in computer science</title>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Character recognition</term>
<term>Computer vision</term>
<term>Image processing</term>
<term>Information retrieval</term>
<term>Metric</term>
<term>Optical character recognition</term>
<term>Target tracking</term>
<term>Text</term>
<term>Tracking</term>
<term>Video signal</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr"><term>Vision ordinateur</term>
<term>Signal vidéo</term>
<term>Reconnaissance caractère</term>
<term>Reconnaissance optique caractère</term>
<term>Texte</term>
<term>Recherche information</term>
<term>Poursuite cible</term>
<term>Traitement image</term>
<term>Pistage</term>
<term>Métrique</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">In this work, we present a recently developed evaluation framework for video OCR specifically for English Text but could well be generalized for other languages as well. Earlier works include the development of an evaluation strategy for text detection and tracking in video, this work is a natural extension. We sucessfully port and use the ASR metrics used in the speech community here in the video domain. Further, we also show results on a small pilot corpus which involves 25 clips. Results obtained are promising and we believe that this is a good baseline and will encourage future participation in such evaluations.</div>
</front>
</TEI>
<inist><standard h6="B"><pA><fA01 i1="01" i2="1"><s0>0302-9743</s0>
</fA01>
<fA05><s2>4338</s2>
</fA05>
<fA08 i1="01" i2="1" l="ENG"><s1>Evaluation framework for video OCR</s1>
</fA08>
<fA09 i1="01" i2="1" l="ENG"><s1>Computer vision, graphics and image processing : 5th Indian conference, ICVGIP 2006, Madurai, India, December 13-16, 2006 : proceedings</s1>
</fA09>
<fA11 i1="01" i2="1"><s1>SOUNDARARAJAN (Padmanabhan)</s1>
</fA11>
<fA11 i1="02" i2="1"><s1>BOONSTRA (Matthew)</s1>
</fA11>
<fA11 i1="03" i2="1"><s1>MANOHAR (Vasant)</s1>
</fA11>
<fA11 i1="04" i2="1"><s1>KORZHOVA (Valentina)</s1>
</fA11>
<fA11 i1="05" i2="1"><s1>GOLDGOF (Dmitry)</s1>
</fA11>
<fA11 i1="06" i2="1"><s1>KASTURI (Rangachar)</s1>
</fA11>
<fA11 i1="07" i2="1"><s1>PRASAD (Shubha)</s1>
</fA11>
<fA11 i1="08" i2="1"><s1>RAJU (Harish)</s1>
</fA11>
<fA11 i1="09" i2="1"><s1>BOWERS (Rachel)</s1>
</fA11>
<fA11 i1="10" i2="1"><s1>GAROFOLO (John)</s1>
</fA11>
<fA12 i1="01" i2="1"><s1>KALRA (Prem K.)</s1>
<s9>ed.</s9>
</fA12>
<fA12 i1="02" i2="1"><s1>PELEG (Shmuel)</s1>
<s9>ed.</s9>
</fA12>
<fA14 i1="01"><s1>Computer Science and Engineering, University of South Florida</s1>
<s2>Tampa, FL</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</fA14>
<fA14 i1="02"><s1>VideoMining Corporation</s1>
<s2>State College, PA</s2>
<s3>USA</s3>
<sZ>7 aut.</sZ>
<sZ>8 aut.</sZ>
</fA14>
<fA14 i1="03"><s1>National Institute of Standards and Technology (NIST), Information Technology Lab -Information Access Division, Speech Group</s1>
<s3>USA</s3>
<sZ>9 aut.</sZ>
<sZ>10 aut.</sZ>
</fA14>
<fA20><s1>829-836</s1>
</fA20>
<fA21><s1>2006</s1>
</fA21>
<fA23 i1="01"><s0>ENG</s0>
</fA23>
<fA26 i1="01"><s0>3-540-68301-1</s0>
</fA26>
<fA43 i1="01"><s1>INIST</s1>
<s2>16343</s2>
<s5>354000153627190740</s5>
</fA43>
<fA44><s0>0000</s0>
<s1>© 2007 INIST-CNRS. All rights reserved.</s1>
</fA44>
<fA45><s0>6 ref.</s0>
</fA45>
<fA47 i1="01" i2="1"><s0>07-0525709</s0>
</fA47>
<fA60><s1>P</s1>
<s2>C</s2>
</fA60>
<fA61><s0>A</s0>
</fA61>
<fA64 i1="01" i2="1"><s0>Lecture notes in computer science</s0>
</fA64>
<fA66 i1="01"><s0>DEU</s0>
</fA66>
<fA66 i1="02"><s0>USA</s0>
</fA66>
<fC01 i1="01" l="ENG"><s0>In this work, we present a recently developed evaluation framework for video OCR specifically for English Text but could well be generalized for other languages as well. Earlier works include the development of an evaluation strategy for text detection and tracking in video, this work is a natural extension. We sucessfully port and use the ASR metrics used in the speech community here in the video domain. Further, we also show results on a small pilot corpus which involves 25 clips. Results obtained are promising and we believe that this is a good baseline and will encourage future participation in such evaluations.</s0>
</fC01>
<fC02 i1="01" i2="X"><s0>001D02C04</s0>
</fC02>
<fC02 i1="02" i2="X"><s0>001D02C03</s0>
</fC02>
<fC03 i1="01" i2="X" l="FRE"><s0>Vision ordinateur</s0>
<s5>01</s5>
</fC03>
<fC03 i1="01" i2="X" l="ENG"><s0>Computer vision</s0>
<s5>01</s5>
</fC03>
<fC03 i1="01" i2="X" l="SPA"><s0>Visión ordenador</s0>
<s5>01</s5>
</fC03>
<fC03 i1="02" i2="X" l="FRE"><s0>Signal vidéo</s0>
<s5>06</s5>
</fC03>
<fC03 i1="02" i2="X" l="ENG"><s0>Video signal</s0>
<s5>06</s5>
</fC03>
<fC03 i1="02" i2="X" l="SPA"><s0>Señal video</s0>
<s5>06</s5>
</fC03>
<fC03 i1="03" i2="X" l="FRE"><s0>Reconnaissance caractère</s0>
<s5>07</s5>
</fC03>
<fC03 i1="03" i2="X" l="ENG"><s0>Character recognition</s0>
<s5>07</s5>
</fC03>
<fC03 i1="03" i2="X" l="SPA"><s0>Reconocimiento carácter</s0>
<s5>07</s5>
</fC03>
<fC03 i1="04" i2="X" l="FRE"><s0>Reconnaissance optique caractère</s0>
<s5>08</s5>
</fC03>
<fC03 i1="04" i2="X" l="ENG"><s0>Optical character recognition</s0>
<s5>08</s5>
</fC03>
<fC03 i1="04" i2="X" l="SPA"><s0>Reconocimento óptico de caracteres</s0>
<s5>08</s5>
</fC03>
<fC03 i1="05" i2="X" l="FRE"><s0>Texte</s0>
<s5>09</s5>
</fC03>
<fC03 i1="05" i2="X" l="ENG"><s0>Text</s0>
<s5>09</s5>
</fC03>
<fC03 i1="05" i2="X" l="SPA"><s0>Texto</s0>
<s5>09</s5>
</fC03>
<fC03 i1="06" i2="X" l="FRE"><s0>Recherche information</s0>
<s5>10</s5>
</fC03>
<fC03 i1="06" i2="X" l="ENG"><s0>Information retrieval</s0>
<s5>10</s5>
</fC03>
<fC03 i1="06" i2="X" l="SPA"><s0>Búsqueda información</s0>
<s5>10</s5>
</fC03>
<fC03 i1="07" i2="3" l="FRE"><s0>Poursuite cible</s0>
<s5>11</s5>
</fC03>
<fC03 i1="07" i2="3" l="ENG"><s0>Target tracking</s0>
<s5>11</s5>
</fC03>
<fC03 i1="08" i2="X" l="FRE"><s0>Traitement image</s0>
<s5>12</s5>
</fC03>
<fC03 i1="08" i2="X" l="ENG"><s0>Image processing</s0>
<s5>12</s5>
</fC03>
<fC03 i1="08" i2="X" l="SPA"><s0>Procesamiento imagen</s0>
<s5>12</s5>
</fC03>
<fC03 i1="09" i2="X" l="FRE"><s0>Pistage</s0>
<s5>13</s5>
</fC03>
<fC03 i1="09" i2="X" l="ENG"><s0>Tracking</s0>
<s5>13</s5>
</fC03>
<fC03 i1="09" i2="X" l="SPA"><s0>Rastreo</s0>
<s5>13</s5>
</fC03>
<fC03 i1="10" i2="X" l="FRE"><s0>Métrique</s0>
<s5>18</s5>
</fC03>
<fC03 i1="10" i2="X" l="ENG"><s0>Metric</s0>
<s5>18</s5>
</fC03>
<fC03 i1="10" i2="X" l="SPA"><s0>Métrico</s0>
<s5>18</s5>
</fC03>
<fN21><s1>344</s1>
</fN21>
<fN44 i1="01"><s1>OTO</s1>
</fN44>
<fN82><s1>OTO</s1>
</fN82>
</pA>
<pR><fA30 i1="01" i2="1" l="ENG"><s1>Indian Conference on Computer Vision Graphics and Image Processing</s1>
<s2>5</s2>
<s3>Madurai IND</s3>
<s4>2006</s4>
</fA30>
</pR>
</standard>
<server><NO>PASCAL 07-0525709 INIST</NO>
<ET>Evaluation framework for video OCR</ET>
<AU>SOUNDARARAJAN (Padmanabhan); BOONSTRA (Matthew); MANOHAR (Vasant); KORZHOVA (Valentina); GOLDGOF (Dmitry); KASTURI (Rangachar); PRASAD (Shubha); RAJU (Harish); BOWERS (Rachel); GAROFOLO (John); KALRA (Prem K.); PELEG (Shmuel)</AU>
<AF>Computer Science and Engineering, University of South Florida/Tampa, FL/Etats-Unis (1 aut., 2 aut., 3 aut., 4 aut., 5 aut., 6 aut.); VideoMining Corporation/State College, PA/Etats-Unis (7 aut., 8 aut.); National Institute of Standards and Technology (NIST), Information Technology Lab -Information Access Division, Speech Group/Etats-Unis (9 aut., 10 aut.)</AF>
<DT>Publication en série; Congrès; Niveau analytique</DT>
<SO>Lecture notes in computer science; ISSN 0302-9743; Allemagne; Da. 2006; Vol. 4338; Pp. 829-836; Bibl. 6 ref.</SO>
<LA>Anglais</LA>
<EA>In this work, we present a recently developed evaluation framework for video OCR specifically for English Text but could well be generalized for other languages as well. Earlier works include the development of an evaluation strategy for text detection and tracking in video, this work is a natural extension. We sucessfully port and use the ASR metrics used in the speech community here in the video domain. Further, we also show results on a small pilot corpus which involves 25 clips. Results obtained are promising and we believe that this is a good baseline and will encourage future participation in such evaluations.</EA>
<CC>001D02C04; 001D02C03</CC>
<FD>Vision ordinateur; Signal vidéo; Reconnaissance caractère; Reconnaissance optique caractère; Texte; Recherche information; Poursuite cible; Traitement image; Pistage; Métrique</FD>
<ED>Computer vision; Video signal; Character recognition; Optical character recognition; Text; Information retrieval; Target tracking; Image processing; Tracking; Metric</ED>
<SD>Visión ordenador; Señal video; Reconocimiento carácter; Reconocimento óptico de caracteres; Texto; Búsqueda información; Procesamiento imagen; Rastreo; Métrico</SD>
<LO>INIST-16343.354000153627190740</LO>
<ID>07-0525709</ID>
</server>
</inist>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/PascalFrancis/Corpus

HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000312 | SxmlIndent | more

HfdSelect -h $EXPLOR_AREA/Data/PascalFrancis/Corpus/biblio.hfd -nk 000312 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    PascalFrancis
   |étape=   Corpus
   |type=    RBID
   |clé=     Pascal:07-0525709
   |texte=   Evaluation framework for video OCR
}}

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024

	Serveur d'exploration sur l'OCR
	Attention, ce site est en cours de développement ! Attention, site généré par des moyens informatiques à partir de corpus bruts. Les informations ne sont donc pas validées.

Serveur d'exploration sur l'OCR

Evaluation framework for video OCR

Evaluation framework for video OCR

Source :

Descripteurs français

English descriptors

Abstract

Notice en format standard (ISO 2709)

Format Inist (serveur)

Links to Exploration step

Le document en format XML

Pour manipuler ce document sous Unix (Dilib)

Pour mettre un lien sur cette page dans le réseau Wicri