Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Evaluation framework for video OCR

Identifieur interne : 000312 ( PascalFrancis/Corpus ); précédent : 000311; suivant : 000313

Evaluation framework for video OCR

Auteurs : Padmanabhan Soundararajan ; Matthew Boonstra ; Vasant Manohar ; Valentina Korzhova ; Dmitry Goldgof ; Rangachar Kasturi ; Shubha Prasad ; Harish Raju ; Rachel Bowers ; John Garofolo

Source :

RBID : Pascal:07-0525709

Descripteurs français

English descriptors

Abstract

In this work, we present a recently developed evaluation framework for video OCR specifically for English Text but could well be generalized for other languages as well. Earlier works include the development of an evaluation strategy for text detection and tracking in video, this work is a natural extension. We sucessfully port and use the ASR metrics used in the speech community here in the video domain. Further, we also show results on a small pilot corpus which involves 25 clips. Results obtained are promising and we believe that this is a good baseline and will encourage future participation in such evaluations.

Notice en format standard (ISO 2709)

Pour connaître la documentation sur le format Inist Standard.

pA  
A01 01  1    @0 0302-9743
A05       @2 4338
A08 01  1  ENG  @1 Evaluation framework for video OCR
A09 01  1  ENG  @1 Computer vision, graphics and image processing : 5th Indian conference, ICVGIP 2006, Madurai, India, December 13-16, 2006 : proceedings
A11 01  1    @1 SOUNDARARAJAN (Padmanabhan)
A11 02  1    @1 BOONSTRA (Matthew)
A11 03  1    @1 MANOHAR (Vasant)
A11 04  1    @1 KORZHOVA (Valentina)
A11 05  1    @1 GOLDGOF (Dmitry)
A11 06  1    @1 KASTURI (Rangachar)
A11 07  1    @1 PRASAD (Shubha)
A11 08  1    @1 RAJU (Harish)
A11 09  1    @1 BOWERS (Rachel)
A11 10  1    @1 GAROFOLO (John)
A12 01  1    @1 KALRA (Prem K.) @9 ed.
A12 02  1    @1 PELEG (Shmuel) @9 ed.
A14 01      @1 Computer Science and Engineering, University of South Florida @2 Tampa, FL @3 USA @Z 1 aut. @Z 2 aut. @Z 3 aut. @Z 4 aut. @Z 5 aut. @Z 6 aut.
A14 02      @1 VideoMining Corporation @2 State College, PA @3 USA @Z 7 aut. @Z 8 aut.
A14 03      @1 National Institute of Standards and Technology (NIST), Information Technology Lab -Information Access Division, Speech Group @3 USA @Z 9 aut. @Z 10 aut.
A20       @1 829-836
A21       @1 2006
A23 01      @0 ENG
A26 01      @0 3-540-68301-1
A43 01      @1 INIST @2 16343 @5 354000153627190740
A44       @0 0000 @1 © 2007 INIST-CNRS. All rights reserved.
A45       @0 6 ref.
A47 01  1    @0 07-0525709
A60       @1 P @2 C
A61       @0 A
A64 01  1    @0 Lecture notes in computer science
A66 01      @0 DEU
A66 02      @0 USA
C01 01    ENG  @0 In this work, we present a recently developed evaluation framework for video OCR specifically for English Text but could well be generalized for other languages as well. Earlier works include the development of an evaluation strategy for text detection and tracking in video, this work is a natural extension. We sucessfully port and use the ASR metrics used in the speech community here in the video domain. Further, we also show results on a small pilot corpus which involves 25 clips. Results obtained are promising and we believe that this is a good baseline and will encourage future participation in such evaluations.
C02 01  X    @0 001D02C04
C02 02  X    @0 001D02C03
C03 01  X  FRE  @0 Vision ordinateur @5 01
C03 01  X  ENG  @0 Computer vision @5 01
C03 01  X  SPA  @0 Visión ordenador @5 01
C03 02  X  FRE  @0 Signal vidéo @5 06
C03 02  X  ENG  @0 Video signal @5 06
C03 02  X  SPA  @0 Señal video @5 06
C03 03  X  FRE  @0 Reconnaissance caractère @5 07
C03 03  X  ENG  @0 Character recognition @5 07
C03 03  X  SPA  @0 Reconocimiento carácter @5 07
C03 04  X  FRE  @0 Reconnaissance optique caractère @5 08
C03 04  X  ENG  @0 Optical character recognition @5 08
C03 04  X  SPA  @0 Reconocimento óptico de caracteres @5 08
C03 05  X  FRE  @0 Texte @5 09
C03 05  X  ENG  @0 Text @5 09
C03 05  X  SPA  @0 Texto @5 09
C03 06  X  FRE  @0 Recherche information @5 10
C03 06  X  ENG  @0 Information retrieval @5 10
C03 06  X  SPA  @0 Búsqueda información @5 10
C03 07  3  FRE  @0 Poursuite cible @5 11
C03 07  3  ENG  @0 Target tracking @5 11
C03 08  X  FRE  @0 Traitement image @5 12
C03 08  X  ENG  @0 Image processing @5 12
C03 08  X  SPA  @0 Procesamiento imagen @5 12
C03 09  X  FRE  @0 Pistage @5 13
C03 09  X  ENG  @0 Tracking @5 13
C03 09  X  SPA  @0 Rastreo @5 13
C03 10  X  FRE  @0 Métrique @5 18
C03 10  X  ENG  @0 Metric @5 18
C03 10  X  SPA  @0 Métrico @5 18
N21       @1 344
N44 01      @1 OTO
N82       @1 OTO
pR  
A30 01  1  ENG  @1 Indian Conference on Computer Vision Graphics and Image Processing @2 5 @3 Madurai IND @4 2006

Format Inist (serveur)

NO : PASCAL 07-0525709 INIST
ET : Evaluation framework for video OCR
AU : SOUNDARARAJAN (Padmanabhan); BOONSTRA (Matthew); MANOHAR (Vasant); KORZHOVA (Valentina); GOLDGOF (Dmitry); KASTURI (Rangachar); PRASAD (Shubha); RAJU (Harish); BOWERS (Rachel); GAROFOLO (John); KALRA (Prem K.); PELEG (Shmuel)
AF : Computer Science and Engineering, University of South Florida/Tampa, FL/Etats-Unis (1 aut., 2 aut., 3 aut., 4 aut., 5 aut., 6 aut.); VideoMining Corporation/State College, PA/Etats-Unis (7 aut., 8 aut.); National Institute of Standards and Technology (NIST), Information Technology Lab -Information Access Division, Speech Group/Etats-Unis (9 aut., 10 aut.)
DT : Publication en série; Congrès; Niveau analytique
SO : Lecture notes in computer science; ISSN 0302-9743; Allemagne; Da. 2006; Vol. 4338; Pp. 829-836; Bibl. 6 ref.
LA : Anglais
EA : In this work, we present a recently developed evaluation framework for video OCR specifically for English Text but could well be generalized for other languages as well. Earlier works include the development of an evaluation strategy for text detection and tracking in video, this work is a natural extension. We sucessfully port and use the ASR metrics used in the speech community here in the video domain. Further, we also show results on a small pilot corpus which involves 25 clips. Results obtained are promising and we believe that this is a good baseline and will encourage future participation in such evaluations.
CC : 001D02C04; 001D02C03
FD : Vision ordinateur; Signal vidéo; Reconnaissance caractère; Reconnaissance optique caractère; Texte; Recherche information; Poursuite cible; Traitement image; Pistage; Métrique
ED : Computer vision; Video signal; Character recognition; Optical character recognition; Text; Information retrieval; Target tracking; Image processing; Tracking; Metric
SD : Visión ordenador; Señal video; Reconocimiento carácter; Reconocimento óptico de caracteres; Texto; Búsqueda información; Procesamiento imagen; Rastreo; Métrico
LO : INIST-16343.354000153627190740
ID : 07-0525709

Links to Exploration step

Pascal:07-0525709

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" level="a">Evaluation framework for video OCR</title>
<author>
<name sortKey="Soundararajan, Padmanabhan" sort="Soundararajan, Padmanabhan" uniqKey="Soundararajan P" first="Padmanabhan" last="Soundararajan">Padmanabhan Soundararajan</name>
<affiliation>
<inist:fA14 i1="01">
<s1>Computer Science and Engineering, University of South Florida</s1>
<s2>Tampa, FL</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author>
<name sortKey="Boonstra, Matthew" sort="Boonstra, Matthew" uniqKey="Boonstra M" first="Matthew" last="Boonstra">Matthew Boonstra</name>
<affiliation>
<inist:fA14 i1="01">
<s1>Computer Science and Engineering, University of South Florida</s1>
<s2>Tampa, FL</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author>
<name sortKey="Manohar, Vasant" sort="Manohar, Vasant" uniqKey="Manohar V" first="Vasant" last="Manohar">Vasant Manohar</name>
<affiliation>
<inist:fA14 i1="01">
<s1>Computer Science and Engineering, University of South Florida</s1>
<s2>Tampa, FL</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author>
<name sortKey="Korzhova, Valentina" sort="Korzhova, Valentina" uniqKey="Korzhova V" first="Valentina" last="Korzhova">Valentina Korzhova</name>
<affiliation>
<inist:fA14 i1="01">
<s1>Computer Science and Engineering, University of South Florida</s1>
<s2>Tampa, FL</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author>
<name sortKey="Goldgof, Dmitry" sort="Goldgof, Dmitry" uniqKey="Goldgof D" first="Dmitry" last="Goldgof">Dmitry Goldgof</name>
<affiliation>
<inist:fA14 i1="01">
<s1>Computer Science and Engineering, University of South Florida</s1>
<s2>Tampa, FL</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author>
<name sortKey="Kasturi, Rangachar" sort="Kasturi, Rangachar" uniqKey="Kasturi R" first="Rangachar" last="Kasturi">Rangachar Kasturi</name>
<affiliation>
<inist:fA14 i1="01">
<s1>Computer Science and Engineering, University of South Florida</s1>
<s2>Tampa, FL</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author>
<name sortKey="Prasad, Shubha" sort="Prasad, Shubha" uniqKey="Prasad S" first="Shubha" last="Prasad">Shubha Prasad</name>
<affiliation>
<inist:fA14 i1="02">
<s1>VideoMining Corporation</s1>
<s2>State College, PA</s2>
<s3>USA</s3>
<sZ>7 aut.</sZ>
<sZ>8 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author>
<name sortKey="Raju, Harish" sort="Raju, Harish" uniqKey="Raju H" first="Harish" last="Raju">Harish Raju</name>
<affiliation>
<inist:fA14 i1="02">
<s1>VideoMining Corporation</s1>
<s2>State College, PA</s2>
<s3>USA</s3>
<sZ>7 aut.</sZ>
<sZ>8 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author>
<name sortKey="Bowers, Rachel" sort="Bowers, Rachel" uniqKey="Bowers R" first="Rachel" last="Bowers">Rachel Bowers</name>
<affiliation>
<inist:fA14 i1="03">
<s1>National Institute of Standards and Technology (NIST), Information Technology Lab -Information Access Division, Speech Group</s1>
<s3>USA</s3>
<sZ>9 aut.</sZ>
<sZ>10 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author>
<name sortKey="Garofolo, John" sort="Garofolo, John" uniqKey="Garofolo J" first="John" last="Garofolo">John Garofolo</name>
<affiliation>
<inist:fA14 i1="03">
<s1>National Institute of Standards and Technology (NIST), Information Technology Lab -Information Access Division, Speech Group</s1>
<s3>USA</s3>
<sZ>9 aut.</sZ>
<sZ>10 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">07-0525709</idno>
<date when="2006">2006</date>
<idno type="stanalyst">PASCAL 07-0525709 INIST</idno>
<idno type="RBID">Pascal:07-0525709</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000312</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a">Evaluation framework for video OCR</title>
<author>
<name sortKey="Soundararajan, Padmanabhan" sort="Soundararajan, Padmanabhan" uniqKey="Soundararajan P" first="Padmanabhan" last="Soundararajan">Padmanabhan Soundararajan</name>
<affiliation>
<inist:fA14 i1="01">
<s1>Computer Science and Engineering, University of South Florida</s1>
<s2>Tampa, FL</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author>
<name sortKey="Boonstra, Matthew" sort="Boonstra, Matthew" uniqKey="Boonstra M" first="Matthew" last="Boonstra">Matthew Boonstra</name>
<affiliation>
<inist:fA14 i1="01">
<s1>Computer Science and Engineering, University of South Florida</s1>
<s2>Tampa, FL</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author>
<name sortKey="Manohar, Vasant" sort="Manohar, Vasant" uniqKey="Manohar V" first="Vasant" last="Manohar">Vasant Manohar</name>
<affiliation>
<inist:fA14 i1="01">
<s1>Computer Science and Engineering, University of South Florida</s1>
<s2>Tampa, FL</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author>
<name sortKey="Korzhova, Valentina" sort="Korzhova, Valentina" uniqKey="Korzhova V" first="Valentina" last="Korzhova">Valentina Korzhova</name>
<affiliation>
<inist:fA14 i1="01">
<s1>Computer Science and Engineering, University of South Florida</s1>
<s2>Tampa, FL</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author>
<name sortKey="Goldgof, Dmitry" sort="Goldgof, Dmitry" uniqKey="Goldgof D" first="Dmitry" last="Goldgof">Dmitry Goldgof</name>
<affiliation>
<inist:fA14 i1="01">
<s1>Computer Science and Engineering, University of South Florida</s1>
<s2>Tampa, FL</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author>
<name sortKey="Kasturi, Rangachar" sort="Kasturi, Rangachar" uniqKey="Kasturi R" first="Rangachar" last="Kasturi">Rangachar Kasturi</name>
<affiliation>
<inist:fA14 i1="01">
<s1>Computer Science and Engineering, University of South Florida</s1>
<s2>Tampa, FL</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author>
<name sortKey="Prasad, Shubha" sort="Prasad, Shubha" uniqKey="Prasad S" first="Shubha" last="Prasad">Shubha Prasad</name>
<affiliation>
<inist:fA14 i1="02">
<s1>VideoMining Corporation</s1>
<s2>State College, PA</s2>
<s3>USA</s3>
<sZ>7 aut.</sZ>
<sZ>8 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author>
<name sortKey="Raju, Harish" sort="Raju, Harish" uniqKey="Raju H" first="Harish" last="Raju">Harish Raju</name>
<affiliation>
<inist:fA14 i1="02">
<s1>VideoMining Corporation</s1>
<s2>State College, PA</s2>
<s3>USA</s3>
<sZ>7 aut.</sZ>
<sZ>8 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author>
<name sortKey="Bowers, Rachel" sort="Bowers, Rachel" uniqKey="Bowers R" first="Rachel" last="Bowers">Rachel Bowers</name>
<affiliation>
<inist:fA14 i1="03">
<s1>National Institute of Standards and Technology (NIST), Information Technology Lab -Information Access Division, Speech Group</s1>
<s3>USA</s3>
<sZ>9 aut.</sZ>
<sZ>10 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author>
<name sortKey="Garofolo, John" sort="Garofolo, John" uniqKey="Garofolo J" first="John" last="Garofolo">John Garofolo</name>
<affiliation>
<inist:fA14 i1="03">
<s1>National Institute of Standards and Technology (NIST), Information Technology Lab -Information Access Division, Speech Group</s1>
<s3>USA</s3>
<sZ>9 aut.</sZ>
<sZ>10 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
</analytic>
<series>
<title level="j" type="main">Lecture notes in computer science</title>
<idno type="ISSN">0302-9743</idno>
<imprint>
<date when="2006">2006</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<title level="j" type="main">Lecture notes in computer science</title>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Character recognition</term>
<term>Computer vision</term>
<term>Image processing</term>
<term>Information retrieval</term>
<term>Metric</term>
<term>Optical character recognition</term>
<term>Target tracking</term>
<term>Text</term>
<term>Tracking</term>
<term>Video signal</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Vision ordinateur</term>
<term>Signal vidéo</term>
<term>Reconnaissance caractère</term>
<term>Reconnaissance optique caractère</term>
<term>Texte</term>
<term>Recherche information</term>
<term>Poursuite cible</term>
<term>Traitement image</term>
<term>Pistage</term>
<term>Métrique</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">In this work, we present a recently developed evaluation framework for video OCR specifically for English Text but could well be generalized for other languages as well. Earlier works include the development of an evaluation strategy for text detection and tracking in video, this work is a natural extension. We sucessfully port and use the ASR metrics used in the speech community here in the video domain. Further, we also show results on a small pilot corpus which involves 25 clips. Results obtained are promising and we believe that this is a good baseline and will encourage future participation in such evaluations.</div>
</front>
</TEI>
<inist>
<standard h6="B">
<pA>
<fA01 i1="01" i2="1">
<s0>0302-9743</s0>
</fA01>
<fA05>
<s2>4338</s2>
</fA05>
<fA08 i1="01" i2="1" l="ENG">
<s1>Evaluation framework for video OCR</s1>
</fA08>
<fA09 i1="01" i2="1" l="ENG">
<s1>Computer vision, graphics and image processing : 5th Indian conference, ICVGIP 2006, Madurai, India, December 13-16, 2006 : proceedings</s1>
</fA09>
<fA11 i1="01" i2="1">
<s1>SOUNDARARAJAN (Padmanabhan)</s1>
</fA11>
<fA11 i1="02" i2="1">
<s1>BOONSTRA (Matthew)</s1>
</fA11>
<fA11 i1="03" i2="1">
<s1>MANOHAR (Vasant)</s1>
</fA11>
<fA11 i1="04" i2="1">
<s1>KORZHOVA (Valentina)</s1>
</fA11>
<fA11 i1="05" i2="1">
<s1>GOLDGOF (Dmitry)</s1>
</fA11>
<fA11 i1="06" i2="1">
<s1>KASTURI (Rangachar)</s1>
</fA11>
<fA11 i1="07" i2="1">
<s1>PRASAD (Shubha)</s1>
</fA11>
<fA11 i1="08" i2="1">
<s1>RAJU (Harish)</s1>
</fA11>
<fA11 i1="09" i2="1">
<s1>BOWERS (Rachel)</s1>
</fA11>
<fA11 i1="10" i2="1">
<s1>GAROFOLO (John)</s1>
</fA11>
<fA12 i1="01" i2="1">
<s1>KALRA (Prem K.)</s1>
<s9>ed.</s9>
</fA12>
<fA12 i1="02" i2="1">
<s1>PELEG (Shmuel)</s1>
<s9>ed.</s9>
</fA12>
<fA14 i1="01">
<s1>Computer Science and Engineering, University of South Florida</s1>
<s2>Tampa, FL</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</fA14>
<fA14 i1="02">
<s1>VideoMining Corporation</s1>
<s2>State College, PA</s2>
<s3>USA</s3>
<sZ>7 aut.</sZ>
<sZ>8 aut.</sZ>
</fA14>
<fA14 i1="03">
<s1>National Institute of Standards and Technology (NIST), Information Technology Lab -Information Access Division, Speech Group</s1>
<s3>USA</s3>
<sZ>9 aut.</sZ>
<sZ>10 aut.</sZ>
</fA14>
<fA20>
<s1>829-836</s1>
</fA20>
<fA21>
<s1>2006</s1>
</fA21>
<fA23 i1="01">
<s0>ENG</s0>
</fA23>
<fA26 i1="01">
<s0>3-540-68301-1</s0>
</fA26>
<fA43 i1="01">
<s1>INIST</s1>
<s2>16343</s2>
<s5>354000153627190740</s5>
</fA43>
<fA44>
<s0>0000</s0>
<s1>© 2007 INIST-CNRS. All rights reserved.</s1>
</fA44>
<fA45>
<s0>6 ref.</s0>
</fA45>
<fA47 i1="01" i2="1">
<s0>07-0525709</s0>
</fA47>
<fA60>
<s1>P</s1>
<s2>C</s2>
</fA60>
<fA61>
<s0>A</s0>
</fA61>
<fA64 i1="01" i2="1">
<s0>Lecture notes in computer science</s0>
</fA64>
<fA66 i1="01">
<s0>DEU</s0>
</fA66>
<fA66 i1="02">
<s0>USA</s0>
</fA66>
<fC01 i1="01" l="ENG">
<s0>In this work, we present a recently developed evaluation framework for video OCR specifically for English Text but could well be generalized for other languages as well. Earlier works include the development of an evaluation strategy for text detection and tracking in video, this work is a natural extension. We sucessfully port and use the ASR metrics used in the speech community here in the video domain. Further, we also show results on a small pilot corpus which involves 25 clips. Results obtained are promising and we believe that this is a good baseline and will encourage future participation in such evaluations.</s0>
</fC01>
<fC02 i1="01" i2="X">
<s0>001D02C04</s0>
</fC02>
<fC02 i1="02" i2="X">
<s0>001D02C03</s0>
</fC02>
<fC03 i1="01" i2="X" l="FRE">
<s0>Vision ordinateur</s0>
<s5>01</s5>
</fC03>
<fC03 i1="01" i2="X" l="ENG">
<s0>Computer vision</s0>
<s5>01</s5>
</fC03>
<fC03 i1="01" i2="X" l="SPA">
<s0>Visión ordenador</s0>
<s5>01</s5>
</fC03>
<fC03 i1="02" i2="X" l="FRE">
<s0>Signal vidéo</s0>
<s5>06</s5>
</fC03>
<fC03 i1="02" i2="X" l="ENG">
<s0>Video signal</s0>
<s5>06</s5>
</fC03>
<fC03 i1="02" i2="X" l="SPA">
<s0>Señal video</s0>
<s5>06</s5>
</fC03>
<fC03 i1="03" i2="X" l="FRE">
<s0>Reconnaissance caractère</s0>
<s5>07</s5>
</fC03>
<fC03 i1="03" i2="X" l="ENG">
<s0>Character recognition</s0>
<s5>07</s5>
</fC03>
<fC03 i1="03" i2="X" l="SPA">
<s0>Reconocimiento carácter</s0>
<s5>07</s5>
</fC03>
<fC03 i1="04" i2="X" l="FRE">
<s0>Reconnaissance optique caractère</s0>
<s5>08</s5>
</fC03>
<fC03 i1="04" i2="X" l="ENG">
<s0>Optical character recognition</s0>
<s5>08</s5>
</fC03>
<fC03 i1="04" i2="X" l="SPA">
<s0>Reconocimento óptico de caracteres</s0>
<s5>08</s5>
</fC03>
<fC03 i1="05" i2="X" l="FRE">
<s0>Texte</s0>
<s5>09</s5>
</fC03>
<fC03 i1="05" i2="X" l="ENG">
<s0>Text</s0>
<s5>09</s5>
</fC03>
<fC03 i1="05" i2="X" l="SPA">
<s0>Texto</s0>
<s5>09</s5>
</fC03>
<fC03 i1="06" i2="X" l="FRE">
<s0>Recherche information</s0>
<s5>10</s5>
</fC03>
<fC03 i1="06" i2="X" l="ENG">
<s0>Information retrieval</s0>
<s5>10</s5>
</fC03>
<fC03 i1="06" i2="X" l="SPA">
<s0>Búsqueda información</s0>
<s5>10</s5>
</fC03>
<fC03 i1="07" i2="3" l="FRE">
<s0>Poursuite cible</s0>
<s5>11</s5>
</fC03>
<fC03 i1="07" i2="3" l="ENG">
<s0>Target tracking</s0>
<s5>11</s5>
</fC03>
<fC03 i1="08" i2="X" l="FRE">
<s0>Traitement image</s0>
<s5>12</s5>
</fC03>
<fC03 i1="08" i2="X" l="ENG">
<s0>Image processing</s0>
<s5>12</s5>
</fC03>
<fC03 i1="08" i2="X" l="SPA">
<s0>Procesamiento imagen</s0>
<s5>12</s5>
</fC03>
<fC03 i1="09" i2="X" l="FRE">
<s0>Pistage</s0>
<s5>13</s5>
</fC03>
<fC03 i1="09" i2="X" l="ENG">
<s0>Tracking</s0>
<s5>13</s5>
</fC03>
<fC03 i1="09" i2="X" l="SPA">
<s0>Rastreo</s0>
<s5>13</s5>
</fC03>
<fC03 i1="10" i2="X" l="FRE">
<s0>Métrique</s0>
<s5>18</s5>
</fC03>
<fC03 i1="10" i2="X" l="ENG">
<s0>Metric</s0>
<s5>18</s5>
</fC03>
<fC03 i1="10" i2="X" l="SPA">
<s0>Métrico</s0>
<s5>18</s5>
</fC03>
<fN21>
<s1>344</s1>
</fN21>
<fN44 i1="01">
<s1>OTO</s1>
</fN44>
<fN82>
<s1>OTO</s1>
</fN82>
</pA>
<pR>
<fA30 i1="01" i2="1" l="ENG">
<s1>Indian Conference on Computer Vision Graphics and Image Processing</s1>
<s2>5</s2>
<s3>Madurai IND</s3>
<s4>2006</s4>
</fA30>
</pR>
</standard>
<server>
<NO>PASCAL 07-0525709 INIST</NO>
<ET>Evaluation framework for video OCR</ET>
<AU>SOUNDARARAJAN (Padmanabhan); BOONSTRA (Matthew); MANOHAR (Vasant); KORZHOVA (Valentina); GOLDGOF (Dmitry); KASTURI (Rangachar); PRASAD (Shubha); RAJU (Harish); BOWERS (Rachel); GAROFOLO (John); KALRA (Prem K.); PELEG (Shmuel)</AU>
<AF>Computer Science and Engineering, University of South Florida/Tampa, FL/Etats-Unis (1 aut., 2 aut., 3 aut., 4 aut., 5 aut., 6 aut.); VideoMining Corporation/State College, PA/Etats-Unis (7 aut., 8 aut.); National Institute of Standards and Technology (NIST), Information Technology Lab -Information Access Division, Speech Group/Etats-Unis (9 aut., 10 aut.)</AF>
<DT>Publication en série; Congrès; Niveau analytique</DT>
<SO>Lecture notes in computer science; ISSN 0302-9743; Allemagne; Da. 2006; Vol. 4338; Pp. 829-836; Bibl. 6 ref.</SO>
<LA>Anglais</LA>
<EA>In this work, we present a recently developed evaluation framework for video OCR specifically for English Text but could well be generalized for other languages as well. Earlier works include the development of an evaluation strategy for text detection and tracking in video, this work is a natural extension. We sucessfully port and use the ASR metrics used in the speech community here in the video domain. Further, we also show results on a small pilot corpus which involves 25 clips. Results obtained are promising and we believe that this is a good baseline and will encourage future participation in such evaluations.</EA>
<CC>001D02C04; 001D02C03</CC>
<FD>Vision ordinateur; Signal vidéo; Reconnaissance caractère; Reconnaissance optique caractère; Texte; Recherche information; Poursuite cible; Traitement image; Pistage; Métrique</FD>
<ED>Computer vision; Video signal; Character recognition; Optical character recognition; Text; Information retrieval; Target tracking; Image processing; Tracking; Metric</ED>
<SD>Visión ordenador; Señal video; Reconocimiento carácter; Reconocimento óptico de caracteres; Texto; Búsqueda información; Procesamiento imagen; Rastreo; Métrico</SD>
<LO>INIST-16343.354000153627190740</LO>
<ID>07-0525709</ID>
</server>
</inist>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/PascalFrancis/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000312 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/PascalFrancis/Corpus/biblio.hfd -nk 000312 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    PascalFrancis
   |étape=   Corpus
   |type=    RBID
   |clé=     Pascal:07-0525709
   |texte=   Evaluation framework for video OCR
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024