Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Automated evaluation of OCR zoning

Identifieur interne : 000A75 ( PascalFrancis/Corpus ); précédent : 000A74; suivant : 000A76

Automated evaluation of OCR zoning

Auteurs : J. Kanai ; S. V. Rice ; T. A. Nartker ; G. Nagy

Source :

RBID : Pascal:95-0128706

Descripteurs français

English descriptors

Abstract

Many current optical character recognition (OCR) systems attempt to decompose printed pages into a set of zones, each confining a single column of text, before converting the characters into coded form. We present a methodology for automatically assessing the accuracy of such decompositions, and demonstrate its use in evaluating six OCR systems

Notice en format standard (ISO 2709)

Pour connaître la documentation sur le format Inist Standard.

pA  
A01 01  1    @0 0162-8828
A02 01      @0 ITPIDJ
A03   1    @0 IEEE trans. pattern anal. mach. intell.
A05       @2 17
A06       @2 1
A08 01  1  ENG  @1 Automated evaluation of OCR zoning
A11 01  1    @1 KANAI (J.)
A11 02  1    @1 RICE (S. V.)
A11 03  1    @1 NARTKER (T. A.)
A11 04  1    @1 NAGY (G.)
A14 01      @1 UNLV, information sci. res. inst. @2 Las Vegas NV 89154-4021 @3 USA @Z 1 aut. @Z 2 aut. @Z 3 aut.
A20       @1 86-90
A21       @1 1995
A23 01      @0 ENG
A43 01      @1 INIST @2 222T @5 354000058415650110
A44       @0 0000
A45       @0 7 ref.
A47 01  1    @0 95-0128706
A60       @1 P @3 CR
A61       @0 A
A64 01  1    @0 IEEE transactions on pattern analysis and machine intelligence
A66 01      @0 USA
C01 01    ENG  @0 Many current optical character recognition (OCR) systems attempt to decompose printed pages into a set of zones, each confining a single column of text, before converting the characters into coded form. We present a methodology for automatically assessing the accuracy of such decompositions, and demonstrate its use in evaluating six OCR systems
C02 01  X    @0 001D02C03
C03 01  X  FRE  @0 Reconnaissance caractère @5 67
C03 01  X  ENG  @0 Character recognition @5 67
C03 01  X  SPA  @0 Reconocimiento carácter @5 67
C03 02  X  FRE  @0 Système optique @5 68
C03 02  X  ENG  @0 Optical system @5 68
C03 02  X  GER  @0 Optisches System @5 68
C03 02  X  SPA  @0 Sistema óptico @5 68
C03 03  X  FRE  @0 Analyse documentaire @5 69
C03 03  X  ENG  @0 Document analysis @5 69
C03 03  X  SPA  @0 Análisis documental @5 69
C03 04  X  FRE  @0 Segmentation @5 70
C03 04  X  ENG  @0 Segmentation @5 70
C03 04  X  SPA  @0 Segmentación @5 70
C03 05  X  FRE  @0 Evaluation performance @5 71
C03 05  X  ENG  @0 Performance evaluation @5 71
C03 05  X  SPA  @0 Evaluación prestación @5 71
C03 06  X  FRE  @0 Page segmentation @4 INC @5 90
C03 07  X  FRE  @0 Layout analysis @4 INC @5 91
N21       @1 081

Format Inist (serveur)

NO : PASCAL 95-0128706 INIST
ET : Automated evaluation of OCR zoning
AU : KANAI (J.); RICE (S. V.); NARTKER (T. A.); NAGY (G.)
AF : UNLV, information sci. res. inst./Las Vegas NV 89154-4021/Etats-Unis (1 aut., 2 aut., 3 aut.)
DT : Publication en série; Correspondance, lettre; Niveau analytique
SO : IEEE transactions on pattern analysis and machine intelligence; ISSN 0162-8828; Coden ITPIDJ; Etats-Unis; Da. 1995; Vol. 17; No. 1; Pp. 86-90; Bibl. 7 ref.
LA : Anglais
EA : Many current optical character recognition (OCR) systems attempt to decompose printed pages into a set of zones, each confining a single column of text, before converting the characters into coded form. We present a methodology for automatically assessing the accuracy of such decompositions, and demonstrate its use in evaluating six OCR systems
CC : 001D02C03
FD : Reconnaissance caractère; Système optique; Analyse documentaire; Segmentation; Evaluation performance; Page segmentation; Layout analysis
ED : Character recognition; Optical system; Document analysis; Segmentation; Performance evaluation
GD : Optisches System
SD : Reconocimiento carácter; Sistema óptico; Análisis documental; Segmentación; Evaluación prestación
LO : INIST-222T.354000058415650110
ID : 95-0128706

Links to Exploration step

Pascal:95-0128706

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" level="a">Automated evaluation of OCR zoning</title>
<author>
<name sortKey="Kanai, J" sort="Kanai, J" uniqKey="Kanai J" first="J." last="Kanai">J. Kanai</name>
<affiliation>
<inist:fA14 i1="01">
<s1>UNLV, information sci. res. inst.</s1>
<s2>Las Vegas NV 89154-4021</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author>
<name sortKey="Rice, S V" sort="Rice, S V" uniqKey="Rice S" first="S. V." last="Rice">S. V. Rice</name>
<affiliation>
<inist:fA14 i1="01">
<s1>UNLV, information sci. res. inst.</s1>
<s2>Las Vegas NV 89154-4021</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author>
<name sortKey="Nartker, T A" sort="Nartker, T A" uniqKey="Nartker T" first="T. A." last="Nartker">T. A. Nartker</name>
<affiliation>
<inist:fA14 i1="01">
<s1>UNLV, information sci. res. inst.</s1>
<s2>Las Vegas NV 89154-4021</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author>
<name sortKey="Nagy, G" sort="Nagy, G" uniqKey="Nagy G" first="G." last="Nagy">G. Nagy</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">95-0128706</idno>
<date when="1995">1995</date>
<idno type="stanalyst">PASCAL 95-0128706 INIST</idno>
<idno type="RBID">Pascal:95-0128706</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000A75</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a">Automated evaluation of OCR zoning</title>
<author>
<name sortKey="Kanai, J" sort="Kanai, J" uniqKey="Kanai J" first="J." last="Kanai">J. Kanai</name>
<affiliation>
<inist:fA14 i1="01">
<s1>UNLV, information sci. res. inst.</s1>
<s2>Las Vegas NV 89154-4021</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author>
<name sortKey="Rice, S V" sort="Rice, S V" uniqKey="Rice S" first="S. V." last="Rice">S. V. Rice</name>
<affiliation>
<inist:fA14 i1="01">
<s1>UNLV, information sci. res. inst.</s1>
<s2>Las Vegas NV 89154-4021</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author>
<name sortKey="Nartker, T A" sort="Nartker, T A" uniqKey="Nartker T" first="T. A." last="Nartker">T. A. Nartker</name>
<affiliation>
<inist:fA14 i1="01">
<s1>UNLV, information sci. res. inst.</s1>
<s2>Las Vegas NV 89154-4021</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author>
<name sortKey="Nagy, G" sort="Nagy, G" uniqKey="Nagy G" first="G." last="Nagy">G. Nagy</name>
</author>
</analytic>
<series>
<title level="j" type="main">IEEE transactions on pattern analysis and machine intelligence</title>
<title level="j" type="abbreviated">IEEE trans. pattern anal. mach. intell.</title>
<idno type="ISSN">0162-8828</idno>
<imprint>
<date when="1995">1995</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<title level="j" type="main">IEEE transactions on pattern analysis and machine intelligence</title>
<title level="j" type="abbreviated">IEEE trans. pattern anal. mach. intell.</title>
<idno type="ISSN">0162-8828</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Character recognition</term>
<term>Document analysis</term>
<term>Optical system</term>
<term>Performance evaluation</term>
<term>Segmentation</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Reconnaissance caractère</term>
<term>Système optique</term>
<term>Analyse documentaire</term>
<term>Segmentation</term>
<term>Evaluation performance</term>
<term>Page segmentation</term>
<term>Layout analysis</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Many current optical character recognition (OCR) systems attempt to decompose printed pages into a set of zones, each confining a single column of text, before converting the characters into coded form. We present a methodology for automatically assessing the accuracy of such decompositions, and demonstrate its use in evaluating six OCR systems</div>
</front>
</TEI>
<inist>
<standard h6="B">
<pA>
<fA01 i1="01" i2="1">
<s0>0162-8828</s0>
</fA01>
<fA02 i1="01">
<s0>ITPIDJ</s0>
</fA02>
<fA03 i2="1">
<s0>IEEE trans. pattern anal. mach. intell.</s0>
</fA03>
<fA05>
<s2>17</s2>
</fA05>
<fA06>
<s2>1</s2>
</fA06>
<fA08 i1="01" i2="1" l="ENG">
<s1>Automated evaluation of OCR zoning</s1>
</fA08>
<fA11 i1="01" i2="1">
<s1>KANAI (J.)</s1>
</fA11>
<fA11 i1="02" i2="1">
<s1>RICE (S. V.)</s1>
</fA11>
<fA11 i1="03" i2="1">
<s1>NARTKER (T. A.)</s1>
</fA11>
<fA11 i1="04" i2="1">
<s1>NAGY (G.)</s1>
</fA11>
<fA14 i1="01">
<s1>UNLV, information sci. res. inst.</s1>
<s2>Las Vegas NV 89154-4021</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</fA14>
<fA20>
<s1>86-90</s1>
</fA20>
<fA21>
<s1>1995</s1>
</fA21>
<fA23 i1="01">
<s0>ENG</s0>
</fA23>
<fA43 i1="01">
<s1>INIST</s1>
<s2>222T</s2>
<s5>354000058415650110</s5>
</fA43>
<fA44>
<s0>0000</s0>
</fA44>
<fA45>
<s0>7 ref.</s0>
</fA45>
<fA47 i1="01" i2="1">
<s0>95-0128706</s0>
</fA47>
<fA60>
<s1>P</s1>
<s3>CR</s3>
</fA60>
<fA61>
<s0>A</s0>
</fA61>
<fA64 i1="01" i2="1">
<s0>IEEE transactions on pattern analysis and machine intelligence</s0>
</fA64>
<fA66 i1="01">
<s0>USA</s0>
</fA66>
<fC01 i1="01" l="ENG">
<s0>Many current optical character recognition (OCR) systems attempt to decompose printed pages into a set of zones, each confining a single column of text, before converting the characters into coded form. We present a methodology for automatically assessing the accuracy of such decompositions, and demonstrate its use in evaluating six OCR systems</s0>
</fC01>
<fC02 i1="01" i2="X">
<s0>001D02C03</s0>
</fC02>
<fC03 i1="01" i2="X" l="FRE">
<s0>Reconnaissance caractère</s0>
<s5>67</s5>
</fC03>
<fC03 i1="01" i2="X" l="ENG">
<s0>Character recognition</s0>
<s5>67</s5>
</fC03>
<fC03 i1="01" i2="X" l="SPA">
<s0>Reconocimiento carácter</s0>
<s5>67</s5>
</fC03>
<fC03 i1="02" i2="X" l="FRE">
<s0>Système optique</s0>
<s5>68</s5>
</fC03>
<fC03 i1="02" i2="X" l="ENG">
<s0>Optical system</s0>
<s5>68</s5>
</fC03>
<fC03 i1="02" i2="X" l="GER">
<s0>Optisches System</s0>
<s5>68</s5>
</fC03>
<fC03 i1="02" i2="X" l="SPA">
<s0>Sistema óptico</s0>
<s5>68</s5>
</fC03>
<fC03 i1="03" i2="X" l="FRE">
<s0>Analyse documentaire</s0>
<s5>69</s5>
</fC03>
<fC03 i1="03" i2="X" l="ENG">
<s0>Document analysis</s0>
<s5>69</s5>
</fC03>
<fC03 i1="03" i2="X" l="SPA">
<s0>Análisis documental</s0>
<s5>69</s5>
</fC03>
<fC03 i1="04" i2="X" l="FRE">
<s0>Segmentation</s0>
<s5>70</s5>
</fC03>
<fC03 i1="04" i2="X" l="ENG">
<s0>Segmentation</s0>
<s5>70</s5>
</fC03>
<fC03 i1="04" i2="X" l="SPA">
<s0>Segmentación</s0>
<s5>70</s5>
</fC03>
<fC03 i1="05" i2="X" l="FRE">
<s0>Evaluation performance</s0>
<s5>71</s5>
</fC03>
<fC03 i1="05" i2="X" l="ENG">
<s0>Performance evaluation</s0>
<s5>71</s5>
</fC03>
<fC03 i1="05" i2="X" l="SPA">
<s0>Evaluación prestación</s0>
<s5>71</s5>
</fC03>
<fC03 i1="06" i2="X" l="FRE">
<s0>Page segmentation</s0>
<s4>INC</s4>
<s5>90</s5>
</fC03>
<fC03 i1="07" i2="X" l="FRE">
<s0>Layout analysis</s0>
<s4>INC</s4>
<s5>91</s5>
</fC03>
<fN21>
<s1>081</s1>
</fN21>
</pA>
</standard>
<server>
<NO>PASCAL 95-0128706 INIST</NO>
<ET>Automated evaluation of OCR zoning</ET>
<AU>KANAI (J.); RICE (S. V.); NARTKER (T. A.); NAGY (G.)</AU>
<AF>UNLV, information sci. res. inst./Las Vegas NV 89154-4021/Etats-Unis (1 aut., 2 aut., 3 aut.)</AF>
<DT>Publication en série; Correspondance, lettre; Niveau analytique</DT>
<SO>IEEE transactions on pattern analysis and machine intelligence; ISSN 0162-8828; Coden ITPIDJ; Etats-Unis; Da. 1995; Vol. 17; No. 1; Pp. 86-90; Bibl. 7 ref.</SO>
<LA>Anglais</LA>
<EA>Many current optical character recognition (OCR) systems attempt to decompose printed pages into a set of zones, each confining a single column of text, before converting the characters into coded form. We present a methodology for automatically assessing the accuracy of such decompositions, and demonstrate its use in evaluating six OCR systems</EA>
<CC>001D02C03</CC>
<FD>Reconnaissance caractère; Système optique; Analyse documentaire; Segmentation; Evaluation performance; Page segmentation; Layout analysis</FD>
<ED>Character recognition; Optical system; Document analysis; Segmentation; Performance evaluation</ED>
<GD>Optisches System</GD>
<SD>Reconocimiento carácter; Sistema óptico; Análisis documental; Segmentación; Evaluación prestación</SD>
<LO>INIST-222T.354000058415650110</LO>
<ID>95-0128706</ID>
</server>
</inist>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/PascalFrancis/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000A75 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/PascalFrancis/Corpus/biblio.hfd -nk 000A75 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    PascalFrancis
   |étape=   Corpus
   |type=    RBID
   |clé=     Pascal:95-0128706
   |texte=   Automated evaluation of OCR zoning
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024