Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Automatic reading of the white pages in a telephone directory

Identifieur interne : 002748 ( Main/Merge ); précédent : 002747; suivant : 002749

Automatic reading of the white pages in a telephone directory

Auteurs : J. Hu [Australie] ; H. Yan [Australie]

Source :

RBID : Pascal:97-0111568

Descripteurs français

English descriptors

Abstract

An optical character recognition (OCR) system for the reading of the white pages of Sydney′s telephone directories is described. The system is used to process each scanned page automatically. First, column segmentation, special symbol segmentation, text line segmentation, and character segmentation are performed. Second, a new structural method is developed to recognize each segmented character based on its skeleton decomposition and coding. Third, a postprocessing module is used to verify the confusing letters based on the text layout information and the contextual information. Experiments with test pages show an average recognition rate of 99.5% with a reliability rate of 99.85%. © 1996 Society of Photo-Optical Instrumentation Engineers.

Links toward previous steps (curation, corpus...)


Links to Exploration step

Pascal:97-0111568

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" level="a">Automatic reading of the white pages in a telephone directory</title>
<author>
<name sortKey="Hu, J" sort="Hu, J" uniqKey="Hu J" first="J." last="Hu">J. Hu</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>University of Sydney Department of Electrical Engineering New South Wales 2006, Australia</s1>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
<country xml:lang="fr">Australie</country>
<wicri:regionArea>University of Sydney Department of Electrical Engineering New South Wales 2006</wicri:regionArea>
<wicri:noRegion>University of Sydney Department of Electrical Engineering New South Wales 2006, Australia</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Yan, H" sort="Yan, H" uniqKey="Yan H" first="H." last="Yan">H. Yan</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>University of Sydney Department of Electrical Engineering New South Wales 2006, Australia</s1>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
<country xml:lang="fr">Australie</country>
<wicri:regionArea>University of Sydney Department of Electrical Engineering New South Wales 2006</wicri:regionArea>
<wicri:noRegion>University of Sydney Department of Electrical Engineering New South Wales 2006, Australia</wicri:noRegion>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">97-0111568</idno>
<date when="1996-11">1996-11</date>
<idno type="stanalyst">PASCAL 97-0111568 AIP</idno>
<idno type="RBID">Pascal:97-0111568</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000921</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000A33</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000915</idno>
<idno type="wicri:doubleKey">0091-3286:1996:Hu J:automatic:reading:of</idno>
<idno type="wicri:Area/Main/Merge">002748</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a">Automatic reading of the white pages in a telephone directory</title>
<author>
<name sortKey="Hu, J" sort="Hu, J" uniqKey="Hu J" first="J." last="Hu">J. Hu</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>University of Sydney Department of Electrical Engineering New South Wales 2006, Australia</s1>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
<country xml:lang="fr">Australie</country>
<wicri:regionArea>University of Sydney Department of Electrical Engineering New South Wales 2006</wicri:regionArea>
<wicri:noRegion>University of Sydney Department of Electrical Engineering New South Wales 2006, Australia</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Yan, H" sort="Yan, H" uniqKey="Yan H" first="H." last="Yan">H. Yan</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>University of Sydney Department of Electrical Engineering New South Wales 2006, Australia</s1>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
<country xml:lang="fr">Australie</country>
<wicri:regionArea>University of Sydney Department of Electrical Engineering New South Wales 2006</wicri:regionArea>
<wicri:noRegion>University of Sydney Department of Electrical Engineering New South Wales 2006, Australia</wicri:noRegion>
</affiliation>
</author>
</analytic>
<series>
<title level="j" type="main">Optical engineering</title>
<title level="j" type="abbreviated">Opt. eng.</title>
<idno type="ISSN">0091-3286</idno>
<imprint>
<date when="1996-11">1996-11</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<title level="j" type="main">Optical engineering</title>
<title level="j" type="abbreviated">Opt. eng.</title>
<idno type="ISSN">0091-3286</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Measuring methods</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>4230S</term>
<term>Méthode mesure</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">An optical character recognition (OCR) system for the reading of the white pages of Sydney′s telephone directories is described. The system is used to process each scanned page automatically. First, column segmentation, special symbol segmentation, text line segmentation, and character segmentation are performed. Second, a new structural method is developed to recognize each segmented character based on its skeleton decomposition and coding. Third, a postprocessing module is used to verify the confusing letters based on the text layout information and the contextual information. Experiments with test pages show an average recognition rate of 99.5% with a reliability rate of 99.85%. © 1996 Society of Photo-Optical Instrumentation Engineers.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Australie</li>
</country>
</list>
<tree>
<country name="Australie">
<noRegion>
<name sortKey="Hu, J" sort="Hu, J" uniqKey="Hu J" first="J." last="Hu">J. Hu</name>
</noRegion>
<name sortKey="Yan, H" sort="Yan, H" uniqKey="Yan H" first="H." last="Yan">H. Yan</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Merge
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 002748 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Merge/biblio.hfd -nk 002748 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Merge
   |type=    RBID
   |clé=     Pascal:97-0111568
   |texte=   Automatic reading of the white pages in a telephone directory
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024