Automatic reading of the white pages in a telephone directory
Identifieur interne : 002748 ( Main/Merge ); précédent : 002747; suivant : 002749Automatic reading of the white pages in a telephone directory
Auteurs : J. Hu [Australie] ; H. Yan [Australie]Source :
- Optical engineering [ 0091-3286 ] ; 1996-11.
Descripteurs français
- Pascal (Inist)
English descriptors
- KwdEn :
Abstract
An optical character recognition (OCR) system for the reading of the white pages of Sydney′s telephone directories is described. The system is used to process each scanned page automatically. First, column segmentation, special symbol segmentation, text line segmentation, and character segmentation are performed. Second, a new structural method is developed to recognize each segmented character based on its skeleton decomposition and coding. Third, a postprocessing module is used to verify the confusing letters based on the text layout information and the contextual information. Experiments with test pages show an average recognition rate of 99.5% with a reliability rate of 99.85%. © 1996 Society of Photo-Optical Instrumentation Engineers.
Links toward previous steps (curation, corpus...)
- to stream PascalFrancis, to step Corpus: 000921
- to stream PascalFrancis, to step Curation: 000A33
- to stream PascalFrancis, to step Checkpoint: 000915
Links to Exploration step
Pascal:97-0111568Le document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en" level="a">Automatic reading of the white pages in a telephone directory</title>
<author><name sortKey="Hu, J" sort="Hu, J" uniqKey="Hu J" first="J." last="Hu">J. Hu</name>
<affiliation wicri:level="1"><inist:fA14 i1="01"><s1>University of Sydney Department of Electrical Engineering New South Wales 2006, Australia</s1>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
<country xml:lang="fr">Australie</country>
<wicri:regionArea>University of Sydney Department of Electrical Engineering New South Wales 2006</wicri:regionArea>
<wicri:noRegion>University of Sydney Department of Electrical Engineering New South Wales 2006, Australia</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Yan, H" sort="Yan, H" uniqKey="Yan H" first="H." last="Yan">H. Yan</name>
<affiliation wicri:level="1"><inist:fA14 i1="01"><s1>University of Sydney Department of Electrical Engineering New South Wales 2006, Australia</s1>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
<country xml:lang="fr">Australie</country>
<wicri:regionArea>University of Sydney Department of Electrical Engineering New South Wales 2006</wicri:regionArea>
<wicri:noRegion>University of Sydney Department of Electrical Engineering New South Wales 2006, Australia</wicri:noRegion>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">INIST</idno>
<idno type="inist">97-0111568</idno>
<date when="1996-11">1996-11</date>
<idno type="stanalyst">PASCAL 97-0111568 AIP</idno>
<idno type="RBID">Pascal:97-0111568</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000921</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000A33</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000915</idno>
<idno type="wicri:doubleKey">0091-3286:1996:Hu J:automatic:reading:of</idno>
<idno type="wicri:Area/Main/Merge">002748</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en" level="a">Automatic reading of the white pages in a telephone directory</title>
<author><name sortKey="Hu, J" sort="Hu, J" uniqKey="Hu J" first="J." last="Hu">J. Hu</name>
<affiliation wicri:level="1"><inist:fA14 i1="01"><s1>University of Sydney Department of Electrical Engineering New South Wales 2006, Australia</s1>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
<country xml:lang="fr">Australie</country>
<wicri:regionArea>University of Sydney Department of Electrical Engineering New South Wales 2006</wicri:regionArea>
<wicri:noRegion>University of Sydney Department of Electrical Engineering New South Wales 2006, Australia</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Yan, H" sort="Yan, H" uniqKey="Yan H" first="H." last="Yan">H. Yan</name>
<affiliation wicri:level="1"><inist:fA14 i1="01"><s1>University of Sydney Department of Electrical Engineering New South Wales 2006, Australia</s1>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
<country xml:lang="fr">Australie</country>
<wicri:regionArea>University of Sydney Department of Electrical Engineering New South Wales 2006</wicri:regionArea>
<wicri:noRegion>University of Sydney Department of Electrical Engineering New South Wales 2006, Australia</wicri:noRegion>
</affiliation>
</author>
</analytic>
<series><title level="j" type="main">Optical engineering</title>
<title level="j" type="abbreviated">Opt. eng.</title>
<idno type="ISSN">0091-3286</idno>
<imprint><date when="1996-11">1996-11</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt><title level="j" type="main">Optical engineering</title>
<title level="j" type="abbreviated">Opt. eng.</title>
<idno type="ISSN">0091-3286</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Measuring methods</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr"><term>4230S</term>
<term>Méthode mesure</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">An optical character recognition (OCR) system for the reading of the white pages of Sydney′s telephone directories is described. The system is used to process each scanned page automatically. First, column segmentation, special symbol segmentation, text line segmentation, and character segmentation are performed. Second, a new structural method is developed to recognize each segmented character based on its skeleton decomposition and coding. Third, a postprocessing module is used to verify the confusing letters based on the text layout information and the contextual information. Experiments with test pages show an average recognition rate of 99.5% with a reliability rate of 99.85%. © 1996 Society of Photo-Optical Instrumentation Engineers.</div>
</front>
</TEI>
<affiliations><list><country><li>Australie</li>
</country>
</list>
<tree><country name="Australie"><noRegion><name sortKey="Hu, J" sort="Hu, J" uniqKey="Hu J" first="J." last="Hu">J. Hu</name>
</noRegion>
<name sortKey="Yan, H" sort="Yan, H" uniqKey="Yan H" first="H." last="Yan">H. Yan</name>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Merge
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 002748 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Merge/biblio.hfd -nk 002748 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= Main |étape= Merge |type= RBID |clé= Pascal:97-0111568 |texte= Automatic reading of the white pages in a telephone directory }}
This area was generated with Dilib version V0.6.32. |