Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Automatic feature design for optical character recognition using an evolutionary search procedure.

Identifieur interne : 000086 ( PubMed/Checkpoint ); précédent : 000085; suivant : 000087

Automatic feature design for optical character recognition using an evolutionary search procedure.

Auteurs : F W Stentiford [Royaume-Uni]

Source :

RBID : pubmed:21869271

Abstract

An automatic evolutionary search is applied to the problem of feature extraction in an OCR application. A performance measure based on feature independence is used to generate features which do not appear to suffer from peaking effects [17]. Features are extracted from a training set of 30 600 machine printed 34 class alphanumeric characters derived from British mail. Classification results on the training set and a test set of 10 200 characters are reported for an increasing number of features. A 1.01 percent forced decision error rate is obtained on the test data using 316 features. The hardware implementation should be cheap and fast to operate. The performance compares favorably with current low cost OCR page readers.

PubMed: 21869271


Affiliations:


Links toward previous steps (curation, corpus...)


Links to Exploration step

pubmed:21869271

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Automatic feature design for optical character recognition using an evolutionary search procedure.</title>
<author>
<name sortKey="Stentiford, F W" sort="Stentiford, F W" uniqKey="Stentiford F" first="F W" last="Stentiford">F W Stentiford</name>
<affiliation wicri:level="2">
<nlm:affiliation>British Telecom Research Laboratories. Ipswich, Suffolk, England.</nlm:affiliation>
<country>Royaume-Uni</country>
<placeName>
<region type="country">Angleterre</region>
</placeName>
<wicri:cityArea>British Telecom Research Laboratories. Ipswich, Suffolk</wicri:cityArea>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PubMed</idno>
<date when="1985">1985</date>
<idno type="RBID">pubmed:21869271</idno>
<idno type="pmid">21869271</idno>
<idno type="wicri:Area/PubMed/Corpus">000091</idno>
<idno type="wicri:Area/PubMed/Curation">000091</idno>
<idno type="wicri:Area/PubMed/Checkpoint">000091</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">Automatic feature design for optical character recognition using an evolutionary search procedure.</title>
<author>
<name sortKey="Stentiford, F W" sort="Stentiford, F W" uniqKey="Stentiford F" first="F W" last="Stentiford">F W Stentiford</name>
<affiliation wicri:level="2">
<nlm:affiliation>British Telecom Research Laboratories. Ipswich, Suffolk, England.</nlm:affiliation>
<country>Royaume-Uni</country>
<placeName>
<region type="country">Angleterre</region>
</placeName>
<wicri:cityArea>British Telecom Research Laboratories. Ipswich, Suffolk</wicri:cityArea>
</affiliation>
</author>
</analytic>
<series>
<title level="j">IEEE transactions on pattern analysis and machine intelligence</title>
<idno type="ISSN">0162-8828</idno>
<imprint>
<date when="1985" type="published">1985</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass></textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">An automatic evolutionary search is applied to the problem of feature extraction in an OCR application. A performance measure based on feature independence is used to generate features which do not appear to suffer from peaking effects [17]. Features are extracted from a training set of 30 600 machine printed 34 class alphanumeric characters derived from British mail. Classification results on the training set and a test set of 10 200 characters are reported for an increasing number of features. A 1.01 percent forced decision error rate is obtained on the test data using 316 features. The hardware implementation should be cheap and fast to operate. The performance compares favorably with current low cost OCR page readers.</div>
</front>
</TEI>
<pubmed>
<MedlineCitation Owner="NLM" Status="PubMed-not-MEDLINE">
<PMID Version="1">21869271</PMID>
<DateCreated>
<Year>2011</Year>
<Month>08</Month>
<Day>26</Day>
</DateCreated>
<DateCompleted>
<Year>2012</Year>
<Month>10</Month>
<Day>02</Day>
</DateCompleted>
<Article PubModel="Print">
<Journal>
<ISSN IssnType="Print">0162-8828</ISSN>
<JournalIssue CitedMedium="Print">
<Volume>7</Volume>
<Issue>3</Issue>
<PubDate>
<Year>1985</Year>
<Month>Mar</Month>
</PubDate>
</JournalIssue>
<Title>IEEE transactions on pattern analysis and machine intelligence</Title>
<ISOAbbreviation>IEEE Trans Pattern Anal Mach Intell</ISOAbbreviation>
</Journal>
<ArticleTitle>Automatic feature design for optical character recognition using an evolutionary search procedure.</ArticleTitle>
<Pagination>
<MedlinePgn>349-55</MedlinePgn>
</Pagination>
<Abstract>
<AbstractText>An automatic evolutionary search is applied to the problem of feature extraction in an OCR application. A performance measure based on feature independence is used to generate features which do not appear to suffer from peaking effects [17]. Features are extracted from a training set of 30 600 machine printed 34 class alphanumeric characters derived from British mail. Classification results on the training set and a test set of 10 200 characters are reported for an increasing number of features. A 1.01 percent forced decision error rate is obtained on the test data using 316 features. The hardware implementation should be cheap and fast to operate. The performance compares favorably with current low cost OCR page readers.</AbstractText>
</Abstract>
<AuthorList CompleteYN="Y">
<Author ValidYN="Y">
<LastName>Stentiford</LastName>
<ForeName>F W</ForeName>
<Initials>FW</Initials>
<AffiliationInfo>
<Affiliation>British Telecom Research Laboratories. Ipswich, Suffolk, England.</Affiliation>
</AffiliationInfo>
</Author>
</AuthorList>
<Language>eng</Language>
<PublicationTypeList>
<PublicationType UI="D016428">Journal Article</PublicationType>
</PublicationTypeList>
</Article>
<MedlineJournalInfo>
<Country>United States</Country>
<MedlineTA>IEEE Trans Pattern Anal Mach Intell</MedlineTA>
<NlmUniqueID>9885960</NlmUniqueID>
<ISSNLinking>0098-5589</ISSNLinking>
</MedlineJournalInfo>
</MedlineCitation>
<PubmedData>
<History>
<PubMedPubDate PubStatus="entrez">
<Year>2011</Year>
<Month>8</Month>
<Day>27</Day>
<Hour>6</Hour>
<Minute>0</Minute>
</PubMedPubDate>
<PubMedPubDate PubStatus="pubmed">
<Year>1985</Year>
<Month>3</Month>
<Day>1</Day>
<Hour>0</Hour>
<Minute>0</Minute>
</PubMedPubDate>
<PubMedPubDate PubStatus="medline">
<Year>1985</Year>
<Month>3</Month>
<Day>1</Day>
<Hour>0</Hour>
<Minute>1</Minute>
</PubMedPubDate>
</History>
<PublicationStatus>ppublish</PublicationStatus>
<ArticleIdList>
<ArticleId IdType="pubmed">21869271</ArticleId>
</ArticleIdList>
</PubmedData>
</pubmed>
<affiliations>
<list>
<country>
<li>Royaume-Uni</li>
</country>
<region>
<li>Angleterre</li>
</region>
</list>
<tree>
<country name="Royaume-Uni">
<region name="Angleterre">
<name sortKey="Stentiford, F W" sort="Stentiford, F W" uniqKey="Stentiford F" first="F W" last="Stentiford">F W Stentiford</name>
</region>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/PubMed/Checkpoint
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000086 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/PubMed/Checkpoint/biblio.hfd -nk 000086 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    PubMed
   |étape=   Checkpoint
   |type=    RBID
   |clé=     pubmed:21869271
   |texte=   Automatic feature design for optical character recognition using an evolutionary search procedure.
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/PubMed/Checkpoint/RBID.i   -Sk "pubmed:21869271" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/PubMed/Checkpoint/biblio.hfd   \
       | NlmPubMed2Wicri -a OcrV1 

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024