Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Optical recognition of handwritten Chinese characters by hierarchical radical matching method

Identifieur interne : 001C88 ( Main/Merge ); précédent : 001C87; suivant : 001C89

Optical recognition of handwritten Chinese characters by hierarchical radical matching method

Auteurs : A. B. Wang [Taïwan] ; K. C. Fan

Source :

RBID : Pascal:01-0013821

Descripteurs français

English descriptors

Abstract

In this paper, a radical-based OCR system for the recognition of handwritten Chinese characters is proposed. In our approach, a recursive hierarchical scheme is developed to perform radical extraction first. Character features and radical features are then extracted for matching. Last, a hierarchical radical matching scheme is devised to identify the radicals embedded in an input Chinese character and recognize the input character accordingly. Experiments for radical extraction are conducted on 1856 characters. The successful rate of radical extraction is 92.5%. The average time for radical extraction is 0.65 second per character. Experiments for matching process are conducted on two sets: training set and testing set, each set includes 900 characters. The overall recognition rate in our experiments is 98.2 and 80.9% (for training set and testing set, respectively). The average recognition time of our hierarchical radical matching scheme is 0.274 s. There are totally 4716 radicals in 1800 characters. In average, one character consists of 2.62 radicals. Each character will match 7.28 radical templates in average. Thus, each radical will match 2.77 radical temples. The experimental results reveal that our proposed method is feasible, flexible, and effective.

Links toward previous steps (curation, corpus...)


Links to Exploration step

Pascal:01-0013821

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" level="a">Optical recognition of handwritten Chinese characters by hierarchical radical matching method</title>
<author>
<name sortKey="Wang, A B" sort="Wang, A B" uniqKey="Wang A" first="A. B." last="Wang">A. B. Wang</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Private Takming Junior Coll of Commerce</s1>
<s3>TWN</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>Taïwan</country>
<wicri:noRegion>Private Takming Junior Coll of Commerce</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Fan, K C" sort="Fan, K C" uniqKey="Fan K" first="K. C." last="Fan">K. C. Fan</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">01-0013821</idno>
<date when="2001">2001</date>
<idno type="stanalyst">PASCAL 01-0013821 EI</idno>
<idno type="RBID">Pascal:01-0013821</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000756</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000037</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000661</idno>
<idno type="wicri:doubleKey">0031-3203:2001:Wang A:optical:recognition:of</idno>
<idno type="wicri:Area/Main/Merge">001C88</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a">Optical recognition of handwritten Chinese characters by hierarchical radical matching method</title>
<author>
<name sortKey="Wang, A B" sort="Wang, A B" uniqKey="Wang A" first="A. B." last="Wang">A. B. Wang</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Private Takming Junior Coll of Commerce</s1>
<s3>TWN</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>Taïwan</country>
<wicri:noRegion>Private Takming Junior Coll of Commerce</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Fan, K C" sort="Fan, K C" uniqKey="Fan K" first="K. C." last="Fan">K. C. Fan</name>
</author>
</analytic>
<series>
<title level="j" type="main">Pattern Recognition</title>
<title level="j" type="abbreviated">Pattern Recognit</title>
<idno type="ISSN">0031-3203</idno>
<imprint>
<date when="2001">2001</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<title level="j" type="main">Pattern Recognition</title>
<title level="j" type="abbreviated">Pattern Recognit</title>
<idno type="ISSN">0031-3203</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Character sets</term>
<term>Hierarchical radical matching methods</term>
<term>Optical character recognition</term>
<term>Pattern matching</term>
<term>Theory</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Théorie</term>
<term>Concordance forme</term>
<term>Jeu caractère</term>
<term>Reconnaissance optique caractère</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">In this paper, a radical-based OCR system for the recognition of handwritten Chinese characters is proposed. In our approach, a recursive hierarchical scheme is developed to perform radical extraction first. Character features and radical features are then extracted for matching. Last, a hierarchical radical matching scheme is devised to identify the radicals embedded in an input Chinese character and recognize the input character accordingly. Experiments for radical extraction are conducted on 1856 characters. The successful rate of radical extraction is 92.5%. The average time for radical extraction is 0.65 second per character. Experiments for matching process are conducted on two sets: training set and testing set, each set includes 900 characters. The overall recognition rate in our experiments is 98.2 and 80.9% (for training set and testing set, respectively). The average recognition time of our hierarchical radical matching scheme is 0.274 s. There are totally 4716 radicals in 1800 characters. In average, one character consists of 2.62 radicals. Each character will match 7.28 radical templates in average. Thus, each radical will match 2.77 radical temples. The experimental results reveal that our proposed method is feasible, flexible, and effective.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Taïwan</li>
</country>
</list>
<tree>
<noCountry>
<name sortKey="Fan, K C" sort="Fan, K C" uniqKey="Fan K" first="K. C." last="Fan">K. C. Fan</name>
</noCountry>
<country name="Taïwan">
<noRegion>
<name sortKey="Wang, A B" sort="Wang, A B" uniqKey="Wang A" first="A. B." last="Wang">A. B. Wang</name>
</noRegion>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Merge
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001C88 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Merge/biblio.hfd -nk 001C88 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Merge
   |type=    RBID
   |clé=     Pascal:01-0013821
   |texte=   Optical recognition of handwritten Chinese characters by hierarchical radical  matching method
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024