Coarse classification of Chinese characters via stroke clustering method
Identifieur interne : 001D58 ( Istex/Checkpoint ); précédent : 001D57; suivant : 001D59Coarse classification of Chinese characters via stroke clustering method
Auteurs : Chin-Chuan Han [République populaire de Chine] ; Yao-Lung Tseng [République populaire de Chine] ; Kuo-Chin Fan [République populaire de Chine, Taïwan] ; An-Bang Wang [République populaire de Chine]Source :
- Pattern Recognition Letters [ 0167-8655 ] ; 1995.
Abstract
In this paper, we propose a stroke clustering-based coarse classification mechanism to classify the multi-fonts Chinese characters. The main purpose of the proposed method is to identify the associating type of an input character together with the extraction of its embedded composing components. In this paper, the K-mean clustering algorithm is employed to cluster the thinned strokes. Besides, mis-clustered stroke modification techniques are developed to rearrange the mis-clustered strokes generated by the K-mean algorithm. Five kinds of fonts for 2500 frequently used Chinese characters are tested in our experiments. The average classification rate is 92.57% which is very promising for coarse classification.
Url:
DOI: 10.1016/0167-8655(95)00054-K
Affiliations:
Links toward previous steps (curation, corpus...)
Links to Exploration step
ISTEX:1810EFB0B5D295E915FC26E7B2C76832A476B30ELe document en format XML
<record><TEI wicri:istexFullTextTei="biblStruct"><teiHeader><fileDesc><titleStmt><title>Coarse classification of Chinese characters via stroke clustering method</title>
<author><name sortKey="Han, Chin Chuan" sort="Han, Chin Chuan" uniqKey="Han C" first="Chin-Chuan" last="Han">Chin-Chuan Han</name>
</author>
<author><name sortKey="Tseng, Yao Lung" sort="Tseng, Yao Lung" uniqKey="Tseng Y" first="Yao-Lung" last="Tseng">Yao-Lung Tseng</name>
</author>
<author><name sortKey="Fan, Kuo Chin" sort="Fan, Kuo Chin" uniqKey="Fan K" first="Kuo-Chin" last="Fan">Kuo-Chin Fan</name>
</author>
<author><name sortKey="Wang, An Bang" sort="Wang, An Bang" uniqKey="Wang A" first="An-Bang" last="Wang">An-Bang Wang</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:1810EFB0B5D295E915FC26E7B2C76832A476B30E</idno>
<date when="1995" year="1995">1995</date>
<idno type="doi">10.1016/0167-8655(95)00054-K</idno>
<idno type="url">https://api.istex.fr/document/1810EFB0B5D295E915FC26E7B2C76832A476B30E/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000474</idno>
<idno type="wicri:Area/Istex/Curation">000467</idno>
<idno type="wicri:Area/Istex/Checkpoint">001D58</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title level="a">Coarse classification of Chinese characters via stroke clustering method</title>
<author><name sortKey="Han, Chin Chuan" sort="Han, Chin Chuan" uniqKey="Han C" first="Chin-Chuan" last="Han">Chin-Chuan Han</name>
<affiliation wicri:level="1"><country xml:lang="fr">République populaire de Chine</country>
<wicri:regionArea>Institute of Computer Science and Information Engineering, National Central University, Chung-Li 32054, Taiwan</wicri:regionArea>
<wicri:noRegion>Taiwan</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Tseng, Yao Lung" sort="Tseng, Yao Lung" uniqKey="Tseng Y" first="Yao-Lung" last="Tseng">Yao-Lung Tseng</name>
<affiliation wicri:level="1"><country xml:lang="fr">République populaire de Chine</country>
<wicri:regionArea>Institute of Computer Science and Information Engineering, National Central University, Chung-Li 32054, Taiwan</wicri:regionArea>
<wicri:noRegion>Taiwan</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Fan, Kuo Chin" sort="Fan, Kuo Chin" uniqKey="Fan K" first="Kuo-Chin" last="Fan">Kuo-Chin Fan</name>
<affiliation wicri:level="1"><country xml:lang="fr">République populaire de Chine</country>
<wicri:regionArea>Institute of Computer Science and Information Engineering, National Central University, Chung-Li 32054, Taiwan</wicri:regionArea>
<wicri:noRegion>Taiwan</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">Taïwan</country>
</affiliation>
</author>
<author><name sortKey="Wang, An Bang" sort="Wang, An Bang" uniqKey="Wang A" first="An-Bang" last="Wang">An-Bang Wang</name>
<affiliation wicri:level="1"><country xml:lang="fr">République populaire de Chine</country>
<wicri:regionArea>Institute of Computer Science and Information Engineering, National Central University, Chung-Li 32054, Taiwan</wicri:regionArea>
<wicri:noRegion>Taiwan</wicri:noRegion>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series><title level="j">Pattern Recognition Letters</title>
<title level="j" type="abbrev">PATREC</title>
<idno type="ISSN">0167-8655</idno>
<imprint><publisher>ELSEVIER</publisher>
<date type="published" when="1995">1995</date>
<biblScope unit="volume">16</biblScope>
<biblScope unit="issue">10</biblScope>
<biblScope unit="page" from="1079">1079</biblScope>
<biblScope unit="page" to="1089">1089</biblScope>
</imprint>
<idno type="ISSN">0167-8655</idno>
</series>
<idno type="istex">1810EFB0B5D295E915FC26E7B2C76832A476B30E</idno>
<idno type="DOI">10.1016/0167-8655(95)00054-K</idno>
<idno type="PII">0167-8655(95)00054-K</idno>
</biblStruct>
</sourceDesc>
<seriesStmt><idno type="ISSN">0167-8655</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass></textClass>
<langUsage><language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">In this paper, we propose a stroke clustering-based coarse classification mechanism to classify the multi-fonts Chinese characters. The main purpose of the proposed method is to identify the associating type of an input character together with the extraction of its embedded composing components. In this paper, the K-mean clustering algorithm is employed to cluster the thinned strokes. Besides, mis-clustered stroke modification techniques are developed to rearrange the mis-clustered strokes generated by the K-mean algorithm. Five kinds of fonts for 2500 frequently used Chinese characters are tested in our experiments. The average classification rate is 92.57% which is very promising for coarse classification.</div>
</front>
</TEI>
<affiliations><list><country><li>République populaire de Chine</li>
<li>Taïwan</li>
</country>
</list>
<tree><country name="République populaire de Chine"><noRegion><name sortKey="Han, Chin Chuan" sort="Han, Chin Chuan" uniqKey="Han C" first="Chin-Chuan" last="Han">Chin-Chuan Han</name>
</noRegion>
<name sortKey="Fan, Kuo Chin" sort="Fan, Kuo Chin" uniqKey="Fan K" first="Kuo-Chin" last="Fan">Kuo-Chin Fan</name>
<name sortKey="Tseng, Yao Lung" sort="Tseng, Yao Lung" uniqKey="Tseng Y" first="Yao-Lung" last="Tseng">Yao-Lung Tseng</name>
<name sortKey="Wang, An Bang" sort="Wang, An Bang" uniqKey="Wang A" first="An-Bang" last="Wang">An-Bang Wang</name>
</country>
<country name="Taïwan"><noRegion><name sortKey="Fan, Kuo Chin" sort="Fan, Kuo Chin" uniqKey="Fan K" first="Kuo-Chin" last="Fan">Kuo-Chin Fan</name>
</noRegion>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Istex/Checkpoint
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001D58 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Istex/Checkpoint/biblio.hfd -nk 001D58 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= Istex |étape= Checkpoint |type= RBID |clé= ISTEX:1810EFB0B5D295E915FC26E7B2C76832A476B30E |texte= Coarse classification of Chinese characters via stroke clustering method }}
This area was generated with Dilib version V0.6.32. |