Coarse classification of Chinese characters via stroke clustering method
Identifieur interne : 000467 ( Istex/Curation ); précédent : 000466; suivant : 000468Coarse classification of Chinese characters via stroke clustering method
Auteurs : Chin-Chuan Han [République populaire de Chine] ; Yao-Lung Tseng [République populaire de Chine] ; Kuo-Chin Fan [République populaire de Chine, Taïwan] ; An-Bang Wang [République populaire de Chine]Source :
- Pattern Recognition Letters [ 0167-8655 ] ; 1995.
Abstract
In this paper, we propose a stroke clustering-based coarse classification mechanism to classify the multi-fonts Chinese characters. The main purpose of the proposed method is to identify the associating type of an input character together with the extraction of its embedded composing components. In this paper, the K-mean clustering algorithm is employed to cluster the thinned strokes. Besides, mis-clustered stroke modification techniques are developed to rearrange the mis-clustered strokes generated by the K-mean algorithm. Five kinds of fonts for 2500 frequently used Chinese characters are tested in our experiments. The average classification rate is 92.57% which is very promising for coarse classification.
Url:
DOI: 10.1016/0167-8655(95)00054-K
Links toward previous steps (curation, corpus...)
- to stream Istex, to step Corpus: Pour aller vers cette notice dans l'étape Curation :000474
Links to Exploration step
ISTEX:1810EFB0B5D295E915FC26E7B2C76832A476B30ELe document en format XML
<record><TEI wicri:istexFullTextTei="biblStruct"><teiHeader><fileDesc><titleStmt><title>Coarse classification of Chinese characters via stroke clustering method</title>
<author><name sortKey="Han, Chin Chuan" sort="Han, Chin Chuan" uniqKey="Han C" first="Chin-Chuan" last="Han">Chin-Chuan Han</name>
<affiliation wicri:level="1"><mods:affiliation>Institute of Computer Science and Information Engineering, National Central University, Chung-Li 32054, Taiwan, R.O.C.</mods:affiliation>
<country xml:lang="fr">République populaire de Chine</country>
<wicri:regionArea>Institute of Computer Science and Information Engineering, National Central University, Chung-Li 32054, Taiwan</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Tseng, Yao Lung" sort="Tseng, Yao Lung" uniqKey="Tseng Y" first="Yao-Lung" last="Tseng">Yao-Lung Tseng</name>
<affiliation wicri:level="1"><mods:affiliation>Institute of Computer Science and Information Engineering, National Central University, Chung-Li 32054, Taiwan, R.O.C.</mods:affiliation>
<country xml:lang="fr">République populaire de Chine</country>
<wicri:regionArea>Institute of Computer Science and Information Engineering, National Central University, Chung-Li 32054, Taiwan</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Fan, Kuo Chin" sort="Fan, Kuo Chin" uniqKey="Fan K" first="Kuo-Chin" last="Fan">Kuo-Chin Fan</name>
<affiliation wicri:level="1"><mods:affiliation>Institute of Computer Science and Information Engineering, National Central University, Chung-Li 32054, Taiwan, R.O.C.</mods:affiliation>
<country xml:lang="fr">République populaire de Chine</country>
<wicri:regionArea>Institute of Computer Science and Information Engineering, National Central University, Chung-Li 32054, Taiwan</wicri:regionArea>
</affiliation>
<affiliation wicri:level="1"><mods:affiliation>E-mail: kcfan@ncuee.ncu.edu.tw</mods:affiliation>
<country wicri:rule="url">Taïwan</country>
</affiliation>
</author>
<author><name sortKey="Wang, An Bang" sort="Wang, An Bang" uniqKey="Wang A" first="An-Bang" last="Wang">An-Bang Wang</name>
<affiliation wicri:level="1"><mods:affiliation>Institute of Computer Science and Information Engineering, National Central University, Chung-Li 32054, Taiwan, R.O.C.</mods:affiliation>
<country xml:lang="fr">République populaire de Chine</country>
<wicri:regionArea>Institute of Computer Science and Information Engineering, National Central University, Chung-Li 32054, Taiwan</wicri:regionArea>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:1810EFB0B5D295E915FC26E7B2C76832A476B30E</idno>
<date when="1995" year="1995">1995</date>
<idno type="doi">10.1016/0167-8655(95)00054-K</idno>
<idno type="url">https://api.istex.fr/document/1810EFB0B5D295E915FC26E7B2C76832A476B30E/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000474</idno>
<idno type="wicri:Area/Istex/Curation">000467</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title level="a">Coarse classification of Chinese characters via stroke clustering method</title>
<author><name sortKey="Han, Chin Chuan" sort="Han, Chin Chuan" uniqKey="Han C" first="Chin-Chuan" last="Han">Chin-Chuan Han</name>
<affiliation wicri:level="1"><mods:affiliation>Institute of Computer Science and Information Engineering, National Central University, Chung-Li 32054, Taiwan, R.O.C.</mods:affiliation>
<country xml:lang="fr">République populaire de Chine</country>
<wicri:regionArea>Institute of Computer Science and Information Engineering, National Central University, Chung-Li 32054, Taiwan</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Tseng, Yao Lung" sort="Tseng, Yao Lung" uniqKey="Tseng Y" first="Yao-Lung" last="Tseng">Yao-Lung Tseng</name>
<affiliation wicri:level="1"><mods:affiliation>Institute of Computer Science and Information Engineering, National Central University, Chung-Li 32054, Taiwan, R.O.C.</mods:affiliation>
<country xml:lang="fr">République populaire de Chine</country>
<wicri:regionArea>Institute of Computer Science and Information Engineering, National Central University, Chung-Li 32054, Taiwan</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Fan, Kuo Chin" sort="Fan, Kuo Chin" uniqKey="Fan K" first="Kuo-Chin" last="Fan">Kuo-Chin Fan</name>
<affiliation wicri:level="1"><mods:affiliation>Institute of Computer Science and Information Engineering, National Central University, Chung-Li 32054, Taiwan, R.O.C.</mods:affiliation>
<country xml:lang="fr">République populaire de Chine</country>
<wicri:regionArea>Institute of Computer Science and Information Engineering, National Central University, Chung-Li 32054, Taiwan</wicri:regionArea>
</affiliation>
<affiliation wicri:level="1"><mods:affiliation>E-mail: kcfan@ncuee.ncu.edu.tw</mods:affiliation>
<country wicri:rule="url">Taïwan</country>
</affiliation>
</author>
<author><name sortKey="Wang, An Bang" sort="Wang, An Bang" uniqKey="Wang A" first="An-Bang" last="Wang">An-Bang Wang</name>
<affiliation wicri:level="1"><mods:affiliation>Institute of Computer Science and Information Engineering, National Central University, Chung-Li 32054, Taiwan, R.O.C.</mods:affiliation>
<country xml:lang="fr">République populaire de Chine</country>
<wicri:regionArea>Institute of Computer Science and Information Engineering, National Central University, Chung-Li 32054, Taiwan</wicri:regionArea>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series><title level="j">Pattern Recognition Letters</title>
<title level="j" type="abbrev">PATREC</title>
<idno type="ISSN">0167-8655</idno>
<imprint><publisher>ELSEVIER</publisher>
<date type="published" when="1995">1995</date>
<biblScope unit="volume">16</biblScope>
<biblScope unit="issue">10</biblScope>
<biblScope unit="page" from="1079">1079</biblScope>
<biblScope unit="page" to="1089">1089</biblScope>
</imprint>
<idno type="ISSN">0167-8655</idno>
</series>
<idno type="istex">1810EFB0B5D295E915FC26E7B2C76832A476B30E</idno>
<idno type="DOI">10.1016/0167-8655(95)00054-K</idno>
<idno type="PII">0167-8655(95)00054-K</idno>
</biblStruct>
</sourceDesc>
<seriesStmt><idno type="ISSN">0167-8655</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass></textClass>
<langUsage><language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">In this paper, we propose a stroke clustering-based coarse classification mechanism to classify the multi-fonts Chinese characters. The main purpose of the proposed method is to identify the associating type of an input character together with the extraction of its embedded composing components. In this paper, the K-mean clustering algorithm is employed to cluster the thinned strokes. Besides, mis-clustered stroke modification techniques are developed to rearrange the mis-clustered strokes generated by the K-mean algorithm. Five kinds of fonts for 2500 frequently used Chinese characters are tested in our experiments. The average classification rate is 92.57% which is very promising for coarse classification.</div>
</front>
</TEI>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Istex/Curation
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000467 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Istex/Curation/biblio.hfd -nk 000467 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= Istex |étape= Curation |type= RBID |clé= ISTEX:1810EFB0B5D295E915FC26E7B2C76832A476B30E |texte= Coarse classification of Chinese characters via stroke clustering method }}
![]() | This area was generated with Dilib version V0.6.32. | ![]() |