Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Peripheral and global features for use in coarse classification of Chinese characters

Identifieur interne : 001729 ( Istex/Corpus ); précédent : 001728; suivant : 001730

Peripheral and global features for use in coarse classification of Chinese characters

Auteurs : Kuo-Sen Chou ; Kuo-Chin Fan ; Tzu-I Fan

Source :

RBID : ISTEX:9EBC45851BF53BF96B2314C37DDCD3A03AD5B9AA

Abstract

In this paper, a simple and effective approach to the coarse classification of handwritten Chinese characters is proposed. In our approach, a Chinese character is characterized by string representation using periphery and global feature vectors. The peripheral features include four strings to represent the structure of segments in top, bottom, left, and right directions. The global features include the number of horizontal segments in the top direction and bottom direction, and the number of stroke segments in a character. In addition, a scoring-based coarse classification scheme is devised in choosing the proper candidate characters. Twenty sets of Chinese characters (5401 characters/set) are tested. The number of candidate characters is reduced from 5401 to about 80 with the error rate less than 1.2% in average. Experimental results reveal the feasibility of the proposed approach in classifying Chinese characters.

Url:
DOI: 10.1016/S0031-3203(96)00090-8

Links to Exploration step

ISTEX:9EBC45851BF53BF96B2314C37DDCD3A03AD5B9AA

Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title>Peripheral and global features for use in coarse classification of Chinese characters</title>
<author>
<name sortKey="Chou, Kuo Sen" sort="Chou, Kuo Sen" uniqKey="Chou K" first="Kuo-Sen" last="Chou">Kuo-Sen Chou</name>
<affiliation>
<mods:affiliation>Institute of Computer Science and Information Engineering, National Central University, Taiwan, R.O.C.</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Fan, Kuo Chin" sort="Fan, Kuo Chin" uniqKey="Fan K" first="Kuo-Chin" last="Fan">Kuo-Chin Fan</name>
<affiliation>
<mods:affiliation>Institute of Computer Science and Information Engineering, National Central University, Taiwan, R.O.C.</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Fan, Tzu I" sort="Fan, Tzu I" uniqKey="Fan T" first="Tzu-I" last="Fan">Tzu-I Fan</name>
<affiliation>
<mods:affiliation>Institute of Computer Science and Information Engineering, National Central University, Taiwan, R.O.C.</mods:affiliation>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:9EBC45851BF53BF96B2314C37DDCD3A03AD5B9AA</idno>
<date when="1997" year="1997">1997</date>
<idno type="doi">10.1016/S0031-3203(96)00090-8</idno>
<idno type="url">https://api.istex.fr/document/9EBC45851BF53BF96B2314C37DDCD3A03AD5B9AA/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">001729</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a">Peripheral and global features for use in coarse classification of Chinese characters</title>
<author>
<name sortKey="Chou, Kuo Sen" sort="Chou, Kuo Sen" uniqKey="Chou K" first="Kuo-Sen" last="Chou">Kuo-Sen Chou</name>
<affiliation>
<mods:affiliation>Institute of Computer Science and Information Engineering, National Central University, Taiwan, R.O.C.</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Fan, Kuo Chin" sort="Fan, Kuo Chin" uniqKey="Fan K" first="Kuo-Chin" last="Fan">Kuo-Chin Fan</name>
<affiliation>
<mods:affiliation>Institute of Computer Science and Information Engineering, National Central University, Taiwan, R.O.C.</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Fan, Tzu I" sort="Fan, Tzu I" uniqKey="Fan T" first="Tzu-I" last="Fan">Tzu-I Fan</name>
<affiliation>
<mods:affiliation>Institute of Computer Science and Information Engineering, National Central University, Taiwan, R.O.C.</mods:affiliation>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="j">Pattern Recognition</title>
<title level="j" type="abbrev">PR</title>
<idno type="ISSN">0031-3203</idno>
<imprint>
<publisher>ELSEVIER</publisher>
<date type="published" when="1996">1996</date>
<biblScope unit="volume">30</biblScope>
<biblScope unit="issue">3</biblScope>
<biblScope unit="page" from="483">483</biblScope>
<biblScope unit="page" to="489">489</biblScope>
</imprint>
<idno type="ISSN">0031-3203</idno>
</series>
<idno type="istex">9EBC45851BF53BF96B2314C37DDCD3A03AD5B9AA</idno>
<idno type="DOI">10.1016/S0031-3203(96)00090-8</idno>
<idno type="PII">S0031-3203(96)00090-8</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0031-3203</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">In this paper, a simple and effective approach to the coarse classification of handwritten Chinese characters is proposed. In our approach, a Chinese character is characterized by string representation using periphery and global feature vectors. The peripheral features include four strings to represent the structure of segments in top, bottom, left, and right directions. The global features include the number of horizontal segments in the top direction and bottom direction, and the number of stroke segments in a character. In addition, a scoring-based coarse classification scheme is devised in choosing the proper candidate characters. Twenty sets of Chinese characters (5401 characters/set) are tested. The number of candidate characters is reduced from 5401 to about 80 with the error rate less than 1.2% in average. Experimental results reveal the feasibility of the proposed approach in classifying Chinese characters.</div>
</front>
</TEI>
<istex>
<corpusName>elsevier</corpusName>
<author>
<json:item>
<name>Kuo-Sen Chou</name>
<affiliations>
<json:string>Institute of Computer Science and Information Engineering, National Central University, Taiwan, R.O.C.</json:string>
</affiliations>
</json:item>
<json:item>
<name>Kuo-Chin Fan</name>
<affiliations>
<json:string>Institute of Computer Science and Information Engineering, National Central University, Taiwan, R.O.C.</json:string>
</affiliations>
</json:item>
<json:item>
<name>Tzu-I Fan</name>
<affiliations>
<json:string>Institute of Computer Science and Information Engineering, National Central University, Taiwan, R.O.C.</json:string>
</affiliations>
</json:item>
</author>
<subject>
<json:item>
<lang>
<json:string>eng</json:string>
</lang>
<value>Chinese character recognition</value>
</json:item>
<json:item>
<lang>
<json:string>eng</json:string>
</lang>
<value>Coarse classification</value>
</json:item>
<json:item>
<lang>
<json:string>eng</json:string>
</lang>
<value>Peripheral feature</value>
</json:item>
<json:item>
<lang>
<json:string>eng</json:string>
</lang>
<value>Global feature</value>
</json:item>
</subject>
<language>
<json:string>eng</json:string>
</language>
<abstract>In this paper, a simple and effective approach to the coarse classification of handwritten Chinese characters is proposed. In our approach, a Chinese character is characterized by string representation using periphery and global feature vectors. The peripheral features include four strings to represent the structure of segments in top, bottom, left, and right directions. The global features include the number of horizontal segments in the top direction and bottom direction, and the number of stroke segments in a character. In addition, a scoring-based coarse classification scheme is devised in choosing the proper candidate characters. Twenty sets of Chinese characters (5401 characters/set) are tested. The number of candidate characters is reduced from 5401 to about 80 with the error rate less than 1.2% in average. Experimental results reveal the feasibility of the proposed approach in classifying Chinese characters.</abstract>
<qualityIndicators>
<score>5.799</score>
<pdfVersion>1.2</pdfVersion>
<pdfPageSize>576 x 821 pts</pdfPageSize>
<refBibsNative>true</refBibsNative>
<keywordCount>4</keywordCount>
<abstractCharCount>929</abstractCharCount>
<pdfWordCount>4155</pdfWordCount>
<pdfCharCount>24367</pdfCharCount>
<pdfPageCount>7</pdfPageCount>
<abstractWordCount>137</abstractWordCount>
</qualityIndicators>
<title>Peripheral and global features for use in coarse classification of Chinese characters</title>
<pii>
<json:string>S0031-3203(96)00090-8</json:string>
</pii>
<genre>
<json:string>research-article</json:string>
</genre>
<serie>
<volume>00</volume>
<pages>
<last>61</last>
<first>58</first>
</pages>
<genre></genre>
<language>
<json:string>unknown</json:string>
</language>
<title>Int. Conf. Comput. Process. Chinese Oriental Languages</title>
</serie>
<host>
<volume>30</volume>
<pii>
<json:string>S0031-3203(00)X0026-X</json:string>
</pii>
<pages>
<last>489</last>
<first>483</first>
</pages>
<issn>
<json:string>0031-3203</json:string>
</issn>
<issue>3</issue>
<genre>
<json:string>Journal</json:string>
</genre>
<language>
<json:string>unknown</json:string>
</language>
<title>Pattern Recognition</title>
<publicationDate>1997</publicationDate>
</host>
<categories>
<wos>
<json:string>COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE</json:string>
<json:string>ENGINEERING, ELECTRICAL & ELECTRONIC</json:string>
</wos>
</categories>
<publicationDate>1996</publicationDate>
<copyrightDate>1997</copyrightDate>
<doi>
<json:string>10.1016/S0031-3203(96)00090-8</json:string>
</doi>
<id>9EBC45851BF53BF96B2314C37DDCD3A03AD5B9AA</id>
<fulltext>
<json:item>
<original>true</original>
<mimetype>application/pdf</mimetype>
<extension>pdf</extension>
<uri>https://api.istex.fr/document/9EBC45851BF53BF96B2314C37DDCD3A03AD5B9AA/fulltext/pdf</uri>
</json:item>
<json:item>
<original>true</original>
<mimetype>text/plain</mimetype>
<extension>txt</extension>
<uri>https://api.istex.fr/document/9EBC45851BF53BF96B2314C37DDCD3A03AD5B9AA/fulltext/txt</uri>
</json:item>
<json:item>
<original>false</original>
<mimetype>application/zip</mimetype>
<extension>zip</extension>
<uri>https://api.istex.fr/document/9EBC45851BF53BF96B2314C37DDCD3A03AD5B9AA/fulltext/zip</uri>
</json:item>
<istex:fulltextTEI uri="https://api.istex.fr/document/9EBC45851BF53BF96B2314C37DDCD3A03AD5B9AA/fulltext/tei">
<teiHeader>
<fileDesc>
<titleStmt>
<title level="a">Peripheral and global features for use in coarse classification of Chinese characters</title>
</titleStmt>
<publicationStmt>
<authority>ISTEX</authority>
<publisher>ELSEVIER</publisher>
<availability>
<p>ELSEVIER</p>
</availability>
<date>1997</date>
</publicationStmt>
<sourceDesc>
<biblStruct type="inbook">
<analytic>
<title level="a">Peripheral and global features for use in coarse classification of Chinese characters</title>
<author>
<persName>
<forename type="first">Kuo-Sen</forename>
<surname>Chou</surname>
</persName>
<affiliation>Author to whom correspondence should be addressed.</affiliation>
<affiliation>Institute of Computer Science and Information Engineering, National Central University, Taiwan, R.O.C.</affiliation>
</author>
<author>
<persName>
<forename type="first">Kuo-Chin</forename>
<surname>Fan</surname>
</persName>
<affiliation>Institute of Computer Science and Information Engineering, National Central University, Taiwan, R.O.C.</affiliation>
</author>
<author>
<persName>
<forename type="first">Tzu-I</forename>
<surname>Fan</surname>
</persName>
<affiliation>Institute of Computer Science and Information Engineering, National Central University, Taiwan, R.O.C.</affiliation>
</author>
</analytic>
<monogr>
<title level="j">Pattern Recognition</title>
<title level="j" type="abbrev">PR</title>
<idno type="pISSN">0031-3203</idno>
<idno type="PII">S0031-3203(00)X0026-X</idno>
<imprint>
<publisher>ELSEVIER</publisher>
<date type="published" when="1996"></date>
<biblScope unit="volume">30</biblScope>
<biblScope unit="issue">3</biblScope>
<biblScope unit="page" from="483">483</biblScope>
<biblScope unit="page" to="489">489</biblScope>
</imprint>
</monogr>
<idno type="istex">9EBC45851BF53BF96B2314C37DDCD3A03AD5B9AA</idno>
<idno type="DOI">10.1016/S0031-3203(96)00090-8</idno>
<idno type="PII">S0031-3203(96)00090-8</idno>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<creation>
<date>1997</date>
</creation>
<langUsage>
<language ident="en">en</language>
</langUsage>
<abstract xml:lang="en">
<p>In this paper, a simple and effective approach to the coarse classification of handwritten Chinese characters is proposed. In our approach, a Chinese character is characterized by string representation using periphery and global feature vectors. The peripheral features include four strings to represent the structure of segments in top, bottom, left, and right directions. The global features include the number of horizontal segments in the top direction and bottom direction, and the number of stroke segments in a character. In addition, a scoring-based coarse classification scheme is devised in choosing the proper candidate characters. Twenty sets of Chinese characters (5401 characters/set) are tested. The number of candidate characters is reduced from 5401 to about 80 with the error rate less than 1.2% in average. Experimental results reveal the feasibility of the proposed approach in classifying Chinese characters.</p>
</abstract>
<textClass>
<keywords scheme="keyword">
<list>
<head>Keywords</head>
<item>
<term>Chinese character recognition</term>
</item>
<item>
<term>Coarse classification</term>
</item>
<item>
<term>Peripheral feature</term>
</item>
<item>
<term>Global feature</term>
</item>
</list>
</keywords>
</textClass>
</profileDesc>
<revisionDesc>
<change when="1996-06-25">Registration</change>
<change when="1996-05-31">Modified</change>
<change when="1996">Published</change>
</revisionDesc>
</teiHeader>
</istex:fulltextTEI>
</fulltext>
<metadata>
<istex:metadataXml wicri:clean="Elsevier, elements deleted: tail">
<istex:xmlDeclaration>version="1.0" encoding="utf-8"</istex:xmlDeclaration>
<istex:docType PUBLIC="-//ES//DTD journal article DTD version 4.5.2//EN//XML" URI="art452.dtd" name="istex:docType"></istex:docType>
<istex:document>
<converted-article version="4.5.2" docsubtype="fla">
<item-info>
<jid>PR</jid>
<aid>96000908</aid>
<ce:pii>S0031-3203(96)00090-8</ce:pii>
<ce:doi>10.1016/S0031-3203(96)00090-8</ce:doi>
<ce:copyright type="unknown" year="1997"></ce:copyright>
</item-info>
<head>
<ce:title>Peripheral and global features for use in coarse classification of Chinese characters</ce:title>
<ce:author-group>
<ce:author>
<ce:given-name>Kuo-Sen</ce:given-name>
<ce:surname>Chou</ce:surname>
<ce:cross-ref refid="COR1">
<ce:sup></ce:sup>
</ce:cross-ref>
<ce:cross-ref refid="AFF1">
<ce:sup></ce:sup>
</ce:cross-ref>
<ce:cross-ref refid="AFF2">
<ce:sup></ce:sup>
</ce:cross-ref>
</ce:author>
<ce:author>
<ce:given-name>Kuo-Chin</ce:given-name>
<ce:surname>Fan</ce:surname>
<ce:cross-ref refid="AFF1">
<ce:sup></ce:sup>
</ce:cross-ref>
</ce:author>
<ce:author>
<ce:given-name>Tzu-I</ce:given-name>
<ce:surname>Fan</ce:surname>
<ce:cross-ref refid="AFF1">
<ce:sup></ce:sup>
</ce:cross-ref>
</ce:author>
<ce:affiliation id="AFF1">
<ce:label>a</ce:label>
<ce:textfn>Institute of Computer Science and Information Engineering, National Central University, Taiwan, R.O.C.</ce:textfn>
</ce:affiliation>
<ce:affiliation id="AFF2">
<ce:label>b</ce:label>
<ce:textfn>Telecommunication Laboratories, MOTC, 12 Lane 551 Min-Tsu Rd, Section 3, Yang-Mei, Taoyuan 326, Taiwan, R.O.C.</ce:textfn>
</ce:affiliation>
<ce:correspondence id="COR1">
<ce:label></ce:label>
<ce:text>Author to whom correspondence should be addressed.</ce:text>
</ce:correspondence>
</ce:author-group>
<ce:date-received day="3" month="6" year="1995"></ce:date-received>
<ce:date-revised day="31" month="5" year="1996"></ce:date-revised>
<ce:date-accepted day="25" month="6" year="1996"></ce:date-accepted>
<ce:abstract>
<ce:section-title>Abstract</ce:section-title>
<ce:abstract-sec>
<ce:simple-para>In this paper, a simple and effective approach to the coarse classification of handwritten Chinese characters is proposed. In our approach, a Chinese character is characterized by string representation using periphery and global feature vectors. The peripheral features include four strings to represent the structure of segments in top, bottom, left, and right directions. The global features include the number of horizontal segments in the top direction and bottom direction, and the number of stroke segments in a character. In addition, a scoring-based coarse classification scheme is devised in choosing the proper candidate characters. Twenty sets of Chinese characters (5401 characters/set) are tested. The number of candidate characters is reduced from 5401 to about 80 with the error rate less than 1.2% in average. Experimental results reveal the feasibility of the proposed approach in classifying Chinese characters.</ce:simple-para>
</ce:abstract-sec>
</ce:abstract>
<ce:keywords>
<ce:section-title>Keywords</ce:section-title>
<ce:keyword>
<ce:text>Chinese character recognition</ce:text>
</ce:keyword>
<ce:keyword>
<ce:text>Coarse classification</ce:text>
</ce:keyword>
<ce:keyword>
<ce:text>Peripheral feature</ce:text>
</ce:keyword>
<ce:keyword>
<ce:text>Global feature</ce:text>
</ce:keyword>
</ce:keywords>
</head>
</converted-article>
</istex:document>
</istex:metadataXml>
<mods version="3.6">
<titleInfo>
<title>Peripheral and global features for use in coarse classification of Chinese characters</title>
</titleInfo>
<titleInfo type="alternative" contentType="CDATA">
<title>Peripheral and global features for use in coarse classification of Chinese characters</title>
</titleInfo>
<name type="personal">
<namePart type="given">Kuo-Sen</namePart>
<namePart type="family">Chou</namePart>
<affiliation>Institute of Computer Science and Information Engineering, National Central University, Taiwan, R.O.C.</affiliation>
<description>Author to whom correspondence should be addressed.</description>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Kuo-Chin</namePart>
<namePart type="family">Fan</namePart>
<affiliation>Institute of Computer Science and Information Engineering, National Central University, Taiwan, R.O.C.</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Tzu-I</namePart>
<namePart type="family">Fan</namePart>
<affiliation>Institute of Computer Science and Information Engineering, National Central University, Taiwan, R.O.C.</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<typeOfResource>text</typeOfResource>
<genre type="research-article" displayLabel="Full-length article"></genre>
<originInfo>
<publisher>ELSEVIER</publisher>
<dateIssued encoding="w3cdtf">1996</dateIssued>
<dateValid encoding="w3cdtf">1996-06-25</dateValid>
<dateModified encoding="w3cdtf">1996-05-31</dateModified>
<copyrightDate encoding="w3cdtf">1997</copyrightDate>
</originInfo>
<language>
<languageTerm type="code" authority="iso639-2b">eng</languageTerm>
<languageTerm type="code" authority="rfc3066">en</languageTerm>
</language>
<physicalDescription>
<internetMediaType>text/html</internetMediaType>
</physicalDescription>
<abstract lang="en">In this paper, a simple and effective approach to the coarse classification of handwritten Chinese characters is proposed. In our approach, a Chinese character is characterized by string representation using periphery and global feature vectors. The peripheral features include four strings to represent the structure of segments in top, bottom, left, and right directions. The global features include the number of horizontal segments in the top direction and bottom direction, and the number of stroke segments in a character. In addition, a scoring-based coarse classification scheme is devised in choosing the proper candidate characters. Twenty sets of Chinese characters (5401 characters/set) are tested. The number of candidate characters is reduced from 5401 to about 80 with the error rate less than 1.2% in average. Experimental results reveal the feasibility of the proposed approach in classifying Chinese characters.</abstract>
<subject>
<genre>Keywords</genre>
<topic>Chinese character recognition</topic>
<topic>Coarse classification</topic>
<topic>Peripheral feature</topic>
<topic>Global feature</topic>
</subject>
<relatedItem type="host">
<titleInfo>
<title>Pattern Recognition</title>
</titleInfo>
<titleInfo type="abbreviated">
<title>PR</title>
</titleInfo>
<genre type="Journal">journal</genre>
<originInfo>
<dateIssued encoding="w3cdtf">199703</dateIssued>
</originInfo>
<identifier type="ISSN">0031-3203</identifier>
<identifier type="PII">S0031-3203(00)X0026-X</identifier>
<part>
<date>199703</date>
<detail type="volume">
<number>30</number>
<caption>vol.</caption>
</detail>
<detail type="issue">
<number>3</number>
<caption>no.</caption>
</detail>
<extent unit="issue pages">
<start>353</start>
<end>535</end>
</extent>
<extent unit="pages">
<start>483</start>
<end>489</end>
</extent>
</part>
</relatedItem>
<identifier type="istex">9EBC45851BF53BF96B2314C37DDCD3A03AD5B9AA</identifier>
<identifier type="DOI">10.1016/S0031-3203(96)00090-8</identifier>
<identifier type="PII">S0031-3203(96)00090-8</identifier>
<recordInfo>
<recordContentSource>ELSEVIER</recordContentSource>
</recordInfo>
</mods>
</metadata>
<enrichments>
<istex:catWosTEI uri="https://api.istex.fr/document/9EBC45851BF53BF96B2314C37DDCD3A03AD5B9AA/enrichments/catWos">
<teiHeader>
<profileDesc>
<textClass>
<classCode scheme="WOS">COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE</classCode>
<classCode scheme="WOS">ENGINEERING, ELECTRICAL & ELECTRONIC</classCode>
</textClass>
</profileDesc>
</teiHeader>
</istex:catWosTEI>
</enrichments>
</istex>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Istex/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001729 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Istex/Corpus/biblio.hfd -nk 001729 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Istex
   |étape=   Corpus
   |type=    RBID
   |clé=     ISTEX:9EBC45851BF53BF96B2314C37DDCD3A03AD5B9AA
   |texte=   Peripheral and global features for use in coarse classification of Chinese characters
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024