Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Recognition of handwritten Chinese characters via short line segments

Identifieur interne : 000623 ( Istex/Corpus ); précédent : 000622; suivant : 000624

Recognition of handwritten Chinese characters via short line segments

Auteurs : Hsi-Jian Lee ; Bin Chen

Source :

RBID : ISTEX:7EB1553EBE0C463E68131E0D8AFE470AC6E5D890

Abstract

A system to recognize handwritten Chinese characters is presented. In the first stage, short line segments are extracted, where a new efficient algorithm is proposed, based on accumulated chain codes, for line approximation. In the feature extraction stage, features of each character are computed, which are further defined by the features of its composing line segments. In the matching stage, dynamic programming is first used to calculate the similarity between a segment of the input character and a segment of the reference character, and then the similarity between the two characters is computed. In order to reduce the number of candidates, a coarse classification algorithm is proposed. The recognition rate for 150 characters is about 90%, excluding the error in coarse classification.

Url:
DOI: 10.1016/0031-3203(92)90052-K

Links to Exploration step

ISTEX:7EB1553EBE0C463E68131E0D8AFE470AC6E5D890

Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title>Recognition of handwritten Chinese characters via short line segments</title>
<author>
<name sortKey="Lee, Hsi Jian" sort="Lee, Hsi Jian" uniqKey="Lee H" first="Hsi-Jian" last="Lee">Hsi-Jian Lee</name>
<affiliation>
<mods:affiliation>Department of Computer Science and Information Engineering, National Chiao Tung University, Hsinchu, Taiwan 30050, Republic of China</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Chen, Bin" sort="Chen, Bin" uniqKey="Chen B" first="Bin" last="Chen">Bin Chen</name>
<affiliation>
<mods:affiliation>Department of Computer Science and Information Engineering, National Chiao Tung University, Hsinchu, Taiwan 30050, Republic of China</mods:affiliation>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:7EB1553EBE0C463E68131E0D8AFE470AC6E5D890</idno>
<date when="1992" year="1992">1992</date>
<idno type="doi">10.1016/0031-3203(92)90052-K</idno>
<idno type="url">https://api.istex.fr/document/7EB1553EBE0C463E68131E0D8AFE470AC6E5D890/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000623</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a">Recognition of handwritten Chinese characters via short line segments</title>
<author>
<name sortKey="Lee, Hsi Jian" sort="Lee, Hsi Jian" uniqKey="Lee H" first="Hsi-Jian" last="Lee">Hsi-Jian Lee</name>
<affiliation>
<mods:affiliation>Department of Computer Science and Information Engineering, National Chiao Tung University, Hsinchu, Taiwan 30050, Republic of China</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Chen, Bin" sort="Chen, Bin" uniqKey="Chen B" first="Bin" last="Chen">Bin Chen</name>
<affiliation>
<mods:affiliation>Department of Computer Science and Information Engineering, National Chiao Tung University, Hsinchu, Taiwan 30050, Republic of China</mods:affiliation>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="j">Pattern Recognition</title>
<title level="j" type="abbrev">PR</title>
<idno type="ISSN">0031-3203</idno>
<imprint>
<publisher>ELSEVIER</publisher>
<date type="published" when="1991">1991</date>
<biblScope unit="volume">25</biblScope>
<biblScope unit="issue">5</biblScope>
<biblScope unit="page" from="543">543</biblScope>
<biblScope unit="page" to="552">552</biblScope>
</imprint>
<idno type="ISSN">0031-3203</idno>
</series>
<idno type="istex">7EB1553EBE0C463E68131E0D8AFE470AC6E5D890</idno>
<idno type="DOI">10.1016/0031-3203(92)90052-K</idno>
<idno type="PII">0031-3203(92)90052-K</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0031-3203</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">A system to recognize handwritten Chinese characters is presented. In the first stage, short line segments are extracted, where a new efficient algorithm is proposed, based on accumulated chain codes, for line approximation. In the feature extraction stage, features of each character are computed, which are further defined by the features of its composing line segments. In the matching stage, dynamic programming is first used to calculate the similarity between a segment of the input character and a segment of the reference character, and then the similarity between the two characters is computed. In order to reduce the number of candidates, a coarse classification algorithm is proposed. The recognition rate for 150 characters is about 90%, excluding the error in coarse classification.</div>
</front>
</TEI>
<istex>
<corpusName>elsevier</corpusName>
<author>
<json:item>
<name>Hsi-Jian Lee</name>
<affiliations>
<json:string>Department of Computer Science and Information Engineering, National Chiao Tung University, Hsinchu, Taiwan 30050, Republic of China</json:string>
</affiliations>
</json:item>
<json:item>
<name>Bin Chen</name>
<affiliations>
<json:string>Department of Computer Science and Information Engineering, National Chiao Tung University, Hsinchu, Taiwan 30050, Republic of China</json:string>
</affiliations>
</json:item>
</author>
<subject>
<json:item>
<lang>
<json:string>eng</json:string>
</lang>
<value>Structural matching</value>
</json:item>
<json:item>
<lang>
<json:string>eng</json:string>
</lang>
<value>Dynamic programming</value>
</json:item>
<json:item>
<lang>
<json:string>eng</json:string>
</lang>
<value>Thinning</value>
</json:item>
<json:item>
<lang>
<json:string>eng</json:string>
</lang>
<value>Line approximation</value>
</json:item>
<json:item>
<lang>
<json:string>eng</json:string>
</lang>
<value>Optical character recognition</value>
</json:item>
</subject>
<language>
<json:string>eng</json:string>
</language>
<abstract>A system to recognize handwritten Chinese characters is presented. In the first stage, short line segments are extracted, where a new efficient algorithm is proposed, based on accumulated chain codes, for line approximation. In the feature extraction stage, features of each character are computed, which are further defined by the features of its composing line segments. In the matching stage, dynamic programming is first used to calculate the similarity between a segment of the input character and a segment of the reference character, and then the similarity between the two characters is computed. In order to reduce the number of candidates, a coarse classification algorithm is proposed. The recognition rate for 150 characters is about 90%, excluding the error in coarse classification.</abstract>
<qualityIndicators>
<score>6.464</score>
<pdfVersion>1.2</pdfVersion>
<pdfPageSize>548 x 792 pts</pdfPageSize>
<refBibsNative>true</refBibsNative>
<keywordCount>5</keywordCount>
<abstractCharCount>796</abstractCharCount>
<pdfWordCount>5682</pdfWordCount>
<pdfCharCount>31138</pdfCharCount>
<pdfPageCount>10</pdfPageCount>
<abstractWordCount>122</abstractWordCount>
</qualityIndicators>
<title>Recognition of handwritten Chinese characters via short line segments</title>
<pii>
<json:string>0031-3203(92)90052-K</json:string>
</pii>
<genre>
<json:string>research-article</json:string>
</genre>
<serie>
<pages>
<last>586</last>
<first>569</first>
</pages>
<genre></genre>
<language>
<json:string>unknown</json:string>
</language>
<title>Handbook of Pattern Recognition and Image Processing</title>
</serie>
<host>
<volume>25</volume>
<pii>
<json:string>S0031-3203(00)X0212-9</json:string>
</pii>
<pages>
<last>552</last>
<first>543</first>
</pages>
<issn>
<json:string>0031-3203</json:string>
</issn>
<issue>5</issue>
<genre>
<json:string>Journal</json:string>
</genre>
<language>
<json:string>unknown</json:string>
</language>
<title>Pattern Recognition</title>
<publicationDate>1992</publicationDate>
</host>
<categories>
<wos>
<json:string>COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE</json:string>
<json:string>ENGINEERING, ELECTRICAL & ELECTRONIC</json:string>
</wos>
</categories>
<publicationDate>1991</publicationDate>
<copyrightDate>1992</copyrightDate>
<doi>
<json:string>10.1016/0031-3203(92)90052-K</json:string>
</doi>
<id>7EB1553EBE0C463E68131E0D8AFE470AC6E5D890</id>
<fulltext>
<json:item>
<original>true</original>
<mimetype>application/pdf</mimetype>
<extension>pdf</extension>
<uri>https://api.istex.fr/document/7EB1553EBE0C463E68131E0D8AFE470AC6E5D890/fulltext/pdf</uri>
</json:item>
<json:item>
<original>true</original>
<mimetype>text/plain</mimetype>
<extension>txt</extension>
<uri>https://api.istex.fr/document/7EB1553EBE0C463E68131E0D8AFE470AC6E5D890/fulltext/txt</uri>
</json:item>
<json:item>
<original>false</original>
<mimetype>application/zip</mimetype>
<extension>zip</extension>
<uri>https://api.istex.fr/document/7EB1553EBE0C463E68131E0D8AFE470AC6E5D890/fulltext/zip</uri>
</json:item>
<istex:fulltextTEI uri="https://api.istex.fr/document/7EB1553EBE0C463E68131E0D8AFE470AC6E5D890/fulltext/tei">
<teiHeader>
<fileDesc>
<titleStmt>
<title level="a">Recognition of handwritten Chinese characters via short line segments</title>
</titleStmt>
<publicationStmt>
<authority>ISTEX</authority>
<publisher>ELSEVIER</publisher>
<availability>
<p>ELSEVIER</p>
</availability>
<date>1992</date>
</publicationStmt>
<sourceDesc>
<biblStruct type="inbook">
<analytic>
<title level="a">Recognition of handwritten Chinese characters via short line segments</title>
<author>
<persName>
<forename type="first">Hsi-Jian</forename>
<surname>Lee</surname>
</persName>
<affiliation>Department of Computer Science and Information Engineering, National Chiao Tung University, Hsinchu, Taiwan 30050, Republic of China</affiliation>
</author>
<author>
<persName>
<forename type="first">Bin</forename>
<surname>Chen</surname>
</persName>
<affiliation>Department of Computer Science and Information Engineering, National Chiao Tung University, Hsinchu, Taiwan 30050, Republic of China</affiliation>
</author>
</analytic>
<monogr>
<title level="j">Pattern Recognition</title>
<title level="j" type="abbrev">PR</title>
<idno type="pISSN">0031-3203</idno>
<idno type="PII">S0031-3203(00)X0212-9</idno>
<imprint>
<publisher>ELSEVIER</publisher>
<date type="published" when="1991"></date>
<biblScope unit="volume">25</biblScope>
<biblScope unit="issue">5</biblScope>
<biblScope unit="page" from="543">543</biblScope>
<biblScope unit="page" to="552">552</biblScope>
</imprint>
</monogr>
<idno type="istex">7EB1553EBE0C463E68131E0D8AFE470AC6E5D890</idno>
<idno type="DOI">10.1016/0031-3203(92)90052-K</idno>
<idno type="PII">0031-3203(92)90052-K</idno>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<creation>
<date>1992</date>
</creation>
<langUsage>
<language ident="en">en</language>
</langUsage>
<abstract xml:lang="en">
<p>A system to recognize handwritten Chinese characters is presented. In the first stage, short line segments are extracted, where a new efficient algorithm is proposed, based on accumulated chain codes, for line approximation. In the feature extraction stage, features of each character are computed, which are further defined by the features of its composing line segments. In the matching stage, dynamic programming is first used to calculate the similarity between a segment of the input character and a segment of the reference character, and then the similarity between the two characters is computed. In order to reduce the number of candidates, a coarse classification algorithm is proposed. The recognition rate for 150 characters is about 90%, excluding the error in coarse classification.</p>
</abstract>
<textClass>
<keywords scheme="keyword">
<list>
<head>Keywords</head>
<item>
<term>Structural matching</term>
</item>
<item>
<term>Dynamic programming</term>
</item>
<item>
<term>Thinning</term>
</item>
<item>
<term>Line approximation</term>
</item>
<item>
<term>Optical character recognition</term>
</item>
</list>
</keywords>
</textClass>
</profileDesc>
<revisionDesc>
<change when="1991-09-10">Registration</change>
<change when="1991-07-15">Modified</change>
<change when="1991">Published</change>
</revisionDesc>
</teiHeader>
</istex:fulltextTEI>
</fulltext>
<metadata>
<istex:metadataXml wicri:clean="Elsevier, elements deleted: tail">
<istex:xmlDeclaration>version="1.0" encoding="utf-8"</istex:xmlDeclaration>
<istex:docType PUBLIC="-//ES//DTD journal article DTD version 4.5.2//EN//XML" URI="art452.dtd" name="istex:docType"></istex:docType>
<istex:document>
<converted-article version="4.5.2" docsubtype="fla">
<item-info>
<jid>PR</jid>
<aid>9290052K</aid>
<ce:pii>0031-3203(92)90052-K</ce:pii>
<ce:doi>10.1016/0031-3203(92)90052-K</ce:doi>
<ce:copyright type="unknown" year="1992"></ce:copyright>
</item-info>
<head>
<ce:title>Recognition of handwritten Chinese characters via short line segments</ce:title>
<ce:author-group>
<ce:author>
<ce:given-name>Hsi-Jian</ce:given-name>
<ce:surname>Lee</ce:surname>
</ce:author>
<ce:author>
<ce:given-name>Bin</ce:given-name>
<ce:surname>Chen</ce:surname>
</ce:author>
<ce:affiliation>
<ce:textfn>Department of Computer Science and Information Engineering, National Chiao Tung University, Hsinchu, Taiwan 30050, Republic of China</ce:textfn>
</ce:affiliation>
</ce:author-group>
<ce:date-received day="10" month="1" year="1991"></ce:date-received>
<ce:date-revised day="15" month="7" year="1991"></ce:date-revised>
<ce:date-accepted day="10" month="9" year="1991"></ce:date-accepted>
<ce:abstract>
<ce:section-title>Abstract</ce:section-title>
<ce:abstract-sec>
<ce:simple-para>A system to recognize handwritten Chinese characters is presented. In the first stage, short line segments are extracted, where a new efficient algorithm is proposed, based on accumulated chain codes, for line approximation. In the feature extraction stage, features of each character are computed, which are further defined by the features of its composing line segments. In the matching stage, dynamic programming is first used to calculate the similarity between a segment of the input character and a segment of the reference character, and then the similarity between the two characters is computed. In order to reduce the number of candidates, a coarse classification algorithm is proposed. The recognition rate for 150 characters is about 90%, excluding the error in coarse classification.</ce:simple-para>
</ce:abstract-sec>
</ce:abstract>
<ce:keywords>
<ce:section-title>Keywords</ce:section-title>
<ce:keyword>
<ce:text>Structural matching</ce:text>
</ce:keyword>
<ce:keyword>
<ce:text>Dynamic programming</ce:text>
</ce:keyword>
<ce:keyword>
<ce:text>Thinning</ce:text>
</ce:keyword>
<ce:keyword>
<ce:text>Line approximation</ce:text>
</ce:keyword>
<ce:keyword>
<ce:text>Optical character recognition</ce:text>
</ce:keyword>
</ce:keywords>
</head>
</converted-article>
</istex:document>
</istex:metadataXml>
<mods version="3.6">
<titleInfo>
<title>Recognition of handwritten Chinese characters via short line segments</title>
</titleInfo>
<titleInfo type="alternative" contentType="CDATA">
<title>Recognition of handwritten Chinese characters via short line segments</title>
</titleInfo>
<name type="personal">
<namePart type="given">Hsi-Jian</namePart>
<namePart type="family">Lee</namePart>
<affiliation>Department of Computer Science and Information Engineering, National Chiao Tung University, Hsinchu, Taiwan 30050, Republic of China</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Bin</namePart>
<namePart type="family">Chen</namePart>
<affiliation>Department of Computer Science and Information Engineering, National Chiao Tung University, Hsinchu, Taiwan 30050, Republic of China</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<typeOfResource>text</typeOfResource>
<genre type="research-article" displayLabel="Full-length article"></genre>
<originInfo>
<publisher>ELSEVIER</publisher>
<dateIssued encoding="w3cdtf">1991</dateIssued>
<dateValid encoding="w3cdtf">1991-09-10</dateValid>
<dateModified encoding="w3cdtf">1991-07-15</dateModified>
<copyrightDate encoding="w3cdtf">1992</copyrightDate>
</originInfo>
<language>
<languageTerm type="code" authority="iso639-2b">eng</languageTerm>
<languageTerm type="code" authority="rfc3066">en</languageTerm>
</language>
<physicalDescription>
<internetMediaType>text/html</internetMediaType>
</physicalDescription>
<abstract lang="en">A system to recognize handwritten Chinese characters is presented. In the first stage, short line segments are extracted, where a new efficient algorithm is proposed, based on accumulated chain codes, for line approximation. In the feature extraction stage, features of each character are computed, which are further defined by the features of its composing line segments. In the matching stage, dynamic programming is first used to calculate the similarity between a segment of the input character and a segment of the reference character, and then the similarity between the two characters is computed. In order to reduce the number of candidates, a coarse classification algorithm is proposed. The recognition rate for 150 characters is about 90%, excluding the error in coarse classification.</abstract>
<subject>
<genre>Keywords</genre>
<topic>Structural matching</topic>
<topic>Dynamic programming</topic>
<topic>Thinning</topic>
<topic>Line approximation</topic>
<topic>Optical character recognition</topic>
</subject>
<relatedItem type="host">
<titleInfo>
<title>Pattern Recognition</title>
</titleInfo>
<titleInfo type="abbreviated">
<title>PR</title>
</titleInfo>
<genre type="Journal">journal</genre>
<originInfo>
<dateIssued encoding="w3cdtf">199205</dateIssued>
</originInfo>
<identifier type="ISSN">0031-3203</identifier>
<identifier type="PII">S0031-3203(00)X0212-9</identifier>
<part>
<date>199205</date>
<detail type="volume">
<number>25</number>
<caption>vol.</caption>
</detail>
<detail type="issue">
<number>5</number>
<caption>no.</caption>
</detail>
<extent unit="issue pages">
<start>445</start>
<end>553</end>
</extent>
<extent unit="pages">
<start>543</start>
<end>552</end>
</extent>
</part>
</relatedItem>
<identifier type="istex">7EB1553EBE0C463E68131E0D8AFE470AC6E5D890</identifier>
<identifier type="DOI">10.1016/0031-3203(92)90052-K</identifier>
<identifier type="PII">0031-3203(92)90052-K</identifier>
<recordInfo>
<recordContentSource>ELSEVIER</recordContentSource>
</recordInfo>
</mods>
</metadata>
<enrichments>
<istex:catWosTEI uri="https://api.istex.fr/document/7EB1553EBE0C463E68131E0D8AFE470AC6E5D890/enrichments/catWos">
<teiHeader>
<profileDesc>
<textClass>
<classCode scheme="WOS">COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE</classCode>
<classCode scheme="WOS">ENGINEERING, ELECTRICAL & ELECTRONIC</classCode>
</textClass>
</profileDesc>
</teiHeader>
</istex:catWosTEI>
</enrichments>
</istex>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Istex/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000623 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Istex/Corpus/biblio.hfd -nk 000623 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Istex
   |étape=   Corpus
   |type=    RBID
   |clé=     ISTEX:7EB1553EBE0C463E68131E0D8AFE470AC6E5D890
   |texte=   Recognition of handwritten Chinese characters via short line segments
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024