Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Identification of font styles and typefaces in printed Korean documents

Identifieur interne : 001867 ( Main/Merge ); précédent : 001866; suivant : 001868

Identification of font styles and typefaces in printed Korean documents

Auteurs : C. B. Jeong [Corée du Sud] ; H. K. Kwag [Corée du Sud] ; S. H. Kim [Corée du Sud] ; J. S. Kim [Corée du Sud] ; S. C. Park [Corée du Sud]

Source :

RBID : Pascal:04-0194678

Descripteurs français

English descriptors

Abstract

This paper proposes a system for the extraction of typographical attributes, such as font style and typeface, that can be used to improve the performance of OCR and keyword spotting technologies on printed Korean document images. Three typographical features have been devised and experimented with 7,200 Korean word images. The individual accuracies for font style identification and typeface identification are 97.2% and 99.1%, respectively.

Links toward previous steps (curation, corpus...)


Links to Exploration step

Pascal:04-0194678

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" level="a">Identification of font styles and typefaces in printed Korean documents</title>
<author>
<name sortKey="Jeong, C B" sort="Jeong, C B" uniqKey="Jeong C" first="C. B." last="Jeong">C. B. Jeong</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Department of Computer Science, Chonnam National University, 300 YongBong-dong</s1>
<s2>BukGu, Gwangju 500-757</s2>
<s3>KOR</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
</inist:fA14>
<country>Corée du Sud</country>
<wicri:noRegion>BukGu, Gwangju 500-757</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Kwag, H K" sort="Kwag, H K" uniqKey="Kwag H" first="H. K." last="Kwag">H. K. Kwag</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Department of Computer Science, Chonnam National University, 300 YongBong-dong</s1>
<s2>BukGu, Gwangju 500-757</s2>
<s3>KOR</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
</inist:fA14>
<country>Corée du Sud</country>
<wicri:noRegion>BukGu, Gwangju 500-757</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Kim, S H" sort="Kim, S H" uniqKey="Kim S" first="S. H." last="Kim">S. H. Kim</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Department of Computer Science, Chonnam National University, 300 YongBong-dong</s1>
<s2>BukGu, Gwangju 500-757</s2>
<s3>KOR</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
</inist:fA14>
<country>Corée du Sud</country>
<wicri:noRegion>BukGu, Gwangju 500-757</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Kim, J S" sort="Kim, J S" uniqKey="Kim J" first="J. S." last="Kim">J. S. Kim</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Department of Computer Science, Chonnam National University, 300 YongBong-dong</s1>
<s2>BukGu, Gwangju 500-757</s2>
<s3>KOR</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
</inist:fA14>
<country>Corée du Sud</country>
<wicri:noRegion>BukGu, Gwangju 500-757</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Park, S C" sort="Park, S C" uniqKey="Park S" first="S. C." last="Park">S. C. Park</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Department of Computer Science, Chonnam National University, 300 YongBong-dong</s1>
<s2>BukGu, Gwangju 500-757</s2>
<s3>KOR</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
</inist:fA14>
<country>Corée du Sud</country>
<wicri:noRegion>BukGu, Gwangju 500-757</wicri:noRegion>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">04-0194678</idno>
<date when="2003">2003</date>
<idno type="stanalyst">PASCAL 04-0194678 INIST</idno>
<idno type="RBID">Pascal:04-0194678</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000555</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000235</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000553</idno>
<idno type="wicri:doubleKey">0302-9743:2003:Jeong C:identification:of:font</idno>
<idno type="wicri:Area/Main/Merge">001867</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a">Identification of font styles and typefaces in printed Korean documents</title>
<author>
<name sortKey="Jeong, C B" sort="Jeong, C B" uniqKey="Jeong C" first="C. B." last="Jeong">C. B. Jeong</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Department of Computer Science, Chonnam National University, 300 YongBong-dong</s1>
<s2>BukGu, Gwangju 500-757</s2>
<s3>KOR</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
</inist:fA14>
<country>Corée du Sud</country>
<wicri:noRegion>BukGu, Gwangju 500-757</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Kwag, H K" sort="Kwag, H K" uniqKey="Kwag H" first="H. K." last="Kwag">H. K. Kwag</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Department of Computer Science, Chonnam National University, 300 YongBong-dong</s1>
<s2>BukGu, Gwangju 500-757</s2>
<s3>KOR</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
</inist:fA14>
<country>Corée du Sud</country>
<wicri:noRegion>BukGu, Gwangju 500-757</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Kim, S H" sort="Kim, S H" uniqKey="Kim S" first="S. H." last="Kim">S. H. Kim</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Department of Computer Science, Chonnam National University, 300 YongBong-dong</s1>
<s2>BukGu, Gwangju 500-757</s2>
<s3>KOR</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
</inist:fA14>
<country>Corée du Sud</country>
<wicri:noRegion>BukGu, Gwangju 500-757</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Kim, J S" sort="Kim, J S" uniqKey="Kim J" first="J. S." last="Kim">J. S. Kim</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Department of Computer Science, Chonnam National University, 300 YongBong-dong</s1>
<s2>BukGu, Gwangju 500-757</s2>
<s3>KOR</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
</inist:fA14>
<country>Corée du Sud</country>
<wicri:noRegion>BukGu, Gwangju 500-757</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Park, S C" sort="Park, S C" uniqKey="Park S" first="S. C." last="Park">S. C. Park</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Department of Computer Science, Chonnam National University, 300 YongBong-dong</s1>
<s2>BukGu, Gwangju 500-757</s2>
<s3>KOR</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
</inist:fA14>
<country>Corée du Sud</country>
<wicri:noRegion>BukGu, Gwangju 500-757</wicri:noRegion>
</affiliation>
</author>
</analytic>
<series>
<title level="j" type="main">Lecture notes in computer science</title>
<idno type="ISSN">0302-9743</idno>
<imprint>
<date when="2003">2003</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<title level="j" type="main">Lecture notes in computer science</title>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Character recognition</term>
<term>Keyword</term>
<term>Korean</term>
<term>Optical character recognition</term>
<term>Printed character</term>
<term>Printed document</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Reconnaissance caractère</term>
<term>Reconnaissance optique caractère</term>
<term>Caractère imprimé</term>
<term>Document imprimé</term>
<term>Coréen</term>
<term>Mot clé</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">This paper proposes a system for the extraction of typographical attributes, such as font style and typeface, that can be used to improve the performance of OCR and keyword spotting technologies on printed Korean document images. Three typographical features have been devised and experimented with 7,200 Korean word images. The individual accuracies for font style identification and typeface identification are 97.2% and 99.1%, respectively.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Corée du Sud</li>
</country>
</list>
<tree>
<country name="Corée du Sud">
<noRegion>
<name sortKey="Jeong, C B" sort="Jeong, C B" uniqKey="Jeong C" first="C. B." last="Jeong">C. B. Jeong</name>
</noRegion>
<name sortKey="Kim, J S" sort="Kim, J S" uniqKey="Kim J" first="J. S." last="Kim">J. S. Kim</name>
<name sortKey="Kim, S H" sort="Kim, S H" uniqKey="Kim S" first="S. H." last="Kim">S. H. Kim</name>
<name sortKey="Kwag, H K" sort="Kwag, H K" uniqKey="Kwag H" first="H. K." last="Kwag">H. K. Kwag</name>
<name sortKey="Park, S C" sort="Park, S C" uniqKey="Park S" first="S. C." last="Park">S. C. Park</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Merge
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001867 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Merge/biblio.hfd -nk 001867 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Merge
   |type=    RBID
   |clé=     Pascal:04-0194678
   |texte=   Identification of font styles and typefaces in printed Korean documents
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024