Identification of font styles and typefaces in printed Korean documents
Identifieur interne : 001867 ( Main/Merge ); précédent : 001866; suivant : 001868Identification of font styles and typefaces in printed Korean documents
Auteurs : C. B. Jeong [Corée du Sud] ; H. K. Kwag [Corée du Sud] ; S. H. Kim [Corée du Sud] ; J. S. Kim [Corée du Sud] ; S. C. Park [Corée du Sud]Source :
- Lecture notes in computer science [ 0302-9743 ] ; 2003.
Descripteurs français
- Pascal (Inist)
English descriptors
- KwdEn :
Abstract
This paper proposes a system for the extraction of typographical attributes, such as font style and typeface, that can be used to improve the performance of OCR and keyword spotting technologies on printed Korean document images. Three typographical features have been devised and experimented with 7,200 Korean word images. The individual accuracies for font style identification and typeface identification are 97.2% and 99.1%, respectively.
Links toward previous steps (curation, corpus...)
- to stream PascalFrancis, to step Corpus: 000555
- to stream PascalFrancis, to step Curation: 000235
- to stream PascalFrancis, to step Checkpoint: 000553
Links to Exploration step
Pascal:04-0194678Le document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en" level="a">Identification of font styles and typefaces in printed Korean documents</title>
<author><name sortKey="Jeong, C B" sort="Jeong, C B" uniqKey="Jeong C" first="C. B." last="Jeong">C. B. Jeong</name>
<affiliation wicri:level="1"><inist:fA14 i1="01"><s1>Department of Computer Science, Chonnam National University, 300 YongBong-dong</s1>
<s2>BukGu, Gwangju 500-757</s2>
<s3>KOR</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
</inist:fA14>
<country>Corée du Sud</country>
<wicri:noRegion>BukGu, Gwangju 500-757</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Kwag, H K" sort="Kwag, H K" uniqKey="Kwag H" first="H. K." last="Kwag">H. K. Kwag</name>
<affiliation wicri:level="1"><inist:fA14 i1="01"><s1>Department of Computer Science, Chonnam National University, 300 YongBong-dong</s1>
<s2>BukGu, Gwangju 500-757</s2>
<s3>KOR</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
</inist:fA14>
<country>Corée du Sud</country>
<wicri:noRegion>BukGu, Gwangju 500-757</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Kim, S H" sort="Kim, S H" uniqKey="Kim S" first="S. H." last="Kim">S. H. Kim</name>
<affiliation wicri:level="1"><inist:fA14 i1="01"><s1>Department of Computer Science, Chonnam National University, 300 YongBong-dong</s1>
<s2>BukGu, Gwangju 500-757</s2>
<s3>KOR</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
</inist:fA14>
<country>Corée du Sud</country>
<wicri:noRegion>BukGu, Gwangju 500-757</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Kim, J S" sort="Kim, J S" uniqKey="Kim J" first="J. S." last="Kim">J. S. Kim</name>
<affiliation wicri:level="1"><inist:fA14 i1="01"><s1>Department of Computer Science, Chonnam National University, 300 YongBong-dong</s1>
<s2>BukGu, Gwangju 500-757</s2>
<s3>KOR</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
</inist:fA14>
<country>Corée du Sud</country>
<wicri:noRegion>BukGu, Gwangju 500-757</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Park, S C" sort="Park, S C" uniqKey="Park S" first="S. C." last="Park">S. C. Park</name>
<affiliation wicri:level="1"><inist:fA14 i1="01"><s1>Department of Computer Science, Chonnam National University, 300 YongBong-dong</s1>
<s2>BukGu, Gwangju 500-757</s2>
<s3>KOR</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
</inist:fA14>
<country>Corée du Sud</country>
<wicri:noRegion>BukGu, Gwangju 500-757</wicri:noRegion>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">INIST</idno>
<idno type="inist">04-0194678</idno>
<date when="2003">2003</date>
<idno type="stanalyst">PASCAL 04-0194678 INIST</idno>
<idno type="RBID">Pascal:04-0194678</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000555</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000235</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000553</idno>
<idno type="wicri:doubleKey">0302-9743:2003:Jeong C:identification:of:font</idno>
<idno type="wicri:Area/Main/Merge">001867</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en" level="a">Identification of font styles and typefaces in printed Korean documents</title>
<author><name sortKey="Jeong, C B" sort="Jeong, C B" uniqKey="Jeong C" first="C. B." last="Jeong">C. B. Jeong</name>
<affiliation wicri:level="1"><inist:fA14 i1="01"><s1>Department of Computer Science, Chonnam National University, 300 YongBong-dong</s1>
<s2>BukGu, Gwangju 500-757</s2>
<s3>KOR</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
</inist:fA14>
<country>Corée du Sud</country>
<wicri:noRegion>BukGu, Gwangju 500-757</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Kwag, H K" sort="Kwag, H K" uniqKey="Kwag H" first="H. K." last="Kwag">H. K. Kwag</name>
<affiliation wicri:level="1"><inist:fA14 i1="01"><s1>Department of Computer Science, Chonnam National University, 300 YongBong-dong</s1>
<s2>BukGu, Gwangju 500-757</s2>
<s3>KOR</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
</inist:fA14>
<country>Corée du Sud</country>
<wicri:noRegion>BukGu, Gwangju 500-757</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Kim, S H" sort="Kim, S H" uniqKey="Kim S" first="S. H." last="Kim">S. H. Kim</name>
<affiliation wicri:level="1"><inist:fA14 i1="01"><s1>Department of Computer Science, Chonnam National University, 300 YongBong-dong</s1>
<s2>BukGu, Gwangju 500-757</s2>
<s3>KOR</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
</inist:fA14>
<country>Corée du Sud</country>
<wicri:noRegion>BukGu, Gwangju 500-757</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Kim, J S" sort="Kim, J S" uniqKey="Kim J" first="J. S." last="Kim">J. S. Kim</name>
<affiliation wicri:level="1"><inist:fA14 i1="01"><s1>Department of Computer Science, Chonnam National University, 300 YongBong-dong</s1>
<s2>BukGu, Gwangju 500-757</s2>
<s3>KOR</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
</inist:fA14>
<country>Corée du Sud</country>
<wicri:noRegion>BukGu, Gwangju 500-757</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Park, S C" sort="Park, S C" uniqKey="Park S" first="S. C." last="Park">S. C. Park</name>
<affiliation wicri:level="1"><inist:fA14 i1="01"><s1>Department of Computer Science, Chonnam National University, 300 YongBong-dong</s1>
<s2>BukGu, Gwangju 500-757</s2>
<s3>KOR</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
</inist:fA14>
<country>Corée du Sud</country>
<wicri:noRegion>BukGu, Gwangju 500-757</wicri:noRegion>
</affiliation>
</author>
</analytic>
<series><title level="j" type="main">Lecture notes in computer science</title>
<idno type="ISSN">0302-9743</idno>
<imprint><date when="2003">2003</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt><title level="j" type="main">Lecture notes in computer science</title>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Character recognition</term>
<term>Keyword</term>
<term>Korean</term>
<term>Optical character recognition</term>
<term>Printed character</term>
<term>Printed document</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr"><term>Reconnaissance caractère</term>
<term>Reconnaissance optique caractère</term>
<term>Caractère imprimé</term>
<term>Document imprimé</term>
<term>Coréen</term>
<term>Mot clé</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">This paper proposes a system for the extraction of typographical attributes, such as font style and typeface, that can be used to improve the performance of OCR and keyword spotting technologies on printed Korean document images. Three typographical features have been devised and experimented with 7,200 Korean word images. The individual accuracies for font style identification and typeface identification are 97.2% and 99.1%, respectively.</div>
</front>
</TEI>
<affiliations><list><country><li>Corée du Sud</li>
</country>
</list>
<tree><country name="Corée du Sud"><noRegion><name sortKey="Jeong, C B" sort="Jeong, C B" uniqKey="Jeong C" first="C. B." last="Jeong">C. B. Jeong</name>
</noRegion>
<name sortKey="Kim, J S" sort="Kim, J S" uniqKey="Kim J" first="J. S." last="Kim">J. S. Kim</name>
<name sortKey="Kim, S H" sort="Kim, S H" uniqKey="Kim S" first="S. H." last="Kim">S. H. Kim</name>
<name sortKey="Kwag, H K" sort="Kwag, H K" uniqKey="Kwag H" first="H. K." last="Kwag">H. K. Kwag</name>
<name sortKey="Park, S C" sort="Park, S C" uniqKey="Park S" first="S. C." last="Park">S. C. Park</name>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Merge
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001867 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Merge/biblio.hfd -nk 001867 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= Main |étape= Merge |type= RBID |clé= Pascal:04-0194678 |texte= Identification of font styles and typefaces in printed Korean documents }}
This area was generated with Dilib version V0.6.32. |