Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Machine recognition of printed Kannada text

Identifieur interne : 000607 ( PascalFrancis/Checkpoint ); précédent : 000606; suivant : 000608

Machine recognition of printed Kannada text

Auteurs : B. Vijay Kumar [Inde] ; A. G. Ramakrishnan [Inde]

Source :

RBID : Pascal:03-0248638

Descripteurs français

English descriptors

Abstract

This paper presents the design of a full fledged OCR system for printed Kannada text. The machine recognition of Kannada characters is difficult due to similarity in the shapes of different characters, script complexity and non-uniqueness in the representation of diacritics. The document image is subject to line segmentation, word segmentation and zone detection. From the zonal information, base characters, vowel modifiers and consonant conjucts are separated. Knowledge based approach is employed for recognizing the base characters. Various features are employed for recognising the characters. These include the coefficients of the Discrete Cosine Transform, Discrete Wavelet Transform and Karhunen-Louve Transform. These features are fed to different classifiers. Structural features are used in the subsequent levels to discriminate confused characters. Use of structural features, increases recognition rate from 93% to 98%. Apart from the classical pattern classification technique of nearest neighbour, Artificial Neural Network (ANN) based classifiers like Back Propogation and Radial Basis Function (RBF) Networks have also been studied. The ANN classifiers are trained in supervised mode using the transform features. Highest recognition rate of 99% is obtained with RBF using second level approximation coefficients of Haar wavelets as the features on presegmented base characters.


Affiliations:


Links toward previous steps (curation, corpus...)


Links to Exploration step

Pascal:03-0248638

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" level="a">Machine recognition of printed Kannada text</title>
<author>
<name sortKey="Vijay Kumar, B" sort="Vijay Kumar, B" uniqKey="Vijay Kumar B" first="B." last="Vijay Kumar">B. Vijay Kumar</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Department of Electrical Engineering, Indian Institute of Science</s1>
<s2>Bangalore 560012</s2>
<s3>IND</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>Inde</country>
<wicri:noRegion>Bangalore 560012</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Ramakrishnan, A G" sort="Ramakrishnan, A G" uniqKey="Ramakrishnan A" first="A. G." last="Ramakrishnan">A. G. Ramakrishnan</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Department of Electrical Engineering, Indian Institute of Science</s1>
<s2>Bangalore 560012</s2>
<s3>IND</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>Inde</country>
<wicri:noRegion>Bangalore 560012</wicri:noRegion>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">03-0248638</idno>
<date when="2002">2002</date>
<idno type="stanalyst">PASCAL 03-0248638 INIST</idno>
<idno type="RBID">Pascal:03-0248638</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000621</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000170</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000607</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a">Machine recognition of printed Kannada text</title>
<author>
<name sortKey="Vijay Kumar, B" sort="Vijay Kumar, B" uniqKey="Vijay Kumar B" first="B." last="Vijay Kumar">B. Vijay Kumar</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Department of Electrical Engineering, Indian Institute of Science</s1>
<s2>Bangalore 560012</s2>
<s3>IND</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>Inde</country>
<wicri:noRegion>Bangalore 560012</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Ramakrishnan, A G" sort="Ramakrishnan, A G" uniqKey="Ramakrishnan A" first="A. G." last="Ramakrishnan">A. G. Ramakrishnan</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Department of Electrical Engineering, Indian Institute of Science</s1>
<s2>Bangalore 560012</s2>
<s3>IND</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>Inde</country>
<wicri:noRegion>Bangalore 560012</wicri:noRegion>
</affiliation>
</author>
</analytic>
<series>
<title level="j" type="main">Lecture notes in computer science</title>
<idno type="ISSN">0302-9743</idno>
<imprint>
<date when="2002">2002</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<title level="j" type="main">Lecture notes in computer science</title>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Character recognition</term>
<term>Discrete cosine transforms</term>
<term>Discrete transformation</term>
<term>Haar function</term>
<term>Image processing</term>
<term>Image segmentation</term>
<term>Karnataka</term>
<term>Neural network</term>
<term>Optical character recognition</term>
<term>Pattern classification</term>
<term>Pattern recognition</term>
<term>Radial basis function</term>
<term>Wavelet transformation</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Transformation ondelette</term>
<term>Reconnaissance caractère</term>
<term>Reconnaissance optique caractère</term>
<term>Fonction Haar</term>
<term>Reconnaissance forme</term>
<term>Fonction base radiale</term>
<term>Segmentation image</term>
<term>Réseau neuronal</term>
<term>Transformation discrète</term>
<term>Traitement image</term>
<term>Transformation cosinus discrète</term>
<term>Classification forme</term>
<term>Karnataka</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">This paper presents the design of a full fledged OCR system for printed Kannada text. The machine recognition of Kannada characters is difficult due to similarity in the shapes of different characters, script complexity and non-uniqueness in the representation of diacritics. The document image is subject to line segmentation, word segmentation and zone detection. From the zonal information, base characters, vowel modifiers and consonant conjucts are separated. Knowledge based approach is employed for recognizing the base characters. Various features are employed for recognising the characters. These include the coefficients of the Discrete Cosine Transform, Discrete Wavelet Transform and Karhunen-Louve Transform. These features are fed to different classifiers. Structural features are used in the subsequent levels to discriminate confused characters. Use of structural features, increases recognition rate from 93% to 98%. Apart from the classical pattern classification technique of nearest neighbour, Artificial Neural Network (ANN) based classifiers like Back Propogation and Radial Basis Function (RBF) Networks have also been studied. The ANN classifiers are trained in supervised mode using the transform features. Highest recognition rate of 99% is obtained with RBF using second level approximation coefficients of Haar wavelets as the features on presegmented base characters.</div>
</front>
</TEI>
<inist>
<standard h6="B">
<pA>
<fA01 i1="01" i2="1">
<s0>0302-9743</s0>
</fA01>
<fA05>
<s2>2423</s2>
</fA05>
<fA08 i1="01" i2="1" l="ENG">
<s1>Machine recognition of printed Kannada text</s1>
</fA08>
<fA09 i1="01" i2="1" l="ENG">
<s1>DAS 2002 : document analysis systems V : Princeton NJ, 19-21 August 2002</s1>
</fA09>
<fA11 i1="01" i2="1">
<s1>VIJAY KUMAR (B.)</s1>
</fA11>
<fA11 i1="02" i2="1">
<s1>RAMAKRISHNAN (A. G.)</s1>
</fA11>
<fA12 i1="01" i2="1">
<s1>LOPRESTI (Daniel)</s1>
<s9>ed.</s9>
</fA12>
<fA12 i1="02" i2="1">
<s1>JIANYING HU</s1>
<s9>ed.</s9>
</fA12>
<fA12 i1="03" i2="1">
<s1>KASHI (Ramanujan)</s1>
<s9>ed.</s9>
</fA12>
<fA14 i1="01">
<s1>Department of Electrical Engineering, Indian Institute of Science</s1>
<s2>Bangalore 560012</s2>
<s3>IND</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</fA14>
<fA20>
<s1>37-48</s1>
</fA20>
<fA21>
<s1>2002</s1>
</fA21>
<fA23 i1="01">
<s0>ENG</s0>
</fA23>
<fA26 i1="01">
<s0>3-540-44068-2</s0>
</fA26>
<fA43 i1="01">
<s1>INIST</s1>
<s2>16343</s2>
<s5>354000108470940040</s5>
</fA43>
<fA44>
<s0>0000</s0>
<s1>© 2003 INIST-CNRS. All rights reserved.</s1>
</fA44>
<fA45>
<s0>5 ref.</s0>
</fA45>
<fA47 i1="01" i2="1">
<s0>03-0248638</s0>
</fA47>
<fA60>
<s1>P</s1>
<s2>C</s2>
</fA60>
<fA61>
<s0>A</s0>
</fA61>
<fA64 i1="01" i2="1">
<s0>Lecture notes in computer science</s0>
</fA64>
<fA66 i1="01">
<s0>DEU</s0>
</fA66>
<fC01 i1="01" l="ENG">
<s0>This paper presents the design of a full fledged OCR system for printed Kannada text. The machine recognition of Kannada characters is difficult due to similarity in the shapes of different characters, script complexity and non-uniqueness in the representation of diacritics. The document image is subject to line segmentation, word segmentation and zone detection. From the zonal information, base characters, vowel modifiers and consonant conjucts are separated. Knowledge based approach is employed for recognizing the base characters. Various features are employed for recognising the characters. These include the coefficients of the Discrete Cosine Transform, Discrete Wavelet Transform and Karhunen-Louve Transform. These features are fed to different classifiers. Structural features are used in the subsequent levels to discriminate confused characters. Use of structural features, increases recognition rate from 93% to 98%. Apart from the classical pattern classification technique of nearest neighbour, Artificial Neural Network (ANN) based classifiers like Back Propogation and Radial Basis Function (RBF) Networks have also been studied. The ANN classifiers are trained in supervised mode using the transform features. Highest recognition rate of 99% is obtained with RBF using second level approximation coefficients of Haar wavelets as the features on presegmented base characters.</s0>
</fC01>
<fC02 i1="01" i2="X">
<s0>001D02C03</s0>
</fC02>
<fC03 i1="01" i2="X" l="FRE">
<s0>Transformation ondelette</s0>
<s5>01</s5>
</fC03>
<fC03 i1="01" i2="X" l="ENG">
<s0>Wavelet transformation</s0>
<s5>01</s5>
</fC03>
<fC03 i1="01" i2="X" l="SPA">
<s0>Transformación ondita</s0>
<s5>01</s5>
</fC03>
<fC03 i1="02" i2="X" l="FRE">
<s0>Reconnaissance caractère</s0>
<s5>02</s5>
</fC03>
<fC03 i1="02" i2="X" l="ENG">
<s0>Character recognition</s0>
<s5>02</s5>
</fC03>
<fC03 i1="02" i2="X" l="SPA">
<s0>Reconocimiento carácter</s0>
<s5>02</s5>
</fC03>
<fC03 i1="03" i2="X" l="FRE">
<s0>Reconnaissance optique caractère</s0>
<s5>03</s5>
</fC03>
<fC03 i1="03" i2="X" l="ENG">
<s0>Optical character recognition</s0>
<s5>03</s5>
</fC03>
<fC03 i1="03" i2="X" l="SPA">
<s0>Reconocimento óptico de caracteres</s0>
<s5>03</s5>
</fC03>
<fC03 i1="04" i2="X" l="FRE">
<s0>Fonction Haar</s0>
<s5>04</s5>
</fC03>
<fC03 i1="04" i2="X" l="ENG">
<s0>Haar function</s0>
<s5>04</s5>
</fC03>
<fC03 i1="04" i2="X" l="SPA">
<s0>Función Haar</s0>
<s5>04</s5>
</fC03>
<fC03 i1="05" i2="X" l="FRE">
<s0>Reconnaissance forme</s0>
<s5>05</s5>
</fC03>
<fC03 i1="05" i2="X" l="ENG">
<s0>Pattern recognition</s0>
<s5>05</s5>
</fC03>
<fC03 i1="05" i2="X" l="SPA">
<s0>Reconocimiento patrón</s0>
<s5>05</s5>
</fC03>
<fC03 i1="06" i2="X" l="FRE">
<s0>Fonction base radiale</s0>
<s5>06</s5>
</fC03>
<fC03 i1="06" i2="X" l="ENG">
<s0>Radial basis function</s0>
<s5>06</s5>
</fC03>
<fC03 i1="06" i2="X" l="SPA">
<s0>Función radial base</s0>
<s5>06</s5>
</fC03>
<fC03 i1="07" i2="1" l="FRE">
<s0>Segmentation image</s0>
<s5>07</s5>
</fC03>
<fC03 i1="07" i2="1" l="ENG">
<s0>Image segmentation</s0>
<s5>07</s5>
</fC03>
<fC03 i1="08" i2="X" l="FRE">
<s0>Réseau neuronal</s0>
<s5>08</s5>
</fC03>
<fC03 i1="08" i2="X" l="ENG">
<s0>Neural network</s0>
<s5>08</s5>
</fC03>
<fC03 i1="08" i2="X" l="SPA">
<s0>Red neuronal</s0>
<s5>08</s5>
</fC03>
<fC03 i1="09" i2="X" l="FRE">
<s0>Transformation discrète</s0>
<s5>09</s5>
</fC03>
<fC03 i1="09" i2="X" l="ENG">
<s0>Discrete transformation</s0>
<s5>09</s5>
</fC03>
<fC03 i1="09" i2="X" l="SPA">
<s0>Transformación discreta</s0>
<s5>09</s5>
</fC03>
<fC03 i1="10" i2="X" l="FRE">
<s0>Traitement image</s0>
<s5>10</s5>
</fC03>
<fC03 i1="10" i2="X" l="ENG">
<s0>Image processing</s0>
<s5>10</s5>
</fC03>
<fC03 i1="10" i2="X" l="SPA">
<s0>Procesamiento imagen</s0>
<s5>10</s5>
</fC03>
<fC03 i1="11" i2="3" l="FRE">
<s0>Transformation cosinus discrète</s0>
<s5>11</s5>
</fC03>
<fC03 i1="11" i2="3" l="ENG">
<s0>Discrete cosine transforms</s0>
<s5>11</s5>
</fC03>
<fC03 i1="12" i2="3" l="FRE">
<s0>Classification forme</s0>
<s5>12</s5>
</fC03>
<fC03 i1="12" i2="3" l="ENG">
<s0>Pattern classification</s0>
<s5>12</s5>
</fC03>
<fC03 i1="13" i2="X" l="FRE">
<s0>Karnataka</s0>
<s2>NG</s2>
<s5>13</s5>
</fC03>
<fC03 i1="13" i2="X" l="ENG">
<s0>Karnataka</s0>
<s2>NG</s2>
<s5>13</s5>
</fC03>
<fC03 i1="13" i2="X" l="SPA">
<s0>Karnataka</s0>
<s2>NG</s2>
<s5>13</s5>
</fC03>
<fC07 i1="01" i2="X" l="FRE">
<s0>Inde</s0>
<s2>NG</s2>
</fC07>
<fC07 i1="01" i2="X" l="ENG">
<s0>India</s0>
<s2>NG</s2>
</fC07>
<fC07 i1="01" i2="X" l="SPA">
<s0>India</s0>
<s2>NG</s2>
</fC07>
<fC07 i1="02" i2="X" l="FRE">
<s0>Asie</s0>
<s2>NG</s2>
</fC07>
<fC07 i1="02" i2="X" l="ENG">
<s0>Asia</s0>
<s2>NG</s2>
</fC07>
<fC07 i1="02" i2="X" l="SPA">
<s0>Asia</s0>
<s2>NG</s2>
</fC07>
<fN21>
<s1>160</s1>
</fN21>
<fN82>
<s1>PSI</s1>
</fN82>
</pA>
<pR>
<fA30 i1="01" i2="1" l="ENG">
<s1>IAPR workshop on document analysis systems</s1>
<s2>5</s2>
<s3>Princeton NJ USA</s3>
<s4>2002-08-19</s4>
</fA30>
</pR>
</standard>
</inist>
<affiliations>
<list>
<country>
<li>Inde</li>
</country>
</list>
<tree>
<country name="Inde">
<noRegion>
<name sortKey="Vijay Kumar, B" sort="Vijay Kumar, B" uniqKey="Vijay Kumar B" first="B." last="Vijay Kumar">B. Vijay Kumar</name>
</noRegion>
<name sortKey="Ramakrishnan, A G" sort="Ramakrishnan, A G" uniqKey="Ramakrishnan A" first="A. G." last="Ramakrishnan">A. G. Ramakrishnan</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/PascalFrancis/Checkpoint
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000607 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/PascalFrancis/Checkpoint/biblio.hfd -nk 000607 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    PascalFrancis
   |étape=   Checkpoint
   |type=    RBID
   |clé=     Pascal:03-0248638
   |texte=   Machine recognition of printed Kannada text
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024